Skip to content

Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning

License

Notifications You must be signed in to change notification settings

kharyal/cogment-verse

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cogment Verse

Apache 2 License Changelog

Cogment is an innovative open source AI platform designed to leverage the advent of AI to benefit humankind through human-AI collaboration developed by AI Redefined. Cogment enables AI researchers and engineers to build, train and operate AI agents in simulated or real environments shared with humans. For the full user documentation visit https://docs.cogment.ai

🚧 This repository is under construction, it propose a library of environments and agent implementations to get started with Human In the Loop Learning (HILL) and Reinforcement Learning (RL) with Cogment in minutes. Cogment Verse is designed both for practionners discovering the field as well as for experienced researchers or engineers as an framework to develop and benchmark new approaches.

Cogment verse includes environments from:

Documentation table of contents

Getting started

Setup

  1. Install docker
  2. Install docker-compose (⚠️ you'll need version > 1.29.2 for this project)
  3. Install cogment (⚠️ version >= 2.0.0 is required)
  4. Clone this repository

Copy the shared definitions

After a fresh close or whenever either the cogment.yaml or any protobuf file in the root directory is changed, you need to copy those changes to the different services source directories. This is achieved with the following.

cogment run copy

Start development auto-reload mode

Cogment verse can be started in development mode where the services restart whenever a source is edited without needing to restart the docker images. It can be started with the following

cogment run dev

🚧 In this mode, changes to the source files in the shared base_dev directory won't be reflected in the running services until you re-start cogment run dev.

To be able to use the client properly, you'll need to build it whenever something changes using

cogment run build_client

Troubleshooting

This project is using rather large libraries such as tensorflow and pytorch, because of that the build might fail if docker doesn't have access to sufficient memory.

Build production images

cogment run build

Build the GPU versions

cogment run build_gpu

Start the production images

cogment run start

Start the GPU versions

cogment run start_gpu

Start and stop runs

Once the services are running in either production or development mode, you can launch a run with the following command

RUN_PARAMS=cartpole_dqn cogment run start_run

The available RUN_PARAMS are defined in run_params.yaml. You can add new parameters as you wish as long as the environments and agents are supported.

To list ongoing runs you can run

cogment run list_runs

To terminate a given run you can run

RUN_ID=angry_gould cogment run terminate_run

Ongoing run identifiers to define RUN_ID can be retrieved by listing the ongoing runs with cogment run list_runs

Run monitoring

You can monitor ongoing run using mlflow. By default a local instance of mlflow is started by cogment-verse and is accessible at http://localhost:3000.

Human player

Some of the availabe run involve a human player, for example benchmark_lander_hill enables a human player to momentarily take control of the lunar lander to help the AI agents during the training process.

Then start the run

RUN_PARAMS=benchmark_lander_hill cogment run start_run

Access the playing interface by navigating to http://localhost:8080

The Play run

The play is a run that is used to test any agent in an environment. The run is started by

RUN_PARAMS=play cogment run start_run

It can be configured with the following parameters (to change in run_params.yaml):

play:
  implementation: play
  config:
    class_name: data_pb2.PlayRunConfig
    # Set to true to have the ability to observe the run in the web client
    observer: true
    # Number of trials to run
    trial_count: 10
    environment:
      # Reference one of the environment specs defined at the top of `run_params.yaml`
      specs: *cartpole_specs
    actors:
    # Configure the players (only the first ones are used, up to the number of required players)
    ## follows the `cogment_verse.ActorParams` datastructure
    - name: agent_1
      actor_class: agent
      # Select the implementation to use
      implementation: random
      agent_config:
        # Define the agent config here
        ## follows the `cogment_verse.AgentConfig` datastructure
        ## Make sure that the selected model is compatible with the selected implementation
        model_id: compassionate_aryabhata_model
        model_version: -1
    - name: agent_2
      actor_class: agent
      implementation: random

Debug

Resources monitoring

Cogment verse comes with prometheus, in /prometheus, and grafana, in /grafana, services to facilitate the monitoring of the cluster.

When running with the default cogment run start, the grafana dashboard can be accessed at http://localhost:3001.

Profiling

Steps

  • Add viztracer to python requirements.txt
  • Modify docker-compose override
    • Add a mount for the profile results json/html file
    • Change the entrypoint of the service viztracer --output_file /output/results.html script.py
  • Rebuild and run jobs

Acknowledgements

The subdirectories /tf_agents/cogment_verse_tf_agents/third_party and /torch_agents/cogment_verse_torch_agents/third_party contains code from third party sources

About

Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.2%
  • TypeScript 5.1%
  • Dockerfile 1.8%
  • Shell 1.3%
  • CSS 0.7%
  • JavaScript 0.5%
  • HTML 0.4%