Fast and simple implementation of RL algorithms, designed to run fully on GPU.
Go to file
Lukas Schneider 96dd4929c5 added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
docs added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
examples added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
licenses/dependencies Updates RSL-RL to version 2.0 (#14) 2023-11-02 00:43:02 +01:00
rsl_rl added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
tests added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
.flake8 Updates RSL-RL to version 2.0 (#14) 2023-11-02 00:43:02 +01:00
.gitignore added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
.pre-commit-config.yaml Updates RSL-RL to version 2.0 (#14) 2023-11-02 00:43:02 +01:00
CONTRIBUTORS.md added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
LICENSE Updates RSL-RL to version 2.0 (#14) 2023-11-02 00:43:02 +01:00
README.md added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00
pyproject.toml Updates RSL-RL to version 2.0 (#14) 2023-11-02 00:43:02 +01:00
setup.py added rewrite of rsl_rl for supporting additional algorithms 2023-12-12 18:32:21 +01:00

README.md

RSL RL

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

Currently, the following algorithms are implemented:

  • Distributed Distributional DDPG (D4PG)
  • Deep Deterministic Policy Gradient (DDPG)
  • Distributional PPO (DPPO)
  • Distributional Soft Actor Critic (DSAC)
  • Proximal Policy Optimization (PPO)
  • Soft Actor Critic (SAC)
  • Twin Delayed DDPG (TD3)

Maintainer: David Hoeller, Nikita Rudin
Affiliation: Robotic Systems Lab, ETH Zurich & NVIDIA
Contact: rudinn@ethz.ch

Installation

To install the package, run the following command in the root directory of the repository:

$ pip3 install -e .

Examples can be run from the examples/ directory. The example directory also include hyperparameters tuned for some gym environments. These are automatically loaded when running the example. Videos of the trained policies are periodically saved to the videos/ directory.

$ python3 examples/example.py

To run gym mujoco environments, you need a working installation of the mujoco simulator and mujoco_py.

Tests

The repository contains a set of tests to ensure that the algorithms are working as expected. To run the tests, simply execute:

$ cd tests/ && python -m unittest

Documentation

To generate documentation, run the following command in the root directory of the repository:

$ pip3 install sphinx sphinx-rtd-theme
$ sphinx-apidoc -o docs/source . ./examples
$ cd docs/ && make html

Contribution Guidelines

We use black formatter for formatting the python code. You should configure black with VSCode or you can manually format files with:

$ pip install black
$ black --line-length 120 .