FeaturesΒΆ

1. Reproducible

We provide algorithms that reproduce stably the results of many influential reinforcement learning algorithms.

2. Large Scale

Ability to support high-performance parallelization of training with thousands of CPUs and multi-GPUs.

3. Reusable

Algorithms provided in the repository can be directly adapted to new tasks by defining a forward network and training mechanism will be built automatically.

4. Extensible

Build new algorithms quickly by inheriting the abstract class in the framework.