Update wording and links

This commit is contained in:
Antonin RAFFIN 2020-10-17 17:04:00 +02:00
parent 00f9d26d55
commit 926e488196
6 changed files with 37 additions and 25 deletions

View File

@ -44,6 +44,7 @@ Traceback (most recent call last): File ...
**System Info** **System Info**
Describe the characteristic of your environment: Describe the characteristic of your environment:
* Describe how the library was installed (pip, docker, source, ...) * Describe how the library was installed (pip, docker, source, ...)
* Stable-Baselines3 and sb3-contrib versions
* GPU models and configuration * GPU models and configuration
* Python version * Python version
* PyTorch version * PyTorch version

View File

@ -1,4 +1,4 @@
## Release 0.9.0a2 (WIP) ## Release 0.10.0a0 (WIP)
### Breaking Changes ### Breaking Changes

View File

@ -2,7 +2,7 @@
This contrib repository is designed for experimental implementations of various This contrib repository is designed for experimental implementations of various
parts of reinforcement training so that others may make use of them. This includes full parts of reinforcement training so that others may make use of them. This includes full
training algorithms, different tools (e.g. new environment wrappers, RL algorithms, different tools (e.g. new environment wrappers,
callbacks) and extending algorithms implemented in stable-baselines3. callbacks) and extending algorithms implemented in stable-baselines3.
**Before opening a pull request**, open an issue discussing the contribution. **Before opening a pull request**, open an issue discussing the contribution.
@ -10,7 +10,7 @@ Once we agree that the plan looks good, go ahead and implement it.
Contributions and review focuses on following three parts: Contributions and review focuses on following three parts:
1) **Implementation quality** 1) **Implementation quality**
- Performance of the training algorithms should match what proposed authors reported (if applicable). - Performance of the RL algorithms should match the one reported by the original authors (if applicable).
- This is ensured by including a code that replicates an experiment from the original - This is ensured by including a code that replicates an experiment from the original
paper or from an established codebase (e.g. the code from authors), as well as paper or from an established codebase (e.g. the code from authors), as well as
a test to check that implementation works on program level (does not crash). a test to check that implementation works on program level (does not crash).
@ -73,7 +73,7 @@ Along with the code, PR **must** include the following:
correctly. This this is **not** for testing e.g. training performance of a learning algorithm, and correctly. This this is **not** for testing e.g. training performance of a learning algorithm, and
should be relatively quick to run. should be relatively quick to run.
Below is a template for documentation for full training algorithms. Below is a template for documentation for full RL algorithms.
```rst ```rst
[Feature/Algorithm name] [Feature/Algorithm name]

View File

@ -5,7 +5,7 @@ pytest:
./scripts/run_tests.sh ./scripts/run_tests.sh
type: type:
pytype pytype -j auto
lint: lint:
# stop the build if there are Python syntax errors or undefined names # stop the build if there are Python syntax errors or undefined names

View File

@ -1,31 +1,38 @@
<img src="docs/\_static/img/logo.png" align="right" width="40%"/>
[![CI](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/workflows/CI/badge.svg)](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/actions) [![codestyle](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black) [![CI](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/workflows/CI/badge.svg)](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/actions) [![codestyle](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
# Stable-Baselines3 - Contrib # Stable-Baselines3 - Contrib (SB3-Contrib)
Contrib package for [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3) - Experimental code. Contrib package for [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3) - Experimental reinforcement learning (RL) code.
"sb3-contrib" for short. "sb3-contrib" for short.
A place for training algorithms and tools that are considered experimental, e.g. implementations of the latest ### What is SB3-Contrib?
publications. Goal is to keep the simplicity, documentation and style of stable-baselines3 but for less matured
implementations.
Why create this repository? Over the span of stable-baselines and stable-baselines3, the community has been eager A place for RL algorithms and tools that are considered experimental, e.g. implementations of the latest publications. Goal is to keep the simplicity, documentation and style of stable-baselines3 but for less matured implementations.
to contribute in form of better logging utilities, environment wrappers, extended support (e.g. different action spaces)
and learning algorithms. However sometimes these utilities were too niche to be considered for stable-baselines or ### Why create this repository?
proved to be too difficult to integrate well into existing code without a mess. sb3-contrib aims to fix this by
not requiring the neatest code integration with existing code and not setting limits on what is too niche: almost everything Over the span of stable-baselines and stable-baselines3, the community has been eager to contribute in form of better logging utilities, environment wrappers, extended support (e.g. different action spaces) and learning algorithms.
remotely useful goes! We hope this allows to extend the known quality of stable-baselines style and documentation beyond
the relatively small scope of utilities of the main repository. However sometimes these utilities were too niche to be considered for stable-baselines or
proved to be too difficult to integrate well into existing code without a mess. sb3-contrib aims to fix this by not requiring the neatest code integration with existing code and not setting limits on what is too niche: almost everything remotely useful goes! We hope this allows to extend the known quality of stable-baselines style and documentation beyond the relatively small scope of utilities of the main repository.
## Features ## Features
See documentation for the full list of included features. See documentation for the full list of included features.
**Training algorithms**: **RL Algorithms**:
- [Truncated Quantile Critics (TQC)](https://arxiv.org/abs/2005.04269) - [Truncated Quantile Critics (TQC)](https://arxiv.org/abs/2005.04269)
<!-- TODO: uncomment when the repo is public -->
<!-- ## Documentation
Documentation is available online: [https://sb3-contrib.readthedocs.io/](https://sb3-contrib.readthedocs.io/) -->
## Installation ## Installation
**Note:** You need the `master` version of [Stable Baselines3](https://github.com/DLR-RM/stable-baselines3/). **Note:** You need the `master` version of [Stable Baselines3](https://github.com/DLR-RM/stable-baselines3/).
@ -40,6 +47,10 @@ Install Stable Baselines3 - Contrib using pip:
pip install git+https://github.com/Stable-Baselines-Team/stable-baselines3-contrib pip install git+https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
``` ```
## How To Contribute
If you want to contribute, please read [**CONTRIBUTING.md**](./CONTRIBUTING.md) guide first.
## Citing the Project ## Citing the Project

View File

@ -6,7 +6,7 @@
Welcome to Stable Baselines3 Contrib docs! Welcome to Stable Baselines3 Contrib docs!
========================================== ==========================================
Contrib package for `Stable Baselines3 <https://github.com/DLR-RM/stable-baselines3>`_ - Experimental code. Contrib package for `Stable Baselines3 (SB3) <https://github.com/DLR-RM/stable-baselines3>`_ - Experimental code.
Github repository: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib Github repository: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
@ -64,7 +64,7 @@ To cite this project in publications:
Contributing Contributing
------------ ------------
If you want to contribute, please read `CONTRIBUTING.md <https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md>`_ first. If you want to contribute, please read `CONTRIBUTING.md <https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/CONTRIBUTING.md>`_ first.
Indices and tables Indices and tables
------------------- -------------------