stable-baselines3-contrib-sacd/docs
Sean Gillen 675304d8fa
Augmented Random Search (ARS) (#42)
* first pass at ars, replicates initial results, still needs more testing, cleanup

* add a few docs and tests, bugfixes for ARS

* debug and comment

* break out dump logs

* rollback so there are now predict workers, some refactoring

* remove callback from self, remove torch multiprocessing

* add module docs

* run formatter

* fix load and rerun formatter

* rename to less mathy variable names, rename _validate_hypers

* refactor to use evaluatate_policy, linear policy no longer uses bias or squashing

* move everything to torch, add support for discrete action spaces, bugfix for alive reward offset

* added tests, passing all of them, add support for discrete action spaces

* update documentation

* allow for reward offset when there are multiple envs

* update results again

* Reformat

* Ignore unused imports

* Renaming + Cleanup

* Experimental multiprocessing

* Cleaner multiprocessing

* Reformat

* Fixes for callback

* Fix combining stats

* 2nd way

* Make the implementation cpu only

* Fixes + POC with mp module

* POC Processes

* Cleaner aync implementation

* Remove unused arg

* Add typing

* Revert vec normalize offset hack

* Add `squash_output` parameter

* Add more tests

* Add comments

* Update doc

* Add comments

* Add more logging

* Fix TRPO issue on GPU

* Tmp fix for ARS tests on GPU

* Additional tmp fixes for ARS

* update docstrings + formatting, fix bad exceptioe string in ARSPolicy

* Add comments and docstrings

* Fix missing import

* Fix type check

* Add dosctrings

* GPU support, first attempt

* Fix test

* Add missing docstring

* Typos

* Update defaults hyperparameters

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-01-18 13:57:27 +01:00
..
_static Add TQC (#4) 2020-10-22 13:43:46 +02:00
common Add Trust Region Policy Optimization (TRPO) (#40) 2021-12-29 11:58:03 +01:00
guide Augmented Random Search (ARS) (#42) 2022-01-18 13:57:27 +01:00
images PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
misc Augmented Random Search (ARS) (#42) 2022-01-18 13:57:27 +01:00
modules Augmented Random Search (ARS) (#42) 2022-01-18 13:57:27 +01:00
Makefile Add base doc 2020-10-12 20:21:52 +02:00
README.md Review docs and update changelog 2020-10-15 02:17:36 +03:00
conda_env.yml Drop python 3.6 support (#55) 2021-12-06 12:59:53 +01:00
conf.py Add base doc 2020-10-12 20:21:52 +02:00
index.rst Augmented Random Search (ARS) (#42) 2022-01-18 13:57:27 +01:00
make.bat Add base doc 2020-10-12 20:21:52 +02:00
spelling_wordlist.txt Augmented Random Search (ARS) (#42) 2022-01-18 13:57:27 +01:00

README.md

Stable Baselines3 Contrib Documentation

This folder contains documentation for the RL baselines contribution repository.

Build the Documentation

Install Sphinx and Theme

pip install sphinx sphinx-autobuild sphinx-rtd-theme

Building the Docs

In the docs/ folder:

make html

if you want to building each time a file is changed:

sphinx-autobuild . _build/html