Antonin RAFFIN
a1b5ea67ae
Multiprocessing support for off policy algorithms ( #50 )
...
* TQC support for multienv
* Add optional layer norm for TQC
* Add layer nprm for all policies
* Revert "Add layer nprm for all policies"
This reverts commit 1306c3c64eb12613464982c66cb416a3bbc66285.
* Revert "Add optional layer norm for TQC"
This reverts commit 200222e3a8878007aa6032d540ae74274a4d0788.
* Add experimental support to train off-policy algorithms with multiple envs
* Bump version
* Update version
2021-12-02 10:40:21 +01:00
Long M. Lưu (刘明龙)
fab19bdb18
Update small QR-DQN docs typo ( #33 )
...
* Update qrdqn.rst
* Update changelog.rst
* Update changelog.rst
Add my name
* Update changelog.rst
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-06-23 14:34:22 +02:00
Antonin RAFFIN
3665695d1e
Dictionary Observations ( #29 )
...
* Add TQC support for new HER version
* Add dict obs support
* Add support for dict obs
2021-05-11 13:24:31 +02:00
Toshiki Watanabe
4b4d487fdb
Fix the target calculation of QR-DQN ( #18 )
...
* Fix the target calculation of QR-DQN
* Update doc
* Update version
* Update changelog
* Update README
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-01-11 14:11:16 +01:00
Toshiki Watanabe
b30397fff5
Add QR-DQN ( #13 )
...
* Add QR-DQN(WIP)
* Update docstring
* Add quantile_huber_loss
* Fix typo
* Remove unnecessary lines
* Update variable names and comments in quantile_huber_loss
* Fix mutable arguments
* Update variable names
* Ignore import not used warnings
* Fix default parameter of optimizer in QR-DQN
* Update quantile_huber_loss to have more reasonable interface
* update tests
* Add assertion to quantile_huber_loss
* Update variable names of quantile regression
* Update comments
* Reduce the number of quantiles during test
* Update comment
* Update quantile_huber_loss
* Fix isort
* Add document of QR-DQN without results
* Update docs
* Fix bugs
* Update doc
* Add comments about shape
* Minor edits
* Update comments
* Add benchmark
* Doc fixes
* Update doc
* Bug fix in saving/loading + update tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-12-21 11:17:48 +01:00