Antonin RAFFIN
a1b5ea67ae
Multiprocessing support for off policy algorithms ( #50 )
...
* TQC support for multienv
* Add optional layer norm for TQC
* Add layer nprm for all policies
* Revert "Add layer nprm for all policies"
This reverts commit 1306c3c64eb12613464982c66cb416a3bbc66285.
* Revert "Add optional layer norm for TQC"
This reverts commit 200222e3a8878007aa6032d540ae74274a4d0788.
* Add experimental support to train off-policy algorithms with multiple envs
* Bump version
* Update version
2021-12-02 10:40:21 +01:00
Antonin RAFFIN
cd0a5e516f
Update citation ( #54 )
...
* Update citation
* Fixes for new SB3 version
* Fix type hint
* Additional fixes
2021-12-01 19:09:32 +01:00
Antonin RAFFIN
b1397bbb72
Release 1.3.0 ( #48 )
2021-10-23 17:21:22 +02:00
Geoff McDonald
d6c5cea644
MaskablePPO dictionary observation support ( #47 )
...
* Add dictionary observation support for ppo_mask.
* Improving naming consistency.
* Update changelog.
* Reformat and add test
* Update doc
* Update README and setup
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-10-23 17:05:37 +02:00
Scott Brownlie
b2e7126840
Train/Eval Mode Support ( #39 )
...
* switch models between train and eval mode
* update changelog
* update release in change log
* Update dependency
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-09-08 12:54:50 +02:00
Antonin RAFFIN
ae39e00c44
Release v1.1.0 ( #34 )
2021-07-02 11:38:46 +02:00
Antonin RAFFIN
2258c72215
Update to new logger ( #32 )
2021-06-14 17:25:08 +02:00
Antonin RAFFIN
08418a3cc8
Bump SB3 version ( #30 )
2021-05-12 11:46:16 +02:00
Antonin RAFFIN
3665695d1e
Dictionary Observations ( #29 )
...
* Add TQC support for new HER version
* Add dict obs support
* Add support for dict obs
2021-05-11 13:24:31 +02:00
Antonin RAFFIN
61bfdbc00a
Fix unused code ( #28 )
...
* Fix unused code
* Update changelog
* Update SB3 dependency
2021-05-05 11:42:10 +02:00
Antonin RAFFIN
81ef23d270
SB3 v1.0 ( #23 )
2021-03-17 14:32:58 +01:00
Antonin RAFFIN
9824daca44
Bug fix for QR-DQN ( #21 )
...
* Bug fix for QR-DQN
* Upgrade SB3
2021-03-06 14:54:43 +01:00
Antonin RAFFIN
7c2eb833c0
Upgrade SB3 ( #20 )
2021-02-27 19:59:21 +01:00
Antonin RAFFIN
74e60381a6
Upgrade Stable-Baselines3 ( #19 )
...
* Upgrade Stable-Baselines3
* Fix policy saving/loading
2021-02-27 18:17:22 +01:00
Toshiki Watanabe
4b4d487fdb
Fix the target calculation of QR-DQN ( #18 )
...
* Fix the target calculation of QR-DQN
* Update doc
* Update version
* Update changelog
* Update README
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-01-11 14:11:16 +01:00
Antonin RAFFIN
e9c6135f90
Update setup readme
2020-12-21 11:20:32 +01:00
Antonin RAFFIN
3598ca284a
Update requirements ( #15 )
2020-12-13 17:29:15 +01:00
Antonin RAFFIN
aac20bd1e6
Release v0.10.0
2020-10-28 15:08:07 +01:00
Antonin RAFFIN
b896b7492e
Update dependencies
2020-10-22 16:35:28 +02:00
Antonin RAFFIN
e8093965c7
Fix doc build
2020-10-22 14:46:05 +02:00
Antonin RAFFIN
7609c87e84
Cleanup TQC
2020-10-12 19:50:08 +02:00
Antonin RAFFIN
99fe824f76
Update requirements
2020-09-25 16:00:49 +02:00
Antonin RAFFIN
17c2dabc7f
Update CI
2020-09-25 12:50:52 +02:00
Antonin RAFFIN
0d9f2e229e
Add TQC and base scripts
2020-09-25 12:47:45 +02:00