Antonin RAFFIN
|
74e60381a6
|
Upgrade Stable-Baselines3 (#19)
* Upgrade Stable-Baselines3
* Fix policy saving/loading
|
2021-02-27 18:17:22 +01:00 |
Toshiki Watanabe
|
b30397fff5
|
Add QR-DQN (#13)
* Add QR-DQN(WIP)
* Update docstring
* Add quantile_huber_loss
* Fix typo
* Remove unnecessary lines
* Update variable names and comments in quantile_huber_loss
* Fix mutable arguments
* Update variable names
* Ignore import not used warnings
* Fix default parameter of optimizer in QR-DQN
* Update quantile_huber_loss to have more reasonable interface
* update tests
* Add assertion to quantile_huber_loss
* Update variable names of quantile regression
* Update comments
* Reduce the number of quantiles during test
* Update comment
* Update quantile_huber_loss
* Fix isort
* Add document of QR-DQN without results
* Update docs
* Fix bugs
* Update doc
* Add comments about shape
* Minor edits
* Update comments
* Add benchmark
* Doc fixes
* Update doc
* Bug fix in saving/loading + update tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
|
2020-12-21 11:17:48 +01:00 |
Antonin RAFFIN
|
eccdc55fdd
|
Add missing param to docstring
|
2020-12-08 18:03:56 +01:00 |
Antonin RAFFIN
|
857a087a2a
|
Update TQC to match SB3 (#14)
|
2020-12-08 15:35:50 +01:00 |
Antonin RAFFIN
|
0700c3eeb0
|
Add TQC (#4)
* Add TQC doc
* Polish code
* Update doc
* Update results
* Update doc
* Update doc
* Add note about PyBullet envs
|
2020-10-22 13:43:46 +02:00 |
Antonin RAFFIN
|
7609c87e84
|
Cleanup TQC
|
2020-10-12 19:50:08 +02:00 |
Antonin RAFFIN
|
5217a0bd73
|
Disable n-step replay
|
2020-09-25 13:18:24 +02:00 |
Antonin RAFFIN
|
0d9f2e229e
|
Add TQC and base scripts
|
2020-09-25 12:47:45 +02:00 |