stable-baselines3-contrib-sacd/sb3_contrib
Toshiki Watanabe b30397fff5
Add QR-DQN (#13)
* Add QR-DQN(WIP)

* Update docstring

* Add quantile_huber_loss

* Fix typo

* Remove unnecessary lines

* Update variable names and comments in quantile_huber_loss

* Fix mutable arguments

* Update variable names

* Ignore import not used warnings

* Fix default parameter of optimizer in QR-DQN

* Update quantile_huber_loss to have more reasonable interface

* update tests

* Add assertion to quantile_huber_loss

* Update variable names of quantile regression

* Update comments

* Reduce the number of quantiles during test

* Update comment

* Update quantile_huber_loss

* Fix isort

* Add document of QR-DQN without results

* Update docs

* Fix bugs

* Update doc

* Add comments about shape

* Minor edits

* Update comments

* Add benchmark

* Doc fixes

* Update doc

* Bug fix in saving/loading + update tests

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-12-21 11:17:48 +01:00
..
common Add QR-DQN (#13) 2020-12-21 11:17:48 +01:00
qrdqn Add QR-DQN (#13) 2020-12-21 11:17:48 +01:00
tqc Add QR-DQN (#13) 2020-12-21 11:17:48 +01:00
__init__.py Add QR-DQN (#13) 2020-12-21 11:17:48 +01:00
py.typed Add TQC and base scripts 2020-09-25 12:47:45 +02:00
version.txt Update requirements (#15) 2020-12-13 17:29:15 +01:00