Commit Graph

17 Commits

Author SHA1 Message Date
Scott Brownlie b2e7126840
Train/Eval Mode Support (#39)
* switch models between train and eval mode

* update changelog

* update release in change log

* Update dependency

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-09-08 12:54:50 +02:00
Antonin RAFFIN 36eca8ee79
Fix type annotation + add python 3.9 + citation (#37) 2021-07-29 18:14:03 +02:00
Antonin RAFFIN 2258c72215
Update to new logger (#32) 2021-06-14 17:25:08 +02:00
Antonin Raffin 30cc206578 Add test for pytorch variables 2021-05-12 11:39:56 +02:00
Antonin RAFFIN 3665695d1e
Dictionary Observations (#29)
* Add TQC support for new HER version

* Add dict obs support

* Add support for dict obs
2021-05-11 13:24:31 +02:00
Antonin RAFFIN 61bfdbc00a
Fix unused code (#28)
* Fix unused code

* Update changelog

* Update SB3 dependency
2021-05-05 11:42:10 +02:00
Antonin RAFFIN 9824daca44
Bug fix for QR-DQN (#21)
* Bug fix for QR-DQN

* Upgrade SB3
2021-03-06 14:54:43 +01:00
Antonin RAFFIN 74e60381a6
Upgrade Stable-Baselines3 (#19)
* Upgrade Stable-Baselines3

* Fix policy saving/loading
2021-02-27 18:17:22 +01:00
Toshiki Watanabe b30397fff5
Add QR-DQN (#13)
* Add QR-DQN(WIP)

* Update docstring

* Add quantile_huber_loss

* Fix typo

* Remove unnecessary lines

* Update variable names and comments in quantile_huber_loss

* Fix mutable arguments

* Update variable names

* Ignore import not used warnings

* Fix default parameter of optimizer in QR-DQN

* Update quantile_huber_loss to have more reasonable interface

* update tests

* Add assertion to quantile_huber_loss

* Update variable names of quantile regression

* Update comments

* Reduce the number of quantiles during test

* Update comment

* Update quantile_huber_loss

* Fix isort

* Add document of QR-DQN without results

* Update docs

* Fix bugs

* Update doc

* Add comments about shape

* Minor edits

* Update comments

* Add benchmark

* Doc fixes

* Update doc

* Bug fix in saving/loading + update tests

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-12-21 11:17:48 +01:00
Antonin RAFFIN eccdc55fdd Add missing param to docstring 2020-12-08 18:03:56 +01:00
Antonin RAFFIN 857a087a2a
Update TQC to match SB3 (#14) 2020-12-08 15:35:50 +01:00
Antonin RAFFIN 2ce8d278cc
Fix features extractor issue (#5)
* Fix feature extractor issue

* Sync with SB3 PR
2020-10-27 14:30:35 +01:00
Antonin RAFFIN 0700c3eeb0
Add TQC (#4)
* Add TQC doc

* Polish code

* Update doc

* Update results

* Update doc

* Update doc

* Add note about PyBullet envs
2020-10-22 13:43:46 +02:00
Antonin RAFFIN 5d7b79d41a Improve coverage 2020-10-12 20:17:33 +02:00
Antonin RAFFIN 7609c87e84 Cleanup TQC 2020-10-12 19:50:08 +02:00
Antonin RAFFIN 5217a0bd73 Disable n-step replay 2020-09-25 13:18:24 +02:00
Antonin RAFFIN 0d9f2e229e Add TQC and base scripts 2020-09-25 12:47:45 +02:00