Commit Graph

151 Commits

Author SHA1 Message Date
Long M. Lưu (刘明龙) fab19bdb18
Update small QR-DQN docs typo (#33)
* Update qrdqn.rst

* Update changelog.rst

* Update changelog.rst

Add my name

* Update changelog.rst

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-06-23 14:34:22 +02:00
Antonin RAFFIN 2258c72215
Update to new logger (#32) 2021-06-14 17:25:08 +02:00
Antonin RAFFIN 08418a3cc8
Bump SB3 version (#30) 2021-05-12 11:46:16 +02:00
Antonin Raffin 30cc206578 Add test for pytorch variables 2021-05-12 11:39:56 +02:00
Antonin RAFFIN 3665695d1e
Dictionary Observations (#29)
* Add TQC support for new HER version

* Add dict obs support

* Add support for dict obs
2021-05-11 13:24:31 +02:00
Antonin RAFFIN 61bfdbc00a
Fix unused code (#28)
* Fix unused code

* Update changelog

* Update SB3 dependency
2021-05-05 11:42:10 +02:00
Antonin RAFFIN 81ef23d270
SB3 v1.0 (#23) 2021-03-17 14:32:58 +01:00
Antonin RAFFIN 9824daca44
Bug fix for QR-DQN (#21)
* Bug fix for QR-DQN

* Upgrade SB3
2021-03-06 14:54:43 +01:00
Antonin RAFFIN 7c2eb833c0
Upgrade SB3 (#20) 2021-02-27 19:59:21 +01:00
Antonin RAFFIN 74e60381a6
Upgrade Stable-Baselines3 (#19)
* Upgrade Stable-Baselines3

* Fix policy saving/loading
2021-02-27 18:17:22 +01:00
Antonin RAFFIN b15cc3dbcf Update README 2021-02-06 17:13:44 +01:00
Toshiki Watanabe 4b4d487fdb
Fix the target calculation of QR-DQN (#18)
* Fix the target calculation of QR-DQN

* Update doc

* Update version

* Update changelog

* Update README

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-01-11 14:11:16 +01:00
Antonin RAFFIN e9c6135f90 Update setup readme 2020-12-21 11:20:32 +01:00
Antonin RAFFIN ab2880c670 Version bump 2020-12-21 11:19:31 +01:00
Toshiki Watanabe b30397fff5
Add QR-DQN (#13)
* Add QR-DQN(WIP)

* Update docstring

* Add quantile_huber_loss

* Fix typo

* Remove unnecessary lines

* Update variable names and comments in quantile_huber_loss

* Fix mutable arguments

* Update variable names

* Ignore import not used warnings

* Fix default parameter of optimizer in QR-DQN

* Update quantile_huber_loss to have more reasonable interface

* update tests

* Add assertion to quantile_huber_loss

* Update variable names of quantile regression

* Update comments

* Reduce the number of quantiles during test

* Update comment

* Update quantile_huber_loss

* Fix isort

* Add document of QR-DQN without results

* Update docs

* Fix bugs

* Update doc

* Add comments about shape

* Minor edits

* Update comments

* Add benchmark

* Doc fixes

* Update doc

* Bug fix in saving/loading + update tests

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-12-21 11:17:48 +01:00
Antonin RAFFIN 3598ca284a
Update requirements (#15) 2020-12-13 17:29:15 +01:00
Antonin RAFFIN eccdc55fdd Add missing param to docstring 2020-12-08 18:03:56 +01:00
Antonin RAFFIN 857a087a2a
Update TQC to match SB3 (#14) 2020-12-08 15:35:50 +01:00
Antonin RAFFIN 6bafcf6e88
Add TimeFeatureWrapper (#7)
* Add TimeFeatureWrapper

* Update README

* Address comments
2020-11-13 13:00:56 +02:00
Antonin RAFFIN aac20bd1e6 Release v0.10.0 2020-10-28 15:08:07 +01:00
Antonin RAFFIN 2ce8d278cc
Fix features extractor issue (#5)
* Fix feature extractor issue

* Sync with SB3 PR
2020-10-27 14:30:35 +01:00
Antonin RAFFIN 017096371c [ci skip] Update README 2020-10-22 16:52:11 +02:00
Antonin RAFFIN b896b7492e Update dependencies 2020-10-22 16:35:28 +02:00
Antonin RAFFIN e8093965c7 Fix doc build 2020-10-22 14:46:05 +02:00
Antonin RAFFIN 0700c3eeb0
Add TQC (#4)
* Add TQC doc

* Polish code

* Update doc

* Update results

* Update doc

* Update doc

* Add note about PyBullet envs
2020-10-22 13:43:46 +02:00
Antonin RAFFIN 3fe0a0c2f2
Merge pull request #1 from Stable-Baselines-Team/contrib
Add contributing guides
2020-10-17 17:10:37 +02:00
Antonin RAFFIN 72fe9a2072 Faster tests 2020-10-17 17:06:11 +02:00
Antonin RAFFIN 926e488196 Update wording and links 2020-10-17 17:04:00 +02:00
Anssi "Miffyli" Kanervisto 00f9d26d55 Spell-checking and small update to issue template 2020-10-15 02:50:16 +03:00
Anssi "Miffyli" Kanervisto 4a1695cf7b Update README 2020-10-15 02:46:29 +03:00
Anssi "Miffyli" Kanervisto 95892f31e6 Update PR template 2020-10-15 02:27:22 +03:00
Anssi "Miffyli" Kanervisto 79fcf54e1e Review docs and update changelog 2020-10-15 02:17:36 +03:00
Anssi "Miffyli" Kanervisto 8d3570ae5f Merge branch 'master' into contrib 2020-10-15 02:07:46 +03:00
Antonin RAFFIN afe7b132e4 Lint 2020-10-12 20:25:11 +02:00
Antonin RAFFIN 5033b192cb Add base doc 2020-10-12 20:21:52 +02:00
Antonin RAFFIN 5d7b79d41a Improve coverage 2020-10-12 20:17:33 +02:00
Antonin RAFFIN 7609c87e84 Cleanup TQC 2020-10-12 19:50:08 +02:00
Anssi "Miffyli" Kanervisto 31da024e96 Update info on docs. Remove changelog part of per-algo docs 2020-10-12 00:47:43 +03:00
Anssi "Miffyli" Kanervisto eed8a29705 Update docs on sharing results/code to replicate. 2020-10-11 22:32:33 +03:00
Anssi "Miffyli" Kanervisto fbe8e6536f Add note about experimental tag in sb3 issues 2020-10-11 22:09:48 +03:00
Anssi "Miffyli" Kanervisto 383ce59c85 Update info on where code should be placed 2020-10-11 22:02:45 +03:00
Anssi "Miffyli" Kanervisto 86508f0694 Replace info about tests 2020-10-06 01:48:42 +03:00
Anssi "Miffyli" Kanervisto e21ba4b2b1 Remove datatypes from example function 2020-10-06 01:44:11 +03:00
Anssi "Miffyli" Kanervisto 467ffee554 Add initial version of contributing 2020-10-05 21:15:30 +03:00
Antonin RAFFIN 99fe824f76 Update requirements 2020-09-25 16:00:49 +02:00
Antonin RAFFIN 5217a0bd73 Disable n-step replay 2020-09-25 13:18:24 +02:00
Antonin RAFFIN 672cb256b1 Update script permissions 2020-09-25 12:53:13 +02:00
Antonin RAFFIN 17c2dabc7f Update CI 2020-09-25 12:50:52 +02:00
Antonin RAFFIN 048a512dd1 Update badge 2020-09-25 12:49:01 +02:00
Antonin RAFFIN 0d9f2e229e Add TQC and base scripts 2020-09-25 12:47:45 +02:00