Antonin RAFFIN
|
7c2eb833c0
|
Upgrade SB3 (#20)
|
2021-02-27 19:59:21 +01:00 |
Antonin RAFFIN
|
74e60381a6
|
Upgrade Stable-Baselines3 (#19)
* Upgrade Stable-Baselines3
* Fix policy saving/loading
|
2021-02-27 18:17:22 +01:00 |
Antonin RAFFIN
|
b15cc3dbcf
|
Update README
|
2021-02-06 17:13:44 +01:00 |
Toshiki Watanabe
|
4b4d487fdb
|
Fix the target calculation of QR-DQN (#18)
* Fix the target calculation of QR-DQN
* Update doc
* Update version
* Update changelog
* Update README
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
|
2021-01-11 14:11:16 +01:00 |
Antonin RAFFIN
|
e9c6135f90
|
Update setup readme
|
2020-12-21 11:20:32 +01:00 |
Antonin RAFFIN
|
ab2880c670
|
Version bump
|
2020-12-21 11:19:31 +01:00 |
Toshiki Watanabe
|
b30397fff5
|
Add QR-DQN (#13)
* Add QR-DQN(WIP)
* Update docstring
* Add quantile_huber_loss
* Fix typo
* Remove unnecessary lines
* Update variable names and comments in quantile_huber_loss
* Fix mutable arguments
* Update variable names
* Ignore import not used warnings
* Fix default parameter of optimizer in QR-DQN
* Update quantile_huber_loss to have more reasonable interface
* update tests
* Add assertion to quantile_huber_loss
* Update variable names of quantile regression
* Update comments
* Reduce the number of quantiles during test
* Update comment
* Update quantile_huber_loss
* Fix isort
* Add document of QR-DQN without results
* Update docs
* Fix bugs
* Update doc
* Add comments about shape
* Minor edits
* Update comments
* Add benchmark
* Doc fixes
* Update doc
* Bug fix in saving/loading + update tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
|
2020-12-21 11:17:48 +01:00 |
Antonin RAFFIN
|
3598ca284a
|
Update requirements (#15)
|
2020-12-13 17:29:15 +01:00 |
Antonin RAFFIN
|
eccdc55fdd
|
Add missing param to docstring
|
2020-12-08 18:03:56 +01:00 |
Antonin RAFFIN
|
857a087a2a
|
Update TQC to match SB3 (#14)
|
2020-12-08 15:35:50 +01:00 |
Antonin RAFFIN
|
6bafcf6e88
|
Add TimeFeatureWrapper (#7)
* Add TimeFeatureWrapper
* Update README
* Address comments
|
2020-11-13 13:00:56 +02:00 |
Antonin RAFFIN
|
aac20bd1e6
|
Release v0.10.0
|
2020-10-28 15:08:07 +01:00 |
Antonin RAFFIN
|
2ce8d278cc
|
Fix features extractor issue (#5)
* Fix feature extractor issue
* Sync with SB3 PR
|
2020-10-27 14:30:35 +01:00 |
Antonin RAFFIN
|
017096371c
|
[ci skip] Update README
|
2020-10-22 16:52:11 +02:00 |
Antonin RAFFIN
|
b896b7492e
|
Update dependencies
|
2020-10-22 16:35:28 +02:00 |
Antonin RAFFIN
|
e8093965c7
|
Fix doc build
|
2020-10-22 14:46:05 +02:00 |
Antonin RAFFIN
|
0700c3eeb0
|
Add TQC (#4)
* Add TQC doc
* Polish code
* Update doc
* Update results
* Update doc
* Update doc
* Add note about PyBullet envs
|
2020-10-22 13:43:46 +02:00 |
Antonin RAFFIN
|
3fe0a0c2f2
|
Merge pull request #1 from Stable-Baselines-Team/contrib
Add contributing guides
|
2020-10-17 17:10:37 +02:00 |
Antonin RAFFIN
|
72fe9a2072
|
Faster tests
|
2020-10-17 17:06:11 +02:00 |
Antonin RAFFIN
|
926e488196
|
Update wording and links
|
2020-10-17 17:04:00 +02:00 |
Anssi "Miffyli" Kanervisto
|
00f9d26d55
|
Spell-checking and small update to issue template
|
2020-10-15 02:50:16 +03:00 |
Anssi "Miffyli" Kanervisto
|
4a1695cf7b
|
Update README
|
2020-10-15 02:46:29 +03:00 |
Anssi "Miffyli" Kanervisto
|
95892f31e6
|
Update PR template
|
2020-10-15 02:27:22 +03:00 |
Anssi "Miffyli" Kanervisto
|
79fcf54e1e
|
Review docs and update changelog
|
2020-10-15 02:17:36 +03:00 |
Anssi "Miffyli" Kanervisto
|
8d3570ae5f
|
Merge branch 'master' into contrib
|
2020-10-15 02:07:46 +03:00 |
Antonin RAFFIN
|
afe7b132e4
|
Lint
|
2020-10-12 20:25:11 +02:00 |
Antonin RAFFIN
|
5033b192cb
|
Add base doc
|
2020-10-12 20:21:52 +02:00 |
Antonin RAFFIN
|
5d7b79d41a
|
Improve coverage
|
2020-10-12 20:17:33 +02:00 |
Antonin RAFFIN
|
7609c87e84
|
Cleanup TQC
|
2020-10-12 19:50:08 +02:00 |
Anssi "Miffyli" Kanervisto
|
31da024e96
|
Update info on docs. Remove changelog part of per-algo docs
|
2020-10-12 00:47:43 +03:00 |
Anssi "Miffyli" Kanervisto
|
eed8a29705
|
Update docs on sharing results/code to replicate.
|
2020-10-11 22:32:33 +03:00 |
Anssi "Miffyli" Kanervisto
|
fbe8e6536f
|
Add note about experimental tag in sb3 issues
|
2020-10-11 22:09:48 +03:00 |
Anssi "Miffyli" Kanervisto
|
383ce59c85
|
Update info on where code should be placed
|
2020-10-11 22:02:45 +03:00 |
Anssi "Miffyli" Kanervisto
|
86508f0694
|
Replace info about tests
|
2020-10-06 01:48:42 +03:00 |
Anssi "Miffyli" Kanervisto
|
e21ba4b2b1
|
Remove datatypes from example function
|
2020-10-06 01:44:11 +03:00 |
Anssi "Miffyli" Kanervisto
|
467ffee554
|
Add initial version of contributing
|
2020-10-05 21:15:30 +03:00 |
Antonin RAFFIN
|
99fe824f76
|
Update requirements
|
2020-09-25 16:00:49 +02:00 |
Antonin RAFFIN
|
5217a0bd73
|
Disable n-step replay
|
2020-09-25 13:18:24 +02:00 |
Antonin RAFFIN
|
672cb256b1
|
Update script permissions
|
2020-09-25 12:53:13 +02:00 |
Antonin RAFFIN
|
17c2dabc7f
|
Update CI
|
2020-09-25 12:50:52 +02:00 |
Antonin RAFFIN
|
048a512dd1
|
Update badge
|
2020-09-25 12:49:01 +02:00 |
Antonin RAFFIN
|
0d9f2e229e
|
Add TQC and base scripts
|
2020-09-25 12:47:45 +02:00 |
Antonin RAFFIN
|
8c282a0288
|
Initial commit
|
2020-09-20 22:09:57 +02:00 |