stable-baselines3-contrib-sacd/sb3_contrib
Geoff McDonald d6c5cea644
MaskablePPO dictionary observation support (#47)
* Add dictionary observation support for ppo_mask.

* Improving naming consistency.

* Update changelog.

* Reformat and add test

* Update doc

* Update README and setup

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-10-23 17:05:37 +02:00
..
common MaskablePPO dictionary observation support (#47) 2021-10-23 17:05:37 +02:00
ppo_mask MaskablePPO dictionary observation support (#47) 2021-10-23 17:05:37 +02:00
qrdqn Train/Eval Mode Support (#39) 2021-09-08 12:54:50 +02:00
tqc Remove sde net arch (#44) 2021-09-28 21:59:59 +02:00
__init__.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
py.typed Add TQC and base scripts 2020-09-25 12:47:45 +02:00
version.txt MaskablePPO dictionary observation support (#47) 2021-10-23 17:05:37 +02:00