stable-baselines3-contrib-sacd/sb3_contrib/common/maskable
Geoff McDonald d6c5cea644
MaskablePPO dictionary observation support (#47)
* Add dictionary observation support for ppo_mask.

* Improving naming consistency.

* Update changelog.

* Reformat and add test

* Update doc

* Update README and setup

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-10-23 17:05:37 +02:00
..
__init__.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
buffers.py MaskablePPO dictionary observation support (#47) 2021-10-23 17:05:37 +02:00
callbacks.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
distributions.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
evaluation.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
policies.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
utils.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00