stable-baselines3-contrib-sacd/sb3_contrib/common/maskable
Alex Pasquali 376d9551de
Update MaskablePPO docs (#150)
* MaskablePPO docs

Added a warning about possible crashes caused by chack_env in case of invalid actions.

* Reformat with black 23

* Rephrase note on action sampling

* Fix action noise

* Update changelog

---------

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2023-02-13 14:31:49 +01:00
..
__init__.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00
buffers.py Update MaskablePPO docs (#150) 2023-02-13 14:31:49 +01:00
callbacks.py Update MaskablePPO docs (#150) 2023-02-13 14:31:49 +01:00
distributions.py Fix `Self` return type (#116) 2022-11-22 13:12:35 +01:00
evaluation.py Update MaskablePPO docs (#150) 2023-02-13 14:31:49 +01:00
policies.py Update MaskablePPO docs (#150) 2023-02-13 14:31:49 +01:00
utils.py PPO variant with invalid action masking (#25) 2021-09-23 14:50:10 +02:00