* Fix `sb3_contrib/common/maskable/callbacks.py` type annotations
* Fix `sb3_contrib/common/vec_env/async_eval.py` type annotations
* Fix `sb3_contrib/common/maskable/buffers.py` type hints
* Fix `sb3_contrib/common/maskable/distributions.py` type hints
* Fix `sb3_contrib/common/maskable/policies.py` type hints
* Fix `sb3_contrib/ppo_mask/ppo_mask.py` type hints
* Update changelog and fix type hints
* Update RTD config
* Switch to ruff for sorting imports
* Evaluate falsy to truthy with not rather than `is False`
* Add `features_extractor` argument to maskable policy
* Add set_options for AsyncEval
* Doc fixes
* Add support for Gym 0.24
* Fixes for gym 0.24
* Fix for new reset signature
* Add tmp SB3 branch
* Fixes for gym 0.26
* Remove unused import
* Fix dependency
* Type annotations fixes
* Reformat
* Reformat with black 23
* Move to gymnasium
* Patch env if needed
* Fix types
* Fix CI
* Fixes for gymnasium
* Fix wrapper annotations
* Update version
* Fix type check
* Update QRDQN type hints and bug fix with multi envs
* Fix TQC type hints
* Fix TRPO type hints
* Additional fixes
* Update SB3 version
* Update issue templates and CI
---------
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
* Issue forms and pyproject.toml
* [ci skip] Fix typos
* Fix isort config
* Use secret link to download atari roms
* Fix for mypy and update config
* Upgrade SB3 and fix warnings
* Fix doc build
* Update Makefile
* Lint first
* MaskablePPO docs
Added a warning about possible crashes caused by chack_env in case of invalid actions.
* Reformat with black 23
* Rephrase note on action sampling
* Fix action noise
* Update changelog
---------
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
* `to(device)` to `device=device` and `float()` to `dtype=th.float32`
* Update changelog
* Fix type checking
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
* Modified sb3_contrib/common/maskable/policies.py
- Added support for non-shared features extractor in file sb3_contrib/common/maskable/policies.py
- updated changelog
* Modified sb3_contrib/common/recurrent/policies.py
* Modified sb3_contrib/qrdqn/policies.py and sb3_contrib/tqc/policies.py
* Updated test_cnn.py
* Upgrade SB3 version
* Revert changes in formatting
* Remove duplicate normalize_images
* Add test for image-like inputs
* Fixes and add more tests
* Update SB3 version
* Fix ARS warnings
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
* Update contribution.md
* New loop struct to make mypy happy
* Update setup.cfg
* Update changelog
* fix squash_output = False in ARS policy
* Add with_bias parameter to ARSPolicy
* Make ARSLinearPolicy a special case of ARSPolicy
* Remove ars_policy from mypy exclude
* Update changelog
* Update SB3 version
* Fix to save ARS linear policy saved with sb3-contrib < 1.7.0
* Fix test
* Turn docstring into comment
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>