diff --git a/docs/misc/changelog.rst b/docs/misc/changelog.rst index b4e42d8..3fb41de 100644 --- a/docs/misc/changelog.rst +++ b/docs/misc/changelog.rst @@ -4,14 +4,14 @@ Changelog ========== -Release 1.3.1a9 (WIP) +Release 1.4.0 (2021-10-23) ------------------------------- -**Add TRPO and ARS** +**Add Trust Region Policy Optimization (TRPO) and Augmented Random Search (ARS) algorithms** Breaking Changes: ^^^^^^^^^^^^^^^^^ - Dropped python 3.6 support -- Upgraded to Stable-Baselines3 >= 1.3.1a9 +- Upgraded to Stable-Baselines3 >= 1.4.0 - ``MaskablePPO`` was updated to match latest SB3 ``PPO`` version (timeout handling and new method for the policy object) New Features: diff --git a/docs/modules/ars.rst b/docs/modules/ars.rst index 7ca1141..de258c2 100644 --- a/docs/modules/ars.rst +++ b/docs/modules/ars.rst @@ -48,10 +48,10 @@ Can I use? ============= ====== =========== Space Action Observation ============= ====== =========== -Discrete ✔️ ❌ +Discrete ✔️ ✔️ Box ✔️ ✔️ -MultiDiscrete ❌ ❌ -MultiBinary ❌ ❌ +MultiDiscrete ❌ ✔️ +MultiBinary ❌ ✔️ Dict ❌ ❌ ============= ====== =========== diff --git a/sb3_contrib/version.txt b/sb3_contrib/version.txt index 89cccac..88c5fb8 100644 --- a/sb3_contrib/version.txt +++ b/sb3_contrib/version.txt @@ -1 +1 @@ -1.3.1a9 +1.4.0 diff --git a/setup.py b/setup.py index 8da3d1d..cb054b7 100644 --- a/setup.py +++ b/setup.py @@ -63,7 +63,7 @@ setup( packages=[package for package in find_packages() if package.startswith("sb3_contrib")], package_data={"sb3_contrib": ["py.typed", "version.txt"]}, install_requires=[ - "stable_baselines3>=1.3.1a9", + "stable_baselines3>=1.4.0", ], description="Contrib package of Stable Baselines3, experimental code.", author="Antonin Raffin",