stable-baselines3-contrib-sacd/sb3_contrib
Max Lodel fc68af8841
Fixed shared_lstm argument in CNN and MultiInput Policies for RecurrentPPO (#90)
* fixed shared_lstm parameter in CNN and MultiInput Policies

* updated tests

* changelog

* Fix FPS for recurrent PPO

* Fix import

* Update changelog

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-07-26 00:27:17 +02:00
..
ars Use higher resolution time_ns() and avoid division by zero (#91) 2022-07-25 23:12:20 +02:00
common Fixed shared_lstm argument in CNN and MultiInput Policies for RecurrentPPO (#90) 2022-07-26 00:27:17 +02:00
ppo_mask Use higher resolution time_ns() and avoid division by zero (#91) 2022-07-25 23:12:20 +02:00
ppo_recurrent Fixed shared_lstm argument in CNN and MultiInput Policies for RecurrentPPO (#90) 2022-07-26 00:27:17 +02:00
qrdqn Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
tqc Update default TQC net arch when using NatureCnn (#79) 2022-06-18 10:53:29 +02:00
trpo Release v1.6.0 and bug fix for TRPO (#84) 2022-07-12 23:12:24 +02:00
__init__.py Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
py.typed Add TQC and base scripts 2020-09-25 12:47:45 +02:00
version.txt Fixed shared_lstm argument in CNN and MultiInput Policies for RecurrentPPO (#90) 2022-07-26 00:27:17 +02:00