stable-baselines3-contrib-sacd

History

Honglu Fan cad9034fdb Handle batch norm in target update (#99 ) * Copy running stats regardless of tau in QRDQN and TQC. See https://github.com/DLR-RM/stable-baselines3/issues/996 * Copy running stats regardless of tau in QRDQN and TQC. See https://github.com/DLR-RM/stable-baselines3/issues/996 * Copy running stats regardless of tau in QRDQN and TQC. See https://github.com/DLR-RM/stable-baselines3/issues/996 * roll back test_cnn.py		2022-08-27 12:31:00 +02:00
..
wrappers	Upgrade Gym to 0.21 (#59 )	2022-02-22 16:25:43 +01:00
test_cnn.py	Add Trust Region Policy Optimization (TRPO) (#40 )	2021-12-29 11:58:03 +01:00
test_deterministic.py	Recurrent PPO (#53 )	2022-05-30 04:31:12 +02:00
test_dict_env.py	Add Trust Region Policy Optimization (TRPO) (#40 )	2021-12-29 11:58:03 +01:00
test_distributions.py	Upgrade Gym to 0.21 (#59 )	2022-02-22 16:25:43 +01:00
test_identity.py	Release v1.6.0 and bug fix for TRPO (#84 )	2022-07-12 23:12:24 +02:00
test_invalid_actions.py	Maskable eval callback call callback fix (#93 )	2022-07-27 19:52:07 +02:00
test_lstm.py	Fixed shared_lstm argument in CNN and MultiInput Policies for RecurrentPPO (#90 )	2022-07-26 00:27:17 +02:00
test_run.py	Allow PPO to turn off advantage normalization (#61 )	2022-02-23 10:11:16 +01:00
test_save_load.py	Upgrade to python 3.7+ syntax (#69 )	2022-04-25 13:02:07 +02:00
test_train_eval_mode.py	Handle batch norm in target update (#99 )	2022-08-27 12:31:00 +02:00
test_utils.py	Upgrade Gym to 0.21 (#59 )	2022-02-22 16:25:43 +01:00