stable-baselines3-contrib-sacd/sb3_contrib
rnederstigt bfa86ce4fe
Fix masked quantities in RecurrentPPO (#78)
* Ignore masked indexes when calculating the loss functions
2022-06-13 16:00:40 +02:00
..
ars Using policy_aliases instead of register_policy (#66) 2022-04-08 21:36:23 +02:00
common Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
ppo_mask Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
ppo_recurrent Fix masked quantities in RecurrentPPO (#78) 2022-06-13 16:00:40 +02:00
qrdqn Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
tqc Upgrade to python 3.7+ syntax (#69) 2022-04-25 13:02:07 +02:00
trpo Upgrade to python 3.7+ syntax (#69) 2022-04-25 13:02:07 +02:00
__init__.py Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00
py.typed Add TQC and base scripts 2020-09-25 12:47:45 +02:00
version.txt Recurrent PPO (#53) 2022-05-30 04:31:12 +02:00