Rename QRDQN logger key (#67)

2022-04-12 12:50:35 +02:00 · 2022-04-12 12:50:35 +02:00 · 812648e6cd
parent 99853265a9
commit 812648e6cd
2 changed files with 2 additions and 1 deletions
--- a/docs/misc/changelog.rst
+++ b/docs/misc/changelog.rst
@ -11,6 +11,7 @@ Breaking Changes:
 - Upgraded to Stable-Baselines3 >= 1.5.1a1
 - Changed the way policy "aliases" are handled ("MlpPolicy", "CnnPolicy", ...), removing the former
  ``register_policy`` helper, ``policy_base`` parameter and using ``policy_aliases`` static attributes instead (@Gregwar)
 - Renamed ``rollout/exploration rate`` key to ``rollout/exploration_rate`` for QRDQN (to be consistent with SB3 DQN)
 New Features:
 ^^^^^^^^^^^^^
--- a/sb3_contrib/qrdqn/qrdqn.py
+++ b/sb3_contrib/qrdqn/qrdqn.py
@ -159,7 +159,7 @@ class QRDQN(OffPolicyAlgorithm):
            polyak_update(self.quantile_net.parameters(), self.quantile_net_target.parameters(), self.tau)
        self.exploration_rate = self.exploration_schedule(self._current_progress_remaining)
-        self.logger.record("rollout/exploration rate", self.exploration_rate)
+        self.logger.record("rollout/exploration_rate", self.exploration_rate)
    def train(self, gradient_steps: int, batch_size: int = 100) -> None:
        # Switch to train mode (this affects batch norm / dropout)