Fix TRPO doc

2021-12-29 15:03:51 +01:00 · 2021-12-29 15:03:51 +01:00 · 3b007ae93b
parent 59be198da0
commit 3b007ae93b
1 changed files with 8 additions and 8 deletions
--- a/docs/guide/examples.rst
+++ b/docs/guide/examples.rst
@ -45,15 +45,15 @@ Train a PPO with invalid action masking agent on a toy environment.
  model.learn(5000)
  model.save("qrdqn_cartpole")

-  TRPO
-  ----
+TRPO
+----

-  Train a Trust Region Policy Optimization (TRPO) agent on the Pendulum environment.
+Train a Trust Region Policy Optimization (TRPO) agent on the Pendulum environment.

-  .. code-block:: python
+.. code-block:: python

-    from sb3_contrib import TRPO
+  from sb3_contrib import TRPO

-    model = TRPO("MlpPolicy", "Pendulum-v0", gamma=0.9, verbose=1)
-    model.learn(total_timesteps=100_000, log_interval=4)
-    model.save("trpo_pendulum")
+  model = TRPO("MlpPolicy", "Pendulum-v0", gamma=0.9, verbose=1)
+  model.learn(total_timesteps=100_000, log_interval=4)
+  model.save("trpo_pendulum")