Skip to content

Commit

Permalink
Fix remaining references to policy gradient.
Browse files Browse the repository at this point in the history
  • Loading branch information
Gamenot committed Jan 5, 2024
1 parent 6779803 commit e6d670b
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions docs/ecosystem/rllib.rst
Original file line number Diff line number Diff line change
Expand Up @@ -8,15 +8,15 @@ RLlib

SMARTS contains two examples using `Proximal Policy Optimization (PPO) <https://docs.ray.io/en/latest/rllib/rllib-algorithms.html#ppo>`_.

#. Policy gradient
#. Proximal policy optimization

+ script: :examples:`e12_rllib/ppo_example.py`
+ Shows the basics of using RLlib with SMARTS through :class:`~smarts.env.rllib_hiway_env.RLlibHiWayEnv`.

#. Policy gradient with population based training
#. Proximal policy optimization with population based training

+ script: :examples:`e12_rllib/ppo_pbt_example.py`
+ Combines Proximal Policy Optimization with `Population Based Training (PBT) <https://docs.ray.io/en/latest/tune/api/doc/ray.tune.schedulers.PopulationBasedTraining.html>`_ scheduling.
+ Combines `Proximal Policy Optimization (PPO)` with `Population Based Training (PBT) <https://docs.ray.io/en/latest/tune/api/doc/ray.tune.schedulers.PopulationBasedTraining.html>`_ scheduling.


Recommended reads
Expand Down

0 comments on commit e6d670b

Please sign in to comment.