Reinforcement Learning Greedy Policy

Creator: Petar Veličković (original)

A grid-world policy that greedily selects the locally best action under estimated state values, steering toward the goal while avoiding a penalty state.

Reinforcement Learning Greedy Policy

Download

Code

reinforcement-learning-greedy-policy.typ (97 lines)

reinforcement-learning-greedy-policy.tex (29 lines)