JAN MALTE LICHTENBERG · JANUARY 5, 2020
Dynamic Programming
The agent (red sphere) can navigate the gridworld environment using one of four actions at each time step.