Get startedGet started for free

RL interaction loop

As you know by now, RL involves an agent making decisions in an environment to maximize some notion of cumulative reward. The agent must discover which actions yield the most reward through interaction.

This exercise is part of the course

Reinforcement Learning with Gymnasium in Python

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise