10
Reinforcement Learning
Agents, environments, Q-learning, and policy gradients
4 lessons195 min totaladvanced
1
RL Foundations
Markov Decision Processes, value functions, and the exploration-exploitation trade-off
2 exercisesQuiz45m
2
Q-Learning & Deep Q-Networks
From Q-tables to deep neural network approximators
2 exercisesQuiz55m
3
Policy Gradient Methods
REINFORCE, Actor-Critic, and Proximal Policy Optimization
2 exercisesQuiz50m
4
RL Applications & Tools
Gymnasium, Stable Baselines3, RLHF, robotics, and game AI
2 exercisesQuiz45m