Duke University | AIPI 590

Introduction to Modern Reinforcement Learning

Course Description

This course provides a comprehensive, hands-on introduction to reinforcement learning (RL), bridging foundational theory with state-of-the-art methods used in modern AI systems. Students will learn how agents learn from interaction through the lens of Markov Decision Processes, value functions, and policy optimization. Beginning with classical tabular methods, the course progresses through deep RL architectures (DQN, PPO), human-in-the-loop learning (RLHF), and advanced topics.

Weekly labs emphasize implementation and experimentation and four open-ended challenges integrate concepts like safety, generalization, and alignment. By the end of the semester, students will be equipped to design, train, and critically evaluate RL agents.

Schedule

WeekDateTopicLectureLabChallenges (due)
1Jan 13From AlphaZero to RLHFLecture 1Lab 1
2Jan 20Reinforcement Learning Foundations: Agent-environment loopLecture 2Lab 2
3Jan 27Reward DesignLab 3Challenge 0
4Feb 3Deep Reinforcement Learning: Value-based AgentsLecture 3Lab 4
5Feb 10Deep Reinforcement Learning: Policy Gradients and PPOLecture 4Lab 5
6Feb 17Safety, Generalization, and ExplorationLecture 5Challenge 1
7Feb 24Human in the Loop RLLecture 6Lab 6
8Mar 3RLHF PipelineLecture 7Lab 7
9Mar 17Offline RLLecture 8Challenge 2
10Mar 24Model-based RL and World ModelsLecture 9Lab 8
11Mar 31Hierarchical RLLecture 10Challenge 3
12Apr 7Inverse RL and Reward InferenceLecture 11Lab 9
13Apr 14Reinforcement Learning EngineeringLecture 12Lab 10
14Apr 21Frontiers in Aligned RLLecture 13Challenge 4

Course Content