Distributional RL
Offered By: Pascal Poupart via YouTube
Course Description
Overview
Explore distributional reinforcement learning in this 23-minute video lecture from Pascal Poupart's CS885 course at the University of Waterloo. Delve into key concepts including return distribution, policy evaluation, convergence, and the Bellman equation. Examine the C51 (Categorical DQN) algorithm, its advantages, and its performance on Atari games. Gain insights into various distributional representations and their applications in reinforcement learning. Access accompanying slides from the course website to enhance your understanding of this advanced topic in machine learning and artificial intelligence.
Syllabus
Outline
Objective
Distributional RL
Return Distribution
Policy Evaluation
Convergence
Bellman Equation
C51 (Categorical DQN)
Advantage
Atari Results
Distributional Representations
Taught by
Pascal Poupart
Related Courses
An Introduction to Functional AnalysisÉcole Centrale Paris via Coursera On-Ramp to AP* Calculus
Weston High School via edX Aléatoire : une introduction aux probabilités - Partie 2
École Polytechnique via Coursera Introduction to Stochastic Processes
Indian Institute of Technology Bombay via Swayam Discrete Stochastic Processes
Massachusetts Institute of Technology via MIT OpenCourseWare