YoVDO

Distributional RL

Offered By: Pascal Poupart via YouTube

Tags

Bellman Equations Courses Convergence Courses

Course Description

Overview

Explore distributional reinforcement learning in this 23-minute video lecture from Pascal Poupart's CS885 course at the University of Waterloo. Delve into key concepts including return distribution, policy evaluation, convergence, and the Bellman equation. Examine the C51 (Categorical DQN) algorithm, its advantages, and its performance on Atari games. Gain insights into various distributional representations and their applications in reinforcement learning. Access accompanying slides from the course website to enhance your understanding of this advanced topic in machine learning and artificial intelligence.

Syllabus

Outline
Objective
Distributional RL
Return Distribution
Policy Evaluation
Convergence
Bellman Equation
C51 (Categorical DQN)
Advantage
Atari Results
Distributional Representations


Taught by

Pascal Poupart

Related Courses

Deep Reinforcement Learning
Nvidia Deep Learning Institute via Udacity
Reinforcement Learning
Edureka
Fundamentals of Deep Reinforcement Learning
Learn Ventures via edX
A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients
Serrano.Academy via YouTube
Deep Robust Reinforcement Learning and Regularization
Simons Institute via YouTube