YoVDO

Provably Efficient Reinforcement Learning with Linear Function Approximation - Chi Jin

Offered By: Institute for Advanced Study via YouTube

Tags

Reinforcement Learning Courses Q-learning Courses Multi-Armed Bandits Courses Sequential Decision Making Courses

Course Description

Overview

Explore provably efficient reinforcement learning with linear function approximation in this 28-minute lecture from the Workshop on Theory of Deep Learning. Delve into sequential decision making, sample efficiency, and value-based algorithms as Chi Jin, a Member of the School of Mathematics at the Institute for Advanced Study, presents cutting-edge research. Examine exploration techniques, including multi-armed bandits and Upper Confidence Bound (UCB), before moving beyond tabular settings to linear function approximation. Investigate linear MDPs and related work in this comprehensive overview of reinforcement learning theory and applications.

Syllabus

Intro
Sequential Decision Making
Reinforcement Learning
Sample Efficiency
Value-based Algorithms
Exploration
Multi-armed Bandits
Upper Confidence Bound (UCB)
Q-learning with UCB
Beyond Tabular Setting
Linear Function Approximation
A Natural Algorithm
Linear MDP
Related Work


Taught by

Institute for Advanced Study

Related Courses

Inside TensorFlow - TF-Agents
TensorFlow via YouTube
Exploration with Limited Memory - Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-Armed Bandits
Association for Computing Machinery (ACM) via YouTube
How Slot Machines Are Advancing the State of the Art in Computer Go AI
Churchill CompSci Talks via YouTube
CS885: Multi-Armed Bandits
Pascal Poupart via YouTube
Game Theoretic Learning and Spectrum Management - Part 2
IEEE Signal Processing Society via YouTube