On the Statistical Complexity of Reinforcement Learning
Offered By: Institute for Pure & Applied Mathematics (IPAM) via YouTube
Course Description
Overview
Syllabus
Intro
Tabular Markov decision process
Prior efforts: algorithms and sample complexity results
Minimax optimal sample complexity of tabular MDP
Adding some structure: state feature map
Representing value function using linear combination of features
Rethinking Bellman equation
Reducing Bellman equation using features
Sample complexity of RL with features
Of-Policy Policy Evaluation (OPE)
OPE with function approximation
Equivalence to plug-in estimation
Minimax-optimal batch policy evaluation
Lower Bound Analysis
Episodic Reinforcement Learning
Feature space embedding of transition kernel
Regret Analysis
Exploration with Value-Targeted Regression VTAL
Taught by
Institute for Pure & Applied Mathematics (IPAM)
Related Courses
Reinforcement LearningIndian Institute of Technology Madras via Swayam Numerical Analysis
Vidyasagar University via Swayam Reinforcement Learning Course
YouTube Numerical Methods in EXCEL/VBA PROGRAMING
Udemy Taylor Series
3Blue1Brown via YouTube