Reinforcement Learning
Offered By: Simons Institute via YouTube
Course Description
Overview
Syllabus
Intro
Birds-eye view of RL
Illustrative application: RL in personal health
General thrust
Direction: Exploiting structure in RL
Vignette: Q-learning with low rank structure
Vignette: Model-free versus model-based method
Estimate dynamics or value functions for LQR? - Linear state space model with quadratic reward function
Performance of LSTD versus model-based metho
Direction: Exploration/exploitation beyond bandi
Vignette: Q-learning with UCB
Vignette: UCB and Monte Carlo Tree Search
Direction: From worst-case to instance-optimalit
Vignette: Instance-optimality of TD learning?
Instance-optimality in policy evaluation
Direction: RL in offline settings and causal inferen
Some future directions exploiting methods from cal inferences instrumental variables propensity score, doubly robust methods, synthetic controls
Taught by
Simons Institute
Related Courses
Can Wikipedia Help Offline Reinforcement Learning - Author InterviewYannic Kilcher via YouTube Can Wikipedia Help Offline Reinforcement Learning? - Paper Explained
Yannic Kilcher via YouTube CAP6412 - Final Project Presentations - Lecture 27
University of Central Florida via YouTube Offline Reinforcement Learning and Model-Based Optimization
Simons Institute via YouTube What Are the Statistical Limits of Offline Reinforcement Learning With Function Approximation?
Simons Institute via YouTube