ADSI Summer Workshop: Algorithmic Foundations of Learning and Control - Emma Brunskill
Offered By: Paul G. Allen School via YouTube
Course Description
Overview
Syllabus
Intro
Legacy of Reinforcement Learning to Benefit People
Techniques to Minimize & Understand Data Needed to Learn to Make Good Decisions
Challenge: Covariate Shift Different Policies-- Different Actions - Different State Distributions
Quest: Batch Policy Optimization w/ Generalization Bounds
Recall: Importance Sampling for RL Batch Policy Evaluation
1st Proof of Convergence to a Local Optima for Batch Policy Gradient
Experiment Settings
HIV treatment simulator
Aim: Strong Generalization Guarantees on Policy Performance, Alternative: Guarantee Find Best in Class Policy
Example: Linear Thresholding Policies
An Advantage Decomposition
Advantage Doubly Robust (ADR) Estimator
Quest for Batch Policy Optimization with Generalization Guarantees
Taught by
Paul G. Allen School
Related Courses
Infinite Memory Transformer - Research Paper ExplainedYannic Kilcher via YouTube Rendering Games with Millions of Ray Traced Lights
Nvidia via YouTube Brooklyn Quant Experience Lecture Series - Fourier-Based Methods for Complex Insurance Products Management
New York University (NYU) via YouTube Batch Offline Reinforcement Learning - Part 1
Simons Institute via YouTube A Classical Algorithm Framework for Dequantizing Quantum Machine Learning
Simons Institute via YouTube