Off-Policy Policy Optimization
Offered By: Simons Institute via YouTube
Course Description
Overview
Explore off-policy policy optimization in reinforcement learning with Dale Schuurmans from Google Brain and the University of Alberta in this 53-minute lecture. Delve into key concepts including the RL problem, batch policy optimization, and optimization objectives. Compare supervised and reinforcement learning approaches, and examine missing data inference in the context of sequential decision making. Gain insights into the emerging challenges in deep learning as applied to reinforcement learning algorithms and policy optimization techniques.
Syllabus
Intro
The RL problem
Batch policy optimization
Optimization objectives
Supervised vs reinforcement learning
Missing data inference
Sequential decision making
Sequential RL
Taught by
Simons Institute
Related Courses
Toward Generalizable Embodied AI for Machine AutonomyBolei Zhou via YouTube What Are the Statistical Limits of Offline Reinforcement Learning With Function Approximation?
Simons Institute via YouTube Better Learning from the Past - Counterfactual - Batch RL
Simons Institute via YouTube Provably Efficient Reinforcement Learning with Linear Function Approximation - Chi Jin
Institute for Advanced Study via YouTube Divide-and-Conquer Monte Carlo Tree Search for Goal-Directed Planning - Paper Explained
Yannic Kilcher via YouTube