Faster Saddle-Point Optimization for Solving Large-Scale Markov Decision Processes
Offered By: VinAI via YouTube
Course Description
Overview
Explore a seminar on advanced optimization techniques for solving large-scale Markov decision processes. Delve into the research of Joan Bas Serrano, a PhD student focusing on theoretical reinforcement learning at Universitat Pompeu Fabra. Examine the linear programming formulation of MDPs and saddle-point optimization theory applied to average-reward Markov decision processes. Discover a novel approach to computing optimal policies using a linearly relaxed version of the saddle-point problem. Analyze the conditions necessary for convergence to the optimal policy and learn about an optimization algorithm designed for fast convergence rates independent of state space size. Gain insights into potential issues with previous work in this area and understand the implications for future research in reinforcement learning algorithms.
Syllabus
Seminar Series: Faster saddle-point optimization for solving large-scale Markov decision processes
Taught by
VinAI
Related Courses
Latent State Recovery in Reinforcement Learning - John LangfordInstitute for Advanced Study via YouTube On the Critic Function of Implicit Generative Models - Arthur Gretton
Institute for Advanced Study via YouTube Priors for Semantic Variables - Yoshua Bengio
Institute for Advanced Study via YouTube Instance-Hiding Schemes for Private Distributed Learning
Institute for Advanced Study via YouTube Learning Probability Distributions - What Can, What Can't Be Done - Shai Ben-David
Institute for Advanced Study via YouTube