Faster Saddle-Point Optimization for Solving Large-Scale Markov Decision Processes

Offered By: VinAI via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore a seminar on advanced optimization techniques for solving large-scale Markov decision processes. Delve into the research of Joan Bas Serrano, a PhD student focusing on theoretical reinforcement learning at Universitat Pompeu Fabra. Examine the linear programming formulation of MDPs and saddle-point optimization theory applied to average-reward Markov decision processes. Discover a novel approach to computing optimal policies using a linearly relaxed version of the saddle-point problem. Analyze the conditions necessary for convergence to the optimal policy and learn about an optimization algorithm designed for fast convergence rates independent of state space size. Gain insights into potential issues with previous work in this area and understand the implications for future research in reinforcement learning algorithms.

Syllabus

Seminar Series: Faster saddle-point optimization for solving large-scale Markov decision processes

Taught by

VinAI

Related Courses

Latent State Recovery in Reinforcement Learning - John Langford
Institute for Advanced Study via YouTube On the Critic Function of Implicit Generative Models - Arthur Gretton
Institute for Advanced Study via YouTube Priors for Semantic Variables - Yoshua Bengio
Institute for Advanced Study via YouTube Instance-Hiding Schemes for Private Distributed Learning
Institute for Advanced Study via YouTube Learning Probability Distributions - What Can, What Can't Be Done - Shai Ben-David
Institute for Advanced Study via YouTube