Online Learning in Markov Decision Processes - Part 1
Offered By: Simons Institute via YouTube
Course Description
Overview
Explore the fundamentals of online learning in Markov Decision Processes (MDPs) through this comprehensive lecture by Ambuj Tewari from the University of Michigan. Delve into key concepts such as online learning theory, E-Cube, R-Max, and the general U principle. Gain insights into algorithm design, notation, and MDPs. Understand optimal MDPs, Bellman equations, and Bellman's theorem. Analyze the optimal approach to online learning in MDPs. This talk, part of the Theory of Reinforcement Learning Boot Camp at the Simons Institute, provides a thorough introduction to the subject and addresses important questions in the field.
Syllabus
Introduction
Online Learning
Theory
ECube
RMax
General of U principle
Algorithm Design
Notation
MDPs
Optimal MDP
Questions
Bellman Equation
Bellman Theorem
Analysis
Optimal
Taught by
Simons Institute
Related Courses
Algorithms: Design and Analysis, Part 2Stanford University via Coursera Discrete Optimization
University of Melbourne via Coursera Conception et mise en œuvre d'algorithmes.
École Polytechnique via Coursera Computability, Complexity & Algorithms
Georgia Institute of Technology via Udacity Discrete Inference and Learning in Artificial Vision
École Centrale Paris via Coursera