Reinforcement Learning in Feature Space: Complexity and Regret
Offered By: Simons Institute via YouTube
Course Description
Overview
Syllabus
Intro
Markov decision process
What does a sample mean?
Complexity and Regret for Tabular MDP
Rethinking Bellman equation
State Feature Map
Representing value function using linear combination of features
Reducing Bellman equation using features
Sample complexity of RL with features
Learning to Control On-The-Fly
Episodic Reinforcement Learning
Hilbert space embedding of transition kernel
The MatrixRL Algorithm
Regret Analysis
From feature to kernel
MatrixRL has a equivalent kernelization
Pros and cons for using features for RL
What could be good state features?
Finding Metastable State Clusters
Example: stochastic diffusion process
Unsupervised state aggregation learning
Soft state aggregation for NYC taxi data
Example: State Trajectories of Demon Attack
Taught by
Simons Institute
Related Courses
Computability, Complexity & AlgorithmsGeorgia Institute of Technology via Udacity Decision Making in a Complex and Uncertain World
University of Groningen via FutureLearn L'avenir de la décision : connaître et agir en complexité
ESSEC Business School via Coursera Advanced Algorithms and Complexity
University of California, San Diego via Coursera Décision, Complexité, Risques
ENS de Lyon via France Université Numerique