Population-Based Methods for Single- and Multi-Agent Reinforcement Learning - Lecture
Offered By: USC Information Sciences Institute via YouTube
Course Description
Overview
Syllabus
Welcome to the Al Seminar Series
Reinforcement Learning (RL)
RL basics
Deep Q-learning (DQN)
Why use target network?
Why reduce estimation variance
Ensemble RL methods
Ensemble RL for variance reduction
MeanQ design choices
Combining with existing techniques
Experiment results (100K interaction steps)
Obviating the target network
Comparing model size and update rate
MeanQ: variance reduction
Loss of ensemble diversity
Linear function approximation
Diversity through independent sampling
Ongoing investigation
Takeaways
Fictitious Play
What to do in large dynamical environments
PSRO convergence properties
Extensive-Form Double Oracle (XDO)
XDO: results
XDO convergence properties
Taught by
USC Information Sciences Institute
Related Courses
機器學習技法 (Machine Learning Techniques)National Taiwan University via Coursera Обучение на размеченных данных
Moscow Institute of Physics and Technology via Coursera Modélisez vos données avec les méthodes ensemblistes
CentraleSupélec via OpenClassrooms Supervised Machine Learning: Classification
IBM via Coursera Machine Learning Under the Hood: The Technical Tips, Tricks, and Pitfalls
SAS via Coursera