Population-Based Methods for Single- and Multi-Agent Reinforcement Learning - Lecture
Offered By: USC Information Sciences Institute via YouTube
Course Description
Overview
Syllabus
Welcome to the Al Seminar Series
Reinforcement Learning (RL)
RL basics
Deep Q-learning (DQN)
Why use target network?
Why reduce estimation variance
Ensemble RL methods
Ensemble RL for variance reduction
MeanQ design choices
Combining with existing techniques
Experiment results (100K interaction steps)
Obviating the target network
Comparing model size and update rate
MeanQ: variance reduction
Loss of ensemble diversity
Linear function approximation
Diversity through independent sampling
Ongoing investigation
Takeaways
Fictitious Play
What to do in large dynamical environments
PSRO convergence properties
Extensive-Form Double Oracle (XDO)
XDO: results
XDO convergence properties
Taught by
USC Information Sciences Institute
Related Courses
Tensorflow Neural Networks using Deep Q-Learning TechniquesCoursera Project Network via Coursera Artificial Intelligence for Business + ChatGPT Prize [2024]
Udemy Artificial Intelligence A-Z 2024: Build 7 AI + LLM & ChatGPT
Udemy Deep Reinforcement Learning: Hands-on AI Tutorial in Python
Udemy Intelligence Artificielle de A à Z
Udemy