Reinforcement Learning Courses
Amazon Web Services via AWS Skill Builder Direct Preference Optimization (DPO): How It Works and How It Topped an LLM Eval Leaderboard
Snorkel AI via YouTube Unsupervised Environment Design
Cooperative AI Foundation via YouTube Self-Play in Artificial Intelligence - Lecture by Noam Brown
Cooperative AI Foundation via YouTube Concordia Contest 2024 - Basic Agent Development Tutorial
Cooperative AI Foundation via YouTube Learning to Cooperate and Compete via Self Play
Cooperative AI Foundation via YouTube RLHF: How to Learn from Human Feedback with Reinforcement Learning
Cooperative AI Foundation via YouTube Aligning AI to Everyone via Reinforcement Learning
Cooperative AI Foundation via YouTube Opponent-Shaping and Interference in General-Sum Games
Cooperative AI Foundation via YouTube How to Go Beyond Research
Cooperative AI Foundation via YouTube