YoVDO

Planning to Explore via Self-Supervised World Models - Paper Explained

Offered By: Yannic Kilcher via YouTube

Tags

Machine Learning Courses Artificial Intelligence Courses Reinforcement Learning Courses Intrinsic Motivation Courses

Course Description

Overview

Explore a groundbreaking approach to self-supervised reinforcement learning in this 35-minute video explanation. Dive into the Plan2Explore model, which enables agents to efficiently explore their environment without predefined rewards. Learn how this innovative method uses planning in a learned imaginary latent world model to seek out uncertain states, improving upon traditional intrinsic reward formulations. Discover the key components of the model, including intrinsic motivation, planning in latent space, and latent disagreement. Understand how Plan2Explore maximizes information gain and tackles challenges in reinforcement learning. Examine the experimental results and final insights provided by the presenter, Yannic Kilcher. Gain a comprehensive understanding of this novel technique that enhances sample efficiency and enables quick adaptation to multiple downstream tasks in zero or few-shot scenarios.

Syllabus

- Intro & Problem Statement
- Model
- Intrinsic Motivation
- Planning in Latent Space
- Latent Disagreement
- Maximizing Information Gain
- More problems with the model
- Experiments
- Final Comments


Taught by

Yannic Kilcher

Related Courses

Business Considerations for 5G with Edge, IoT, and AI
Linux Foundation via edX
FinTech for Finance and Business Leaders
ACCA via edX
AI-900: Microsoft Certified Azure AI Fundamentals
A Cloud Guru
AWS Certified Machine Learning - Specialty (LA)
A Cloud Guru
Azure AI Components and Services
A Cloud Guru