YoVDO

Learning to Act by Watching Unlabeled Online Videos - Paper Explained

Offered By: Yannic Kilcher via YouTube

Tags

Machine Learning Courses Reinforcement Learning Courses Fine-Tuning Courses

Course Description

Overview

Explore a comprehensive analysis of OpenAI's Video PreTraining (VPT) technique for tackling the complex challenge of creating an AI agent capable of playing Minecraft. Delve into the innovative approach of leveraging a small set of labeled contractor data to pseudo-label a vast corpus of scraped gameplay footage. Discover the model architecture, experimental results, and fine-tuning processes that led to the first Minecraft agent achieving the impressive feat of crafting a diamond pickaxe autonomously. Learn about the semi-supervised imitation learning method used to extend internet-scale pretraining to sequential decision domains, and understand how this behavioral prior demonstrates nontrivial zero-shot capabilities. Gain insights into the potential applications of this technology in robotics, video games, and computer use, as well as the hardware considerations for implementing such advanced AI systems.

Syllabus

- Intro
- How to spend money most effectively?
- Getting a large dataset with labels
- Model architecture
- Experimental results and fine-tuning
- Reinforcement Learning to the Diamond Pickaxe
- Final comments and hardware


Taught by

Yannic Kilcher

Related Courses

Amazon SageMaker JumpStart Foundations (Japanese)
Amazon Web Services via AWS Skill Builder
AWS Flash - Generative AI with Diffusion Models
Amazon Web Services via AWS Skill Builder
AWS Flash - Operationalize Generative AI Applications (FMOps/LLMOps)
Amazon Web Services via AWS Skill Builder
AWS SimuLearn: Automate Fine-Tuning of an LLM
Amazon Web Services via AWS Skill Builder
AWS SimuLearn: Fine-Tune a Base Model with RLHF
Amazon Web Services via AWS Skill Builder