Building Generalist Robotics Policies from Scratch
Offered By: Montreal Robotics via YouTube
Course Description
Overview
Syllabus
Intro: ChatGPT, Language Models and the Goals of Generalist Robotics Policies
Reading and exploring the data
Creating a Dataset
Creating a Dataset
Creating the transformer encoder
Creating image patches to tokenized
Putting together the VIT
Training the VIT
Making the GRP, starting with adding text inputs
Modifying the data for training
Converting continuous actions to discrete bins
Converting continuous actions to discrete bins
Standardizing the state inputs
Changing to use continuous actions
Standizing the action space
Adding goal images to the transformer
Adding blocked masked attention to use either goal
Scaling training
Training results across A100s
Evaluation using the SimpleEnv robotics simulator
Taught by
Montreal Robotics
Related Courses
Vision Transformers Explained + Fine-Tuning in PythonJames Briggs via YouTube ConvNeXt- A ConvNet for the 2020s - Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models
HuggingFace via YouTube Intro to Dense Vectors for NLP and Vision
James Briggs via YouTube