A Gentle Visual Intro to Transformer Models
Offered By: HuggingFace via YouTube
Course Description
Overview
Explore the inner workings of Transformer models in this 29-minute visual introduction presented by Jay Alammar from Cohere. Dive into key concepts such as encoder, decoder, and attention mechanisms. Learn about pretraining, architecture, language models, tokenization, embedding, and scaling in the context of Transformers. Gain insights from Jay's expertise, known for his popular ML blog that has helped millions understand machine learning concepts from basic to cutting-edge technologies like BERT and GPT-3.
Syllabus
Intro
Introduction
Pretraining
Architecture
Language models
Tokenization
Embedding
Language
Scaling
Questions
Taught by
Hugging Face
Related Courses
How to Build Codex SolutionsMicrosoft via YouTube Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube Building Intelligent Applications with World-Class AI
Microsoft via YouTube Stanford Seminar - Transformers in Language: The Development of GPT Models Including GPT-3
Stanford University via YouTube ChatGPT: GPT-3, GPT-4 Turbo: Unleash the Power of LLM's
Udemy