A Gentle Visual Intro to Transformer Models

Offered By: HuggingFace via YouTube

Course Description

Overview

Explore the inner workings of Transformer models in this 29-minute visual introduction presented by Jay Alammar from Cohere. Dive into key concepts such as encoder, decoder, and attention mechanisms. Learn about pretraining, architecture, language models, tokenization, embedding, and scaling in the context of Transformers. Gain insights from Jay's expertise, known for his popular ML blog that has helped millions understand machine learning concepts from basic to cutting-edge technologies like BERT and GPT-3.

Syllabus

Intro
Introduction
Pretraining
Architecture
Language models
Tokenization
Embedding
Language
Scaling
Questions

Taught by

Hugging Face

Related Courses

Sentiment Analysis with Deep Learning using BERT
Coursera Project Network via Coursera Natural Language Processing with Attention Models
DeepLearning.AI via Coursera Fine Tune BERT for Text Classification with TensorFlow
Coursera Project Network via Coursera Deploy a BERT question answering bot on Django
Coursera Project Network via Coursera Generating discrete sequences: language and music
Ural Federal University via edX