YoVDO

Stanford Seminar 2022 - Transformer Circuits, Induction Heads, In-Context Learning

Offered By: Stanford University via YouTube

Tags

Neural Networks Courses In-context Learning Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fascinating world of mechanistic interpretability in neural networks through this Stanford seminar. Delve into the concept of neural network parameters as compiled computer programs and learn how to reverse engineer them into human-understandable algorithms. Focus on transformer language models and discover the significance of "induction head circuits" in enabling in-context learning. Examine how these circuits allow models to repeat text, translate, and mimic functions from earlier context. Understand the pivotal role of induction heads in driving sharp phase changes during the learning process, impacting loss curves and model learning trajectories. Gain insights from Chris Olah, co-founder of Anthropic and leader of their interpretability efforts, as he shares his expertise on AI safety and large model interpretation.

Syllabus

CS25 I Stanford Seminar 2022 - Transformer Circuits, Induction Heads, In-Context Learning


Taught by

Stanford Online

Tags

Related Courses

CMU Advanced NLP: How to Use Pre-Trained Models
Graham Neubig via YouTube
Pretraining Task Diversity and the Emergence of Non-Bayesian In-Context Learning for Regression
Simons Institute via YouTube
In-Context Learning: A Case Study of Simple Function Classes
Simons Institute via YouTube
AI Mastery: Ultimate Crash Course in Prompt Engineering for Large Language Models
Data Science Dojo via YouTube
New Summarization Techniques for LLM Applications - Building a Note-Taking App
Sam Witteveen via YouTube