Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained
Offered By: Yannic Kilcher via YouTube
Course Description
Overview
Explore an innovative approach to creating commonsense knowledge graphs using large language models in this 45-minute video lecture. Learn how to leverage GPT-3's capabilities through clever prompting techniques and targeted critic models to generate high-quality symbolic knowledge. Discover the process of distilling commonsense knowledge from general language models, and understand how this method outperforms traditional human-generated corpora in quantity, quality, and diversity. Gain insights into the Symbolic Knowledge Distillation framework, its application to the ATOMIC dataset, and its potential to revolutionize the training of smaller, specialized commonsense models. Follow along as the lecture covers topics such as generating events and inferences, evaluating datasets, and using critic models to filter data, ultimately leading to key findings that challenge conventional approaches in NLP and commonsense reasoning.
Syllabus
- Intro & Overview
- Sponsor: Weights & Biases
- Commonsense Knowledge Graphs
- ATOMIC dataset
- Generating the corpus from a model
- Prompting GPT-3
- Generating Events
- Generating Inferences
- Evaluating the created dataset
- Introducing the critic
- Using the critic to filter the data
- Training a student on the generated data
- Key Findings
- Comments & Conclusion
Taught by
Yannic Kilcher
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent