Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained

Offered By: Yannic Kilcher via YouTube

Course Description

Overview

Explore an innovative approach to creating commonsense knowledge graphs using large language models in this 45-minute video lecture. Learn how to leverage GPT-3's capabilities through clever prompting techniques and targeted critic models to generate high-quality symbolic knowledge. Discover the process of distilling commonsense knowledge from general language models, and understand how this method outperforms traditional human-generated corpora in quantity, quality, and diversity. Gain insights into the Symbolic Knowledge Distillation framework, its application to the ATOMIC dataset, and its potential to revolutionize the training of smaller, specialized commonsense models. Follow along as the lecture covers topics such as generating events and inferences, evaluating datasets, and using critic models to filter data, ultimately leading to key findings that challenge conventional approaches in NLP and commonsense reasoning.

Syllabus

- Intro & Overview
- Sponsor: Weights & Biases
- Commonsense Knowledge Graphs
- ATOMIC dataset
- Generating the corpus from a model
- Prompting GPT-3
- Generating Events
- Generating Inferences
- Evaluating the created dataset
- Introducing the critic
- Using the critic to filter the data
- Training a student on the generated data
- Key Findings
- Comments & Conclusion

Taught by

Yannic Kilcher

Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue