YoVDO

Extracting Training Data from Large Language Models - Paper Explained

Offered By: Yannic Kilcher via YouTube

Tags

Language Models Courses Artificial Intelligence Courses Data Privacy Courses

Course Description

Overview

Explore a comprehensive video analysis of a research paper that uncovers a method for extracting verbatim training data from large language models. Delve into the security and privacy implications for models like GPT-3, as the presenter breaks down the paper's findings, methodology, and results. Learn about eidetic memorization in language models, the adversary's objectives, and the two-step extraction method. Examine the analysis of main results, including the vulnerability of larger models, and consider proposed mitigation strategies. Gain insights into the ethical concerns surrounding the publication of large language models trained on private datasets and the potential risks of exposing personally identifiable information.

Syllabus

- Intro & Overview
- Personal Data Example
- Eidetic Memorization & Language Models
- Adversary's Objective & Outlier Data
- Ethical Hedging
- Two-Step Method Overview
- Perplexity Baseline
- Improvement via Perplexity Ratios
- Weights for Patterns & Weights for Memorization
- Analysis of Main Results
- Mitigation Strategies
- Conclusion & Comments


Taught by

Yannic Kilcher

Related Courses

Introduction to Data Analytics for Business
University of Colorado Boulder via Coursera
Digital and the Everyday: from codes to cloud
NPTEL via Swayam
Systems and Application Security
(ISC)² via Coursera
Protecting Health Data in the Modern Age: Getting to Grips with the GDPR
University of Groningen via FutureLearn
Teaching Impacts of Technology: Data Collection, Use, and Privacy
University of California, San Diego via Coursera