YoVDO

CMU Multilingual NLP: Active Learning

Offered By: Graham Neubig via YouTube

Tags

Natural Language Processing (NLP) Courses Active Learning Courses Multilingual Natural Language Processing Courses

Course Description

Overview

Explore active learning techniques for natural language processing in this 29-minute lecture from CMU's Multilingual NLP course. Delve into token-level and sequence-level active learning, examining uncertainty paradigms, query by committee, and various uncertainty measures. Gain insights into human effort considerations in active learning, including cost assessment and annotation reusability. Learn how to implement an effective active learning pipeline and understand its importance in multilingual NLP applications. Engage with a discussion question to reinforce your understanding of the presented concepts.

Syllabus

Intro
Types of Learning
Active Learning Pipeline
Why Active Learning?
Fundamental Ideas
Uncertainty Paradigms
Query by Committee
Sequence-level Uncertainty Measures
Training on Token Level
Token-level Representativeness Metrics
Sequence-to-sequence Uncertainty Metrics
Human Effort and Active Learning • In simulation, it's common to assess active learning based on words/sentences annotated
Considering Cost in Active Learning
Reusability of Active Learning Annotations
Discussion Question


Taught by

Graham Neubig

Related Courses

CMU Multilingual NLP - The LORELEI Project
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Speech
Graham Neubig via YouTube
Multilingual NLP 2022 - Language Contact and Change
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Typology
Graham Neubig via YouTube