CMU Multilingual NLP 2020 - Data Annotation
Offered By: Graham Neubig via YouTube
Course Description
Overview
Learn about data annotation techniques, tools, and best practices for multilingual natural language processing in this 53-minute lecture from CMU's CS11-737 course. Explore topics such as finding and managing labelers, defining annotation tasks, interactive dictionary learning, and ensuring consistency and efficiency in the labeling process. Gain insights into popular labeling toolkits like BRAT and discover strategies for building effective multilingual NLP datasets.
Syllabus
Data Annotation
Finding Labelers
Non-native labeling
What can be labeled
Defining the task
Dictionary: Interactive Learning
Spice: Lex Learner Build Your User awb Language eng Project aug 19 Logout
Checking the Labeling
Checking for consistency
Checking for efficiency
Checking for usefulness
Choose the Task
Labeling Toolkits
BRAT Labeling Tool
Taught by
Graham Neubig
Related Courses
CMU Multilingual NLP - The LORELEI ProjectGraham Neubig via YouTube CMU Multilingual NLP 2022 - Speech
Graham Neubig via YouTube Multilingual NLP 2022 - Language Contact and Change
Graham Neubig via YouTube CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT
Graham Neubig via YouTube CMU Multilingual NLP 2022 - Typology
Graham Neubig via YouTube