CMU Multilingual NLP 2020 - Data Annotation
Offered By: Graham Neubig via YouTube
Course Description
Overview
Learn about data annotation techniques, tools, and best practices for multilingual natural language processing in this 53-minute lecture from CMU's CS11-737 course. Explore topics such as finding and managing labelers, defining annotation tasks, interactive dictionary learning, and ensuring consistency and efficiency in the labeling process. Gain insights into popular labeling toolkits like BRAT and discover strategies for building effective multilingual NLP datasets.
Syllabus
Data Annotation
Finding Labelers
Non-native labeling
What can be labeled
Defining the task
Dictionary: Interactive Learning
Spice: Lex Learner Build Your User awb Language eng Project aug 19 Logout
Checking the Labeling
Checking for consistency
Checking for efficiency
Checking for usefulness
Choose the Task
Labeling Toolkits
BRAT Labeling Tool
Taught by
Graham Neubig
Related Courses
Natural Language ProcessingColumbia University via Coursera Natural Language Processing
Stanford University via Coursera Introduction to Natural Language Processing
University of Michigan via Coursera moocTLH: Nuevos retos en las tecnologĂas del lenguaje humano
Universidad de Alicante via MirĂadax Natural Language Processing
Indian Institute of Technology, Kharagpur via Swayam