CMU Multilingual NLP 2020 - Multilingual Training and Cross-Lingual Transfer
Offered By: Graham Neubig via YouTube
Course Description
Overview
Explore methods for training multilingual systems, zero-shot adaptation, and open problems in multilingual learning in this 40-minute lecture from CMU's CS11-737 "Multilingual Natural Language Processing" course. Delve into topics such as cross-lingual transfer, rapid adaptation of massive multilingual models, meta-learning for multilingual training, and improving zero-shot neural machine translation. Examine challenges like training data imbalance, underperformance of multilingual models compared to bilingual ones, and issues with one-to-many transfer. Learn about techniques including heuristic sampling of data, multilingual knowledge distillation, and adding language-specific layers. Gain insights into the complexities of supporting multiple languages and addressing the needs of underrepresented languages in NLP.
Syllabus
Many languages are left behind
Roadmap
Cross-lingual transfer
Supporting multiple languages could be tedious
Combining the two methods
Use case: covid-19 response
Rapid adaptation of massive multilingual models
Meta-learning for multilingual training
Multilingual NMT
Improve zero-shot NMT
Align multilingual representation
Zero-shot transfer for pretrained representations
Massively multilingual training
Training data highly imbalanced
Heuristic Sampling of Data
Learning to balance data
Problem: sometimes underperforms bilingual model
Multilingual Knowledge Distillation
Adding Language-specific layers
Problem: one-to-many transfer
Problem: multilingual
evaluation
Discussion question
Taught by
Graham Neubig
Related Courses
Building a unique NLP project: 1984 book vs 1984 albumCoursera Project Network via Coursera Exam Prep AI-102: Microsoft Azure AI Engineer Associate
Whizlabs via Coursera Amazon Echo Reviews Sentiment Analysis Using NLP
Coursera Project Network via Coursera Amazon Translate: Translate documents with batch translation
Coursera Project Network via Coursera Analyze Text Data with Yellowbrick
Coursera Project Network via Coursera