YoVDO

CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT

Offered By: Graham Neubig via YouTube

Tags

Natural Language Processing (NLP) Courses Transfer Learning Courses Data Augmentation Courses Multilingual Natural Language Processing Courses

Course Description

Overview

Explore data-driven strategies for Neural Machine Translation in this 41-minute lecture by Graham Neubig. Delve into various data augmentation techniques, including back translation, meta-back translation, and pivoting for high-resource languages. Learn about machine translation evaluation methods, such as BLEU scores, BERT Score, and COMET. Examine the challenges of high and low-resource languages, and discover approaches like transfer learning and dictionary-based augmentation. Gain insights into word alignment, word-by-word data augmentation, and reordering techniques to enhance translation quality.

Syllabus

Introduction
Machine Translation Evaluation
Manual Evaluation
Human Evaluation Shared Tasks
Blue Scores
Shortness Penalty
Bert Score
BlueRT
Comet
Bart Score
Meta Evaluation
Database Strategies
High and Low Resource Languages
Data Augmentation
Back Translation
Training Schedule
Generating Translations
In iterative back translation
Metaback translation
Metaback translation issues
High resource languages augmentation
High resource languages pivoting
Monolingual data copying
Transfer learning
Dictionarybased augmentation
Word alignment
Word by word data augmentation
Reordering
Assignment


Taught by

Graham Neubig

Related Courses

Building a unique NLP project: 1984 book vs 1984 album
Coursera Project Network via Coursera
Exam Prep AI-102: Microsoft Azure AI Engineer Associate
Whizlabs via Coursera
Amazon Echo Reviews Sentiment Analysis Using NLP
Coursera Project Network via Coursera
Amazon Translate: Translate documents with batch translation
Coursera Project Network via Coursera
Analyze Text Data with Yellowbrick
Coursera Project Network via Coursera