YoVDO

CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT

Offered By: Graham Neubig via YouTube

Tags

Natural Language Processing (NLP) Courses Transfer Learning Courses Data Augmentation Courses Multilingual Natural Language Processing Courses

Course Description

Overview

Explore data-driven strategies for Neural Machine Translation in this 41-minute lecture by Graham Neubig. Delve into various data augmentation techniques, including back translation, meta-back translation, and pivoting for high-resource languages. Learn about machine translation evaluation methods, such as BLEU scores, BERT Score, and COMET. Examine the challenges of high and low-resource languages, and discover approaches like transfer learning and dictionary-based augmentation. Gain insights into word alignment, word-by-word data augmentation, and reordering techniques to enhance translation quality.

Syllabus

Introduction
Machine Translation Evaluation
Manual Evaluation
Human Evaluation Shared Tasks
Blue Scores
Shortness Penalty
Bert Score
BlueRT
Comet
Bart Score
Meta Evaluation
Database Strategies
High and Low Resource Languages
Data Augmentation
Back Translation
Training Schedule
Generating Translations
In iterative back translation
Metaback translation
Metaback translation issues
High resource languages augmentation
High resource languages pivoting
Monolingual data copying
Transfer learning
Dictionarybased augmentation
Word alignment
Word by word data augmentation
Reordering
Assignment


Taught by

Graham Neubig

Related Courses

Structuring Machine Learning Projects
DeepLearning.AI via Coursera
Natural Language Processing on Google Cloud
Google Cloud via Coursera
Introduction to Learning Transfer and Life Long Learning (3L)
University of California, Irvine via Coursera
Advanced Deployment Scenarios with TensorFlow
DeepLearning.AI via Coursera
Neural Style Transfer with TensorFlow
Coursera Project Network via Coursera