YoVDO

All You Need to Know on Multilingual Sentence Vectors - 1 Model, 50+ Languages

Offered By: James Briggs via YouTube

Tags

Natural Language Processing (NLP) Courses Machine Learning Courses Model Evaluation Courses

Course Description

Overview

Explore the world of multilingual sentence vectors in this comprehensive 40-minute video tutorial. Learn how to create language-agnostic vector representations of text that enable cross-lingual comparisons. Discover the challenges of multilingual models and innovative approaches to overcome them. Dive into multi-task training techniques, including mUSE, and understand the concept of multilingual knowledge distillation. Follow a visual walkthrough of the entire process, from parallel data preparation to model evaluation. Master the art of choosing and initializing student models, working with ParallelSentencesDataset, and fine-tuning with appropriate loss functions. Gain practical insights into developing your own multilingual sentence transformers and leveraging high-performing pretrained models for various applications such as semantic search and topic modeling across multiple languages.

Syllabus

Intro
Multilingual Vectors
Multi-task Training mUSE
Multilingual Knowledge Distillation
Knowledge Distillation Training
Visual Walkthrough
Parallel Data Prep
Choosing a Student Model
Initializing the Models
ParallelSentencesDataset
Loss and Fine-tuning
Model Evaluation
Outro


Taught by

James Briggs

Related Courses

Natural Language Processing
Columbia University via Coursera
Natural Language Processing
Stanford University via Coursera
Introduction to Natural Language Processing
University of Michigan via Coursera
moocTLH: Nuevos retos en las tecnologĂ­as del lenguaje humano
Universidad de Alicante via MirĂ­adax
Natural Language Processing
Indian Institute of Technology, Kharagpur via Swayam