YoVDO

CMU Multilingual NLP 2020 - Automatic Speech Recognition

Offered By: Graham Neubig via YouTube

Tags

Natural Language Processing (NLP) Courses Training Models Courses Multilingual Natural Language Processing Courses

Course Description

Overview

Learn about automatic speech recognition in this 40-minute lecture from CMU's Multilingual Natural Language Processing course. Explore pronunciation modeling, acoustic modeling, and language modeling. Dive into topics like voice dialing systems, dynamic time warping, template matching, and training acoustic models. Understand how to estimate language model costs, measure ASR success, and discuss key points in speech recognition technology. Access additional course materials through the provided class website.

Syllabus

Automatic Speech Recognition
Voice Dialing System
Matching in Frequency Domain
Dynamic Time Warping
DTW algorithm
Matching Templates
DTW issues
More reliable matching
More reliable distances
Extending template model
Training an acoustic model
Language Model Estimate cost of sequence of words in the language • Need appropriate training data
Pronunciation Model
Measuring ASR Success
How good is good?
ASR Discussion Point


Taught by

Graham Neubig

Related Courses

CMU Multilingual NLP - The LORELEI Project
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Speech
Graham Neubig via YouTube
Multilingual NLP 2022 - Language Contact and Change
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Typology
Graham Neubig via YouTube