YoVDO

MIT 6.S191 - Automatic Speech Recognition

Offered By: Alexander Amini via YouTube

Tags

Natural Language Processing (NLP) Courses Deep Learning Courses Language Models Courses Encoder-Decoder Models Courses

Course Description

Overview

Explore automatic speech recognition in this MIT 6.S191 lecture featuring Rev.com experts Miguel Jetté and Jennifer Drexler. Delve into how Rev.com combines human-in-the-loop techniques with deep learning to create a cutting-edge English speech recognition engine. Learn about word error rates, data selection, speech input processing, subword units, melscale encoding, decoder mechanisms, attention-based ASR, connectionist temporal classification, and language models. Gain insights into the latest advancements in speech recognition technology and its practical applications in the industry.

Syllabus

Intro
Rev Data
Word Error Rate
Organization Entity
Test Benchmark
Data Selection
Speech Input
Subword Units
Melscale
Encoder Decoder
Speech Recognition
AttentionBased ASR
ConnectionistTemporal Classification
Language Models
Questions


Taught by

https://www.youtube.com/@AAmini/videos

Tags

Related Courses

Natural Language Processing on Google Cloud
Google Cloud via Coursera
Introduction to T5 for Sentiment Span Extraction
Abhishek Thakur via YouTube
CMU Advanced NLP 2021 - Conditional Generation
Graham Neubig via YouTube
Neural Nets for NLP 2021 - Conditioned Generation
Graham Neubig via YouTube
Neural Nets for NLP 2020: Conditioned Generation
Graham Neubig via YouTube