MIT 6.S191 - Automatic Speech Recognition
Offered By: Alexander Amini via YouTube
Course Description
Overview
Explore automatic speech recognition in this MIT 6.S191 lecture featuring Rev.com experts Miguel Jetté and Jennifer Drexler. Delve into how Rev.com combines human-in-the-loop techniques with deep learning to create a cutting-edge English speech recognition engine. Learn about word error rates, data selection, speech input processing, subword units, melscale encoding, decoder mechanisms, attention-based ASR, connectionist temporal classification, and language models. Gain insights into the latest advancements in speech recognition technology and its practical applications in the industry.
Syllabus
Intro
Rev Data
Word Error Rate
Organization Entity
Test Benchmark
Data Selection
Speech Input
Subword Units
Melscale
Encoder Decoder
Speech Recognition
AttentionBased ASR
ConnectionistTemporal Classification
Language Models
Questions
Taught by
https://www.youtube.com/@AAmini/videos
Tags
Related Courses
Building a unique NLP project: 1984 book vs 1984 albumCoursera Project Network via Coursera Exam Prep AI-102: Microsoft Azure AI Engineer Associate
Whizlabs via Coursera Amazon Echo Reviews Sentiment Analysis Using NLP
Coursera Project Network via Coursera Amazon Translate: Translate documents with batch translation
Coursera Project Network via Coursera Analyze Text Data with Yellowbrick
Coursera Project Network via Coursera