YoVDO

Target-Speaker Methods for Speech Recognition - Overlapping Speech Solutions

Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube

Tags

Speech Recognition Courses Signal Processing Courses GPU Acceleration Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore cutting-edge techniques for tackling overlapping speech in multi-talker Automatic Speech Recognition (ASR) applications through this 52-minute talk by Desh Raj from the Center for Language & Speech Processing at Johns Hopkins University. Delve into the world of "target-speaker" methods, starting with a traditional signal processing approach and its new GPU-accelerated implementation that dramatically speeds up meeting transcription. Learn about an innovative project leveraging wake-words for on-device target-speaker ASR, resulting in significant Word Error Rate (WER) reductions. Discover how self-supervised models can be incorporated into this paradigm to further enhance speech recognition capabilities. Gain valuable insights into overcoming challenges in creating effective ASR systems for complex audio environments such as meeting transcription and smart assistants in noisy settings.

Syllabus

Target-speaker Methods for Speech Recognition – Desh Raj


Taught by

Center for Language & Speech Processing(CLSP), JHU

Related Courses

Survey of Music Technology
Georgia Institute of Technology via Coursera
Fundamentals of Electrical Engineering Laboratory
Rice University via Coursera
Critical Listening for Studio Production
Queen's University Belfast via FutureLearn
Fundamentos de Comunicaciones Ópticas
Universitat Politècnica de València via UPV [X]
Sense101x: Sense, Control, Act: Measure the Universe, Transform the World
University of Queensland via edX