YoVDO

CMU Multilingual NLP 2022 - Speech

Offered By: Graham Neubig via YouTube

Tags

Natural Language Processing (NLP) Courses Text to Speech Courses Multilingual Natural Language Processing Courses

Course Description

Overview

Explore the fundamentals of speech processing and its applications in this comprehensive lecture by Shinji Watanabe. Delve into the nature of speech, its various applications, and the importance of speech databases. Examine the hierarchical structure of speech and gain insights into key topics such as speech waveforms, coding, and the information contained within speech sounds. Learn about Automatic Speech Recognition (ASR), Text-to-Speech (TTS) synthesis, and speech translation. Discuss privacy concerns in speech technology and explore speech enhancement techniques. Investigate different types of speech variations, including speaking styles and environments, with examples of read and spontaneous speech. Discover the sources of speech data used in research and development. Access accompanying slides and references for further study.

Syllabus

Intro
What is speech???
Speech waveform
What kind of information does speech sound contain?
Speech coding
Automatic Speech Recognition (ASR)
Speech Synthesis (TTS: Text to Speech)
Speech Translation
Privacy in speech
Speech enhancement Several types of problems
Speech variations Speaking styles and environments
Read speech examples
Spontaneous speech
Where we found the speech data?


Taught by

Graham Neubig

Related Courses

CMU Multilingual NLP - The LORELEI Project
Graham Neubig via YouTube
Multilingual NLP 2022 - Language Contact and Change
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Typology
Graham Neubig via YouTube
CMU Multilingual NLP 2022 - Words, Parts of Speech, Morphology
Graham Neubig via YouTube