Generative Model-Based Text-to-Speech Synthesis
Offered By: MITCBMM via YouTube
Course Description
Overview
Syllabus
Intro
Outline
Text-to-speech as sequence-to-sequence mapping
Speech production process
Typical flow of TTS system
Speech synthesis approaches
Probabilistic formulation of TTS
Approximation (2)
Representation - Linguistic features
Representation - Acoustic features
Representation - Mapping
HMM-based generative acoustic model for TTS
Alternative acoustic model
FFNN-based acoustic model for TTS [6]
NN-based generative acoustic model for TTS
NN-based generative model for TTS
Learned features
WaveNet: A generative model for raw audio
WaveNet - Causal dilated convolution
WaveNet - Architecture
Softmax
WaveNet vs conventional audio generative models
Relax approximation
Generative model-based text-to-speech synthesis
Beyond text-to-speech synthesis
Beyond generative TTS
Taught by
MITCBMM
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera Good Brain, Bad Brain: Basics
University of Birmingham via FutureLearn Statistical Learning with R
Stanford University via edX Machine Learning 1—Supervised Learning
Brown University via Udacity Fundamentals of Neuroscience, Part 2: Neurons and Networks
Harvard University via edX