YoVDO

I-Vector Representation Based on GMM and DNN for Audio Classification

Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube

Tags

Machine Learning Courses Deep Neural Networks Courses Gaussian Mixture Models Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the state-of-the-art I-vector approach for audio classification tasks in this 57-minute lecture by Najim Dehak from the Center for Language & Speech Processing at Johns Hopkins University. Delve into the modeling and capturing of variability in Gaussian Mixture Model (GMM) mean components across audio recordings. Discover recent subspace approaches like Non-negative Factor Analysis (NFA) and Subspace Multinomial Model (SMM) that focus on GMM weights variability. Learn how these techniques can be applied to model hidden layer neuron activations in deep neural networks for sequential data recognition tasks such as language and dialect recognition. Gain insights from Dehak's extensive background in artificial intelligence, pattern recognition, and speech processing, including his pivotal role in developing the i-vector framework for speaker verification.

Syllabus

I-vector representation based on GMMand DNN for audio classification - Najim Dehak (JHU)


Taught by

Center for Language & Speech Processing(CLSP), JHU

Related Courses

Cluster Analysis and Unsupervised Machine Learning in Python
Udemy
Unsupervised Learning
Serrano.Academy via YouTube
Hands on Machine Learning - Unsupervised Learning
Shashank Kalanithi via YouTube
PCA, AE, K-Means, Gaussian Mixture Model, Sparse Coding, and Intuitive VAE
Alfredo Canziani via YouTube
Small Covers for Near-zero Sets of Polynomials and Learning Latent Variable Models
IEEE via YouTube