Preparing the Speech Dataset
Offered By: Valerio Velardo - The Sound of AI via YouTube
Course Description
Overview
Learn how to pre-process a voice dataset by extracting Mel-frequency cepstral coefficients (MFCCs) and saving them in a JSON file in this 37-minute tutorial video. Explore the Speech Commands Dataset and follow along with the provided code to prepare your audio data for deep learning applications. Gain insights into dataset overview, prerequisites, data dictionary creation, and efficient storage techniques. Perfect for those interested in audio processing and machine learning for speech recognition tasks.
Syllabus
Introduction
Speech Dataset
Dataset Overview
Preparing the Dataset
Prerequisites
Data Dictionary
Loop Free
Magic
Labels
Storage
Review
Store
Outro
Taught by
Valerio Velardo - The Sound of AI
Related Courses
Introduction to Digital Sound DesignEmory University via Coursera Foundations of Wavelets and Multirate Digital Signal Processing
Indian Institute of Technology Bombay via Swayam iOS Development for Creative Entrepreneurs
University of California, Irvine via Coursera Deploying TinyML
Harvard University via edX Digital Signal Processing
École Polytechnique Fédérale de Lausanne via Coursera