Pre-Processing Audio with Different Durations
Offered By: Valerio Velardo - The Sound of AI via YouTube
Course Description
Overview
Discover techniques for preprocessing audio data of varying lengths in this informative tutorial. Learn to cut and zero-pad waveforms using PyTorch and torchaudio, with practical code examples provided. Explore the process of setting NUM_SAMPLES, passing it to UrbanSoundDataset, and updating the __getitem__ method. Gain hands-on experience in cutting waveforms and applying right padding. Follow along as the instructor demonstrates how to run scripts to verify padding and cutting operations. Perfect for those looking to enhance their audio processing skills in machine learning applications.
Syllabus
Intro
Setting NUM_SAMPLES
Passing NUM_SAMPLES to UrbanSoundDataset
Updating __getitem__
Cutting waveform
Right padding waveform
Run script to check padding
Run script to check cutting
What's up next
Outro
Taught by
Valerio Velardo - The Sound of AI
Related Courses
Deep Learning with Python and PyTorch.IBM via edX Introduction to Machine Learning
Duke University via Coursera How Google does Machine Learning em Português Brasileiro
Google Cloud via Coursera Intro to Deep Learning with PyTorch
Facebook via Udacity Secure and Private AI
Facebook via Udacity