Custom Audio PyTorch Dataset with Torchaudio
Offered By: Valerio Velardo - The Sound of AI via YouTube
Course Description
Overview
Learn how to create a custom audio dataset using PyTorch and torchaudio in this comprehensive tutorial video. Explore the process of loading audio files from the UrbanSound8K dataset while gaining insights into basic I/O functions in torchaudio. Dive into the implementation of a custom UrbanSoundDataset class, including the constructor, __len__, and __getitem__ methods. Understand how to work with PyTorch's Dataset and DataLoader classes, navigate the UrbanSound8K dataset structure, and utilize torchaudio's backend for efficient audio file handling. Follow along as the instructor demonstrates the entire process, from initial setup to running the completed UrbanSoundDataset class, providing valuable hands-on experience in audio data preprocessing for machine learning applications.
Syllabus
Intro
Urban Sound 8K Dataset webpage
PyTorch Dataset and DataLoader classes
UrbanSoundDataset class
Implementing the constructor - part 1
Content of Urban Sound 8K
Implementing the constructor - part 2
Implementing __len__
Implementing __getitem__ - part 1
Torchaudio backend I/O
Implementing __getitem__ - part 2
Getting audio sample path
Getting label
UrbanSoundDataset class recap
Running UrbanSoundDataset
Coming up next
Taught by
Valerio Velardo - The Sound of AI
Related Courses
Deep Learning with Python and PyTorch.IBM via edX Introduction to Machine Learning
Duke University via Coursera How Google does Machine Learning em Português Brasileiro
Google Cloud via Coursera Intro to Deep Learning with PyTorch
Facebook via Udacity Secure and Private AI
Facebook via Udacity