How to Build Custom Datasets for Text in PyTorch
Offered By: Aladdin Persson via YouTube
Course Description
Overview
Learn how to build custom datasets for text processing in PyTorch with this in-depth tutorial video. Explore advanced techniques for handling text data using an image captioning dataset (Flickr8k) as an example. Discover how to implement a PyTorch Dataset for loading Flickr data, set up vocabulary and numericalization, create collate functions for batch padding, and develop a function for obtaining data loaders. Apply these principles to various text-based projects, including translation, question answering, and sentiment analysis. Follow along as the instructor demonstrates the code implementation, troubleshoots errors, and provides valuable insights for working with text data in PyTorch.
Syllabus
- Introduction
- Overview of what we're going to do
- Imports
- Setup of Pytorch Dataset for loading Flickr
- Setup of Vocabulary and Numericalization
- Creating Collate for Padding of Batch
- Function for getting data loader
- Running code & fixing couple of errors
- Ending
Taught by
Aladdin Persson
Related Courses
Deep Learning with Python and PyTorch.IBM via edX Introduction to Machine Learning
Duke University via Coursera How Google does Machine Learning em Português Brasileiro
Google Cloud via Coursera Intro to Deep Learning with PyTorch
Facebook via Udacity Secure and Private AI
Facebook via Udacity