Hugging Face Datasets - Adding Images
Offered By: James Briggs via YouTube
Course Description
Overview
Learn how to work with the Hugging Face datasets library in Python, focusing on adding images, using dataset builder scripts, the download manager, and iter_archive function. Explore best practices for Hugging Face Datasets in Python, applicable to image search, similarity search, classification, and question-answering tasks. Discover techniques for creating and compressing tar files for images, implementing dataset builder scripts, and utilizing the iterable download manager with iter_archive. Master the process of defining the _generate_examples function, adding datasets to the Hugging Face Datasets Hub, and troubleshooting common errors. Gain insights into using newly created datasets and handling larger image collections efficiently. This tutorial covers essential steps from introduction to advanced techniques, making it easier to train and fine-tune models with PyTorch and TensorFlow.
Syllabus
Intro
Creating Tar Files for Images
Compressing Images in Tar Files
Adding Dataset Builder Script
Iterable Download Manager with iter_archive
_generate_examples Function Definition
Adding to Hugging Face Datasets Hub
Fixing Errors
Using Your New Dataset
Dealing with Larger Image Datasets
Taught by
James Briggs
Related Courses
4.0 Shades of Digitalisation for the Chemical and Process IndustriesUniversity of Padova via FutureLearn A Day in the Life of a Data Engineer
Amazon Web Services via AWS Skill Builder FinTech for Finance and Business Leaders
ACCA via edX Accounting Data Analytics
University of Illinois at Urbana-Champaign via Coursera Accounting Data Analytics
Coursera