YoVDO

Hugging Face Datasets - Dataset Builder Scripts for Beginners

Offered By: James Briggs via YouTube

Tags

Hugging Face Courses Python Courses PyTorch Courses Classification Courses Semantic Search Courses Similarity Search Courses Vector Similarity Search Courses

Course Description

Overview

Learn how to work with dataset builder scripts in Hugging Face Datasets using Python. Explore the download manager, Apache Arrow datatypes, and techniques for creating compressed files. Discover how to generate examples, finish split generators, and add datasets to Hugging Face. Gain insights into using datasets for similarity search, semantic search, vector similarity search, classification, and question-answering tasks. Apply these skills to streamline the training and fine-tuning of models with PyTorch and TensorFlow.

Syllabus

Intro
Creating Compressed Files
Creating Dataset Build Script
Download Manager
Finishing Split Generator
Generate Examples Method
Add Dataset to Hugging Face
Apache Arrow Features
What's Next?


Taught by

James Briggs

Related Courses

Hugging Face on Azure - Partnership and Solutions Announcement
Microsoft via YouTube
Question Answering in Azure AI - Custom and Prebuilt Solutions - Episode 49
Microsoft via YouTube
Open Source Platforms for MLOps
Duke University via Coursera
Masked Language Modelling - Retraining BERT with Hugging Face Trainer - Coding Tutorial
rupert ai via YouTube
Masked Language Modelling with Hugging Face - Microsoft Sentence Completion - Coding Tutorial
rupert ai via YouTube