YoVDO

Fast, Flexible, and Scalable Data Loading for ML Training with Ray Data

Offered By: Anyscale via YouTube

Tags

Machine Learning Courses TensorFlow Courses Distributed Systems Courses Fault Tolerance Courses Data Preprocessing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the capabilities of Ray Data for fast, flexible, and scalable data loading in machine learning training pipelines through this 31-minute conference talk. Dive into performance comparisons between different open-source data loader solutions and discover how Ray Data matches PyTorch DataLoader and tf.data in single-node performance while offering advanced features for scale. Learn about in-memory streaming, automatic recovery from out-of-memory failures, and support for heterogeneous clusters. Gain insights into how Ray Data provides unmatched speed, scale, and flexibility compared to other open-source data loaders, addressing the growing complexity of data preprocessing requirements in diverse data types. Access the accompanying slide deck for a comprehensive overview of the presented concepts and techniques.

Syllabus

Fast, Flexible, and Scalable Data Loading for ML Training with Ray Data


Taught by

Anyscale

Related Courses

MongoDB for DBAs
MongoDB University
MongoDB Advanced Deployment and Operations
MongoDB University
Building Cloud Apps with Microsoft Azure - Part 3
Microsoft via edX
Implementing Microsoft Windows Server Disks and Volumes
Microsoft via edX
Cloud Computing and Distributed Systems
Indian Institute of Technology Patna via Swayam