Streamlining Data Access for Ray with Alluxio
Offered By: Anyscale via YouTube
Course Description
Overview
Explore how Alluxio addresses the challenge of efficient and unified data access in machine learning at the Ray Summit 2024 presentation. Learn about overcoming GPU availability limitations and fragmented data issues across organizational locations as companies upgrade their data platforms and adopt advanced AI frameworks like Ray. Discover Alluxio's service that enables seamless data access for Ray from multiple sources, regardless of cloud or storage providers. Understand strategies to overcome network bottlenecks and complex authentication protocols, ensuring unimpeded GPU training across data silos and inconsistent access methods. Gain insights into building robust data infrastructures that accelerate AI innovation, presented by Haoyuan Li and Bin Fan from Alluxio in this 32-minute talk.
Syllabus
Streamlining Data Access for Ray with Alluxio | Ray Summit 2024
Taught by
Anyscale
Related Courses
Advancing GPU Analytics with RAPIDS Accelerator for Apache Spark and AlluxioDatabricks via YouTube Building Super-Contributors in Alluxio Open Source Community
Linux Foundation via YouTube Accelerating Spark Workloads in a Mesos Environment with Alluxio
Linux Foundation via YouTube Accelerating Spark Workloads in an Apache Mesos Environment with Alluxio
Linux Foundation via YouTube Data Caching for Enterprise-Grade Petabyte-Scale OLAP
USENIX via YouTube