YoVDO

Building an Instant-On Serverless Platform for Large-Scale Data Processing Using Ray

Offered By: Anyscale via YouTube

Tags

Python Courses Apache Spark Courses pandas Courses AWS Glue Courses Data Processing Courses Distributed Computing Courses Data Integration Courses Serverless Computing Courses ETL Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the development of AWS Glue for Ray, a serverless platform for large-scale data processing, in this 14-minute conference talk. Learn how AWS Glue integrated Ray.io to enable distributed Python workloads and scale data integration tasks. Discover the implementation of Ray's core APIs, distributed collection APIs, and the integration of Modin for efficient ETL operations on massive datasets. Gain insights into the innovations made in cluster management, demand-based autoscaling, and the creation of an instant-on, interactive serverless Ray platform. Understand how this solution addresses customer needs for scaling Python workloads over large datasets, utilizing ARM-based platforms and IPv6 addressing for workers.

Syllabus

Building an Instant-On Serverless Platform for Large-Scale Data Processing Using Ray


Taught by

Anyscale

Related Courses

Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Reliable Distributed Algorithms - Part 1
KTH Royal Institute of Technology via edX
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera
Réalisez des calculs distribués sur des données massives
CentraleSupélec via OpenClassrooms