YoVDO

Zeus - Uber's Highly Scalable Distributed Shuffle as a Service

Offered By: Databricks via YouTube

Tags

Distributed Systems Courses Big Data Courses Cloud Computing Courses Apache Spark Courses Data Processing Courses Scalability Courses

Course Description

Overview

Dive into the architecture and deployment of Zeus, Uber's highly scalable and distributed shuffle as a service powering all data processing at the company. Explore how this ground-up solution supports hundreds of thousands of jobs and millions of containers, shuffling petabytes of data. Learn about Zeus' paradigm-shifting approach to external shuffle, resulting in improved performance for most jobs despite remote data writing. Compare Zeus' performance with different storage systems backed by external shuffle, such as NFS and HDFS. Discover the integration of Zeus with Spark and how it contrasts with Spark's built-in sort-based shuffle mechanism. Gain insights into the future roadmap and plans for Zeus, and understand its impact on addressing hardware failures, reliability, and scalability challenges in one of the largest Spark and Hive clusters in the industry.

Syllabus

Introduction
External Shuffle Service
Deep Dive
Design Principle
Horizontal Scalable
Network Connections
Network Latency
Compression
Network IO
Connection
Asynchronous Commit
Tolerance
Local States
Spark Compatibility
Metrics
Production Quality
Data Management
Summary


Taught by

Databricks

Related Courses

Coding the Matrix: Linear Algebra through Computer Science Applications
Brown University via Coursera
كيف تفكر الآلات - مقدمة في تقنيات الحوسبة
King Fahd University of Petroleum and Minerals via Rwaq (رواق)
Datascience et Analyse situationnelle : dans les coulisses du Big Data
IONIS via IONIS
Data Lakes for Big Data
EdCast
統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco