Scaling Apache Spark on Kube to Apple Scale
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the challenges and solutions of scaling Apache Spark on Kubernetes to Apple's massive scale in this informative conference talk. Discover which customer workloads easily ported to Apache Spark on Kubernetes and which ones faced difficulties. Learn valuable considerations and best practices for both operators and end users of Apache Spark-Kubernetes platforms. Gain insights into migrating from YARN with HDFS to Kubernetes, and understand how to effectively deploy new enhancements like shuffle tracking and graceful decommissioning. Determine when to use these features and when to avoid them. Whether you're an operator or end user, this talk will equip you with essential knowledge to optimize your Apache Spark on Kubernetes journey.
Syllabus
Scaling Apache Spark on Kube to Apple Scale - Amanda Moran & Holden Karau, Apple
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera