Automating Spark Upgrades and Migrations at Netflix
Offered By: Databricks via YouTube
Course Description
Overview
Discover how Netflix automates Apache Spark upgrades and migrations in this 41-minute conference talk. Learn about open-source tools for rewriting Spark code, techniques for testing Spark jobs in production, and methods for tracking job states. Explore the process of migrating to a containerized environment and gain insights from user experiences. Acquire valuable skills for upgrading Spark pipelines without stress and validating them using the write-audit-publish pattern. Ideal for data scientists, ML engineers, and platform engineers managing Spark infrastructure, this talk provides practical strategies for handling legacy data products and evolving AI libraries. Presented by Holden Karau and Robert Morck from Netflix, the session offers expert guidance on streamlining Spark upgrades and migrations in large-scale data environments.
Syllabus
Stranger Triumphs: Automating Spark Upgrades & Migrations at Netflix
Taught by
Databricks
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera