YoVDO

Automating Spark Upgrades and Migrations at Netflix

Offered By: Databricks via YouTube

Tags

Apache Spark Courses Data Migration Courses Containerization Courses Data Pipelines Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how Netflix automates Apache Spark upgrades and migrations in this 41-minute conference talk. Learn about open-source tools for rewriting Spark code, techniques for testing Spark jobs in production, and methods for tracking job states. Explore the process of migrating to a containerized environment and gain insights from user experiences. Acquire valuable skills for upgrading Spark pipelines without stress and validating them using the write-audit-publish pattern. Ideal for data scientists, ML engineers, and platform engineers managing Spark infrastructure, this talk provides practical strategies for handling legacy data products and evolving AI libraries. Presented by Holden Karau and Robert Morck from Netflix, the session offers expert guidance on streamlining Spark upgrades and migrations in large-scale data environments.

Syllabus

Stranger Triumphs: Automating Spark Upgrades & Migrations at Netflix


Taught by

Databricks

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera