Automating Spark Upgrades and Migrations at Netflix
Offered By: Databricks via YouTube
Course Description
Overview
Discover how Netflix automates Apache Spark upgrades and migrations in this 41-minute conference talk. Learn about open-source tools for rewriting Spark code, techniques for testing Spark jobs in production, and methods for tracking job states. Explore the process of migrating to a containerized environment and gain insights from user experiences. Acquire valuable skills for upgrading Spark pipelines without stress and validating them using the write-audit-publish pattern. Ideal for data scientists, ML engineers, and platform engineers managing Spark infrastructure, this talk provides practical strategies for handling legacy data products and evolving AI libraries. Presented by Holden Karau and Robert Morck from Netflix, the session offers expert guidance on streamlining Spark upgrades and migrations in large-scale data environments.
Syllabus
Stranger Triumphs: Automating Spark Upgrades & Migrations at Netflix
Taught by
Databricks
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Data Analysis with Python
IBM via Coursera Intro to TensorFlow 日本語版
Google Cloud via Coursera TensorFlow on Google Cloud - Français
Google Cloud via Coursera Freedom of Data with SAP Data Hub
SAP Learning