YoVDO

Smooth Migration Practice from MapReduce to Spark at ByteDance

Offered By: The ASF via YouTube

Tags

Apache Spark Courses Big Data Courses MapReduce Courses Distributed Computing Courses Data Migration Courses Batch Processing Courses Infrastructure Engineering Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about ByteDance's innovative approach to migrating from MapReduce to Spark in this 33-minute conference talk. Explore the challenges faced by ByteDance's big data infrastructure team as they manage 1.2 million daily Spark jobs alongside 20,000-30,000 MapReduce tasks. Discover the issues with the MapReduce engine, including low ROI for framework updates, poor adaptability to new computing scheduling frameworks, and suboptimal computing performance. Gain insights into ByteDance's smooth migration solution, which allows users to transition legacy jobs to Spark with minimal modifications, significantly reducing migration costs and improving efficiency. Understand how this approach addresses the need for additional Pipeline tools and supports various scripts not natively compatible with Spark.

Syllabus

Smooth Migration Practice From Mapreduce To Spark At Bytedance


Taught by

The ASF

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera