Smooth Migration Practice from MapReduce to Spark at ByteDance
Offered By: The ASF via YouTube
Course Description
Overview
Learn about ByteDance's innovative approach to migrating from MapReduce to Spark in this 33-minute conference talk. Explore the challenges faced by ByteDance's big data infrastructure team as they manage 1.2 million daily Spark jobs alongside 20,000-30,000 MapReduce tasks. Discover the issues with the MapReduce engine, including low ROI for framework updates, poor adaptability to new computing scheduling frameworks, and suboptimal computing performance. Gain insights into ByteDance's smooth migration solution, which allows users to transition legacy jobs to Spark with minimal modifications, significantly reducing migration costs and improving efficiency. Understand how this approach addresses the need for additional Pipeline tools and supports various scripts not natively compatible with Spark.
Syllabus
Smooth Migration Practice From Mapreduce To Spark At Bytedance
Taught by
The ASF
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera