YoVDO

Smooth Migration Practice from MapReduce to Spark at ByteDance

Offered By: The ASF via YouTube

Tags

Apache Spark Courses Big Data Courses MapReduce Courses Distributed Computing Courses Data Migration Courses Batch Processing Courses Infrastructure Engineering Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about ByteDance's innovative approach to migrating from MapReduce to Spark in this 33-minute conference talk. Explore the challenges faced by ByteDance's big data infrastructure team as they manage 1.2 million daily Spark jobs alongside 20,000-30,000 MapReduce tasks. Discover the issues with the MapReduce engine, including low ROI for framework updates, poor adaptability to new computing scheduling frameworks, and suboptimal computing performance. Gain insights into ByteDance's smooth migration solution, which allows users to transition legacy jobs to Spark with minimal modifications, significantly reducing migration costs and improving efficiency. Understand how this approach addresses the need for additional Pipeline tools and supports various scripts not natively compatible with Spark.

Syllabus

Smooth Migration Practice From Mapreduce To Spark At Bytedance


Taught by

The ASF

Related Courses

Introduction to Windows PowerShell
Microsoft via edX
Windows PowerShell Basics
Microsoft via edX
Preparing for Google Cloud Certification: Cloud Data Engineer
Google Cloud via Coursera
Data Engineering on Google Cloud Platform en Français
Google Cloud via Coursera
Data Engineering on Google Cloud Platform en Español
Google Cloud via Coursera