YoVDO

Understanding the MapReduce Programming Model

Offered By: Pluralsight

Tags

MapReduce Courses Big Data Courses Parallel Processing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
The MapReduce programming model is the de facto standard for parallel processing of Big Data. This course introduces MapReduce, explains how data flows through a MapReduce program, and guides you through writing your first MapReduce program in Java.

Processing millions of records requires that you first understand the art of breaking down your tasks into parallel processes. The MapReduce programming model, part of the Hadoop eco-system, gives you a framework to define your solution in terms of parallel tasks, which are then combined to give you the final desired result. In this course, Understanding the MapReduce Programming Model, you'll get an introduction to the MapReduce paradigm. First, you'll learn how it helps you visualize how data flows through the map, partition, shuffle, and sort phases before it gets to the reduce phase and gives you the final result. Next, it will guide you through your very first MapReduce program in Java. Finally, you'll learn to extend the framework Mapper and Reducer classes to plug in your own logic and then run this code on your local machine without using a Hadoop cluster. By the end of this course, you will be able to break big data problems into parallel tasks to help tackle large-scale data munging operations.

Syllabus

  • Course Overview 1min
  • Introducing MapReduce 29mins
  • A "Hello World" MapReduce Job 39mins
  • Controlling Parallelism in Map and Reduce Phases 38mins

Taught by

Janani Ravi

Related Courses

Intro to Data Science
Udacity
Mining Massive Datasets
Stanford University via edX
Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera
Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera
Big Data Analytics in Healthcare
Georgia Institute of Technology via edX