Intro to Hadoop and MapReduce
Offered By: Cloudera via Udacity
Course Description
Overview
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Syllabus
- Big Data
- What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
- HDFS and MapReduce
- Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
- MapReduce code
- Write your own MapReduce code.
- MapReduce Design Patterns
- Use common patterns for MapReduce programs to analyze Udacity forum data.
Taught by
Ian Wrigley and Sarah Sproehnle
Tags
Related Courses
Processing Big Data with Hadoop in Azure HDInsightMicrosoft via edX Implementing Real-Time Analytics with Hadoop in Azure HDInsight
Microsoft via edX Hadoop Platform and Application Framework
University of California, San Diego via Coursera Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera Deploying a Hadoop Cluster
Udacity