Intro to Hadoop and MapReduce
Offered By: Cloudera via Udacity
Course Description
Overview
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Syllabus
- Big Data
- What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
- HDFS and MapReduce
- Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
- MapReduce code
- Write your own MapReduce code.
- MapReduce Design Patterns
- Use common patterns for MapReduce programs to analyze Udacity forum data.
Taught by
Ian Wrigley and Sarah Sproehnle
Tags
Related Courses
Data AnalysisJohns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Scientific Computing
University of Washington via Coursera Introduction to Data Science
University of Washington via Coursera Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera