Intro to Hadoop and MapReduce
Offered By: Cloudera via Udacity
Course Description
Overview
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Syllabus
- Big Data
- What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
- HDFS and MapReduce
- Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
- MapReduce code
- Write your own MapReduce code.
- MapReduce Design Patterns
- Use common patterns for MapReduce programs to analyze Udacity forum data.
Taught by
Ian Wrigley and Sarah Sproehnle
Tags
Related Courses
Address Business Issues with Data ScienceCertNexus via Coursera Advanced Clinical Data Science
University of Colorado System via Coursera Advanced Data Science Capstone
IBM via Coursera Advanced Data Science with IBM
IBM via Coursera Advanced Deep Learning Methods for Healthcare
University of Illinois at Urbana-Champaign via Coursera