Intro to Hadoop and MapReduce
Offered By: Cloudera via Udacity
Course Description
Overview
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Syllabus
- Big Data
- What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
- HDFS and MapReduce
- Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
- MapReduce code
- Write your own MapReduce code.
- MapReduce Design Patterns
- Use common patterns for MapReduce programs to analyze Udacity forum data.
Taught by
Ian Wrigley and Sarah Sproehnle
Tags
Related Courses
Accounting AnalyticsUniversity of Pennsylvania via Coursera AWS Certified Big Data - Specialty
A Cloud Guru Big Data Essentials
A Cloud Guru Big Data Fundamentals
A Cloud Guru Data Science Basics
A Cloud Guru