Learning Hadoop

Offered By: LinkedIn Learning

Tags

Hadoop Courses Big Data Courses Cloud Computing Courses Apache Spark Courses Dataproc Courses MapReduce Courses HDFS Courses

Course Description

Overview

Learn all the essentials of Hadoop, a key tool for processing and understanding big data.

Syllabus

Introduction

What and why Hadoop?
What you should know
Use cloud services

1. Set Up Cloud Hadoop

What is Hadoop?
Review Hadoop distributions and cloud services
Set up GCP Dataproc Metastore and VM cluster
Verify GCP Dataproc VM cluster

2. Understand Hadoop Core Components

Understand Hadoop components
Understand Java virtual machines (JVMs)
Explore Hadoop file systems: HDFS
Explore Hadoop file systems: AWS S3
Review Hadoop cluster components

3. Set Up and Verify Development Environment

Review test jobs
Review job output
Verify Hadoop web interfaces in your test environment
Verify Hadoop Spark web interfaces in your test environment
Use the Jupyter interface for Hadoop

4. Understand MapReduce

What is MapReduce?
What is MapReduce word count?
Review MapReduce word count job
Prepare for MapReduce Java coding
Review MapReduce WordCount job code

5. Tune MapReduce

Tune by physical methods
Tune a Mapper
Understanding data types
Tune a Reducer
Use MR 2.0 and 3.0
Review MR optimization examples

6. Scale Cloud Hadoop

Migrate to Cloud Hadoop
Scale VM-based Clusters
Use autoscale policies
Scale Kubernetes Spark clusters

7. Use Hive, Pig, and Spark

Understand Hive and HBase
Create and query tables with Hive
Understand Pig
Run WordCount using Pig
Review Spark architecture
Scale a Spark job to calculate Pi

Conclusion

Learn more about using Hadoop

Taught by

Lynn Langit

Related Courses

Software as a Service
University of California, Berkeley via Coursera Software Defined Networking
Georgia Institute of Technology via Coursera Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera Web-Technologien
openHPI Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique