YoVDO

Hadoop Ecosystem Essentials

Offered By: Packt via FutureLearn

Tags

Hadoop Courses Data Analysis Courses Presto Courses Data Management Courses Cluster Management Courses Spark Streaming Courses

Course Description

Overview

Learn the skills needed to succeed as a data analyst

For data analysts, Hadoop is an extremely powerful tool to help process large amounts of data and is used by successful companies such as Google and Spotify.

On this four-week course, you’ll learn how to use Hadoop to its full potential to make it easier for you to store, analyse, and scale big data.

Through step-by-step guides and exercises, you’ll gain the knowledge and practical skills to take into your role in data analytics.

Understand how to manage your Hadoop cluster

You’ll understand how to manage clusters with Yet Another Resource Negotiator (YARN), Mesos, Zookeeper, Oozie, Zeppelin, and Hue.

With this knowledge, you’ll be able to ensure high performance, workload management, security, and more.

Learn how to analyse streams of data

Next, you’ll uncover the techniques to handle and stream data in real-time using Kafka, Flume, Spark Streaming, Flink, and Storm.

This understanding will help you to react and respond quickly to any issues that may arise.

Hone your data handling skills

Finally, you’ll learn how to design real-world systems using the Hadoop ecosystem to ensure you can use your skills in practice.

By the end of the course, you’ll have the knowledge to handle large amounts of data using Hadoop.

This course is designed for anyone who wants to hone their data handling skills using Hadoop.

You’ll be shown how to use a variety of open source utilities within the Hadoop environment. We assume you’ve already installed the Hadoop environment. If you haven’t, check out Introduction to Big Data Analytics with Hadoop.


Syllabus

  • Querying data interactively in Hadoop
    • Introduction to the course
    • Apache Drill
    • Apache Phoenix
    • Presto
    • Wrap up
  • Managing your cluster in Hadoop
    • Introduction to Week 2
    • Managing resources
    • Managing clusters and tasks
    • Other technologies
    • Wrap up
  • Feeding and analysing data in Hadoop
    • Introduction to Week 3
    • Kafka
    • Apache Flume
    • Spark Streaming
    • Introducing Apache Storm
    • Flink
    • Wrap up
  • Designing real-world systems
    • Introduction to Week 4
    • Architecture design
    • Wrap up

Taught by

Astrid deRidder

Related Courses

A Cloud Guru's Elastic Certified Engineer Exam Preparation Course
A Cloud Guru
Certified Kubernetes Administrator (CKA)
A Cloud Guru
Cloud Native Certified Kubernetes Administrator (CKA) (Legacy)
A Cloud Guru
Confluent Certified Developer for Apache Kafka (CCDAK)
A Cloud Guru
EKS Basics
A Cloud Guru