Hadoop Ecosystem Essentials
Offered By: Packt via FutureLearn
Course Description
Overview
Learn the skills needed to succeed as a data analyst
For data analysts, Hadoop is an extremely powerful tool to help process large amounts of data and is used by successful companies such as Google and Spotify.
On this four-week course, you’ll learn how to use Hadoop to its full potential to make it easier for you to store, analyse, and scale big data.
Through step-by-step guides and exercises, you’ll gain the knowledge and practical skills to take into your role in data analytics.
Understand how to manage your Hadoop cluster
You’ll understand how to manage clusters with Yet Another Resource Negotiator (YARN), Mesos, Zookeeper, Oozie, Zeppelin, and Hue.
With this knowledge, you’ll be able to ensure high performance, workload management, security, and more.
Learn how to analyse streams of data
Next, you’ll uncover the techniques to handle and stream data in real-time using Kafka, Flume, Spark Streaming, Flink, and Storm.
This understanding will help you to react and respond quickly to any issues that may arise.
Hone your data handling skills
Finally, you’ll learn how to design real-world systems using the Hadoop ecosystem to ensure you can use your skills in practice.
By the end of the course, you’ll have the knowledge to handle large amounts of data using Hadoop.
This course is designed for anyone who wants to hone their data handling skills using Hadoop.
You’ll be shown how to use a variety of open source utilities within the Hadoop environment. We assume you’ve already installed the Hadoop environment. If you haven’t, check out Introduction to Big Data Analytics with Hadoop.
Syllabus
- Querying data interactively in Hadoop
- Introduction to the course
- Apache Drill
- Apache Phoenix
- Presto
- Wrap up
- Managing your cluster in Hadoop
- Introduction to Week 2
- Managing resources
- Managing clusters and tasks
- Other technologies
- Wrap up
- Feeding and analysing data in Hadoop
- Introduction to Week 3
- Kafka
- Apache Flume
- Spark Streaming
- Introducing Apache Storm
- Flink
- Wrap up
- Designing real-world systems
- Introduction to Week 4
- Architecture design
- Wrap up
Taught by
Astrid deRidder
Related Courses
A Cloud Guru's Elastic Certified Engineer Exam Preparation CourseA Cloud Guru Certified Kubernetes Administrator (CKA)
A Cloud Guru Cloud Native Certified Kubernetes Administrator (CKA) (Legacy)
A Cloud Guru Confluent Certified Developer for Apache Kafka (CCDAK)
A Cloud Guru EKS Basics
A Cloud Guru