YoVDO

Apache Spark, Hadoop Project with Kafka and Python - End to End Development

Offered By: YouTube

Tags

Big Data Courses Python Courses Hadoop Courses Apache Spark Courses Apache Kafka Courses

Course Description

Overview

Dive into a comprehensive 90-minute tutorial on building an end-to-end data processing system using Apache Spark, Hadoop, and Kafka with Python. Explore the architecture and code walkthrough of each component, including Kafka Producer, Spark Structured Streaming as Kafka Consumer, and data processing with Hive and Presto. Develop a REST API using Flask and Flask-RESTPlus, and create a dashboard or web application with Python Dash. Learn how to integrate these technologies to create a robust, scalable data pipeline for real-time processing and analysis.

Syllabus

End to End Project using Spark/Hadoop | Code Walkthrough | Architecture | Part 1 | DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough | Kafka Producer | Part 2 | DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough |Spark Streaming|Part 3.1| DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough |Spark Streaming|Part 3.2| DM | DataMaking.
End to End Project using Spark/Hadoop|Code Walkthrough|Hive | Part 4 | DM | DataMaking | Data Making.
End to End Project using Spark/Hadoop | Code Walkthrough | Presto Module | Part 5 | DM | DataMaking.
Spark Project with Kafka | REST API Module using Flask, Flast-RESTPlus | Part 6 | DM | DataMaking.
Spark Project with Kafka | Dashboard Module using Python Dash | Part 7 | DM | DataMaking.


Taught by

DataMaking

Related Courses

Building ETL and Data Pipelines with Bash, Airflow and Kafka
IBM via edX
Apache Spark and Scala Certification Training
Edureka
Apache Kafka Certification Training
Edureka
ETL and Data Pipelines with Shell, Airflow and Kafka
IBM via Coursera
Creating a Streaming Data Pipeline With Apache Kafka
Google Cloud via Coursera