YoVDO

Apache Spark, Hadoop Project with Kafka and Python - End to End Development

Offered By: YouTube

Tags

Big Data Courses Python Courses Hadoop Courses Apache Spark Courses Apache Kafka Courses

Course Description

Overview

Dive into a comprehensive 90-minute tutorial on building an end-to-end data processing system using Apache Spark, Hadoop, and Kafka with Python. Explore the architecture and code walkthrough of each component, including Kafka Producer, Spark Structured Streaming as Kafka Consumer, and data processing with Hive and Presto. Develop a REST API using Flask and Flask-RESTPlus, and create a dashboard or web application with Python Dash. Learn how to integrate these technologies to create a robust, scalable data pipeline for real-time processing and analysis.

Syllabus

End to End Project using Spark/Hadoop | Code Walkthrough | Architecture | Part 1 | DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough | Kafka Producer | Part 2 | DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough |Spark Streaming|Part 3.1| DM | DataMaking.
End to End Project using Spark/Hadoop | Code Walkthrough |Spark Streaming|Part 3.2| DM | DataMaking.
End to End Project using Spark/Hadoop|Code Walkthrough|Hive | Part 4 | DM | DataMaking | Data Making.
End to End Project using Spark/Hadoop | Code Walkthrough | Presto Module | Part 5 | DM | DataMaking.
Spark Project with Kafka | REST API Module using Flask, Flast-RESTPlus | Part 6 | DM | DataMaking.
Spark Project with Kafka | Dashboard Module using Python Dash | Part 7 | DM | DataMaking.


Taught by

DataMaking

Related Courses

Intro to Hadoop and MapReduce
Cloudera via Udacity
Processing Big Data with Hadoop in Azure HDInsight
Microsoft via edX
Implementing Real-Time Analytics with Hadoop in Azure HDInsight
Microsoft via edX
Hadoop Platform and Application Framework
University of California, San Diego via Coursera
Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera