Data Streaming
Offered By: Udacity
Course Description
Overview
Learn the latest skills to process data in real-time by building fluency in modern data engineering tools, such as Apache Spark, Kafka, Spark Streaming, and Kafka Streaming.
Syllabus
- Welcome to the Data Streaming Nanodegree Program
- Data Ingestion with Kafka and Kafka Streaming
- Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
- Streaming API Development and Documentation
- In this course you will grow your expertise in the components of streaming data systems, and build a real
time analytics application. Specifically, you will be able to identify components of Spark Streaming (architecture
and API), build a continuous application with Structured Streaming, consume and process data from Apache
Kafka with Spark Structured Streaming (including setting up and running a Spark Cluster), create a DataFrame
as an aggregation of source DataFrames, sink a composite DataFrame to Kafka, and visually inspect a data sink
for accuracy. - Career Services
Taught by
Ben Goldberg, Judit Lantos, David Drummond and Jillian Kim
Related Courses
A Day in the Life of a Data EngineerAmazon Web Services via AWS Skill Builder A Day in the Life of a Data Engineer (Indonesian)
Amazon Web Services via AWS Skill Builder A Day in the Life of a Data Engineer (Japanese)
Amazon Web Services via AWS Skill Builder A Day in the Life of a Data Engineer (Korean)
Amazon Web Services via AWS Skill Builder A Day in the Life of a Data Engineer (Simplified Chinese)
Amazon Web Services via AWS Skill Builder