YoVDO

Data Streaming

Offered By: Udacity

Tags

Big Data Courses Apache Spark Courses Data Engineering Courses Real-Time Data Processing Courses Spark Streaming Courses

Course Description

Overview

Learn the latest skills to process data in real-time by building fluency in modern data engineering tools, such as Apache Spark, Kafka, Spark Streaming, and Kafka Streaming.

Syllabus

  • Welcome to the Data Streaming Nanodegree Program
  • Data Ingestion with Kafka and Kafka Streaming
    • Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
  • Streaming API Development and Documentation
    • In this course you will grow your expertise in the components of streaming data systems, and build a real
      time analytics application. Specifically, you will be able to identify components of Spark Streaming (architecture
      and API), build a continuous application with Structured Streaming, consume and process data from Apache
      Kafka with Spark Structured Streaming (including setting up and running a Spark Cluster), create a DataFrame
      as an aggregation of source DataFrames, sink a composite DataFrame to Kafka, and visually inspect a data sink
      for accuracy.
  • Career Services

Taught by

Ben Goldberg, Judit Lantos, David Drummond and Jillian Kim

Related Courses

Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera
Big Data for Better Performance
Open2Study
Big Data and Education
Columbia University via edX
Big Data Analytics in Healthcare
Georgia Institute of Technology via Udacity
Data Mining with Weka
University of Waikato via Independent