YoVDO

Data Streaming

Offered By: Udacity

Tags

Big Data Courses Apache Spark Courses Data Engineering Courses Real-Time Data Processing Courses Spark Streaming Courses

Course Description

Overview

Learn the latest skills to process data in real-time by building fluency in modern data engineering tools, such as Apache Spark, Kafka, Spark Streaming, and Kafka Streaming.

Syllabus

  • Welcome to the Data Streaming Nanodegree Program
  • Data Ingestion with Kafka and Kafka Streaming
    • Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
  • Streaming API Development and Documentation
    • In this course you will grow your expertise in the components of streaming data systems, and build a real
      time analytics application. Specifically, you will be able to identify components of Spark Streaming (architecture
      and API), build a continuous application with Structured Streaming, consume and process data from Apache
      Kafka with Spark Structured Streaming (including setting up and running a Spark Cluster), create a DataFrame
      as an aggregation of source DataFrames, sink a composite DataFrame to Kafka, and visually inspect a data sink
      for accuracy.
  • Career Services

Taught by

Ben Goldberg, Judit Lantos, David Drummond and Jillian Kim

Related Courses

内存数据库管理
openHPI
CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Processing Big Data with Azure Data Lake Analytics
Microsoft via edX
Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera