Modeling Streaming Data for Processing with Apache Beam
Offered By: Pluralsight
Course Description
Overview
The Apache Beam unified model allows us to process batch as well as streaming data using the same API. Several execution backends such as Google Cloud Dataflow, Apache Spark, and Apache Flink are compatible with Beam.
Streaming data usually needs to be processed real-time or near real-time which means stream processing systems need to have capabilities that allow them to process data with low latency, high performance and fault-tolerance. In this course, Modeling Streaming Data for Processing with Apache Beam, you will gain the ability to work with streams and use the Beam unified model to build data parallel pipelines. First, you will explore the similarities and differences between batch processing and stream processing. Next, you will discover the Apache Beam APIs which allow one to define pipelines that process batch as well as streaming data. Finally, you will learn how windowing operations can be applied to streaming data. When you are finished with this course, you will have a strong grasp of the models and architectures used with streaming data and be able to work with the Beam unified model to define and run transformations on input streams.
Streaming data usually needs to be processed real-time or near real-time which means stream processing systems need to have capabilities that allow them to process data with low latency, high performance and fault-tolerance. In this course, Modeling Streaming Data for Processing with Apache Beam, you will gain the ability to work with streams and use the Beam unified model to build data parallel pipelines. First, you will explore the similarities and differences between batch processing and stream processing. Next, you will discover the Apache Beam APIs which allow one to define pipelines that process batch as well as streaming data. Finally, you will learn how windowing operations can be applied to streaming data. When you are finished with this course, you will have a strong grasp of the models and architectures used with streaming data and be able to work with the Beam unified model to define and run transformations on input streams.
Taught by
Janani Ravi
Related Courses
Microsoft Azure Exam DP-200 - Implementing an Azure Data SolutionA Cloud Guru Microsoft Azure Exam DP-201 - Designing an Azure Data Solution
A Cloud Guru Microsoft Certified: Azure Data Engineer Associate (DP-203)
A Cloud Guru Traduciendo texto con Amazon Translate
Coursera Project Network via Coursera Apprentissage automatique dans le cloud avec AWS Batch (Français) | Machine Learning in the Cloud with AWS Batch (French)
Amazon Web Services via AWS Skill Builder