Apache Flink: Batch Mode Data Engineering
Offered By: LinkedIn Learning
Course Description
Overview
Explore how to build batch mode data pipelines with Apache Flink, the powerful and popular stream-processing platform.
Syllabus
Introduction
- Batch mode engineering
- What is Apache Flink?
- Apache Flink features
- Architecture of Apache Flink
- Flink program structure
- Flink execution flow
- Installing Flink standalone
- Creating a Flink project
- Build a sample Flink program
- Running jobs on the cluster
- Using the Flink web interface
- Setting up the exercise files
- DataSet API concepts
- Reading a CSV File
- Using Map
- Using FlatMap
- Using filters
- Using aggregates
- Using Reduce
- Using POJO classes
- Join operations
- Using MySQL with Flink
- Using broadcast variables
- Problem definition
- Computing total score
- Printing scores for physics
- Computing average scores across subjects
- Find the top student for each subject
- Next steps
Taught by
Kumaran Ponnambalam
Related Courses
Cloud Computing Concepts: Part 2University of Illinois at Urbana-Champaign via Coursera Programming Reactive Systems
École Polytechnique Fédérale de Lausanne via edX Data Engineering on Google Cloud Platform en Français
Google Cloud via Coursera Architecting Stream Processing Solutions Using Google Cloud Pub/Sub
Pluralsight Developing Stream Processing Applications with AWS Kinesis
Pluralsight