Apache Flink: Batch Mode Data Engineering
Offered By: LinkedIn Learning
Course Description
Overview
Explore how to build batch mode data pipelines with Apache Flink, the powerful and popular stream-processing platform.
Syllabus
Introduction
- Batch mode engineering
- What is Apache Flink?
- Apache Flink features
- Architecture of Apache Flink
- Flink program structure
- Flink execution flow
- Installing Flink standalone
- Creating a Flink project
- Build a sample Flink program
- Running jobs on the cluster
- Using the Flink web interface
- Setting up the exercise files
- DataSet API concepts
- Reading a CSV File
- Using Map
- Using FlatMap
- Using filters
- Using aggregates
- Using Reduce
- Using POJO classes
- Join operations
- Using MySQL with Flink
- Using broadcast variables
- Problem definition
- Computing total score
- Printing scores for physics
- Computing average scores across subjects
- Find the top student for each subject
- Next steps
Taught by
Kumaran Ponnambalam
Related Courses
Azure Data Engineer con Databricks y Azure Data FactoryCoursera Project Network via Coursera Data Integration with Microsoft Azure Data Factory
Microsoft via Coursera Azure Data Factory : Implement SCD Type 1
Coursera Project Network via Coursera Building Resilient Streaming Systems on Google Cloud Platform 日本語版
Google Cloud via Coursera Customising your models with TensorFlow 2
Imperial College London via Coursera