YoVDO

Apache Flink: Batch Mode Data Engineering

Offered By: LinkedIn Learning

Tags

Apache Flink Courses Stream Processing Courses Data Pipelines Courses JOIN Operations Courses

Course Description

Overview

Explore how to build batch mode data pipelines with Apache Flink, the powerful and popular stream-processing platform.

Syllabus

Introduction
  • Batch mode engineering
1. Apache Flink
  • What is Apache Flink?
  • Apache Flink features
  • Architecture of Apache Flink
  • Flink program structure
  • Flink execution flow
2. Setting Up Flink
  • Installing Flink standalone
  • Creating a Flink project
  • Build a sample Flink program
  • Running jobs on the cluster
  • Using the Flink web interface
  • Setting up the exercise files
3. Dataset API
  • DataSet API concepts
  • Reading a CSV File
  • Using Map
  • Using FlatMap
  • Using filters
  • Using aggregates
  • Using Reduce
4. Advanced Capabilities
  • Using POJO classes
  • Join operations
  • Using MySQL with Flink
  • Using broadcast variables
5. Use Case Project
  • Problem definition
  • Computing total score
  • Printing scores for physics
  • Computing average scores across subjects
  • Find the top student for each subject
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Apache Kafka Deep Dive
A Cloud Guru
Microsoft Certified: Azure Data Engineer Associate (DP-203)
A Cloud Guru
Approfondimento sui concetti e gli strumenti per analizzare i dati in streaming (Italiano) | Deep Dive into Concepts and Tools for Analyzing Streaming Data (Italian)
Amazon Web Services via AWS Skill Builder
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Apache Kafka
LearnKartS via Coursera