Building an Open Source Streaming Analytics Stack with Kafka and Druid
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore how to construct a streaming analytics stack using Kafka and Druid in this 41-minute conference talk from the Linux Foundation. Learn about the challenges of batch processing systems and discover how combining Kafka and Druid can create a robust data pipeline supporting real-time and batch ingestion with flexible, low-latency queries. Delve into topics such as event handling, data delivery problems, stream processing challenges, and approximation algorithms. Gain insights into Druid's architecture and understand how this open-source technology combination can guarantee system availability, maintain data integrity, and support fast, flexible queries for deriving insights from vast quantities of data.
Syllabus
Introduction
Overview
The Problem
Events
Example
Problems
Models
Data Delivery
Data Delivery Problems
Kafka Summary
Stream Processing
Stream Processing Challenges
Stream Processing System
Challenges
Subheading Queries
Technical Overview
Approximation Algorithms
Druid Architecture
Rules of Example
Raw Data
Shuffle
Join
Joint
Conclusions
Taught by
Linux Foundation
Tags
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Data Analysis with Python
IBM via Coursera Intro to TensorFlow 日本語版
Google Cloud via Coursera TensorFlow on Google Cloud - Français
Google Cloud via Coursera Freedom of Data with SAP Data Hub
SAP Learning