YoVDO

Building an Open Source Streaming Analytics Stack with Kafka and Druid

Offered By: Linux Foundation via YouTube

Tags

Data Pipelines Courses Data Integrity Courses Event Handling Courses Stream Processing Courses Approximation Algorithms Courses Batch Processing Courses Real-Time Data Processing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to construct a streaming analytics stack using Kafka and Druid in this 41-minute conference talk from the Linux Foundation. Learn about the challenges of batch processing systems and discover how combining Kafka and Druid can create a robust data pipeline supporting real-time and batch ingestion with flexible, low-latency queries. Delve into topics such as event handling, data delivery problems, stream processing challenges, and approximation algorithms. Gain insights into Druid's architecture and understand how this open-source technology combination can guarantee system availability, maintain data integrity, and support fast, flexible queries for deriving insights from vast quantities of data.

Syllabus

Introduction
Overview
The Problem
Events
Example
Problems
Models
Data Delivery
Data Delivery Problems
Kafka Summary
Stream Processing
Stream Processing Challenges
Stream Processing System
Challenges
Subheading Queries
Technical Overview
Approximation Algorithms
Druid Architecture
Rules of Example
Raw Data
Shuffle
Join
Joint
Conclusions


Taught by

Linux Foundation

Tags

Related Courses

Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Data Analysis with Python
IBM via Coursera
Intro to TensorFlow 日本語版
Google Cloud via Coursera
TensorFlow on Google Cloud - Français
Google Cloud via Coursera
Freedom of Data with SAP Data Hub
SAP Learning