Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source
Offered By: Linux Foundation via YouTube
Course Description
Overview
Syllabus
Intro
Presenter Bio
What makes analytic applications special?
SQL data warehouses run analytic queries
What ClickHouse is not
Merge Tree is the workhorse table engine
Merge Tree data layout
Detailed storage layout within a single part /var/lib/clickhouse/data/airline/ontime
Adding CPUs boosts parallelized execution
I/O drives ClickHouse performance
Compression and codecs reduce I/O
Effect on storage is dramatic
Materialized views restructure/reduce data
Pattern: TTLs + downsampled views
Alternative pattern: Tiered storage
More table engines for clustering!
How do distributed queries work?
Pattern: Kafka-based ingestion pipelines
Alternative ingest pattern: Kafka engine
Pattern: Grafana visualization
Pattern: Operation on Kubernetes
Wrap-up . ClickHouse meets/beats proprietary SQL data warehouses in head-to-head comparisons
Taught by
Linux Foundation
Tags
Related Courses
Crie sua página pessoal usando React e Github PagesCoursera Project Network via Coursera Introduction to RISC-V
Linux Foundation via edX C# Framework Design
LinkedIn Learning GitHub Basics Course (How To)
Treehouse Android Development from Scratch to Create Cool Apps!
Udemy