OLAP on Cassandra Data with Arrow, Flight SQL, ADBC, and DuckDB
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore innovative approaches to performing OLAP on Cassandra data using modern technologies in this 30-minute conference talk. Learn how to overcome traditional challenges of accessing Cassandra data for analytical purposes without impacting transactional workloads. Discover techniques for converting Cassandra sstable snapshots into parquet files and querying them with a cutting-edge OLAP stack. Gain insights into the future of OLAP, focusing on standards and technologies like Apache Arrow, Arrow JDBC, and ADBC for enhanced OLAP engine pluggability. Understand how to leverage Flight SQL Server to enable OLAP SQL and Ibis queries against parquet data exported from Cassandra. Acquire practical knowledge on running Flight SQL server with DuckDB and SQLite back-ends, securing Flight SQL, and deploying it in Kubernetes environments with Graviton (arm64) CPUs.
Syllabus
OLAP on Your Cassandra Data with Arrow, Flight SQL, ADBC, and DuckDB - Philip Moore, Voltron Data
Taught by
Linux Foundation
Tags
Related Courses
Machine Learning with RAPIDS - Accelerating Data Science WorkflowsNvidia via YouTube Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube