YoVDO

Apache Arrow DataFusion - A Fast, Embeddable, Modular Analytic Query Engine

Offered By: CMU Database Group via YouTube

Tags

Apache Arrow Courses SQL Courses Rust Courses Data Processing Courses Distributed Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the capabilities of Apache Arrow DataFusion in this comprehensive seminar from the CMU Database Group's Database Building Blocks series. Delve into the intricacies of this fast, embeddable, and modular analytic query engine as presented by speaker Andrew Lamb. Learn about the engine's architecture, performance optimizations, and its role in modern data analytics. Discover how DataFusion leverages the Apache Arrow format for efficient in-memory processing and its integration possibilities with other data systems. Gain insights into real-world applications and use cases for DataFusion in data-intensive environments. Understand the benefits of its modular design and how it enables customization for specific analytical needs. This hour-long talk provides valuable knowledge for database professionals, data engineers, and anyone interested in high-performance query processing.

Syllabus

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)


Taught by

CMU Database Group

Related Courses

Machine Learning with RAPIDS - Accelerating Data Science Workflows
Nvidia via YouTube
Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube
Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube
DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube
Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube