YoVDO

Speed Up Data Access with PyArrow Apache Arrow - Data is the New API

Offered By: PyCon US via YouTube

Tags

PyCon US Courses Programming Languages Courses pandas Courses NumPy Courses Data Processing Courses Apache Arrow Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the world of Apache Arrow and its Python library PyArrow in this informative talk from PyCon US. Learn about a revolutionary approach to data access that eliminates the need for APIs and serialization/deserialization cycles. Discover how Apache Arrow's language-independent columnar memory format enables zero-copy reads for lightning-fast data access across multiple programming languages. Gain insights into PyArrow's architecture, use cases, and integration with popular Python libraries like pandas and numpy. Examine interesting statistics and improvements that can help determine if PyArrow is suitable for your work. Additionally, get introduced to Apache Flight, a high-performance wire protocol designed for bulk data transfer in analytics. While not a tutorial, this talk provides valuable information to help you decide whether to explore PyArrow for your data access and analytics needs.

Syllabus

Introduction
What is Data
Data is the new API
What is Apache Arrow
Examples
Memory mapped files
Statistics


Taught by

PyCon US

Related Courses

Machine Learning with RAPIDS - Accelerating Data Science Workflows
Nvidia via YouTube
Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube
Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube
DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube
Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube