Apache Arrow and Go: A Match Made in Data
Offered By: The ASF via YouTube
Course Description
Overview
Explore the Golang implementations of Apache Arrow and Parquet in this 42-minute conference talk from ApacheCon 2022. Dive into the process of getting started with Go Arrow and Parquet libraries, learn how to create an Arrow Flight server and client over gRPC, and discover methods for integrating with other runtimes using the C Data API. Examine how Go's concurrency primitives make it an ideal choice for constructing efficient pipelines for parallel processing of large data volumes. Gain insights into the internals of the implementation, understanding how Go Arrow and Parquet libraries achieve their performance, including the benefits of SIMD. Perfect for developers looking to leverage Apache Arrow's growing standardization in data processing with Go's powerful features.
Syllabus
Apache Arrow and Go A match made in Data - Matthew Topol
Taught by
The ASF
Related Courses
Python for Data Science Tips, Tricks, & TechniquesLinkedIn Learning Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube Recent Parquet Improvements in Apache Spark - Vectorized Complex Types and Column Index Support
Databricks via YouTube Optimizing Spark SQL Jobs with Parallel and Asynchronous IO
Databricks via YouTube Degrading Performance - Understanding and Solving Small Files Syndrome
Databricks via YouTube