Ten Years of Building Open Source Standards: From Parquet to Arrow to OpenLineage
Offered By: Data Council via YouTube
Course Description
Overview
Explore the journey of building successful open source projects in the data ecosystem through this 35-minute conference talk by Julien Le Dem, Chief Architect at Astronomer and Co-Founder of Datakin. Gain insights into the ideation process and early growth of Apache Parquet columnar format, and discover how it led to the creation of Apache Arrow. Learn about the development of OpenLineage, an LFAI & Data project bringing observability to the data ecosystem. Understand the factors that contributed to the success of these projects and how they have shaped the data landscape over the past decade. Benefit from Le Dem's extensive experience in data processing tools and content platforms, including his work at Twitter, Wework, Dremio, and Yahoo.
Syllabus
Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer
Taught by
Data Council
Related Courses
Machine Learning with RAPIDS - Accelerating Data Science WorkflowsNvidia via YouTube Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube