YoVDO

Ten Years of Building Open Source Standards: From Parquet to Arrow to OpenLineage

Offered By: Data Council via YouTube

Tags

Apache Arrow Courses Open Source Courses Data Lineage Courses Columnar Storage Courses Apache Parquet Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the journey of building successful open source projects in the data ecosystem through this 35-minute conference talk by Julien Le Dem, Chief Architect at Astronomer and Co-Founder of Datakin. Gain insights into the ideation process and early growth of Apache Parquet columnar format, and discover how it led to the creation of Apache Arrow. Learn about the development of OpenLineage, an LFAI & Data project bringing observability to the data ecosystem. Understand the factors that contributed to the success of these projects and how they have shaped the data landscape over the past decade. Benefit from Le Dem's extensive experience in data processing tools and content platforms, including his work at Twitter, Wework, Dremio, and Yahoo.

Syllabus

Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer


Taught by

Data Council

Related Courses

Machine Learning with RAPIDS - Accelerating Data Science Workflows
Nvidia via YouTube
Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube
Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube
DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube
Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube