Ten Years of Building Open Source Standards in Data Engineering
Offered By: Data Council via YouTube
Course Description
Overview
Dive into a 38-minute conference talk exploring the decade-long journey of building open-source standards in data engineering. Follow Julien Le Dem's insights as he recounts the birth of Parquet, the transition to low-latency systems, and the development of Arrow. Discover how community-driven collaboration has fueled groundbreaking innovations in data processing over the past ten years. Gain valuable knowledge about the evolution of data engineering and the impact of open-source projects on the industry. Learn from the experiences of a seasoned expert who has contributed to major developments at companies like WeWork, Dremio, Twitter, and Yahoo.
Syllabus
Ten Years of Building Open Source Standards
Taught by
Data Council
Related Courses
Using Pandas and Dask to Work with Large Columnar Datasets in Apache ParquetEuroPython Conference via YouTube Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube Building InfluxDB 3.0 with Apache Arrow, DataFusion, Flight and Parquet
Data Council via YouTube Time Series Analytics with Apache Arrow, Pandas, and Parquet - A 101 Introduction
Data Council via YouTube Ten Years of Building Open Source Standards: From Parquet to Arrow to OpenLineage
Data Council via YouTube