YoVDO

Building a Unified Feature Platform with DuckDB and Arrow

Offered By: Data Council via YouTube

Tags

DuckDB Courses Data Science Courses Machine Learning Courses Apache Spark Courses Data Transformation Courses Distributed Computing Courses Feature Engineering Courses Apache Arrow Courses Tecton Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into a technical talk by Mike Eastham, Chief Architect at Tecton, exploring the development of Tecton's innovative Feature Platform for Machine Learning. Learn how DuckDB and Arrow are leveraged to address data transformation challenges and provide a faster, more integrated local development experience. Discover effective strategies for scaling datasets without a distributed query engine, techniques for implementing DuckDB extensions, and insights on creating interoperability with DeltaLake outside the Spark ecosystem. Compare performance with Spark and understand how Tecton utilizes cloud resources to handle large datasets while maintaining an optimal laptop developer experience. Gain valuable knowledge on building cutting-edge data and AI systems from this 36-minute session presented at Data Council.

Syllabus

Building a Unified Feature Platform with DuckDB and Arrow


Taught by

Data Council

Related Courses

Machine Learning with RAPIDS - Accelerating Data Science Workflows
Nvidia via YouTube
Streaming Featurization with Ibis, Substrait and Apache Arrow
Open Data Science via YouTube
Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube
DataFusion and Apache Arrow: Supercharging Data Analytics with a Rust-Based Query Engine
Databricks via YouTube
Cloud Fetch: High-Bandwidth Connectivity for BI Tools - Databricks
Databricks via YouTube