Building a Unified Feature Platform with DuckDB and Arrow
Offered By: Data Council via YouTube
Course Description
Overview
Dive into a technical talk by Mike Eastham, Chief Architect at Tecton, exploring the development of Tecton's innovative Feature Platform for Machine Learning. Learn how DuckDB and Arrow are leveraged to address data transformation challenges and provide a faster, more integrated local development experience. Discover effective strategies for scaling datasets without a distributed query engine, techniques for implementing DuckDB extensions, and insights on creating interoperability with DeltaLake outside the Spark ecosystem. Compare performance with Spark and understand how Tecton utilizes cloud resources to handle large datasets while maintaining an optimal laptop developer experience. Gain valuable knowledge on building cutting-edge data and AI systems from this 36-minute session presented at Data Council.
Syllabus
Building a Unified Feature Platform with DuckDB and Arrow
Taught by
Data Council
Related Courses
DuckDB - High-Performance SQL Queries on Pandas Dataframe - PythonSamuel Chan via YouTube New Feature - What the Fibers Extension Can Do for You
International PHP Conference via YouTube Getting Started with PHP-FFI - Introduction to Foreign Function Interface
International PHP Conference via YouTube DuckDB - Bringing Analytical SQL Directly to Your Python Shell
EuroPython Conference via YouTube OLAP on Cassandra Data with Arrow, Flight SQL, ADBC, and DuckDB
Linux Foundation via YouTube