Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Columnar Format
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the cutting-edge Lance columnar format designed to revolutionize data infrastructure for AI workloads. Delve into the limitations of existing formats like Parquet and ORC for ML/AI tasks, and discover how Lance offers superior performance for vector search, deep learning, model evaluations, and exploratory data analysis of unstructured data. Learn about the motivations behind this new open-source format, its innovative design principles, and how it optimizes for modern storage options. Gain insights into implementing Lance for AI workloads, achieving significant performance improvements with minimal effort, and preparing your data lakehouse infrastructure for the ubiquitous adoption of AI across enterprises.
Syllabus
Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Col... Chang She
Taught by
Linux Foundation
Tags
Related Courses
Macroeconometric ForecastingInternational Monetary Fund via edX Machine Learning With Big Data
University of California, San Diego via Coursera Data Science at Scale - Capstone Project
University of Washington via Coursera Structural Equation Model and its Applications | 结构方程模型及其应用 (粤语)
The Chinese University of Hong Kong via Coursera Data Science in Action - Building a Predictive Churn Model
SAP Learning