Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Columnar Format
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the cutting-edge Lance columnar format designed to revolutionize data infrastructure for AI workloads. Delve into the limitations of existing formats like Parquet and ORC for ML/AI tasks, and discover how Lance offers superior performance for vector search, deep learning, model evaluations, and exploratory data analysis of unstructured data. Learn about the motivations behind this new open-source format, its innovative design principles, and how it optimizes for modern storage options. Gain insights into implementing Lance for AI workloads, achieving significant performance improvements with minimal effort, and preparing your data lakehouse infrastructure for the ubiquitous adoption of AI across enterprises.
Syllabus
Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Col... Chang She
Taught by
Linux Foundation
Tags
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX