Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Columnar Format
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the cutting-edge Lance columnar format designed to revolutionize data infrastructure for AI workloads. Delve into the limitations of existing formats like Parquet and ORC for ML/AI tasks, and discover how Lance offers superior performance for vector search, deep learning, model evaluations, and exploratory data analysis of unstructured data. Learn about the motivations behind this new open-source format, its innovative design principles, and how it optimizes for modern storage options. Gain insights into implementing Lance for AI workloads, achieving significant performance improvements with minimal effort, and preparing your data lakehouse infrastructure for the ubiquitous adoption of AI across enterprises.
Syllabus
Beyond Parquet and ORC: Upgrading Data Infrastructure for Multi-modal AI with Lance Col... Chang She
Taught by
Linux Foundation
Tags
Related Courses
Google Cloud Platform for Machine Learning Essential TrainingLinkedIn Learning Answer Complex Questions From an Arbitrarily Large Set of Documents With Vector Search and GPT-3
David Shapiro ~ AI via YouTube Advanced Sentiment Analysis with NLP Transformers and Vector Search
James Briggs via YouTube GUI-Based Few Shot Classification Model Trainer - Demo
James Briggs via YouTube Spotify's Podcast Search Explained
James Briggs via YouTube