Getting Started with Delta Lake
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the fundamentals of Delta Lake in this 38-minute sponsored tutorial presented by Guenia Izquierdo Delgado and Sajith Appukuttan from Databricks. Learn about the open-source storage framework that enables Lakehouse architecture creation using various compute engines like Spark, PrestoDB, Flink, Trino, and Hive. Discover how Delta Lake addresses modern data engineering requirements and challenges, focusing on data reliability and optimized query performance for big data use cases. Through presentations, hands-on code examples, and notebooks, gain insights into batch and streaming data ingestion, fast interactive queries, and machine learning applications. Understand key data reliability challenges, learn how Delta Lake improves data lakes at scale, and explore its role within the wider open-source ecosystem for framework and tool developers. By the end of the tutorial, acquire knowledge on creating a Lakehouse architecture using Delta Lake and its potential benefits for your organization. To participate fully, ensure you have Docker engine installed on your computer.
Syllabus
Sponsored Session: Getting Started with Delta Lake - Guenia Izquierdo Delgado & Sajith Appukuttan
Taught by
Linux Foundation
Tags
Related Courses
Delta Lake 2.0 Overview - New Features and Community CollaborationsDatabricks via YouTube Why Lakehouse Architecture Now - Exploring Enterprise Data Warehouse Failures and the Need for Lakehouse Paradigm
Databricks via YouTube Scaling Climate Data for FinTech with an Open Source Data Mesh
Linux Foundation via YouTube ETL - Extract Trino Load: A Case for Trino as a Batch Processing Engine
Linux Foundation via YouTube Consuming Legend Data Models and Services Using BI, Python/ML and Other Tools
Linux Foundation via YouTube