YoVDO

Delta Lake Streaming - Internals and Query Progress Logs

Offered By: Databricks via YouTube

Tags

Delta Lake Courses Data Lakes Courses Real-Time Data Processing Courses Data Pipelines Courses

Course Description

Overview

Dive deep into the internals of Structured Streaming with Delta Lake in this 29-minute technical talk from Databricks. Explore the seamless integration of Delta Lake and structured streaming for real-time data processing capabilities. Understand the functional components of structured streaming using Delta as a source, including Query Progress Logs (QPL) and their importance in production environments. Learn how to track streaming job progress and map it to source Delta tables using QPL. Examine the contents of checkpoint directories and their significance for Delta streams. Gain insights into the marriage of Delta Lake and streaming, and discover why it's becoming increasingly popular among users building curated data lakes and end-to-end data pipelines.

Syllabus

Introduction
Sample Stream
Internals
Query Progress Log
Example Stream
Source Table
Source Table History
Query Progress
Streaming Checkpoint


Taught by

Databricks

Related Courses

Distributed Computing with Spark SQL
University of California, Davis via Coursera
Apache Spark (TM) SQL for Data Analysts
Databricks via Coursera
Building Your First ETL Pipeline Using Azure Databricks
Pluralsight
Implement a data lakehouse analytics solution with Azure Databricks
Microsoft via Microsoft Learn
Perform data science with Azure Databricks
Microsoft via Microsoft Learn