Delta Lake Streaming - Internals and Query Progress Logs
Offered By: Databricks via YouTube
Course Description
Overview
Dive deep into the internals of Structured Streaming with Delta Lake in this 29-minute technical talk from Databricks. Explore the seamless integration of Delta Lake and structured streaming for real-time data processing capabilities. Understand the functional components of structured streaming using Delta as a source, including Query Progress Logs (QPL) and their importance in production environments. Learn how to track streaming job progress and map it to source Delta tables using QPL. Examine the contents of checkpoint directories and their significance for Delta streams. Gain insights into the marriage of Delta Lake and streaming, and discover why it's becoming increasingly popular among users building curated data lakes and end-to-end data pipelines.
Syllabus
Introduction
Sample Stream
Internals
Query Progress Log
Example Stream
Source Table
Source Table History
Query Progress
Streaming Checkpoint
Taught by
Databricks
Related Courses
Distributed Computing with Spark SQLUniversity of California, Davis via Coursera Apache Spark (TM) SQL for Data Analysts
Databricks via Coursera Building Your First ETL Pipeline Using Azure Databricks
Pluralsight Implement a data lakehouse analytics solution with Azure Databricks
Microsoft via Microsoft Learn Perform data science with Azure Databricks
Microsoft via Microsoft Learn