Efficient Near Real-Time Event Ingestion using DLT - Insights and Lessons
Offered By: Databricks via YouTube
Course Description
Overview
Explore Nextdoor's journey from hourly batch event ingestion to a near-real-time streaming solution using Delta Live Tables (DLT) in this 18-minute conference talk. Discover how this transformation enables internal users to query events promptly for analysis, monitoring, and real-time aggregations while reducing compute costs. Gain insights into leveraging file notification over directory listing, implementing effective monitoring techniques, and resolving conflicts between streaming and batch pipelines. Learn about using custom Spark metrics to determine optimal data consumption points and understand how to leverage schema evolution for evolving event schemas within DLT. Presented by Kavin Palanisamy, Software Engineer at Nextdoor's Data Platform team, this talk offers valuable lessons and practical knowledge for data engineers and analysts working with real-time data ingestion and processing.
Syllabus
Efficient Near Real-Time Event Ingestion using DLT: Insights and Lessons
Taught by
Databricks
Related Courses
Understanding China, 1700-2000: A Data Analytic Approach, Part 1The Hong Kong University of Science and Technology via Coursera The Analytics Edge
Massachusetts Institute of Technology via edX 大数据与信息传播 Big Data and Information Dissemination
Fudan University via Coursera The Future of Fashion
Marist College via Independent The Mobile Consumer
Marist College via Independent