Power Up Your Lakehouse with Git Semantics and Delta Lake
Offered By: Databricks via YouTube
Course Description
Overview
Explore a 23-minute conference talk that delves into enhancing lakehouse architecture with Git semantics and Delta Lake. Learn how to overcome data versioning challenges in DataOps, including writing, auditing, and publishing changes, rolling back to consistent states, creating reproducible workloads, and building economical dev/test environments. Discover how the combination of Delta Lake and lakeFS can apply Git-like semantics to improve time travel capabilities in lakehouses. Understand how Delta Lake provides linear history through table snapshots, while lakeFS adds branching and merging functionalities, resulting in enhanced data quality and operational economics. Gain insights from Oz Katz, CTO and Co-creator of lakeFS, on implementing these tools for improved data management practices.
Syllabus
Power Up Your Lakehouse with Git Semantics & Delta Lake
Taught by
Databricks
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera