Power Up Your Lakehouse with Git Semantics and Delta Lake
Offered By: Databricks via YouTube
Course Description
Overview
Explore a 23-minute conference talk that delves into enhancing lakehouse architecture with Git semantics and Delta Lake. Learn how to overcome data versioning challenges in DataOps, including writing, auditing, and publishing changes, rolling back to consistent states, creating reproducible workloads, and building economical dev/test environments. Discover how the combination of Delta Lake and lakeFS can apply Git-like semantics to improve time travel capabilities in lakehouses. Understand how Delta Lake provides linear history through table snapshots, while lakeFS adds branching and merging functionalities, resulting in enhanced data quality and operational economics. Gain insights from Oz Katz, CTO and Co-creator of lakeFS, on implementing these tools for improved data management practices.
Syllabus
Power Up Your Lakehouse with Git Semantics & Delta Lake
Taught by
Databricks
Related Courses
Multi-Table Transactions with LakeFS and Delta Lake - Tech TalkDatabricks via YouTube CI/CD for Data - Building Dev/Test Data Environments with Open Source Stacks
CNCF [Cloud Native Computing Foundation] via YouTube Building Reproducible ML Processes with an Open Source Stack
Linux Foundation via YouTube Version Control for Lakehouse Architecture - Essential Practices and Benefits
Databricks via YouTube Developing Data Pipelines with Branch Deployments - A New Approach
Databricks via YouTube