Power Up Your Lakehouse with Git Semantics and Delta Lake
Offered By: Databricks via YouTube
Course Description
Overview
Explore a 23-minute conference talk that delves into enhancing lakehouse architecture with Git semantics and Delta Lake. Learn how to overcome data versioning challenges in DataOps, including writing, auditing, and publishing changes, rolling back to consistent states, creating reproducible workloads, and building economical dev/test environments. Discover how the combination of Delta Lake and lakeFS can apply Git-like semantics to improve time travel capabilities in lakehouses. Understand how Delta Lake provides linear history through table snapshots, while lakeFS adds branching and merging functionalities, resulting in enhanced data quality and operational economics. Gain insights from Oz Katz, CTO and Co-creator of lakeFS, on implementing these tools for improved data management practices.
Syllabus
Power Up Your Lakehouse with Git Semantics & Delta Lake
Taught by
Databricks
Related Courses
Distributed Computing with Spark SQLUniversity of California, Davis via Coursera Apache Spark (TM) SQL for Data Analysts
Databricks via Coursera Building Your First ETL Pipeline Using Azure Databricks
Pluralsight Implement a data lakehouse analytics solution with Azure Databricks
Microsoft via Microsoft Learn Perform data science with Azure Databricks
Microsoft via Microsoft Learn