YoVDO

Power Up Your Lakehouse with Git Semantics and Delta Lake

Offered By: Databricks via YouTube

Tags

Delta Lake Courses Data Engineering Courses Time Travel Courses DataOps Courses LakeFS Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 23-minute conference talk that delves into enhancing lakehouse architecture with Git semantics and Delta Lake. Learn how to overcome data versioning challenges in DataOps, including writing, auditing, and publishing changes, rolling back to consistent states, creating reproducible workloads, and building economical dev/test environments. Discover how the combination of Delta Lake and lakeFS can apply Git-like semantics to improve time travel capabilities in lakehouses. Understand how Delta Lake provides linear history through table snapshots, while lakeFS adds branching and merging functionalities, resulting in enhanced data quality and operational economics. Gain insights from Oz Katz, CTO and Co-creator of lakeFS, on implementing these tools for improved data management practices.

Syllabus

Power Up Your Lakehouse with Git Semantics & Delta Lake


Taught by

Databricks

Related Courses

Multi-Table Transactions with LakeFS and Delta Lake - Tech Talk
Databricks via YouTube
CI/CD for Data - Building Dev/Test Data Environments with Open Source Stacks
CNCF [Cloud Native Computing Foundation] via YouTube
Building Reproducible ML Processes with an Open Source Stack
Linux Foundation via YouTube
Version Control for Lakehouse Architecture - Essential Practices and Benefits
Databricks via YouTube
Developing Data Pipelines with Branch Deployments - A New Approach
Databricks via YouTube