YoVDO

Git for Data: Managing Data like Code with lakeFS

Offered By: Confluent via YouTube

Tags

Data Management Courses Git Courses Apache Kafka Courses Data Engineering Courses Merkle Tree Courses LakeFS Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how lakeFS, an open-source data version control tool, transforms object storage into Git-like repositories for managing and testing data like code. Learn about the core benefits of lakeFS, including better data management, reproducibility, and historical data reprocessing. Discover how it differs from Git, utilizes Merkle Trees, and integrates with Apache Kafka. Gain insights into various use cases, industries that benefit from lakeFS, and how it addresses common data problems. Understand the potential of applying code-like workflows to data management, enabling teams to handle petabytes of information efficiently and recover from accidental data deletions or corruptions.

Syllabus

- Intro
- What is lakeFS?
- lakeFS vs. Git
- What is a Merkle Tree?
- What are some lakeFS use-cases?
- What data problems does lakeFS test?
- What types of customers or industries use lakeFS?
- lakeFS and Apache Kafka
- It's a wrap!


Taught by

Confluent

Related Courses

Multi-Table Transactions with LakeFS and Delta Lake - Tech Talk
Databricks via YouTube
CI/CD for Data - Building Dev/Test Data Environments with Open Source Stacks
CNCF [Cloud Native Computing Foundation] via YouTube
Building Reproducible ML Processes with an Open Source Stack
Linux Foundation via YouTube
Power Up Your Lakehouse with Git Semantics and Delta Lake
Databricks via YouTube
Version Control for Lakehouse Architecture - Essential Practices and Benefits
Databricks via YouTube