Git for Data: Managing Data like Code with lakeFS
Offered By: Confluent via YouTube
Course Description
Overview
Explore how lakeFS, an open-source data version control tool, transforms object storage into Git-like repositories for managing and testing data like code. Learn about the core benefits of lakeFS, including better data management, reproducibility, and historical data reprocessing. Discover how it differs from Git, utilizes Merkle Trees, and integrates with Apache Kafka. Gain insights into various use cases, industries that benefit from lakeFS, and how it addresses common data problems. Understand the potential of applying code-like workflows to data management, enabling teams to handle petabytes of information efficiently and recover from accidental data deletions or corruptions.
Syllabus
- Intro
- What is lakeFS?
- lakeFS vs. Git
- What is a Merkle Tree?
- What are some lakeFS use-cases?
- What data problems does lakeFS test?
- What types of customers or industries use lakeFS?
- lakeFS and Apache Kafka
- It's a wrap!
Taught by
Confluent
Related Courses
Données et services numériques, dans le nuage et ailleursCertificat informatique et internet via France Université Numerique Introduction to Digital Curation
University College London via Independent Excel Avanzado
Miríadax SAP Business Warehouse powered by SAP HANA
SAP Learning Programming Mobile Applications for Android Handheld Systems: Part 2
University of Maryland, College Park via Coursera