YoVDO

Data Mesh in Practice - Beyond the Data Lake for Fashion E-Commerce

Offered By: Databricks via YouTube

Tags

Data Mesh Courses Data Lakes Courses Data Management Courses Fashion Industry Courses Delta Lake Courses

Course Description

Overview

Explore how Zalando, Europe's leading online fashion platform, transitioned from a centralized Data Lake to a distributed Data Mesh architecture in this 30-minute conference talk. Learn about the challenges of the Data Lake paradigm, including unclear responsibilities, lack of data ownership, and poor data availability. Discover how Zalando addressed these issues by implementing a decentralized approach that empowers data owners with domain knowledge. Delve into the concept of Data Products, which go beyond simple file sharing to ensure quality and acknowledge ownership. Follow Zalando's journey as they built their Data Mesh using Spark and Delta Lake, and gain insights into their ongoing efforts to simplify data product creation through templating. Understand the benefits of this architectural shift in terms of scalability, accessibility, and data democratization for large-scale organizations.

Syllabus

Intro
Introductions
Arifs background
Agenda
Data Sources
Data Pipeline
The bottleneck
The pain points
What is Data Mesh
Product Thinking
Data Product Manager
Bring Your Own Bucket
Central Provision of Infrastructure
Decentral Ownership
Data Mesh Adoption
Whats Next
Closing


Taught by

Databricks

Related Courses

Data Lakes for Big Data
EdCast
Distributed Computing with Spark SQL
University of California, Davis via Coursera
Modernizing Data Lakes and Data Warehouses with Google Cloud
Google Cloud via Coursera
Data Engineering with AWS
Udacity
Preparing for Google Cloud Certification: Cloud Data Engineer
Google Cloud via Coursera