Building a Lakehouse Architecture for Business Data Empowerment
Offered By: Databricks via YouTube
Course Description
Overview
Explore a 36-minute conference talk on building a data platform using the LakeHouse Architecture. Learn how Wehkamp created a uniform system to provide reliable, timely, and GDPR-compliant data access across their company. Discover the three-level data curation approach - bronze, silver, and gold - and how it enables data democratization while maintaining privacy. Gain insights into the use of open-source technologies, pseudonymization of PII fields, and the development of a custom library for efficient data source ingestion. Understand the implementation of key components such as ACID transactions, Structured Stream processing, Slack alerting, data quality checks, and CI/CD. Hear about the platform's positive impact on various teams and its role in modernizing Wehkamp's data infrastructure.
Syllabus
Intro
Who are we
Agenda
Where did we start
Requirements
Vint Ingest
Alerting
Our Journey
Whats Next
Questions
Taught by
Databricks
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera