YoVDO

Building an Open Data Lakehouse Using Apache Spark, Apache Iceberg and Dremio

Offered By: SQLBits via YouTube

Tags

Apache Spark Courses Big Data Courses SQL Courses Data Governance Courses Data Engineering Courses Data Security Courses Open Source Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the evolution of the big data landscape and the concept of data lakehouses in this 45-minute SQLBits conference talk. Delve into the development of Open Data Lakehouses, focusing on their implementation, security, governance, and enhancement using Dremio. Learn about key technologies such as Apache Spark and Apache Iceberg, and their roles in building modern data architectures. Gain insights from speaker Mike Flower on topics including SQL 2019, data lake management, database engines, big data analytics, optimization techniques, and platform-agnostic approaches. Discover how these concepts apply to big data and data engineering, data security and governance, and developer-oriented practices in the evolving data landscape.

Syllabus

Building an Open Data Lakehouse using Apache Spark, Apache Iceberg and Dremio


Taught by

SQLBits

Related Courses

Building Modern Data Streaming Apps with Open Source
Linux Foundation via YouTube
How to Stabilize a GenAI-First Modern Data LakeHouse - Provisioning 20,000 Ephemeral Data Lakes per Year
CNCF [Cloud Native Computing Foundation] via YouTube
Data Storage and Queries
DeepLearning.AI via Coursera
Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube
Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube