YoVDO

The Making of an Exabyte-scale Data Lakehouse Using Apache Ozone

Offered By: The ASF via YouTube

Tags

Hadoop Courses Apache Spark Courses Apache NiFi Courses Apache Hive Courses Apache Ozone Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the creation of an Exabyte-scale Data Lakehouse in this 41-minute conference talk from The ASF. Discover how Apache Ozone object store enables effortless scaling to exabytes of data, empowering high-performance queries while reducing costs and carbon footprint. Learn about the collaborative efforts between Ozone and key stakeholders like Hive, Impala, Spark, Nifi, and Iceberg communities to ensure optimal integration. Delve into recent integration endeavors aimed at providing a cohesive Data Lakehouse experience on the Ozone platform. Gain insights from speakers Saketa Chalamchala, a Sr. Software Engineer at Cloudera, and Siddharth Wagle as they cover topics including Ozone overview, Hadoop architecture, Ozone building blocks, metadata duality, Lakehouse sizing, and data migration. Conclude with a demonstration showcasing the practical applications of this cutting-edge technology.

Syllabus

Introduction
Agenda
Requirements
Ozone
Ozone overview
Ozone differentiators
Hadoop Architecture
Highlevel component architecture
Ozone building blocks
What does Ozone do
Metadata
Duality
Lakehouse sizing
Data migration
Demo


Taught by

The ASF

Related Courses

Amazon EMR Getting Started (Indonesian)
Amazon Web Services via AWS Skill Builder
Analisar e preparar dados com o Amazon SageMaker Data Wrangler e o Amazon EMR (Português (Brasil)) | Lab - Analyze and Prepare Data with Amazon SageMaker Data Wrangler and Amazon EMR (Portuguese (Brazil))
Amazon Web Services via AWS Skill Builder
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Managing Big Data in Clusters and Cloud Storage
Cloudera via Coursera
Analyzing Big Data with SQL
Cloudera via Coursera