Embracing Big Data Workloads in Cloud-Native Environments with Data Locality
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore data locality in cloud-native environments for big data workloads in this 45-minute conference talk. Learn how Kubernetes schedules workloads based on CPU and memory resources, and discover the challenges of applying this approach to stateful big data workloads. Compare data locality support across mainstream container-attached storage solutions for Kubernetes. Dive into the network topology support provided by Apache Hadoop Ozone and its application as a locality-aware container-attached storage via Ozone CSI plugin. Witness a demonstration using Spark on Kubernetes to showcase the advantages of data locality-aware scheduling with Apache Hadoop Ozone. Gain insights into the evolution of big data, locality concepts in big data processing, Hadoop HDFS locality, and the implementation of Apache HDFS and Ozone in Kubernetes environments.
Syllabus
Intro
Outline
Big Data Evolution in ...
Locality in Big Data
Hadoop HDFS Locality
Apache HDFS in Kubernetes
Apache Ozone Overview
Apache Ozone Storage Container
Apache Ozone Topology
Apache Ozone in Kubernetes
Apache Ozone in Rubernetes Road Map
Taught by
Linux Foundation
Tags
Related Courses
HDFS CSI Plugin: Speeding Up Kubernetes in On-Premises Big Data ClustersLinux Foundation via YouTube Best Practice for Cloud Native Storage in Finance Industry
CNCF [Cloud Native Computing Foundation] via YouTube Elevating Scalable Object Storage: A Comprehensive Exploration of Ozone's Trailblazing Capabilities
The ASF via YouTube Apache Ratis - A High Performance Raft Library
The ASF via YouTube Iceberg Replication - Enterprise-Grade Solution for Apache Iceberg Tables
The ASF via YouTube