Embracing Big Data Workloads in Cloud-Native Environments with Data Locality
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore data locality in cloud-native environments for big data workloads in this 45-minute conference talk. Learn how Kubernetes schedules workloads based on CPU and memory resources, and discover the challenges of applying this approach to stateful big data workloads. Compare data locality support across mainstream container-attached storage solutions for Kubernetes. Dive into the network topology support provided by Apache Hadoop Ozone and its application as a locality-aware container-attached storage via Ozone CSI plugin. Witness a demonstration using Spark on Kubernetes to showcase the advantages of data locality-aware scheduling with Apache Hadoop Ozone. Gain insights into the evolution of big data, locality concepts in big data processing, Hadoop HDFS locality, and the implementation of Apache HDFS and Ozone in Kubernetes environments.
Syllabus
Intro
Outline
Big Data Evolution in ...
Locality in Big Data
Hadoop HDFS Locality
Apache HDFS in Kubernetes
Apache Ozone Overview
Apache Ozone Storage Container
Apache Ozone Topology
Apache Ozone in Kubernetes
Apache Ozone in Rubernetes Road Map
Taught by
Linux Foundation
Tags
Related Courses
Advanced Operating SystemsGeorgia Institute of Technology via Udacity High Performance Computing
Georgia Institute of Technology via Udacity GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX CS125x: Advanced Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX