YoVDO

Embracing Big Data Workloads in Cloud-Native Environments with Data Locality

Offered By: Linux Foundation via YouTube

Tags

Big Data Courses Cloud Computing Courses Kubernetes Courses Distributed Systems Courses Data Processing Courses Apache Ozone Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore data locality in cloud-native environments for big data workloads in this 45-minute conference talk. Learn how Kubernetes schedules workloads based on CPU and memory resources, and discover the challenges of applying this approach to stateful big data workloads. Compare data locality support across mainstream container-attached storage solutions for Kubernetes. Dive into the network topology support provided by Apache Hadoop Ozone and its application as a locality-aware container-attached storage via Ozone CSI plugin. Witness a demonstration using Spark on Kubernetes to showcase the advantages of data locality-aware scheduling with Apache Hadoop Ozone. Gain insights into the evolution of big data, locality concepts in big data processing, Hadoop HDFS locality, and the implementation of Apache HDFS and Ozone in Kubernetes environments.

Syllabus

Intro
Outline
Big Data Evolution in ...
Locality in Big Data
Hadoop HDFS Locality
Apache HDFS in Kubernetes
Apache Ozone Overview
Apache Ozone Storage Container
Apache Ozone Topology
Apache Ozone in Kubernetes
Apache Ozone in Rubernetes Road Map


Taught by

Linux Foundation

Tags

Related Courses

Advanced Operating Systems
Georgia Institute of Technology via Udacity
High Performance Computing
Georgia Institute of Technology via Udacity
GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity
Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX
CS125x: Advanced Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX