YoVDO

Accelerating Spark Workloads in a Mesos Environment with Alluxio

Offered By: Linux Foundation via YouTube

Tags

Apache Spark Courses Big Data Courses Data Analysis Courses Cloud Computing Courses Data Processing Courses ETL Courses Alluxio Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to optimize Apache Spark workloads in a Mesos environment using Alluxio in this informative conference talk. Learn about the challenges of processing data stored in disparate cloud and on-premise storage systems, and discover how Alluxio, a memory-speed virtual distributed storage system, can be deployed on Mesos to create a unified namespace for connecting compute frameworks to various storage systems. Understand the architecture of Mesos, Spark, and Alluxio, and how their integration can achieve optimal performance for enterprises by enabling memory-speed data access, eliminating ETL and data duplication, and facilitating new workloads across all data sources. Gain insights from Gene Pang, a PMC and maintainer of the Alluxio open source project, as he shares his expertise on creating an efficient architecture for large-scale data processing and analysis.

Syllabus

Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.


Taught by

Linux Foundation

Tags

Related Courses

Building Batch Data Pipelines on GCP auf Deutsch
Google Cloud via Coursera
Building Batch Data Pipelines on GCP en Français
Google Cloud via Coursera
Mastering Azure Data Factory: From Basics to Advanced Level
Udemy
Data Science de A a Z - Extraçao e Exibição dos Dados
Udemy
Building Batch Data Processing Solutions in Microsoft Azure
Pluralsight