Accelerating Spark Workloads in a Mesos Environment with Alluxio
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore how to optimize Apache Spark workloads in a Mesos environment using Alluxio in this informative conference talk. Learn about the challenges of processing data stored in disparate cloud and on-premise storage systems, and discover how Alluxio, a memory-speed virtual distributed storage system, can be deployed on Mesos to create a unified namespace for connecting compute frameworks to various storage systems. Understand the architecture of Mesos, Spark, and Alluxio, and how their integration can achieve optimal performance for enterprises by enabling memory-speed data access, eliminating ETL and data duplication, and facilitating new workloads across all data sources. Gain insights from Gene Pang, a PMC and maintainer of the Alluxio open source project, as he shares his expertise on creating an efficient architecture for large-scale data processing and analysis.
Syllabus
Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.
Taught by
Linux Foundation
Tags
Related Courses
Building Batch Data Pipelines on GCP auf DeutschGoogle Cloud via Coursera Building Batch Data Pipelines on GCP en Français
Google Cloud via Coursera Mastering Azure Data Factory: From Basics to Advanced Level
Udemy Data Science de A a Z - Extraçao e Exibição dos Dados
Udemy Building Batch Data Processing Solutions in Microsoft Azure
Pluralsight