Accelerating Spark Workloads in a Mesos Environment with Alluxio
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore how to optimize Apache Spark workloads in a Mesos environment using Alluxio in this informative conference talk. Learn about the challenges of processing data stored in disparate cloud and on-premise storage systems, and discover how Alluxio, a memory-speed virtual distributed storage system, can be deployed on Mesos to create a unified namespace for connecting compute frameworks to various storage systems. Understand the architecture of Mesos, Spark, and Alluxio, and how their integration can achieve optimal performance for enterprises by enabling memory-speed data access, eliminating ETL and data duplication, and facilitating new workloads across all data sources. Gain insights from Gene Pang, a PMC and maintainer of the Alluxio open source project, as he shares his expertise on creating an efficient architecture for large-scale data processing and analysis.
Syllabus
Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.
Taught by
Linux Foundation
Tags
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera