YoVDO

Accelerating Spark Workloads in a Mesos Environment with Alluxio

Offered By: Linux Foundation via YouTube

Tags

Apache Spark Courses Big Data Courses Data Analysis Courses Cloud Computing Courses Data Processing Courses ETL Courses Alluxio Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to optimize Apache Spark workloads in a Mesos environment using Alluxio in this informative conference talk. Learn about the challenges of processing data stored in disparate cloud and on-premise storage systems, and discover how Alluxio, a memory-speed virtual distributed storage system, can be deployed on Mesos to create a unified namespace for connecting compute frameworks to various storage systems. Understand the architecture of Mesos, Spark, and Alluxio, and how their integration can achieve optimal performance for enterprises by enabling memory-speed data access, eliminating ETL and data duplication, and facilitating new workloads across all data sources. Gain insights from Gene Pang, a PMC and maintainer of the Alluxio open source project, as he shares his expertise on creating an efficient architecture for large-scale data processing and analysis.

Syllabus

Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.


Taught by

Linux Foundation

Tags

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera