Accelerating Spark Workloads in an Apache Mesos Environment with Alluxio
Offered By: Linux Foundation via YouTube
Course Description
Overview
Learn how to optimize Apache Spark workloads in an Apache Mesos environment using Alluxio in this informative conference talk. Discover the challenges of processing data stored in various cloud and on-premise storage systems, and explore how Alluxio, an open-source memory speed virtual distributed storage system, can address these issues. Understand the architecture of Mesos, Spark, and Alluxio, and how their integration can create an optimal enterprise architecture. Gain insights into eliminating ETL pains, reducing data duplication, and enabling new workloads across all data sources. Explore the benefits of Alluxio's unified namespace and its ability to connect compute frameworks like Apache Spark to multiple storage systems while providing memory-speed data access.
Syllabus
Accelerating Spark Workloads in an Apache Mesos Environment with Alluxio
Taught by
Linux Foundation
Tags
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera