Flare - Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data
Offered By: Strange Loop Conference via YouTube
Course Description
Overview
Explore a conference talk that delves into Flare, an accelerator module designed to optimize Apache Spark for scale-up architectures and medium-size data processing. Learn how Flare addresses the performance limitations of Spark in scenarios where data can fit into the main memory of a single powerful server. Discover how this innovative approach incorporates code generation techniques inspired by main-memory database systems to deliver significant speedups for a wide range of applications. Gain insights into the challenges of processing medium-size workloads that require high performance, frequent execution on changing data, or integration with external libraries and systems like TensorFlow. Understand the potential benefits of Flare in enhancing Spark's capabilities for users who value its familiar interface and tooling but need improved performance for computationally intensive tasks on scale-up architectures.
Syllabus
"Flare: Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data" by Gregory Essertel
Taught by
Strange Loop Conference
Tags
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera