YoVDO

Flare - Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data

Offered By: Strange Loop Conference via YouTube

Tags

Strange Loop Conference Courses Data Science Courses Apache Spark Courses Data Engineering Courses

Course Description

Overview

Explore a conference talk that delves into Flare, an accelerator module designed to optimize Apache Spark for scale-up architectures and medium-size data processing. Learn how Flare addresses the performance limitations of Spark in scenarios where data can fit into the main memory of a single powerful server. Discover how this innovative approach incorporates code generation techniques inspired by main-memory database systems to deliver significant speedups for a wide range of applications. Gain insights into the challenges of processing medium-size workloads that require high performance, frequent execution on changing data, or integration with external libraries and systems like TensorFlow. Understand the potential benefits of Flare in enhancing Spark's capabilities for users who value its familiar interface and tooling but need improved performance for computationally intensive tasks on scale-up architectures.

Syllabus

"Flare: Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data" by Gregory Essertel


Taught by

Strange Loop Conference

Tags

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera