Turbocharging Spark with Samsung SmartSSD Computational Storage Devices
Offered By: Databricks via YouTube
Course Description
Overview
Discover how to significantly boost Apache Spark performance using Samsung SmartSSDĀ® Computational Storage Devices (CSDs) powered by Xilinx FPGAs in this 30-minute video presentation by Databricks. Learn about the innovative approach developed by Samsung, Xilinx, and Bigstream to seamlessly accelerate the Spark platform, resulting in impressive 2x-8x performance gains without requiring any user code changes. Explore the technical aspects of Bigstream middleware's acceleration method, examine performance results using the TPC-DS benchmark suite, and understand the potential Total Cost of Ownership (TCO) savings for Spark users. Gain insights into the industry surrounding computational storage, SmartSSD use cases, acceleration techniques, and limitations of Spark. Delve into the inner workings of SmartSSDs, their acceleration capabilities, and the partnership ecosystem supporting this technology.
Syllabus
Introduction
Industry around Computational Storage
SmartSSD Use Cases
Acceleration
Limitations of Spark
SmartSSD
Results
How it works
BigStream
SmartSSDs
SmartSSD acceleration
SmartSSD partners
Technical spec
Taught by
Databricks
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera