ACCL+: FPGA-Based Collective Engine for Distributed Applications
Offered By: USENIX via YouTube
Course Description
Overview
Explore a 15-minute conference talk from USENIX OSDI '24 that introduces ACCL+, an innovative FPGA-based collective communication library. Discover how this open-source solution enhances the development of distributed applications using FPGAs in cloud deployments. Learn about ACCL+'s portability across platforms, support for various networking protocols, and its ability to enable direct FPGA-to-FPGA collective communication. Understand its dual functionality as a collective offload engine for CPU applications and a component in FPGA-based distributed systems. Examine the performance evaluation results comparing ACCL+ to software MPI over RDMA on an FPGA cluster with 100 Gb/s networking. Gain insights into practical use cases, including CPU-based vector-matrix multiplication and FPGA-based distributed deep-learning recommendation inference.
Syllabus
OSDI '24 - ACCL+: an FPGA-Based Collective Engine for Distributed Applications
Taught by
USENIX
Related Courses
Cloud Computing Concepts, Part 1University of Illinois at Urbana-Champaign via Coursera Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera Reliable Distributed Algorithms - Part 1
KTH Royal Institute of Technology via edX Introduction to Apache Spark and AWS
University of London International Programmes via Coursera Réalisez des calculs distribués sur des données massives
CentraleSupélec via OpenClassrooms