YoVDO

MSRL - Distributed Reinforcement Learning with Dataflow Fragments

Offered By: USENIX via YouTube

Tags

USENIX Annual Technical Conference Courses Parallel Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a conference talk from USENIX ATC '23 that introduces MSRL, a novel distributed reinforcement learning (RL) training system. Discover how MSRL utilizes the concept of fragmented dataflow graphs (FDGs) to execute RL algorithms flexibly on GPU clusters. Learn about the challenges in current RL systems and how MSRL addresses them by decoupling algorithm definition from distributed execution strategies. Understand the benefits of FDGs in handling diverse RL algorithms, allowing fragments to execute on different devices through various low-level dataflow implementations. Gain insights into how MSRL's distribution policy enables efficient mapping of fragments to devices without altering the RL algorithm implementation. Examine the experimental results demonstrating MSRL's ability to expose trade-offs between execution strategies while outperforming existing RL systems with fixed strategies.

Syllabus

USENIX ATC '23 - MSRL: Distributed Reinforcement Learning with Dataflow Fragments


Taught by

USENIX

Related Courses

Amazon DynamoDB - A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service
USENIX via YouTube
Faasm - Lightweight Isolation for Efficient Stateful Serverless Computing
USENIX via YouTube
AC-Key - Adaptive Caching for LSM-based Key-Value Stores
USENIX via YouTube
The Future of the Past - Challenges in Archival Storage
USENIX via YouTube
A Decentralized Blockchain with High Throughput and Fast Confirmation
USENIX via YouTube