SHEPHERD - Serving DNNs in the Wild
Offered By: USENIX via YouTube
Course Description
Overview
Explore a groundbreaking model serving system called SHEPHERD in this 15-minute conference talk from NSDI '23. Discover how SHEPHERD tackles the challenges of scalability, high system goodput, and maximum resource utilization across compute units for inference requests in interactive web services. Learn about its innovative two-level design that separates planning and serving modules, leveraging request stream aggregation for improved predictability and resource utilization. Understand the novel online algorithm employed by SHEPHERD for guaranteed goodput under unpredictable workloads, utilizing preemptions and model-specific batching properties. Gain insights into the system's performance, which achieves up to 18.1X higher goodput and 1.8X better utilization compared to prior state-of-the-art solutions, while scaling to hundreds of workers.
Syllabus
NSDI '23 - SHEPHERD: Serving DNNs in the Wild
Taught by
USENIX
Related Courses
Financial Sustainability: The Numbers side of Social Enterprise+Acumen via NovoEd Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera Developing Repeatable ModelsĀ® to Scale Your Impact
+Acumen via Independent Managing Microsoft Windows Server Active Directory Domain Services
Microsoft via edX Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms