Hyrax - Fail-in-Place Server Operation in Cloud Platforms
Offered By: USENIX via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a groundbreaking approach to handling server hardware failures in cloud platforms through this 15-minute conference talk from OSDI '23. Delve into Hyrax, an innovative datacenter stack that allows compute servers with failed components to continue hosting VMs while masking degraded capacity and performance. Discover how this fail-in-place operation model addresses the unsustainability of traditional all-or-nothing approaches and aligns with emerging technology trends. Learn about the novel model for changes in memory interleaving when deactivating faulty memory modules, a key enabler of Hyrax. Examine experimental results from cloud production servers demonstrating Hyrax's ability to overcome common hardware failures without impacting peak VM performance. Analyze large-scale simulation findings using production traces, revealing how Hyrax reduces server repair requirements by 50-60% while maintaining VM scheduling efficiency.
Syllabus
OSDI '23 - Hyrax: Fail-in-Place Server Operation in Cloud Platforms
Taught by
USENIX
Related Courses
Accessing your AWS EC2 serversCoursera Project Network via Coursera LXC/LXD Deep Dive
A Cloud Guru Nagios Certified Professional Prep Course
A Cloud Guru Building a Media Sharing Website - Part 1: Media Upload (Japanese)
Amazon Web Services via AWS Skill Builder AWS Cloud Practitioner
City College of San Francisco via California Community Colleges System