Finding the Needle in the Haystack - Predicting Storage Device Failures in Data Centers
Offered By: USENIX via YouTube
Course Description
Overview
Explore a practical approach to predicting storage device failures in data centers through this 31-minute conference talk from SREcon23 Asia/Pacific. Delve into the challenges faced by Site Reliability Engineers in managing and monitoring vast numbers of storage devices, and learn about a multi-phase proactive sampling-based system designed to address these issues. Witness a live demonstration of the system implemented in a multi-tiered cloud storage pool, and gain insights into innovative techniques for improving accuracy, performance, and cost-effectiveness in failure prediction. Discover how this research can be applied to solve real-world challenges in data center management and storage device reliability.
Syllabus
SREcon23 Asia/Pacific - Finding the Needle in the Haystack: Predicting Storage Device Failures in...
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Tracing Bare Metal with OpenTelemetry
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube