Tracing Bare Metal with OpenTelemetry
Offered By: USENIX via YouTube
Course Description
Overview
Explore how Equinix Metal implemented OpenTelemetry tracing for bare metal provisioning in this 32-minute conference talk from SREcon22 Americas. Learn from Principal Engineer Amy Tobey and SRE Shelby Spees as they share their journey from frequent, long-lasting incidents to improved reliability through distributed tracing. Discover how the team created on-ramps for engineers across the organization to instrument their own code, facilitating knowledge transfer and empowering both veterans and newcomers to debug issues more efficiently. Gain insights into system issues identified through tracing and major reliability wins achieved. Understand the challenges and solutions in implementing OpenTelemetry across a globally-distributed, multidisciplinary team managing two dozen software services deployed on 70+ Kubernetes clusters across six continents.
Syllabus
Introduction
Turning on OpenTelemetry
What is the problem
Setting up OpenTelemetry
Testing
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube DO, RE, Me - Measuring the Effectiveness of Site Reliability Engineering
USENIX via YouTube