Jupyter as Incident Response Tool
Offered By: USENIX via YouTube
Course Description
Overview
Explore how Jupyter can be utilized as an effective incident response tool in this 20-minute conference talk from SREcon20 Americas. Learn to leverage Jupyter's dynamic exploration capabilities and result-sharing features for Site Reliability Engineering. Follow along as the speaker demonstrates triaging and remediating a simulated cache slowdown incident affecting site performance. Discover post-incident best practices for proper documentation and preparation for incident retrospectives. Gain insights into Jupyter notebooks, kernels, data analysis techniques, and practical applications of tools like Boto3 for efficient problem-solving in SRE contexts.
Syllabus
Intro
Acknowledgement of Country
Jupyter Notebooks
Jupyter Kernels
Data Science Origins
Imaginary Stack
Symptom
Spoiler
Query Data
Analyze Data
Boto3
We Know The Culprit
Connect to Host
Confirm the Problem
Fix the Problem
Post-Incident
Final Thoughts
Taught by
USENIX
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX