Is Big Data Performance Reproducible in Modern Cloud Networks?
Offered By: USENIX via YouTube
Course Description
Overview
Explore a 21-minute conference talk from USENIX NSDI '20 that delves into the challenges of reproducing big data performance in modern cloud networks. Examine the impact of network variability on cloud-based big-data workloads through extensive data collection from commercial and research clouds. Discover how quality-of-service mechanisms and service provider policies can exacerbate variability issues. Learn about the significant slowdowns and lack of predictability in big-data workloads, even when using state-of-the-art experimentation techniques. Gain valuable insights into reducing performance volatility and improving experiment repeatability in cloud environments. Understand the importance of considering variability in cloud performance evaluations and the need for running more experiments to achieve reliable results.
Syllabus
Intro
Big data infrastructure in the cloud
How do we assess performance in the cloud ?
Cloud performance is variable
Variability is disconsidered in performance evaluations
Experiment Design (1) - Measuring the Cloud
Main Findings - Modern Cloud Networks
Experiment Design (2) -- Reproducing App Performance
How to run repeatable experiments?
TL;DR: run more experiments
Taught by
USENIX
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX