YoVDO

Is Big Data Performance Reproducible in Modern Cloud Networks?

Offered By: USENIX via YouTube

Tags

USENIX Symposium on Networked Systems Design and Implementation (NSDI) Courses Big Data Courses Data Analysis Courses Cloud Computing Courses

Course Description

Overview

Explore a 21-minute conference talk from USENIX NSDI '20 that delves into the challenges of reproducing big data performance in modern cloud networks. Examine the impact of network variability on cloud-based big-data workloads through extensive data collection from commercial and research clouds. Discover how quality-of-service mechanisms and service provider policies can exacerbate variability issues. Learn about the significant slowdowns and lack of predictability in big-data workloads, even when using state-of-the-art experimentation techniques. Gain valuable insights into reducing performance volatility and improving experiment repeatability in cloud environments. Understand the importance of considering variability in cloud performance evaluations and the need for running more experiments to achieve reliable results.

Syllabus

Intro
Big data infrastructure in the cloud
How do we assess performance in the cloud ?
Cloud performance is variable
Variability is disconsidered in performance evaluations
Experiment Design (1) - Measuring the Cloud
Main Findings - Modern Cloud Networks
Experiment Design (2) -- Reproducing App Performance
How to run repeatable experiments?
TL;DR: run more experiments


Taught by

USENIX

Related Courses

Social Network Analysis
University of Michigan via Coursera
Intro to Algorithms
Udacity
Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX