Refining Systems Data without Losing Fidelity
Offered By: USENIX via YouTube
Course Description
Overview
Syllabus
Intro
Complex systems are hard to manage.
User experiences.
User experiences marbles.
without breaking the bank?
Three strategies for taming the spew.
Reduce. Reuse. Recycle.
Store less data.
Stop writing read-never data.
First, structure your data.
One event per transaction.
Often, trimming isn't enough.
Sample your data.
Statistics to the rescue!
Count 1/N events.
Count traces together.
Don't be afraid of sample rates.
Don't believe me? Ask a data scientist.
Aggregate data.
Aggregation destroys cardinality.
Temporal correlation is weak.
Math on quantiles is misleading.
Aggregation is a last resort.
How can sampling be cheap enough?
Systems scale with load.
Reconcile using the sample rate.
How can we save the relevant events?
Normalize per-key.
Different key, different probability.
Retain errors & slow queries.
Metrics and events can be friends!
Taught by
USENIX
Related Courses
Données et services numériques, dans le nuage et ailleursCertificat informatique et internet via France Université Numerique Introduction to Digital Curation
University College London via Independent Excel Avanzado
Miríadax SAP Business Warehouse powered by SAP HANA
SAP Learning Programming Mobile Applications for Android Handheld Systems: Part 2
University of Maryland, College Park via Coursera