YoVDO

μSlope - High Compression and Fast Search on Semi-Structured Logs

Offered By: USENIX via YouTube

Tags

Log Management Courses Big Data Courses Data Storage Courses JSON Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a groundbreaking system for compressing and searching semi-structured log data in this 16-minute conference talk from OSDI '24. Dive into μSlope, a solution designed to address the challenges of managing massive amounts of log data produced by internet-scale services. Learn how this innovative approach achieves lossless compression of semi-structured formats like JSON while enabling fast search capabilities without full decompression. Discover the techniques used to concisely represent schema structures, "structurize" semi-structured data, and group records with similar schemas into well-structured tables. Examine the impressive compression ratios achieved by μSlope, ranging from 21.9:1 to 186.8:1, surpassing existing semi-structured data management systems. Gain insights into how this system outperforms Zstandard in compression and offers search speeds up to 5.77 times faster than other solutions. Understand the potential impact of μSlope on log data storage and analysis, particularly for companies like Uber that generate over 10PB of log data daily.

Syllabus

OSDI '24 - μSlope: High Compression and Fast Search on Semi-Structured Logs


Taught by

USENIX

Related Courses

Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera
Big Data for Better Performance
Open2Study
Big Data and Education
Columbia University via edX
Big Data Analytics in Healthcare
Georgia Institute of Technology via Udacity
Data Mining with Weka
University of Waikato via Independent