YoVDO

One Billion Row Challenge: Optimizing Java for Massive Data Processing

Offered By: Devoxx via YouTube

Tags

Java Courses Data Aggregation Courses Parallelization Courses SIMD Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into the fascinating world of the "One Billion Row Challenge" that captivated the Java community in this 49-minute Devoxx conference talk. Explore the intricacies of aggregating temperature values from a massive 13 GB CSV file containing one billion rows, grouped by weather station names. Discover the innovative techniques employed by the fastest solutions to process this enormous dataset in under two seconds. Learn about advanced optimization strategies, including parallelization, efficient memory access, optimized parsing routines using SIMD and SWAR, and custom map implementations. Gain insights into the personal experiences and valuable lessons learned by the speakers while organizing and participating in this community-driven challenge. Enhance your understanding of high-performance Java programming and data processing techniques through this engaging presentation.

Syllabus

1BRC – Nerd Sniping the Java Community by Roy van Rijn, Gunnar Morling


Taught by

Devoxx

Related Courses

Stanford Seminar - MIPS Open, Wave Computing
Stanford University via YouTube
Loop Analysis and Vectorization in Julia - JuliaCon 2020
The Julia Programming Language via YouTube
Intrinsic Functions and Vector Processing Extensions for SIMD Parallel Operations in C++
javidx9 via YouTube
Intrinsics - Low-Level Engine Development with Burst - Unite Copenhagen
Unity via YouTube
Aggregating Ticks to Manage Scale in Sea of Thieves - Unreal Fest Europe 2019 - Unreal Engine
Unreal Engine via YouTube