One Billion Row Challenge: Optimizing Java for Massive Data Processing
Offered By: Devoxx via YouTube
Course Description
Overview
Dive into the fascinating world of the "One Billion Row Challenge" that captivated the Java community in this 49-minute Devoxx conference talk. Explore the intricacies of aggregating temperature values from a massive 13 GB CSV file containing one billion rows, grouped by weather station names. Discover the innovative techniques employed by the fastest solutions to process this enormous dataset in under two seconds. Learn about advanced optimization strategies, including parallelization, efficient memory access, optimized parsing routines using SIMD and SWAR, and custom map implementations. Gain insights into the personal experiences and valuable lessons learned by the speakers while organizing and participating in this community-driven challenge. Enhance your understanding of high-performance Java programming and data processing techniques through this engaging presentation.
Syllabus
1BRC – Nerd Sniping the Java Community by Roy van Rijn, Gunnar Morling
Taught by
Devoxx
Related Courses
Algorithms, Part IPrinceton University via Coursera Introduction à la programmation orientée objet (en Java)
École Polytechnique Fédérale de Lausanne via Coursera MongoDB for Java Developers
MongoDB University Initiation à la programmation (en Java)
École Polytechnique Fédérale de Lausanne via Coursera Intro to Java Programming
San Jose State University via Udacity