Scala and the JVM as a Big Data Platform - Lessons from Apache Spark
Offered By: Scala Days Conferences via YouTube
Course Description
Overview
Explore the advantages and challenges of using Scala and the JVM for Big Data applications, with a focus on Apache Spark. Learn how Scala's features, such as its pragmatic balance of object-oriented and functional programming, interpreter mode, rich Collections library, and pattern matching, contribute to its effectiveness in Big Data processing. Discover the strengths of the JVM as a scalable computing platform and understand its limitations in high-performance data crunching. Examine the Tungsten project's efforts to optimize Spark's performance through custom data layouts, manual memory management, and code generation. Gain insights into the future improvements that could enhance both Scala and the JVM for Big Data applications.
Syllabus
Scala and the JVM as a Big Data Platform Lessons from Apache Spark by Dean Wampler
Taught by
Scala Days Conferences
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera