Deep Dive into New Features of Apache Spark 3.1
Offered By: Databricks via YouTube
Course Description
Overview
Syllabus
Intro
ANSI SOL Compliance
Fail Earlier for Invalid Data
Forbid Confusing CAST
ANSI Mode GA in Spark 3.2
Unified CREATE TABLE SOL Syntax
CHAR/VARCHAR Support
More ANSI Features Coming in Spark 3.2!
Node Decommissioning
Summary
SOL Performance
Shuffle Hash Join Improvement
Partition Pruning Improvement
Predicate Pushdown Improvement
Reduce Query Compiling Latency (3.2)
Stream-stream Join
State Store for Structured Streaming
Rocks DB State Store
Add the type hints PEP 484 to PySpark!
Static Error Detection
Python Dependency Management
Visualization and Plotting
Usability Enhancements
New Utility Functions for Unix Time
New Utility Functions for Time Zone
EXPLAIN FORMMATTED
Ignore Hints
Documentation and Environments
New Doc for PySpark
Deprecations and Removals
Taught by
Databricks
Related Courses
Introduction to DatabasesMeta via Coursera Web Development
Udacity Introduction to Data Science
University of Washington via Coursera Datenmanagement mit SQL
openHPI Sabermetrics 101: Introduction to Baseball Analytics
Boston University via edX