Deep Dive into New Features of Apache Spark 3.1
Offered By: Databricks via YouTube
Course Description
Overview
Syllabus
Intro
ANSI SOL Compliance
Fail Earlier for Invalid Data
Forbid Confusing CAST
ANSI Mode GA in Spark 3.2
Unified CREATE TABLE SOL Syntax
CHAR/VARCHAR Support
More ANSI Features Coming in Spark 3.2!
Node Decommissioning
Summary
SOL Performance
Shuffle Hash Join Improvement
Partition Pruning Improvement
Predicate Pushdown Improvement
Reduce Query Compiling Latency (3.2)
Stream-stream Join
State Store for Structured Streaming
Rocks DB State Store
Add the type hints PEP 484 to PySpark!
Static Error Detection
Python Dependency Management
Visualization and Plotting
Usability Enhancements
New Utility Functions for Unix Time
New Utility Functions for Time Zone
EXPLAIN FORMMATTED
Ignore Hints
Documentation and Environments
New Doc for PySpark
Deprecations and Removals
Taught by
Databricks
Related Courses
Coding the Matrix: Linear Algebra through Computer Science ApplicationsBrown University via Coursera كيف تفكر الآلات - مقدمة في تقنيات الحوسبة
King Fahd University of Petroleum and Minerals via Rwaq (رواق) Datascience et Analyse situationnelle : dans les coulisses du Big Data
IONIS via IONIS Data Lakes for Big Data
EdCast 統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco