State Reader API: The New "Statestore" Data Source for Structured Streaming
Offered By: Databricks via YouTube
Course Description
Overview
Explore the new State Reader API, a powerful capability introduced by Databricks for accessing and analyzing Structured Streaming's internal state data. Learn how this API differs from traditional Spark data formats and its primary purpose in developing, debugging, and troubleshooting stateful Structured Streaming workloads. Dive into stateful operator basics, understand common challenges with state data, and discover how the State Reader API, set to be included in Apache Spark⢠4.0.0, addresses these issues. This 16-minute talk, presented by Craig Lukasik, Sr. SSA at Databricks, provides valuable insights for data engineers and developers working with Apache Spark and Structured Streaming.
Syllabus
State Reader API: the New "Statestore" Data Source
Taught by
Databricks
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera