Managing Millions of Tests Using Databricks - Automated Monitoring and Reporting System
Offered By: Databricks via YouTube
Course Description
Overview
Explore the challenges and solutions of managing millions of daily tests for Databricks Runtime in this 25-minute conference talk. Dive into the automated test monitoring and reporting system built using Databricks, learning how to ingest data from various sources like CI systems and Bazel build metadata into Delta. Discover techniques for analyzing test results, reporting failures to owners through Jira, and creating effective quality tracking reports. Gain insights into the deep technical stack, wide surface area, and guiding principles behind Databricks' testing approach. Learn about establishing test results and owners tables, building data pipelines, and implementing developer-friendly failure reporting. Understand how to connect problems with the right owners and use appropriate tools to solve complex testing challenges in large-scale data engineering and machine learning environments.
Syllabus
Intro
Deep technical stack
Wide surface area
Testing, testing, testing
Guiding principles
What is the actual problem?
Building data pipelines
Use the right tools for solving the problem
Establishing test results tables
Establishing test owners table
Reporting test failures to Jira
Test reporting pipeline
Connecting the problem with the right owner
Developer-friendly failure reporting
Taught by
Databricks
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Data Analysis with Python
IBM via Coursera Intro to TensorFlow 日本語版
Google Cloud via Coursera TensorFlow on Google Cloud - Français
Google Cloud via Coursera Freedom of Data with SAP Data Hub
SAP Learning