Testing Data Pipelines - Techniques for Ensuring Data Flow Integrity
Offered By: PyCon US via YouTube
Course Description
Overview
Explore effective strategies for testing data pipelines in this informative PyCon US talk. Learn how to ensure smooth data flow and quickly identify and resolve issues in your pipelines. Discover toolkit-agnostic techniques applicable beyond Airflow, including unit testing for individual components, integration testing for the entire pipeline, and end-to-end testing for accurate data output. Gain insights into unique methods such as data snapshot testing and online and offline data quality checks. Apply software application testing principles to data pipeline development and maintenance. Access the presentation slides for a comprehensive overview of the concepts discussed in this 25-minute talk.
Syllabus
Talks - Amitosh Swain: Testing Data Pipelines
Taught by
PyCon US
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Data Analysis with Python
IBM via Coursera Intro to TensorFlow 日本語版
Google Cloud via Coursera TensorFlow on Google Cloud - Français
Google Cloud via Coursera Freedom of Data with SAP Data Hub
SAP Learning