YoVDO

Code Once, Use Often - Declarative Data Pipelines

Offered By: Databricks via YouTube

Tags

Data Pipelines Courses Data Engineering Courses Code Reusability Courses ETL Courses Declarative Programming Courses

Course Description

Overview

Explore declarative data pipelines in this 28-minute conference talk featuring Anthony Awuley and Carter Kilgour from Flashfood. Learn about food waste reduction efforts, Flashfood's data challenges, and the evolution of their data pipeline solutions. Discover the benefits of declarative approaches, including code reusability and automation. Gain insights into tools like airflow-declarative, SyncTable, and custom operators for extract and transform processes. Understand key lessons learned, future challenges, and the potential of Spark YAML. Conclude with a discussion on Keillor's Principles and the importance of feedback in data engineering.

Syllabus

Intro
Agenda
Food Waste
Flashfood Data
Problem Definition
Attempt 1
The Declarative Data Pipeline
Attempt 3
The right amount of automation
Why configs?
airflow-declarative
SyncTablelob
Custom Operator
Extract
Transform
Summary
Lessons Learned
Challenges ahead
Spark YAML
Keillor's Principles
Feedback


Taught by

Databricks

Related Courses

Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Data Analysis with Python
IBM via Coursera
Intro to TensorFlow 日本語版
Google Cloud via Coursera
TensorFlow on Google Cloud - Français
Google Cloud via Coursera
Freedom of Data with SAP Data Hub
SAP Learning