YoVDO

Modern Data Orchestration: Best Practices and Real-World Use Cases

Offered By: The ASF via YouTube

Tags

Apache Airflow Courses Apache Spark Courses dbt (Data build tool) Courses Data Pipelines Courses Data Lineage Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore advanced techniques and best practices for elevating your data pipeline game in this practical talk. Dive into real-world use cases, examining patterns for data pipelines using Airflow with Spark, DBT, and Polars. Learn strategies to avoid dependencies management in Airflow and reuse DAG templates across your organization. Delve into fundamental concepts of data pipelines, including data lineage, observability, metadata, quality, and auditing, and discover how to integrate these elements effectively. Master the art of writing clean code for data pipelines using the Factory Design Pattern with spark-submit, Airflow, and KubernatesPodOperator. Gain insights into Airflow alternatives like Dagster and Mage for your data architecture. Led by Riccardo Amadio, a Senior Data Engineer at Agile Lab, this 26-minute presentation offers a no-nonsense approach to modern data orchestration.

Syllabus

Modern Data Orchestrators


Taught by

The ASF

Related Courses

Introduction to Airflow in Python
DataCamp
Building Data Engineering Pipelines in Python
DataCamp
The Complete Hands-On Introduction to Apache Airflow
Udemy
Apache Airflow: The Hands-On Guide
Udemy
ETL and Data Pipelines with Shell, Airflow and Kafka
IBM via Coursera