YoVDO

Data Pipelines with Celery: Modular, Signal-Driven and Manageable

Offered By: EuroPython Conference via YouTube

Tags

Celery Courses Python Courses Redis Courses Distributed Systems Courses Fault Tolerance Courses Batch Processing Courses Modular Programming Courses Data Pipelines Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore innovative techniques for designing and implementing efficient data pipelines using Celery, Redis, and signal-based triggering in this 25-minute conference talk from EuroPython 2024. Learn how to segment pipelines into smaller, manageable components to enhance fault tolerance, improve modularity, and simplify testing and debugging. Discover the benefits of using Redis as a data store and leveraging Celery's signals to create self-triggering pipelines that efficiently manage data batches within API rate limits and system resource constraints. Compare this new approach to traditional periodic tasks, understanding how it can increase data throughput and completeness. Gain insights into implementing secondary benefits such as result persistence and reporting for data analysis and optimization in budget-sensitive environments. Walk away with fresh perspectives and practical techniques for creating more effective and maintainable data pipelines in your own projects using Celery.

Syllabus

Data pipelines with Celery: modular, signal-driven and manageable — Marin Aglić Čuvić


Taught by

EuroPython Conference

Related Courses

Advanced Operating Systems
Georgia Institute of Technology via Udacity
High Performance Computing
Georgia Institute of Technology via Udacity
GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity
Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX
CS125x: Advanced Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX