Building Data Workflows with Luigi and Kubernetes
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore how to build complex data pipelines in Python using Luigi and Kubernetes in this EuroPython 2019 conference talk. Learn about Luigi's problem-solving capabilities for batch job management, including dependency resolution, workflow management, visualization, and failure handling. Discover techniques for packaging Luigi pipelines as Docker images for easier testing and deployment. Gain insights into deploying pipelines on Kubernetes clusters for scalable Big Data processing and cost-effective infrastructure management. Get tips and tricks for optimizing Luigi Scheduler's performance with Kubernetes batch execution. Benefit from a demo project and practical advice tailored for data scientists, data engineers, BI developers, and software developers working with batch jobs and Big Data.
Syllabus
Intro
About Nar
Luigi Implementation
Kubernetes Implementation
Setup
Kubernetes
Questions
Different Kubernetes
Why Luigi
Configuring Luigi
Wrapping up
Taught by
EuroPython Conference
Related Courses
Artificial Intelligence for RoboticsStanford University via Udacity Intro to Computer Science
University of Virginia via Udacity Design of Computer Programs
Stanford University via Udacity Web Development
Udacity Programming Languages
University of Virginia via Udacity