YoVDO

Mastering a Data Pipeline with Python - 6 Years of Learned Lessons from Mistakes

Offered By: EuroPython Conference via YouTube

Tags

EuroPython Courses Python Courses PySpark Courses Data Storage Courses Data Transformation Courses Data Acquisition Courses Data Pipelines Courses Data Ingestion Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive talk from EuroPython 2020 that delves into the intricacies of building data pipelines with Python. Learn from six years of hard-earned experience and valuable lessons gleaned from mistakes in creating reliable data pipelines and managing vast amounts of valuable data. Discover how to effectively utilize Python as the core technology for data pipeline development. Gain insights into various components of the data pipeline puzzle, including data acquisition, ingestion, transformation, storage, workflow management, and serving. Compare the merits of PySpark versus Dask and Pandas, understand the role of Airflow in workflow management, and explore Apache Arrow as a novel approach to data processing. Benefit from best practices and learn to anticipate and address potential issues in data pipeline development.

Syllabus

Robson Junior - Mastering a data pipeline with Python: 6 years of learned lessons from mistakes


Taught by

EuroPython Conference

Related Courses

A Brief History of Data Storage
EuroPython Conference via YouTube
Breaking the Stereotype - Evolution & Persistence of Gender Bias in Tech
EuroPython Conference via YouTube
We Can Get More from Spatial, GIS, and Public Domain Datasets
EuroPython Conference via YouTube
Using NLP to Detect Knots in Protein Structures
EuroPython Conference via YouTube
The Challenges of Doing Infra-As-Code Without "The Cloud"
EuroPython Conference via YouTube