Data Pipelines à La Mode
Offered By: GOTO Conferences via YouTube
Course Description
Overview
Explore the intricacies of data pipelines in this insightful conference talk from YOW! 2019. Delve into the challenges of reproducibility, version control, and algorithm updates in data processing systems. Learn about "pure" data pipelines and how techniques from distributed build systems can enhance traceability, preserve previous results, and optimize workflow efficiency. Gain practical knowledge through concrete examples in various languages and distributed computation frameworks. Discover strategies to address common issues in data pipeline management, including result reproduction, code versioning, and handling algorithm changes. Ideal for data scientists, analytics professionals, and anyone involved in designing or maintaining data processing systems.
Syllabus
Data Pipelines à La Mode • Tommy Hall • YOW! 2019
Taught by
GOTO Conferences
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera