Automating Data Pipelines with Python and GitHub Actions - Code Walkthrough
Offered By: Shaw Talebi via YouTube
Course Description
Overview
Learn how to automate data pipelines using Python and GitHub Actions in this comprehensive video tutorial. Explore two methods for automation: using orchestration tools and combining Python with triggers. Dive into a practical example of automating an ETL (Extract, Transform, Load) pipeline, covering the entire process from creating the Python script to setting up a GitHub repository and configuring GitHub Actions. Discover how to create workflow YAML files, add repository secrets, and commit changes. Gain insights into building a full-stack data science project, with additional resources provided for further learning and implementation.
Syllabus
Intro -
Motivation -
2 Ways to Automate -
Way 1: Orchestration Tool -
Way 2: Python + Triggers -
GitHub Actions -
Example Code: Automating ETL Pipeline -
1 Create ETL Python Script -
2 Create GitHub Repo -
3 Create Workflow .yml File -
4 Add Repo Secrets -
5 Commit and Push -
Final ML App -
Taught by
Shaw Talebi
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera