ETL in Python and SQL
Offered By: LinkedIn Learning
Course Description
Overview
Gain the knowledge you need to build data pipelines in a data-driven world.
Syllabus
Introduction
- Create an ETL in Python and SQL
- Tools used in this course
- What are ETLs and how do you create them?
- ETL process overview
- Exploring your data with pandas (Python) and SQL
- Understanding your data
- Challenge: Reading data using Python
- Solution: Reading data using Python
- Loading data from different sources
- Extracting your data
- Cleaning, preprocessing data, and data formatting
- Standardization, handling duplicates, and missing values
- Challenge: Extract and transform data using pandas
- Solution: Extract and transform data using pandas
- Introduction to data warehouses and data lakes
- Loading data into relational databases
- Data quality checks and validation with SQL
- Challenge: Transform the data and remove duplicates and nulls
- Solution: Transform the data and remove duplicates and nulls
- Querying your data with SQL
- Scheduling ETL jobs with Airflow: Part 1
- Scheduling ETL jobs with Airflow: Part 2
- Challenge: Load the data into a database and automate
- Solution: Load the data into a database and automate
- Expand your knowledge of ETLs
Taught by
Jennifer Ebe
Related Courses
Building Batch Data Pipelines on GCP auf DeutschGoogle Cloud via Coursera Building Batch Data Pipelines on GCP en Français
Google Cloud via Coursera Mastering Azure Data Factory: From Basics to Advanced Level
Udemy Data Science de A a Z - Extraçao e Exibição dos Dados
Udemy Building Batch Data Processing Solutions in Microsoft Azure
Pluralsight