Data Engineering Foundations
Offered By: LinkedIn Learning
Course Description
Overview
Learn the key facets of data engineering, from its place in the data science realm, to the specific tasks and skills every data engineer should possess.
Syllabus
Introduction
- What is data engineering?
- Introduction to data engineering
- Data engineer vs. data scientist
- Essential tools for data engineering
- Intro to databases and their types
- Understanding database schema
- Distributive computing
- MapReduce and Hadoop
- Hive
- Spark
- Airflow
- Sources of data extraction
- Data extraction from a PostgreSQL database
- Challenge: Data extraction
- Solution: Data extraction
- Transforming data
- Challenge: Transforming data
- Solution: Transforming data
- Loading data into a DB
- Challenge: Loading data
- Solution: Loading data
- Scheduling ETL pipeline using Airflow
- Next steps
Taught by
Harshit Tyagi
Related Courses
Building Batch Data Pipelines on GCP auf DeutschGoogle Cloud via Coursera Building Batch Data Pipelines on GCP en Français
Google Cloud via Coursera Mastering Azure Data Factory: From Basics to Advanced Level
Udemy Data Science de A a Z - Extraçao e Exibição dos Dados
Udemy Building Batch Data Processing Solutions in Microsoft Azure
Pluralsight