YoVDO

Data Engineering Foundations

Offered By: LinkedIn Learning

Tags

Databases Courses Hadoop Courses Apache Spark Courses Apache Airflow Courses PostgreSQL Courses Data Extraction Courses MapReduce Courses Data Engineering Courses ETL Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn the key facets of data engineering, from its place in the data science realm, to the specific tasks and skills every data engineer should possess.

Syllabus

Introduction
  • What is data engineering?
1. Introduction to Data Engineering
  • Introduction to data engineering
  • Data engineer vs. data scientist
  • Essential tools for data engineering
2. Databases and Dataframes
  • Intro to databases and their types
  • Understanding database schema
  • Distributive computing
3. Data Engineering Tools
  • MapReduce and Hadoop
  • Hive
  • Spark
  • Airflow
4. ETL Pipelines
  • Sources of data extraction
  • Data extraction from a PostgreSQL database
  • Challenge: Data extraction
  • Solution: Data extraction
  • Transforming data
  • Challenge: Transforming data
  • Solution: Transforming data
  • Loading data into a DB
  • Challenge: Loading data
  • Solution: Loading data
  • Scheduling ETL pipeline using Airflow
Conclusion
  • Next steps

Taught by

Harshit Tyagi

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera