YoVDO

Data Engineering Foundations

Offered By: LinkedIn Learning

Tags

Databases Courses Hadoop Courses Apache Spark Courses Apache Airflow Courses PostgreSQL Courses Data Extraction Courses MapReduce Courses Data Engineering Courses ETL Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn the key facets of data engineering, from its place in the data science realm, to the specific tasks and skills every data engineer should possess.

Syllabus

Introduction
  • What is data engineering?
1. Introduction to Data Engineering
  • Introduction to data engineering
  • Data engineer vs. data scientist
  • Essential tools for data engineering
2. Databases and Dataframes
  • Intro to databases and their types
  • Understanding database schema
  • Distributive computing
3. Data Engineering Tools
  • MapReduce and Hadoop
  • Hive
  • Spark
  • Airflow
4. ETL Pipelines
  • Sources of data extraction
  • Data extraction from a PostgreSQL database
  • Challenge: Data extraction
  • Solution: Data extraction
  • Transforming data
  • Challenge: Transforming data
  • Solution: Transforming data
  • Loading data into a DB
  • Challenge: Loading data
  • Solution: Loading data
  • Scheduling ETL pipeline using Airflow
Conclusion
  • Next steps

Taught by

Harshit Tyagi

Related Courses

Intro to Hadoop and MapReduce
Cloudera via Udacity
Processing Big Data with Hadoop in Azure HDInsight
Microsoft via edX
Implementing Real-Time Analytics with Hadoop in Azure HDInsight
Microsoft via edX
Hadoop Platform and Application Framework
University of California, San Diego via Coursera
Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera