YoVDO

ETL Processing on Google Cloud Using Dataflow and BigQuery

Offered By: Google via Google Cloud Skills Boost

Tags

BigQuery Courses Data Warehousing Courses Google Cloud Platform (GCP) Courses Dataflow Courses Apache Beam Courses Data Transformation Courses Data Engineering Courses Cloud Storage Courses Data Pipelines Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
In this lab you will build several Data Pipelines that will ingest data from a publicly available dataset into BigQuery.

Syllabus

  • GSP290
  • Overview
  • Setup
  • Task 1. Ensure that the Dataflow API is successfully enabled
  • Task 2. Download the starter code
  • Task 3. Create Cloud Storage Bucket
  • Task 4. Copy files to your bucket
  • Task 5. Create the BigQuery dataset
  • Task 6. Build a Dataflow pipeline
  • Task 7. Data ingestion
  • Task 8. Review pipeline python code
  • Task 9. Run the Apache Beam pipeline
  • Task 10. Data transformation
  • Task 11. Run the Apache Beam pipeline
  • Task 12. Data enrichment
  • Task 13. Review pipeline python code
  • Task 14. Run the Apache Beam pipeline
  • Task 15. Data lake to Mart
  • Task 16. Review pipeline python code
  • Task 17. Run the Apache Beam Pipeline
  • Test your understanding
  • Congratulations!

Tags

Related Courses

Google Cloud Fundamentals: Core Infrastructure
Google via Coursera
Google Cloud Big Data and Machine Learning Fundamentals
Google Cloud via Coursera
Serverless Data Analysis with Google BigQuery and Cloud Dataflow en Français
Google Cloud via Coursera
Essential Google Cloud Infrastructure: Foundation
Google Cloud via Coursera
Elastic Google Cloud Infrastructure: Scaling and Automation
Google Cloud via Coursera