ETL Processing on Google Cloud Using Dataflow and BigQuery
Offered By: Google via Google Cloud Skills Boost
Course Description
Overview
In this lab you will build several Data Pipelines that will ingest data from a publicly available dataset into BigQuery.
Syllabus
- GSP290
- Overview
- Setup
- Task 1. Ensure that the Dataflow API is successfully enabled
- Task 2. Download the starter code
- Task 3. Create Cloud Storage Bucket
- Task 4. Copy files to your bucket
- Task 5. Create the BigQuery dataset
- Task 6. Build a Dataflow pipeline
- Task 7. Data ingestion
- Task 8. Review pipeline python code
- Task 9. Run the Apache Beam pipeline
- Task 10. Data transformation
- Task 11. Run the Apache Beam pipeline
- Task 12. Data enrichment
- Task 13. Review pipeline python code
- Task 14. Run the Apache Beam pipeline
- Task 15. Data lake to Mart
- Task 16. Review pipeline python code
- Task 17. Run the Apache Beam Pipeline
- Test your understanding
- Congratulations!
Tags
Related Courses
SAP Business Warehouse powered by SAP HANASAP Learning Relational Database Support for Data Warehouses
University of Colorado System via Coursera Data Warehouse Concepts, Design, and Data Integration
University of Colorado System via Coursera Business Intelligence Concepts, Tools, and Applications
University of Colorado System via Coursera Design and Build a Data Warehouse for Business Intelligence Implementation
University of Colorado System via Coursera