Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform
Offered By: Google Cloud via Coursera
Course Description
Overview
This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs.
In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis.
Pre-requisites
• Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience)
• Some knowledge of Python
In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis.
Pre-requisites
• Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience)
• Some knowledge of Python
Syllabus
Module 1: Introduction to Cloud Dataproc
Module 2: Running Dataproc jobs
Module 3: Leveraging GCP
Module 4: Analyzing Unstructured Data
Module 2: Running Dataproc jobs
Module 3: Leveraging GCP
Module 4: Analyzing Unstructured Data
Taught by
Google Cloud Training
Tags
Related Courses
Software as a ServiceUniversity of California, Berkeley via Coursera Software Defined Networking
Georgia Institute of Technology via Coursera Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera Web-Technologien
openHPI Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique