YoVDO

Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

Offered By: Google Cloud via Coursera

Tags

Dataproc Courses Big Data Courses Cloud Computing Courses

Course Description

Overview

This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs.

In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis.

Pre-requisites
• Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience)
• Some knowledge of Python

Syllabus

Module 1: Introduction to Cloud Dataproc

Module 2: Running Dataproc jobs

Module 3: Leveraging GCP

Module 4: Analyzing Unstructured Data


Taught by

Google Cloud Training

Tags

Related Courses

Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform auf Deutsch
Google Cloud via Coursera
Leveraging Unstructured Data with Cloud Dataproc on Google Cloud em Português Brasileiro
Google Cloud via Coursera
Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Español
Google Cloud via Coursera
Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform 日本語版
Google Cloud via Coursera
Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform en Français
Google Cloud via Coursera