YoVDO

Machine Learning with Spark on Google Cloud Dataproc

Offered By: Google via Google Cloud Skills Boost

Tags

Dataproc Courses Big Data Courses Machine Learning Courses Apache Spark Courses Data Cleaning Courses Logistic Regression Courses Model Evaluation Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
In this lab you will learn how to implement logistic regression using a machine learning library for Apache Spark running on a Google Cloud Dataproc cluster to develop a model for data from a multivariable dataset

Syllabus

  • GSP271
  • Overview
  • Setup and requirements
  • Task 1. Create a Dataproc cluster
  • Task 2. Set up bucket and start pyspark session
  • Task 3. Read and clean up dataset
  • Task 4. Develop a logistic regression model
  • Task 5. Save and restore a logistic regression model
  • Task 6. Predict with the logistic regression model
  • Task 7. Examine model behavior
  • Task 8. Evaluate the model
  • Congratulations!

Tags

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera