YoVDO

Spark MLlIB

Offered By: IBM via Cognitive Class

Tags

Apache Spark Courses Statistics & Probability Courses Machine Learning Courses Linear Algebra Courses Classification Courses Collaborative Filtering Courses Clustering Courses

Course Description

Overview

Spark provides a machine learning library known as MLlib. Spark MLlib provides various machine learning algorithms such as classification, regression, clustering, and collaborative filtering. It also provides tools such as featurization, pipelines, persistence, and utilities for handling linear algebra operations, statistics and data handling. This course will start you off on your journey and walk you through some of the machine learning libraries and how to use them. 

Syllabus

  • Module 1 - Spark MLlib Datatypes
    1. Understand the difference between Dense and Sparse Data Types, and how they apply to LabeledPoints and matrices.
    2. Understand how to create and use the different matrices that are available in Spark MLlib.
  • Module 2 - Review of Algorithms
    1. Have a general understanding of each of the algorithm that will be discussed in the course and how they work.
    2. Learn how to instantiate simple Linear Regression and Classification models, including Linear Regression, Support Vector Machines, and Logistic Regression.
  • Module 3 - Spark MLlib Decision Trees and Random Forests
    1. Learn about the different input parameters used to create Decision Trees and Random Forests. 
    2. Understand the effects of tuning specific parameters for Decision Trees and Random Forests. 
  • Module 4 - Spark MLlib Clustering
    1. Learn about the parameters involved in creating K-Means Clustering models and Gaussian Mixture Clustering models.
    2. Describe how outputs and uses of the functions available to each clustering model.

Tags

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Statistics One
Princeton University via Coursera
Intro to Statistics
Stanford University via Udacity
Passion Driven Statistics
Wesleyan University via Coursera