YoVDO

Spark for Machine Learning & AI

Offered By: LinkedIn Learning

Tags

Apache Spark Courses Machine Learning Courses Classification Courses Clustering Courses Recommendation Systems Courses K-means Courses Data Preprocessing Courses DataFrames Courses MLlib Courses

Course Description

Overview

Discover the powerful Apache Spark platform for machine learning. Learn about preprocessing data, applying algorithms to a variety of machine learning problems, and more.

Syllabus

Introduction
  • Welcome
1. Introduction to Spark and MLlib
  • Introduction to Spark
  • Steps in the machine learning process
  • Install Spark
  • Organizing data in DataFrames
  • Components of Spark MLlib
2. Data Preparation and Transformation
  • Introduction to preprocessing
  • Normalize numeric data
  • Standardize numeric data
  • Bucketize numeric data
  • Tokenize text data
  • TF-IDF
  • Summary of preprocessing
3. Clustering
  • Introduction to clustering
  • K-means clustering
  • Hierarchical clustering
  • Summary of clustering techniques
4. Classification
  • Introduction to classification
  • Preprocessing the Iris data set
  • Naive Bayes classification
  • Multilayer perceptron classification
  • Decision trees classification
  • Summary of classification algorithms
5. Regression
  • Introduction to regresssion
  • Preprocessing regression data
  • Linear regression
  • Decision tree regression
  • Gradient-boosted tree regression
  • Summary of regression algorithms
6. Recommendations
  • Understand recommendation systems
  • Collaborative filtering
Conclusion
  • Tips for using Spark MLlib

Taught by

Dan Sullivan

Related Courses

Genomic Data Science and Clustering (Bioinformatics V)
University of California, San Diego via Coursera
用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera
Data Mining Project
University of Illinois at Urbana-Champaign via Coursera
Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera
Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX