YoVDO

Spark for Machine Learning & AI

Offered By: LinkedIn Learning

Tags

Apache Spark Courses Machine Learning Courses Classification Courses Clustering Courses Recommendation Systems Courses K-means Courses Data Preprocessing Courses DataFrames Courses MLlib Courses

Course Description

Overview

Discover the powerful Apache Spark platform for machine learning. Learn about preprocessing data, applying algorithms to a variety of machine learning problems, and more.

Syllabus

Introduction
  • Welcome
1. Introduction to Spark and MLlib
  • Introduction to Spark
  • Steps in the machine learning process
  • Install Spark
  • Organizing data in DataFrames
  • Components of Spark MLlib
2. Data Preparation and Transformation
  • Introduction to preprocessing
  • Normalize numeric data
  • Standardize numeric data
  • Bucketize numeric data
  • Tokenize text data
  • TF-IDF
  • Summary of preprocessing
3. Clustering
  • Introduction to clustering
  • K-means clustering
  • Hierarchical clustering
  • Summary of clustering techniques
4. Classification
  • Introduction to classification
  • Preprocessing the Iris data set
  • Naive Bayes classification
  • Multilayer perceptron classification
  • Decision trees classification
  • Summary of classification algorithms
5. Regression
  • Introduction to regresssion
  • Preprocessing regression data
  • Linear regression
  • Decision tree regression
  • Gradient-boosted tree regression
  • Summary of regression algorithms
6. Recommendations
  • Understand recommendation systems
  • Collaborative filtering
Conclusion
  • Tips for using Spark MLlib

Taught by

Dan Sullivan

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera