Spark for Machine Learning & AI
Offered By: LinkedIn Learning
Course Description
Overview
Discover the powerful Apache Spark platform for machine learning. Learn about preprocessing data, applying algorithms to a variety of machine learning problems, and more.
Syllabus
Introduction
- Welcome
- Introduction to Spark
- Steps in the machine learning process
- Install Spark
- Organizing data in DataFrames
- Components of Spark MLlib
- Introduction to preprocessing
- Normalize numeric data
- Standardize numeric data
- Bucketize numeric data
- Tokenize text data
- TF-IDF
- Summary of preprocessing
- Introduction to clustering
- K-means clustering
- Hierarchical clustering
- Summary of clustering techniques
- Introduction to classification
- Preprocessing the Iris data set
- Naive Bayes classification
- Multilayer perceptron classification
- Decision trees classification
- Summary of classification algorithms
- Introduction to regresssion
- Preprocessing regression data
- Linear regression
- Decision tree regression
- Gradient-boosted tree regression
- Summary of regression algorithms
- Understand recommendation systems
- Collaborative filtering
- Tips for using Spark MLlib
Taught by
Dan Sullivan
Related Courses
Mining Massive DatasetsStanford University via edX Nearest Neighbor Collaborative Filtering
University of Minnesota via Coursera Practical Deep Learning For Coders
fast.ai via Independent Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX ความรู้พื้นฐานเกี่ยวกับบิ๊กดาตา | Big Data Concept
Sukhothai Thammathirat Open University via ThaiMOOC