Spark for Machine Learning & AI
Offered By: LinkedIn Learning
Course Description
Overview
Discover the powerful Apache Spark platform for machine learning. Learn about preprocessing data, applying algorithms to a variety of machine learning problems, and more.
Syllabus
Introduction
- Welcome
- Introduction to Spark
- Steps in the machine learning process
- Install Spark
- Organizing data in DataFrames
- Components of Spark MLlib
- Introduction to preprocessing
- Normalize numeric data
- Standardize numeric data
- Bucketize numeric data
- Tokenize text data
- TF-IDF
- Summary of preprocessing
- Introduction to clustering
- K-means clustering
- Hierarchical clustering
- Summary of clustering techniques
- Introduction to classification
- Preprocessing the Iris data set
- Naive Bayes classification
- Multilayer perceptron classification
- Decision trees classification
- Summary of classification algorithms
- Introduction to regresssion
- Preprocessing regression data
- Linear regression
- Decision tree regression
- Gradient-boosted tree regression
- Summary of regression algorithms
- Understand recommendation systems
- Collaborative filtering
- Tips for using Spark MLlib
Taught by
Dan Sullivan
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera