YoVDO

CS125x: Advanced Distributed Machine Learning with Apache Spark

Offered By: University of California, Berkeley via edX

Tags

Machine Learning Courses Artificial Intelligence Courses Distributed Systems Courses

Course Description

Overview

Building on the core ideas presented in Distributed Machine Learning with Spark, this course covers advanced topics for training and deploying large-scale learning pipelines. You will study state-of-the-art distributed algorithms for collaborative filtering, ensemble methods (e.g., random forests), clustering and topic modeling, with a focus on model parallelism and the crucial tradeoffs between computation and communication.

After completing this course, you will have a thorough understanding of the statistical and algorithmic principles required to develop and deploy distributed machine learning pipelines. You will further have the expertise to write efficient and scalable code in Spark, using MLlib and the spark.ml package in particular.

Taught by

Ameet Talwalkar and Jon Bates

Tags

Related Courses

Advanced Operating Systems
Georgia Institute of Technology via Udacity
High Performance Computing
Georgia Institute of Technology via Udacity
GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity
Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX
Reliable Distributed Algorithms - Part 2
KTH Royal Institute of Technology via edX