Scaling Machine Learning with Spark
Offered By: GOTO Conferences via YouTube
Course Description
Overview
Explore the intricacies of building scalable machine learning solutions with Apache Spark in this insightful GOTO Book Club interview featuring Adi Polak and Holden Karau. Delve into the world of distributed computing and machine learning as these experts discuss their book "Scaling Machine Learning with Spark." Learn about managing the ML lifecycle with MLflow, data preprocessing with Spark, feature engineering, model training with MLlib, and building pipelines. Discover how to combine Spark's power with deep learning, work with distributed TensorFlow, and scale machine learning using PyTorch. Gain valuable insights into when and why to use various technologies in the Apache Spark ecosystem, including Spark MLlib, MLflow, TensorFlow, and PyTorch. Perfect for data scientists and ML practitioners looking to enhance their understanding of end-to-end distributed ML workflows.
Syllabus
Scaling Machine Learning with Spark • Adi Polak & Holden Karau
Taught by
GOTO Conferences
Related Courses
Genomic Data Science and Clustering (Bioinformatics V)University of California, San Diego via Coursera 用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera Data Mining Project
University of Illinois at Urbana-Champaign via Coursera Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX