How to Reduce ML Computing Costs: Building Efficient Multi-Cloud Infrastructure
Offered By: MLOps World: Machine Learning in Production via YouTube
Course Description
Overview
Discover how to significantly reduce machine learning computing costs in this 38-minute conference talk from MLOps World. Learn from Jaeman An, Founder & CEO of VESSL AI, as he shares insights on building time- and cost-effective ML infrastructure. Explore hybrid cloud architectures using Terraform and Kubernetes, cost optimization techniques with spot instances and fractional GPUs, and solutions to common multi-environment challenges. Gain practical knowledge on dataset mounting, network performance optimization, and server monitoring. Follow a step-by-step guide to implement these strategies and potentially achieve over 80% cost savings on ML projects.
Syllabus
Intro
From notebooks to training jobs
Experiment dashboard
Cluster Dashboard
Multi-Cloud ML Infrastructure
Build hybrid cluster with Kubernetes & Terraform
Terraforming AWS Infrastructure
Test AWS Infrastructure
Terraforming GCP Infrastructure
Cloud Troubleshooting
Dataset Mounting
Cluster Management & Monitoring
Common Interface
Fractional GPUs
multicluster-scheduler
reCap: Step-by-Step Guide
Taught by
MLOps World: Machine Learning in Production
Related Courses
Cybersecurity Policy for Water and Electricity InfrastructuresUniversity of Colorado System via Coursera Continuous Delivery & DevOps
University of Virginia via Coursera Preparing for your Professional Cloud Architect Journey
Google Cloud via Coursera Infrastructure Planning and Managements
Indian Institute of Technology Madras via Swayam Public Library Management
University of Michigan via edX