YoVDO

DevOps to MLOps: Scaling ML Models to 2 Million+ Requests per Day

Offered By: Conf42 via YouTube

Tags

MLOps Courses Machine Learning Courses DevOps Courses Cloud Computing Courses Microservices Courses Scaling Courses API Development Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the journey from DevOps to MLOps in this conference talk that delves into scaling machine learning models to handle over 2 million requests per day. Learn about the key steps in MLOps, including model development, deployment, and monitoring. Discover a real-world case study on eKYC SaaS APIs, examining the cloud-agnostic architecture and scaling strategies employed. Gain insights into eliminating single points of failure, capacity planning, and cost optimization through autoscaling. Analyze production issues related to GPU utilization and high latency, and extract valuable lessons for implementing MLOps at scale. Enhance your understanding of the challenges and solutions in transitioning from traditional DevOps to MLOps for large-scale machine learning applications.

Syllabus

intro
preamble
chinmay naik
agenda
what is mlops
mlops steps
simpelst mlops flow
production work ahead
case study - ekyc saas apis
ml model apis
architecture
ekyc saas apis - requirements
cloud agnostic architecture
why cloud agnostic?
scaling journey
eliminate single points of failure
capacity planning
cost optimization and autoscaling
production issue 1 - gpu utilization in nomad
production issue 2 - high latency issue
lessons
keep learning


Taught by

Conf42

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent