Running Multiple Models on the Same GPU on Spot Instances
Offered By: MLOps World: Machine Learning in Production via YouTube
Course Description
Overview
Discover cost-effective strategies for running machine learning inference in the cloud through this 33-minute conference talk from MLOps World: Machine Learning in Production. Explore GPU fractionalization and the use of Spot instances as presented by Oscar Rovira, Co-founder of Mystic AI. Learn about the benefits and limitations of GPU fractionalization, as well as the value and potential challenges of utilizing Spot instances. Gain insights into how combining these approaches can significantly increase throughput and reduce costs for your GenAI applications, with practical examples provided to illustrate these optimization techniques.
Syllabus
Running Multiple Models on the Same GPU, on Spot Instances
Taught by
MLOps World: Machine Learning in Production
Related Courses
Software as a ServiceUniversity of California, Berkeley via Coursera Software Defined Networking
Georgia Institute of Technology via Coursera Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera Web-Technologien
openHPI Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique