Accelerating High-Performance Machine Learning at Scale in Kubernetes
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore a hands-on guide for productionizing optimized machine learning models in cloud native ecosystems using production-ready open source frameworks in this 36-minute conference talk from KubeCon + CloudNativeCon North America 2022. Dive into a practical use case deploying the GPT-2 NLP model in Kubernetes using ONNX Runtime from the Seldon Core Triton server. Learn how to create a scalable production NLP microservice for intelligent text generation applications. Discover key challenges in the MLOps space and understand how various tools interoperate throughout the production machine learning lifecycle. Gain insights from industry experts Alejandro Saucedo and Elena Neroslavskaya on accelerating high-performance machine learning at scale in Kubernetes environments.
Syllabus
Accelerating High-Performance Machine Learning at Scale i... Alejandro Saucedo & Elena Neroslavskaya
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Learning Machine Learning with .NET, PyTorch and the ONNX RuntimeMicrosoft via YouTube Using Apache OpenNLP with OpenSearch K-NN Vector Search
Linux Foundation via YouTube LLMs Fine Tuning and Inferencing Using ONNX Runtime - Workshop
Linux Foundation via YouTube Real-Time Inference of Neural Networks: A Guide for DSP Engineers
ADC - Audio Developer Conference via YouTube MLOPS: Inferencia ViVIT ONNX Model en Azure ML Managed EndPoint (AKS)
The Machine Learning Engineer via YouTube