YoVDO

Accelerating High-Performance Machine Learning at Scale in Kubernetes

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Machine Learning Courses Kubernetes Courses MLOps Courses GPT-2 Courses Model Optimization Courses ONNX Runtime Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a hands-on guide for productionizing optimized machine learning models in cloud native ecosystems using production-ready open source frameworks in this 36-minute conference talk from KubeCon + CloudNativeCon North America 2022. Dive into a practical use case deploying the GPT-2 NLP model in Kubernetes using ONNX Runtime from the Seldon Core Triton server. Learn how to create a scalable production NLP microservice for intelligent text generation applications. Discover key challenges in the MLOps space and understand how various tools interoperate throughout the production machine learning lifecycle. Gain insights from industry experts Alejandro Saucedo and Elena Neroslavskaya on accelerating high-performance machine learning at scale in Kubernetes environments.

Syllabus

Accelerating High-Performance Machine Learning at Scale i... Alejandro Saucedo & Elena Neroslavskaya


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Generating New Recipes using GPT-2
Coursera Project Network via Coursera
Deep Learning NLP: Training GPT-2 from scratch
Coursera Project Network via Coursera
Artificial Creativity
Parsons School of Design via Coursera
Coding Train Late Night - GPT-2, Hue Lights, Discord Bot
Coding Train via YouTube
Coding Train Late Night - Fetch, GPT-2 and RunwayML
Coding Train via YouTube