Taming the Beast - Managing the Day 2 Operational Complexity of Kubeflow
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore strategies for managing the operational complexity of Kubeflow in this 25-minute conference talk from KubeCon + CloudNativeCon North America 2021. Gain insights into deploying, configuring, and maintaining Kubeflow, a machine learning toolkit for Kubernetes. Learn tips for navigating the platform's many components, including notebooks, service meshes, and pipelines. Discover lessons from experienced practitioners on managing Kubeflow deployments and contributing to upstream development. Delve into topics such as deployment options, operators, day 2 operations, component updates, security measures, integration with Istio, upgrades, external databases, and troubleshooting techniques. Equip yourself with practical knowledge to effectively tame the operational challenges of Kubeflow and optimize your machine learning workflows on Kubernetes.
Syllabus
Intro
Overview
Challenges
Taming the Beast
Kubeflow Deployment
Kubeflow Manifests Restructure
How This Can Help
Another Deployment Option
Operators
Kubeflow Operator
Day 2
Kubeflow Component Update
Stale Webhooks
Securing Your Deployment
Integration with Different Istio
Updates / Upgrades
External DB
Kubernetes Version Update
Troubleshoot Platform
Monitoring
The Future
References
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Building Geospatial Apps on Postgres, PostGIS, & Citus at Large ScaleMicrosoft via YouTube Unlocking the Power of ML for Your JavaScript Applications with TensorFlow.js
TensorFlow via YouTube Managing the Reactive World with RxJava - Jake Wharton
ChariotSolutions via YouTube What's New in Grails 2.0
ChariotSolutions via YouTube Performance Analysis of Apache Spark and Presto in Cloud Environments
Databricks via YouTube