Deployment - FSDL 2022
Offered By: The Full Stack via YouTube
Course Description
Overview
Syllabus
Overview
First, deploy a prototype with gradio or streamlit
Model-in-server architecture
Model-in-database architecture
Model-as-a-service architecture
REST APIs for model services
Dependency management for model services
Containerization for model services with Docker
Performance optimization: to GPU or not to GPU?
Optimization for CPUs: distillation, quantization, and caching
Optimization for GPUs: Batching and GPU sharing
Libraries for model serving on GPUs
Horizontal scaling
Horizontal scaling with container orchestration k8s
Horizontal scaling with serverless services
Rollouts: shadows and canaries
Managed options for model serving AWS Sagemaker
Takeaways on model services
Moving to edge
Frameworks for edge deployment
Making efficient models for the edge
Mindsets and takeaways for edge deployment
Takeways for deploying ML models
Taught by
The Full Stack
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX