Deployment - FSDL 2022
Offered By: The Full Stack via YouTube
Course Description
Overview
Syllabus
Overview
First, deploy a prototype with gradio or streamlit
Model-in-server architecture
Model-in-database architecture
Model-as-a-service architecture
REST APIs for model services
Dependency management for model services
Containerization for model services with Docker
Performance optimization: to GPU or not to GPU?
Optimization for CPUs: distillation, quantization, and caching
Optimization for GPUs: Batching and GPU sharing
Libraries for model serving on GPUs
Horizontal scaling
Horizontal scaling with container orchestration k8s
Horizontal scaling with serverless services
Rollouts: shadows and canaries
Managed options for model serving AWS Sagemaker
Takeaways on model services
Moving to edge
Frameworks for edge deployment
Making efficient models for the edge
Mindsets and takeaways for edge deployment
Takeways for deploying ML models
Taught by
The Full Stack
Related Courses
Building Scalable Applications with .NET CoreEDUCBA via Coursera Cloud computing en AWS: Ejecuta un sitio web en EC2
Coursera Project Network via Coursera Como manter a alta disponibilidade com o Auto Scaling (Português) | Maintaining High Availability with Auto Scaling (Portuguese)
Amazon Web Services via AWS Skill Builder Introduction to Amazon CloudFront (Portuguese)
Amazon Web Services via AWS Skill Builder Architecting Big Data Applications: Batch Mode Application Engineering
LinkedIn Learning