YoVDO

Advanced LLMOps: Deploying and Managing LLMs in Production

Offered By: LinkedIn Learning

Tags

Vector Databases Courses API Deployment Courses LLMOps Courses Retrieval Augmented Generation (RAG) Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn advanced techniques and best practices for deploying and monitoring large language models in production environments.

Syllabus

Introduction
  • Deploying LLMs for production
  • Working in Google Colab
1. Deployment Options for LLMs
  • Overview of deployment options
  • Deploying via APIs
  • Using fine-tuned models for deployment
  • Custom models: Building and deployment
2. Handling API Limitations
  • Understanding API limitations
  • Strategies to handle endpoint uptime limitations
  • Mitigating latency issues in LLM deployment
  • Challenge: API limitations for LLM deployment
  • Solution: API limitations for LLM deployment
3. Deployment Architecture
  • Vector databases for LLM deployment
  • Agents in LLM deployment
  • Chains in LLM deployment
  • Challenge: Deploy a simple RAG application using an API
  • Solution: Deploying a simple RAG application using an API
4. Monitoring LLM Performance
  • Introduction to LLM performance monitoring
  • Addressing hallucinations in LLMs
5. Advanced Deployment Techniques
  • Prompt management for LLM deployment
  • Evaluating LLMs in production
  • Challenge: Evaluating LLM systems
  • Solution: Evaluating LLM systems
6. Security and Cost Considerations
  • Security considerations for LLMs in production
  • Balancing costs and performance in LLM deployment
  • Strategies for cost-effective LLM deployment
  • Challenge: Estimating costs of an LLM API
  • Solution: Estimating costs of an LLM API
Conclusion
  • Next steps

Taught by

Soham Chatterjee and Archana Vaidheeswaran

Related Courses

Better Llama with Retrieval Augmented Generation - RAG
James Briggs via YouTube
Live Code Review - Pinecone Vercel Starter Template and Retrieval Augmented Generation
Pinecone via YouTube
Nvidia's NeMo Guardrails - Full Walkthrough for Chatbots - AI
James Briggs via YouTube
Hugging Face LLMs with SageMaker - RAG with Pinecone
James Briggs via YouTube
Supercharge Your LLM Applications with RAG
Data Science Dojo via YouTube