Deploy RAG/AI App to AWS Cloud - Step-by-Step Tutorial
Offered By: pixegami via YouTube
Course Description
Overview
Embark on an advanced step-by-step tutorial to deploy a Python RAG/AI project to the AWS cloud, transforming it into a public API hosted on AWS Lambda for scalability and high performance. Dive into RAG concepts, explore project architecture, and integrate FastAPI. Master Docker image building, implement deployment hacks, and conduct local testing. Learn to construct AWS infrastructure using CDK and create an asynchronous API. Access the provided GitHub repository for code references and explore related videos covering RAG basics, FastAPI, AWS fundamentals, and Docker on Lambda to enhance your understanding of cloud deployment strategies for AI applications.
Syllabus
- Introduction
- RAG Recap
- Project Architecture
- Adding FastAPI
- Building a Docker Image
- Deployment Hacks
- Local Testing With Docker
- Build AWS Infrastructure with CDK
- Creating an Async API
- Wrapping Up
Taught by
pixegami
Related Courses
Cloud Computing Applications, Part 1: Cloud Systems and InfrastructureUniversity of Illinois at Urbana-Champaign via Coursera Introduction to Cloud Infrastructure Technologies
Linux Foundation via edX Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms The Docker for DevOps course: From development to production
Udemy Windows Server 2016: Virtualization
Microsoft via edX