Deploy RAG/AI App to AWS Cloud - Step-by-Step Tutorial
Offered By: pixegami via YouTube
Course Description
Overview
Embark on an advanced step-by-step tutorial to deploy a Python RAG/AI project to the AWS cloud, transforming it into a public API hosted on AWS Lambda for scalability and high performance. Dive into RAG concepts, explore project architecture, and integrate FastAPI. Master Docker image building, implement deployment hacks, and conduct local testing. Learn to construct AWS infrastructure using CDK and create an asynchronous API. Access the provided GitHub repository for code references and explore related videos covering RAG basics, FastAPI, AWS fundamentals, and Docker on Lambda to enhance your understanding of cloud deployment strategies for AI applications.
Syllabus
- Introduction
- RAG Recap
- Project Architecture
- Adding FastAPI
- Building a Docker Image
- Deployment Hacks
- Local Testing With Docker
- Build AWS Infrastructure with CDK
- Creating an Async API
- Wrapping Up
Taught by
pixegami
Related Courses
Artificial Intelligence for RoboticsStanford University via Udacity Intro to Computer Science
University of Virginia via Udacity Design of Computer Programs
Stanford University via Udacity Web Development
Udacity Programming Languages
University of Virginia via Udacity