Improving Complex RAG Systems and Achieving No-Regret, Lightning-Fast Deployment Iterations of LLMs
Offered By: Data Science Festival via YouTube
Course Description
Overview
Explore advanced techniques for enhancing complex Retrieval Augmented Generation (RAG) systems and implementing rapid, risk-free deployment iterations of Large Language Models (LLMs) in this 33-minute talk from the Data Science Festival. Delve into strategies to overcome LLM limitations such as outdated training data, restricted context windows, latency, and API rate limits. Learn how to leverage AWS lambda aliases for deploying "shadow" versions and feature flagging beta releases, enabling safe and swift iterations of LLM-based applications. Discover methods to evaluate system performance in production using unique data gathered from these deployment techniques. Gain insights applicable to ML/software engineers and data scientists, with a focus on practical coding experience in LLMs, RAG, and AWS lambdas or equivalent technologies.
Syllabus
Improving complex RAG systems and achieving no regret lightning fast deployment iterations of LLMs
Taught by
Data Science Festival
Related Courses
Data AnalysisJohns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Scientific Computing
University of Washington via Coursera Introduction to Data Science
University of Washington via Coursera Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera