YoVDO

RAG Has Been Oversimplified - Exploring Complexities in Retrieval Augmented Generation

Offered By: MLOps.community via YouTube

Tags

Retrieval Augmented Generation (RAG) Courses MLOps Courses Prompt Engineering Courses Vector Databases Courses Embeddings Courses Similarity Search Courses Multimodal AI Courses Retrieval Augmented Generation Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the complexities of Retrieval Augmented Generation (RAG) in this 49-minute MLOps podcast episode featuring Yujian Tang, Developer Advocate at Zilliz. Delve into the nuanced challenges developers face when implementing RAG, moving beyond industry oversimplifications. Learn about embedding vector databases, the consensus on large and small language models, and the intricacies of QA bots. Discover critical components of the RAG stack, including citation building, context vs. relevance, and similarity search. Examine RAG optimization techniques, discuss scenarios where RAG may not be suitable, and explore multimodal RAG applications. Gain insights into fashion app development and video citation methods while understanding the trade-offs in LLM interactions.

Syllabus

[] Yujian's preferred coffee
[] Takeaways
[] Please like, share, and subscribe to our MLOps channels!
[] The hero of the LLM space
[] Embeddings into Vector databases
[] What is large and what is small LLM consensus
[] QA Bot behind the scenes
[] Fun fact getting more context
[] RAGs eliminate the ability of LLMs to hallucinate
[] Critical part of the rag stack
[] Building citations
[] Difference between context and relevance
[] Missing prompt tooling
[] Similarity search
[] RAG Optimization
[] Interacting with LLMs and tradeoffs
[] RAGs not suited for
[] Fashion App
[] Multimodel Rags vs LLM RAGs
[] Multimodel use cases
[] Video citations
[] Wrap up


Taught by

MLOps.community

Related Courses

Better Llama with Retrieval Augmented Generation - RAG
James Briggs via YouTube
Live Code Review - Pinecone Vercel Starter Template and Retrieval Augmented Generation
Pinecone via YouTube
Nvidia's NeMo Guardrails - Full Walkthrough for Chatbots - AI
James Briggs via YouTube
Hugging Face LLMs with SageMaker - RAG with Pinecone
James Briggs via YouTube
Supercharge Your LLM Applications with RAG
Data Science Dojo via YouTube