YoVDO

The Real End-to-End RAG Stack - Lecture 217

Offered By: MLOps.community via YouTube

Tags

Retrieval Augmented Generation Courses MLOps Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into a comprehensive podcast episode featuring Sam Bean, Software Engineer in Applied AI at Rewind.ai, discussing the intricacies of the real end-to-end Retrieval-Augmented Generation (RAG) stack. Explore the complexities of running retrieval and inference infrastructure for Language Model (LLM) applications, the concept of the RAG flywheel, methods for maintaining response quality, and techniques for pruning and deduplicating documents. Gain insights into Sam's experience in training, evaluating, and deploying production-grade inference solutions for language models, as well as his background in personalization algorithms. Learn about cutting-edge concepts like REinforced Self Training (REST) and its integration with REACT. Discover practical approaches to search challenges, expensive evaluation processes, and the importance of data quality in machine learning operations. Delve into discussions on multimodal RAG, content-focused approaches, and the implementation of DSPy in production environments. This episode offers valuable knowledge for professionals and enthusiasts in the fields of artificial intelligence, machine learning, and natural language processing.

Syllabus

[] Sam's preferred coffee
[] Takeaways
[] A competitive coding pinball player
[] Sam's MLOps journey
[] Search Challenges with ML
[] Expensive evaluation
[] Labeling Parties Boost Data Quality
[] Zeno's Paradox of Motion
[] Sam's job at Rewind AI
[] Multimodal RAG
[30:59 - ] Zilliz Ad
[] University of Prague paper leak
[] Signals behind the scenes
[] Content Over Metadata Approach
[] Optionality around evaluation and search
[] Incremental Robustness Building
[] Solid Foundations for Success
[] Production RAGs
[] Thoughts on DSPy
[] Using DSPy in Production
[] Wrap up


Taught by

MLOps.community

Related Courses

Machine Learning Operations (MLOps): Getting Started
Google Cloud via Coursera
Проектирование и реализация систем машинного обучения
Higher School of Economics via Coursera
Demystifying Machine Learning Operations (MLOps)
Pluralsight
Machine Learning Engineer with Microsoft Azure
Microsoft via Udacity
Machine Learning Engineering for Production (MLOps)
DeepLearning.AI via Coursera