Expanding RAG with Multimodal Capabilities - Haystack US 2024
Offered By: OpenSource Connections via YouTube
Course Description
Overview
Explore the expansion of Retrieval-Augmented Generation (RAG) workflows to incorporate multimodal capabilities in this conference talk from Haystack US 2024. Delve into the challenges of traditional RAG systems that primarily focus on text-based retrieval, and discover how to leverage Language Models (LLMs) and multimodal embeddings to enhance both retrieval and generation processes. Witness a live demonstration showcasing the processing of PDF documents in a vector database, extracting content from images, tables, and text. Learn how multimodal search can be employed in the retriever and how LLMs can enrich the final response. Gain insights from search specialist Praveen Mohan Prasad and solutions architect Hajer Bouafif on implementing and operationalizing strategies to improve search experiences using Machine Learning and building large-scale Machine Learning search solutions.
Syllabus
Haystack US 2024 - Praveen Mohan Prasad & Hajer Bouafif: Expanding RAG with multimodal capabilities
Taught by
OpenSource Connections
Related Courses
Microsoft Bot Framework and Conversation as a PlatformMicrosoft via edX Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube Improving Customer Experiences with Speech to Text and Text to Speech
Microsoft via YouTube Stanford Seminar - Deep Learning in Speech Recognition
Stanford University via YouTube Select Topics in Python: Natural Language Processing
Codio via Coursera