YoVDO

Expanding RAG with Multimodal Capabilities - Haystack US 2024

Offered By: OpenSource Connections via YouTube

Tags

Retrieval Augmented Generation (RAG) Courses Machine Learning Courses Vector Databases Courses Image Processing Courses Unstructured Data Courses Language Models Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the expansion of Retrieval-Augmented Generation (RAG) workflows to incorporate multimodal capabilities in this conference talk from Haystack US 2024. Delve into the challenges of traditional RAG systems that primarily focus on text-based retrieval, and discover how to leverage Language Models (LLMs) and multimodal embeddings to enhance both retrieval and generation processes. Witness a live demonstration showcasing the processing of PDF documents in a vector database, extracting content from images, tables, and text. Learn how multimodal search can be employed in the retriever and how LLMs can enrich the final response. Gain insights from search specialist Praveen Mohan Prasad and solutions architect Hajer Bouafif on implementing and operationalizing strategies to improve search experiences using Machine Learning and building large-scale Machine Learning search solutions.

Syllabus

Haystack US 2024 - Praveen Mohan Prasad & Hajer Bouafif: Expanding RAG with multimodal capabilities


Taught by

OpenSource Connections

Related Courses

Microsoft Bot Framework and Conversation as a Platform
Microsoft via edX
Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube
Improving Customer Experiences with Speech to Text and Text to Speech
Microsoft via YouTube
Stanford Seminar - Deep Learning in Speech Recognition
Stanford University via YouTube
Select Topics in Python: Natural Language Processing
Codio via Coursera