YoVDO

Vision Language Models and PDFs: What You See Is What You Search - Haystack EU 2024

Offered By: OpenSource Connections via YouTube

Tags

Information Retrieval Courses Vision-Language Models Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a groundbreaking approach to information extraction from complex PDF documents in this conference talk from Haystack EU 2024. Discover how Vision Language Models (VLMs) are revolutionizing the traditional multi-step process of text extraction, OCR, layout analysis, chunking, and embedding. Learn about ColPali, a new retrieval model that efficiently embeds entire PDF pages, including text, figures, and charts, resulting in improved retrieval quality and a simplified extraction and indexing process. Gain insights into representing ColPali in Vespa and its superior performance on the Visual Document Retrieval (ViDoRe) Benchmark. Benefit from the expertise of Jo Kristian, Chief Scientist at Vespa.ai, as he shares his two decades of experience in building and deploying search and recommender systems.

Syllabus

Haystack EU 2024 - Jo Kristian Bergum:What You See Is What You Search: Vision Language Models & PDFs


Taught by

OpenSource Connections

Related Courses

Amazon Kendra Getting Started (Japanese)
Amazon Web Services via AWS Skill Builder
Amazon Q Business Getting Started (Simplified Chinese)
Amazon Web Services via AWS Skill Builder
AWS Flash - A Hands-On Look at Amazon Q Business Expert
Amazon Web Services via AWS Skill Builder
AWS SimuLearn: Documents Indexing and Search
Amazon Web Services via AWS Skill Builder
AWS SimuLearn: Extract Text from Docs
Amazon Web Services via AWS Skill Builder