LayoutLM - Pre-training of Text and Layout for Document Image Understanding

Offered By: BIMSA via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore a 48-minute conference talk by Yiheng Xu from BIMSA on LayoutLM, a groundbreaking pre-training model for document image understanding. Discover how LayoutLM innovatively combines text and layout information from scanned documents, addressing a crucial gap in traditional NLP pre-training techniques. Learn about the model's unique approach to jointly processing textual content and spatial layout, enhancing its effectiveness in tasks like information extraction from scanned documents. Gain insights into how LayoutLM incorporates visual features to further enrich its understanding of document structure. Understand the significance of this pioneering framework that, for the first time, integrates text and layout learning for document-level pre-training, potentially revolutionizing various real-world document processing applications.

Syllabus

Yiheng Xu: LayoutLM: Pre-training of Text and Layout for Document Image Understanding #ICBS2024

Taught by

BIMSA

LayoutLM - Pre-training of Text and Layout for Document Image Understanding

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

LayoutLM - Pre-training of Text and Layout for Document Image Understanding

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue