YoVDO

LayoutLM - Pre-training of Text and Layout for Document Image Understanding

Offered By: BIMSA via YouTube

Tags

Computer Vision Courses Artificial Intelligence Courses Machine Learning Courses Optical Character Recognition Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 48-minute conference talk by Yiheng Xu from BIMSA on LayoutLM, a groundbreaking pre-training model for document image understanding. Discover how LayoutLM innovatively combines text and layout information from scanned documents, addressing a crucial gap in traditional NLP pre-training techniques. Learn about the model's unique approach to jointly processing textual content and spatial layout, enhancing its effectiveness in tasks like information extraction from scanned documents. Gain insights into how LayoutLM incorporates visual features to further enrich its understanding of document structure. Understand the significance of this pioneering framework that, for the first time, integrates text and layout learning for document-level pre-training, potentially revolutionizing various real-world document processing applications.

Syllabus

Yiheng Xu: LayoutLM: Pre-training of Text and Layout for Document Image Understanding #ICBS2024


Taught by

BIMSA

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent