Neural-Symbolic VQA - Disentangling Reasoning from Vision and Language Understanding

Offered By: University of Central Florida via YouTube

Course Description

Overview

Explore the innovative approach to Visual Question Answering (VQA) that disentangles reasoning from vision and language understanding in this 27-minute lecture from the University of Central Florida. Delve into the task breakdown, architecture overview, and key components such as question parsing and program execution. Examine quantitative results on CLEVR and CLEVR-Humans datasets, and discover how this neural-symbolic method extends to new scenes like Minecraft. Gain insights into the future of AI systems that can effectively combine reasoning with visual and linguistic comprehension.

Syllabus

Intro
Visual Question Answering
Task Breakdown
Architecture Overview
Question Parsing
Program Execution
Training
Quantitative results on CLEVR
CLEVR-Humans & Results
New Scenes: Minecraft
Summary

Taught by

UCF CRCV

Related Courses

LearnToMod For Educators
University of California, San Diego via Coursera Minecraft, Coding and Teaching
University of California, San Diego via edX Complete YouTube Gaming Course: Attract 500,000 Subs in 2021
Udemy Modding By Kaupenjoe: Minecraft Modding for beginners (Version 1-16-X)
Skillshare CVE-2021-44228 - Log4j - Minecraft Vulnerable and So Much More
John Hammond via YouTube

Neural-Symbolic VQA - Disentangling Reasoning from Vision and Language Understanding

Tags

Course Description

Overview

Syllabus

Taught by

Tags

Related Courses

Neural-Symbolic VQA - Disentangling Reasoning from Vision and Language Understanding

Tags

Course Description

Overview

Syllabus

Taught by

Tags

Related Courses

Login to Continue