Video Content Understanding Using Text
Offered By: University of Central Florida via YouTube
Course Description
Overview
Syllabus
Intro
Motivation
Challenges
Algorithm
Training
Video Representation
Scoring function
Optimization - Updating Rules
Exemplar queries
Test on Unseen Queries
Qualitative results
Sentence Encoder
Spatial Attention Network • Which regions of the frames to look?
Temporal Attention Model
Inference Module
Experiments
Limitations
What is an Inaccuracy?
Formulation
Detection By Reconstruction
Visual Features
Inaccuracy Detection
Correction
Last two chapters
How about the opposite problem?
Problem Definition
Proposed Approach - Generator Block Diagram
Text Encoding
Start and End Distributions
Latent Path Construction
Conditional BatchNormalization (CBN)
Frame Generation
UpPooling Block Details
Proposed Approach - Discriminator
Loss Function - Generator
Hinge GAN-Loss on Discriminator
Evaluation Metrics
A2D Quantitative Results
A2D Results
Robotic Results
Dissertation Summary
Future Work
Taught by
UCF CRCV
Tags
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Computational Photography
Georgia Institute of Technology via Coursera Einführung in Computer Vision
Technische Universität München (Technical University of Munich) via Coursera Introduction to Computer Vision
Georgia Institute of Technology via Udacity