Analysis of Large-Scale Visual Recognition - Bay Area Vision Meeting
Offered By: Meta via YouTube
Course Description
Overview
Explore the evolution and challenges of large-scale visual recognition in this 40-minute Meta conference talk from the Bay Area Vision Meeting. Dive into the IMAGENET Large Scale Visual Recognition Challenge (ILSVRC) from 2010-2012, examining its diverse object classes and two main tasks: Classification and Classification + Localization. Analyze the performance of various systems, including SuperVision (SV) and VGG, across different scales and object types. Investigate the impact of clutter, texture, and scale on recognition accuracy. Conclude with insights into the ILSVRC 2013 large-scale object detection task, gaining a comprehensive understanding of the field's progress and future directions.
Syllabus
Intro
Large-scale recognition
IMAGENET Large Scale Visual Recognition Challenge (ILSVRC) 2010-2012
Variety of object classes in ILSVRC
ILSVRC Task 1: Classification
ILSVRC Task 2: Classification + Localization
ILSVRC (2012)
Chance Performance of Localization
Level of clutter
SuperVision (SV)
Difference in accuracy: SV versus VGG
Cumulative accuracy across scales Classification only
Textured objects (ILSVRC-500)
Textured objects (416 classes)
Localizing textured objects (416 classes, same average object scale at each level of texture)
ILSVRC 2013 with large-scale object detection
Taught by
Meta Developers
Related Courses
Computer Vision: The FundamentalsUniversity of California, Berkeley via Coursera Detección de objetos
Universitat Autònoma de Barcelona (Autonomous University of Barcelona) via Coursera Deep Learning Summer School
Independent Deep Learning in Computer Vision
Higher School of Economics via Coursera Computer Vision and Image Analysis
Microsoft via edX