YoVDO

Introduction to Computer Vision

Offered By: Georgia Institute of Technology via Udacity

Tags

Computer Vision Courses Image Processing Courses Feature Detection Courses

Course Description

Overview

This course provides an introduction to computer vision including fundamentals of image formation, camera imaging geometry, feature detection and matching, multiview geometry including stereo, motion estimation and tracking, and classification. We’ll develop basic methods for applications that include finding known models in images, depth recovery from stereo, camera calibration, image stabilization, automated alignment (e.g. panoramas), tracking, and action recognition. We focus less on the machine learning aspect of CV as that is really classification theory best learned in an ML course.

The focus of the course is to develop the intuitions and mathematics of the methods in lecture, and then to learn about the difference between theory and practice in the problem sets. All algorithms work perfectly in the slides. But remember what Yogi Berra said: In theory there is no difference between theory and practice. In practice there is. (Einstein said something similar but who knows more about real life?) In this course you do not, for the most part, apply high-level library functions but use low to mid level algorithms to analyze images and extract structural information.


Syllabus

  • Introduction
    • Introduction
  • Image Processing for Computer Vision
    • Linear image processing,Model fitting,Frequency domain analysis
  • Camera Models and Views
    • Camera models,Stereo geometry,Camera calibration,Multiple views
  • Image Features
    • Feature detection,Feature descriptors,Model fitting
  • Lighting
    • Photometry,Lightness,Shape from shading
  • Image Motion
    • Overview,Optical flow
  • Tracking
    • Introduction to tracking,Parametric models,Non-parametric models,Tracking considerations
  • Classification and Recognition
    • Introduction to recognition,Classification: Generative models,Classification: Discriminative models,Action recognition
  • Useful Methods
    • Color spaces and segmentation,Binary morphology,3D perception
  • Human Visual System
    • The retina,Vision in the brain

Taught by

Irfan Essa and Aaron Bobick

Tags

Related Courses

2D image processing
Higher School of Economics via Coursera
3D Reconstruction - Multiple Viewpoints
Columbia University via Coursera
3D Reconstruction - Single Viewpoint
Columbia University via Coursera
Post Graduate Certificate in Advanced Machine Learning & AI
Indian Institute of Technology Roorkee via Coursera
Advanced Computer Vision with TensorFlow
DeepLearning.AI via Coursera