YoVDO

Towards Machines that Perceive and Communicate

Offered By: MITCBMM via YouTube

Tags

Semantic Segmentation Courses Object Detection Courses Pose Estimation Courses

Course Description

Overview

Explore cutting-edge research in visual scene understanding and grounded language comprehension in this 1-hour 22-minute talk by Kevin Murphy from Google Research. Delve into topics such as semantic segmentation, object detection, instance segmentation, and person detection/pose estimation, including award-winning systems like DeepLab and entries in the COCO'16 competition. Discover work on visually grounded referring expressions, discriminative image captioning, and generative models of visual imagination. Learn how these components can be integrated to create systems that better comprehend images and words, advancing the field of AI and machine learning. Gain insights from Murphy's extensive experience in computer science, statistics, and machine learning, spanning academia and industry.

Syllabus

Intro
Vail
Agenda
Vision and Language
Deep Understanding
Image Classification
Labeled Data
Semantic Segmentation
Classification Problems
Standard Metrics
Urban Data
Object Detection
Universe En
Jonathan
Demo
Results


Taught by

MITCBMM

Related Courses

Mastering Image Segmentation with PyTorch
Packt via Coursera
Semantic Segmentation Explained (Traditional Chinese)
Amazon Web Services via AWS Skill Builder
Computer Vision For iOS Developers Course
Udemy
Pretrained CNN Features for Semantic Segmentation Using Random Forest
DigitalSreeni via YouTube
Attacking Optical Flow
Andreas Geiger via YouTube