Convergence of Vision and Language in AI - Recent Developments and Projects
Offered By: Aleksa Gordić - The AI Epiphany via YouTube
Course Description
Overview
Explore the convergence of vision and language in artificial intelligence through this 55-minute talk featuring Lucas Beyer from Google DeepMind. Delve into Beyer's personal journey, understand the motivations behind integrating vision and language, and learn how language serves as an API for vision. Discover the concept of LiT tuning, examine the convergence of architectures in AI, and gain insights into PaLI, a vision-language model. Engage with cutting-edge research and projects in the field of AI, including Vision Transformers (ViT) and other innovative approaches to combining visual and linguistic information processing.
Syllabus
Lucas's story
Motivation
Language as API for vision
LiT tuning
Convergence of architectures
PaLI vision language model
Taught by
Aleksa Gordić - The AI Epiphany
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Computational Photography
Georgia Institute of Technology via Coursera Einführung in Computer Vision
Technische Universität München (Technical University of Munich) via Coursera Introduction to Computer Vision
Georgia Institute of Technology via Udacity