Transforming Computer Vision with LLMs - Key Projects and Insights
Offered By: Data Science Dojo via YouTube
Course Description
Overview
Explore the revolutionary impact of large language models (LLMs) on computer vision in this 44-minute talk. Gain insights into how text-only LLMs are achieving remarkable success in visual understanding through prompting and tool use. Discover key LLM-centered projects transforming the field, including VisProg, ViperGPT, VoxelGPT, and HuggingGPT. Learn about the challenges and lessons from building VoxelGPT, and acquire practitioner's insights into domain-specific prompt engineering. Delve into the future prospects of LLMs in computer vision. Suitable for researchers and practitioners interested in computer vision, generative AI, LLMs, and machine learning.
Syllabus
Transforming Computer Vision with LLMs
Taught by
Data Science Dojo
Related Courses
Building and Managing Superior SkillsState University of New York via Coursera ChatGPT et IA : mode d'emploi pour managers et RH
CNAM via France Université Numerique Digital Skills: Artificial Intelligence
Accenture via FutureLearn AI Foundations for Everyone
IBM via Coursera Design a Feminist Chatbot
Institute of Coding via FutureLearn