Intro to Multi-Modal ML with OpenAI's CLIP
Offered By: James Briggs via YouTube
Course Description
Overview
Explore OpenAI's CLIP, a multi-modal model capable of understanding relationships between text and images, in this 23-minute tutorial. Learn how to use CLIP via the Hugging Face library to create text and image embeddings, perform text-image similarity searches, and explore alternative image and text search methods. Gain practical insights into multi-modal machine learning and discover the power of CLIP in bridging the gap between textual and visual data processing.
Syllabus
Intro
What is CLIP?
Getting started
Creating text embeddings
Creating image embeddings
Embedding a lot of images
Text-image similarity search
Alternative image and text search
Taught by
James Briggs
Related Courses
TensorFlow for NLP: Text Embedding and ClassificationCoursera Project Network via Coursera Google Sites Essential Training
LinkedIn Learning 2024 Advanced Machine Learning and Deep Learning Projects
Udemy OpenAI Python API Bootcamp: Learn to use AI, GPT, and more!
Udemy Prompt Engineering - Understanding Large Language Models with ChatGPT
Prodramp via YouTube