Multimodal Conversational Interfaces with GPT and Vision AI
Offered By: Microsoft via YouTube
Course Description
Overview
Discover the groundbreaking GPT-4 Visual model from OpenAI, introducing multimodal input and output capabilities in this 41-minute conference talk. Explore the integration of GPT-4 Visual into Azure Cognitive Search and its enhancement with vision embeddings, revolutionizing AI-driven information retrieval. Learn how images and videos can now prompt or supplement prompts to large language models like GPT-4. Gain insights into new multimodal models for Azure AI Content Safety, part of Microsoft's Responsible AI product suite. Presented by a panel of experts including Fisayo Feyisetan, Theodoros Lappas, and Thomas Soemo, this session from Microsoft Ignite 2023 offers valuable resources and information on transforming conversational interfaces with cutting-edge AI technology.
Syllabus
Multimodal Conversational Interfaces with GPT and Vision AI | BRK205
Taught by
Microsoft Ignite
Tags
Related Courses
Generative AI, from GANs to CLIP, with Python and PytorchUdemy ODSC East 2022 Keynote by Luis Vargas, Ph.D. - The Big Wave of AI at Scale
Open Data Science via YouTube Comparing AI Image Caption Models: GIT, BLIP, and ViT+GPT2
1littlecoder via YouTube In Conversation with the Godfather of AI
Collision Conference via YouTube LLaVA: The New Open Access Multimodal AI Model
1littlecoder via YouTube