Introduction to Multimodal Prompting for Generative AI
Offered By: LinkedIn Learning
Course Description
Overview
Learn how you can leverage modern AI systems that utilize multimodality.
Syllabus
Introduction
- GenAI with multimodal prompts
- What is multimodality?
- Visual modality
- Textual and auditory modality
- GPT-4 and 4o
- Text to image in GPT-4
- GPT-4 API with various input types
- Challenge: Drawing to code
- Solution: Drawing to code
- What is Gemini?
- Images in Gemini
- Gemini video inputs
- Challenge: Video narration
- Solution: Video narration
- Audio in generative AI
- Prompt and audio
- Generating music
- Challenge: Soundtrack creation
- Solution: Soundtrack creation
- Next steps
Taught by
Ronnie Sheer
Related Courses
Learn Google Bard and GeminiUdemy Gemini and the Future of Generative AI Tools - Interview with Simon Tokumine
TensorFlow via YouTube Gemini and GPT Sales Agents with RAG - Comparison and Implementation
echohive via YouTube Building a Streamlit Interface for Unified Chat with Multiple LLMs
echohive via YouTube Gemini 1.5 Pro for Code - Building LLM Agents with CrewAI
Sam Witteveen via YouTube