Introduction to Multimodal Prompting for Generative AI
Offered By: LinkedIn Learning
Course Description
Overview
Learn how you can leverage modern AI systems that utilize multimodality.
Syllabus
Introduction
- GenAI with multimodal prompts
- What is multimodality?
- Visual modality
- Textual and auditory modality
- GPT-4 and 4o
- Text to image in GPT-4
- GPT-4 API with various input types
- Challenge: Drawing to code
- Solution: Drawing to code
- What is Gemini?
- Images in Gemini
- Gemini video inputs
- Challenge: Video narration
- Solution: Video narration
- Audio in generative AI
- Prompt and audio
- Generating music
- Challenge: Soundtrack creation
- Solution: Soundtrack creation
- Next steps
Taught by
Ronnie Sheer
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Computational Photography
Georgia Institute of Technology via Coursera Einführung in Computer Vision
Technische Universität München (Technical University of Munich) via Coursera Introduction to Computer Vision
Georgia Institute of Technology via Udacity