Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
Offered By: Google via Google Cloud Skills Boost
Course Description
Overview
Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond the video using multimodality with Gemini; building metadata of documents containing text and images, getting all relevant text chunks, and printing citations by using Multimodal Retrieval Augmented Generation (RAG) with Gemini. A skill badge is an exclusive digital badge issued by Google Cloud in recognition of your proficiency with Google Cloud products and services and tests your ability to apply your knowledge in an interactive hands-on environment. Complete this skill badge course and the final assessment challenge lab to receive a skill badge that you can share with your network.
Syllabus
- Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
- Multimodality with Gemini
- Using Gemini for Multimodal Retail Recommendations
- Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
- Inspect Rich Documents with Gemini Multimodality and Multimodal RAG: Challenge Lab
- Your Next Steps
- Course Badge
Tags
Related Courses
Learn Google Bard and GeminiUdemy Gemini and the Future of Generative AI Tools - Interview with Simon Tokumine
TensorFlow via YouTube Gemini and GPT Sales Agents with RAG - Comparison and Implementation
echohive via YouTube Building a Streamlit Interface for Unified Chat with Multiple LLMs
echohive via YouTube Gemini 1.5 Pro for Code - Building LLM Agents with CrewAI
Sam Witteveen via YouTube