YoVDO

Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

Offered By: Google via Google Cloud Skills Boost

Tags

Gemini Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond the video using multimodality with Gemini; building metadata of documents containing text and images, getting all relevant text chunks, and printing citations by using Multimodal Retrieval Augmented Generation (RAG) with Gemini. A skill badge is an exclusive digital badge issued by Google Cloud in recognition of your proficiency with Google Cloud products and services and tests your ability to apply your knowledge in an interactive hands-on environment. Complete this skill badge course and the final assessment challenge lab to receive a skill badge that you can share with your network.

Syllabus

  • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
    • Multimodality with Gemini
    • Using Gemini for Multimodal Retail Recommendations
    • Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
    • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG: Challenge Lab
  • Your Next Steps
    • Course Badge

Tags

Related Courses

Learn Google Bard and Gemini
Udemy
Gemini and the Future of Generative AI Tools - Interview with Simon Tokumine
TensorFlow via YouTube
Gemini and GPT Sales Agents with RAG - Comparison and Implementation
echohive via YouTube
Building a Streamlit Interface for Unified Chat with Multiple LLMs
echohive via YouTube
Gemini 1.5 Pro for Code - Building LLM Agents with CrewAI
Sam Witteveen via YouTube