Build an Image Captioning Tool for Visually Impaired Users with Gemini
Offered By: LinkedIn Learning
Course Description
Overview
Find out how artificial intelligence can help you make better web experiences for visually impaired users.
Syllabus
Introduction
- Image captioning with AI
- What you should know
- Who this course is for
- Understanding Gemini models
- Gemini pricing
- Signing up for an Google AI Studio account
- Getting your API key
- Cloning the seed project
- Project code walkthrough
- Adding the image upload functionality
- Adding the prompt functionality
- Writing the caption display
- Building out the Express.js API
- Configuring the Generative AI SDK
- Adding routes
- Setting up file upload functionality
- Writing the prompt request and response
- Connecting the frontend to the API
- Adding a progress indicator
- Using the Web Speech API to read captions
- Next steps
Taught by
Fikayo Adepoju
Related Courses
Deep Learning For Visual ComputingIndian Institute of Technology, Kharagpur via Swayam Literacy Essentials: Core Concepts Generative Adversarial Network
Pluralsight Machine Learning & Deep Learning Projects
The AI University via YouTube Implement Image Captioning with Recurrent Neural Networks
Pluralsight VirTex- Learning Visual Representations from Textual Annotations
Yannic Kilcher via YouTube