Build an Image Captioning Tool for Visually Impaired Users with Gemini
Offered By: LinkedIn Learning
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Find out how artificial intelligence can help you make better web experiences for visually impaired users.
Syllabus
Introduction
- Image captioning with AI
- What you should know
- Who this course is for
- Understanding Gemini models
- Gemini pricing
- Signing up for an Google AI Studio account
- Getting your API key
- Cloning the seed project
- Project code walkthrough
- Adding the image upload functionality
- Adding the prompt functionality
- Writing the caption display
- Building out the Express.js API
- Configuring the Generative AI SDK
- Adding routes
- Setting up file upload functionality
- Writing the prompt request and response
- Connecting the frontend to the API
- Adding a progress indicator
- Using the Web Speech API to read captions
- Next steps
Taught by
Fikayo Adepoju
Related Courses
Accessible Landing Page Solutions in XDCoursera Project Network via Coursera Accessible Docs
Cabrillo College via California Community Colleges System Accessible Docs
Cabrillo College via California Community Colleges System Accessible Media
Cabrillo College via California Community Colleges System Web Production I
City College of San Francisco via California Community Colleges System