Build an Image Captioning Tool for Visually Impaired Users with Gemini
Offered By: LinkedIn Learning
Course Description
Overview
Find out how artificial intelligence can help you make better web experiences for visually impaired users.
Syllabus
Introduction
- Image captioning with AI
- What you should know
- Who this course is for
- Understanding Gemini models
- Gemini pricing
- Signing up for an Google AI Studio account
- Getting your API key
- Cloning the seed project
- Project code walkthrough
- Adding the image upload functionality
- Adding the prompt functionality
- Writing the caption display
- Building out the Express.js API
- Configuring the Generative AI SDK
- Adding routes
- Setting up file upload functionality
- Writing the prompt request and response
- Connecting the frontend to the API
- Adding a progress indicator
- Using the Web Speech API to read captions
- Next steps
Taught by
Fikayo Adepoju
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Artificial Intelligence for Robotics
Stanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent