Build an Image Captioning Tool for Visually Impaired Users with Gemini

Offered By: LinkedIn Learning

Tags

Artificial Intelligence Courses Web Accessibility Courses Backend Development Courses Image Captioning Courses Gemini Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Find out how artificial intelligence can help you make better web experiences for visually impaired users.

Syllabus

Introduction

Image captioning with AI
What you should know
Who this course is for

1. Setting Up Access to Gemini API

Understanding Gemini models
Gemini pricing
Signing up for an Google AI Studio account
Getting your API key

2. Building the Interface

Cloning the seed project
Project code walkthrough
Adding the image upload functionality
Adding the prompt functionality
Writing the caption display

3. Building the Backend: Connecting to Gemini

Building out the Express.js API
Configuring the Generative AI SDK
Adding routes
Setting up file upload functionality
Writing the prompt request and response

4. Bringing It All Together

Connecting the frontend to the API
Adding a progress indicator
Using the Web Speech API to read captions

Conclusion

Next steps

Taught by

Fikayo Adepoju

Related Courses

Designing RESTful APIs
Udacity Introduction to NodeJS
Microsoft via edX Exploring GraphQL: A Query Language for APIs
Linux Foundation via edX Build a Google Firebase Web Application
Coursera Project Network via Coursera Build a Twitter Clone Backend
Coursera Project Network via Coursera