YoVDO

Build an Image Captioning Tool for Visually Impaired Users with Gemini

Offered By: LinkedIn Learning

Tags

Artificial Intelligence Courses Web Accessibility Courses Backend Development Courses Image Captioning Courses Gemini Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Find out how artificial intelligence can help you make better web experiences for visually impaired users.

Syllabus

Introduction
  • Image captioning with AI
  • What you should know
  • Who this course is for
1. Setting Up Access to Gemini API
  • Understanding Gemini models
  • Gemini pricing
  • Signing up for an Google AI Studio account
  • Getting your API key
2. Building the Interface
  • Cloning the seed project
  • Project code walkthrough
  • Adding the image upload functionality
  • Adding the prompt functionality
  • Writing the caption display
3. Building the Backend: Connecting to Gemini
  • Building out the Express.js API
  • Configuring the Generative AI SDK
  • Adding routes
  • Setting up file upload functionality
  • Writing the prompt request and response
4. Bringing It All Together
  • Connecting the frontend to the API
  • Adding a progress indicator
  • Using the Web Speech API to read captions
Conclusion
  • Next steps

Taught by

Fikayo Adepoju

Related Courses

Accessible Landing Page Solutions in XD
Coursera Project Network via Coursera
Accessible Docs
Cabrillo College via California Community Colleges System
Accessible Docs
Cabrillo College via California Community Colleges System
Accessible Media
Cabrillo College via California Community Colleges System
Web Production I
City College of San Francisco via California Community Colleges System