YoVDO

Build an Image Captioning Tool for Visually Impaired Users with Gemini

Offered By: LinkedIn Learning

Tags

Artificial Intelligence Courses Web Accessibility Courses Backend Development Courses Image Captioning Courses Gemini Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Find out how artificial intelligence can help you make better web experiences for visually impaired users.

Syllabus

Introduction
  • Image captioning with AI
  • What you should know
  • Who this course is for
1. Setting Up Access to Gemini API
  • Understanding Gemini models
  • Gemini pricing
  • Signing up for an Google AI Studio account
  • Getting your API key
2. Building the Interface
  • Cloning the seed project
  • Project code walkthrough
  • Adding the image upload functionality
  • Adding the prompt functionality
  • Writing the caption display
3. Building the Backend: Connecting to Gemini
  • Building out the Express.js API
  • Configuring the Generative AI SDK
  • Adding routes
  • Setting up file upload functionality
  • Writing the prompt request and response
4. Bringing It All Together
  • Connecting the frontend to the API
  • Adding a progress indicator
  • Using the Web Speech API to read captions
Conclusion
  • Next steps

Taught by

Fikayo Adepoju

Related Courses

Designing RESTful APIs
Udacity
Introduction to NodeJS
Microsoft via edX
Exploring GraphQL: A Query Language for APIs
Linux Foundation via edX
Build a Google Firebase Web Application
Coursera Project Network via Coursera
Build a Twitter Clone Backend
Coursera Project Network via Coursera