YoVDO

Build an Image Captioning Tool for Visually Impaired Users with Gemini

Offered By: LinkedIn Learning

Tags

Artificial Intelligence Courses Web Accessibility Courses Backend Development Courses Image Captioning Courses Gemini Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Find out how artificial intelligence can help you make better web experiences for visually impaired users.

Syllabus

Introduction
  • Image captioning with AI
  • What you should know
  • Who this course is for
1. Setting Up Access to Gemini API
  • Understanding Gemini models
  • Gemini pricing
  • Signing up for an Google AI Studio account
  • Getting your API key
2. Building the Interface
  • Cloning the seed project
  • Project code walkthrough
  • Adding the image upload functionality
  • Adding the prompt functionality
  • Writing the caption display
3. Building the Backend: Connecting to Gemini
  • Building out the Express.js API
  • Configuring the Generative AI SDK
  • Adding routes
  • Setting up file upload functionality
  • Writing the prompt request and response
4. Bringing It All Together
  • Connecting the frontend to the API
  • Adding a progress indicator
  • Using the Web Speech API to read captions
Conclusion
  • Next steps

Taught by

Fikayo Adepoju

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Artificial Intelligence for Robotics
Stanford University via Udacity
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent