YoVDO

VigNET - An Intelligent Camera App That Assists You

Offered By: PyCon US via YouTube

Tags

PyCon US Courses Deep Learning Courses Python Courses Cloud Deployment Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the development of an intelligent camera application designed to assist visually impaired individuals in understanding their surroundings through a 28-minute talk at PyCon US. Learn about the Visual Question Answering (VQA) app that utilizes deep learning and the Vision Language Transformer (ViLT) model to provide rapid responses to image-related queries. Discover the advantages of ViLT over traditional vision language pre-trained models, best practices for modularizing the application, and steps to deploy this deep learning-based solution on Google Cloud Platform. Gain insights into how built-in Python libraries facilitate the implementation and deployment of complex models like ViLT. Access the open-source code and follow a walkthrough to build your own visual question answering application, complete with speech-to-text and text-to-speech capabilities for enhanced accessibility.

Syllabus

Talk - Padmaja Bhagwat/Manisha R: VigNET: An intelligent camera app that assists you...


Taught by

PyCon US

Related Courses

Programming Cloud Services for Android Handheld Systems
Vanderbilt University via Coursera
SAP S/4HANA in a Nutshell
SAP Learning
Transformation to Hybrid Landscapes
SAP Learning
Ruby on Rails: An Introduction
Johns Hopkins University via Coursera
Capstone: Photo Tourist Web Application
Johns Hopkins University via Coursera