VigNET - An Intelligent Camera App That Assists You
Offered By: PyCon US via YouTube
Course Description
Overview
Explore the development of an intelligent camera application designed to assist visually impaired individuals in understanding their surroundings through a 28-minute talk at PyCon US. Learn about the Visual Question Answering (VQA) app that utilizes deep learning and the Vision Language Transformer (ViLT) model to provide rapid responses to image-related queries. Discover the advantages of ViLT over traditional vision language pre-trained models, best practices for modularizing the application, and steps to deploy this deep learning-based solution on Google Cloud Platform. Gain insights into how built-in Python libraries facilitate the implementation and deployment of complex models like ViLT. Access the open-source code and follow a walkthrough to build your own visual question answering application, complete with speech-to-text and text-to-speech capabilities for enhanced accessibility.
Syllabus
Talk - Padmaja Bhagwat/Manisha R: VigNET: An intelligent camera app that assists you...
Taught by
PyCon US
Related Courses
Programming Cloud Services for Android Handheld SystemsVanderbilt University via Coursera SAP S/4HANA in a Nutshell
SAP Learning Transformation to Hybrid Landscapes
SAP Learning Ruby on Rails: An Introduction
Johns Hopkins University via Coursera Capstone: Photo Tourist Web Application
Johns Hopkins University via Coursera