VigNET - An Intelligent Camera App That Assists You
Offered By: PyCon US via YouTube
Course Description
Overview
Explore the development of an intelligent camera application designed to assist visually impaired individuals in understanding their surroundings through a 28-minute talk at PyCon US. Learn about the Visual Question Answering (VQA) app that utilizes deep learning and the Vision Language Transformer (ViLT) model to provide rapid responses to image-related queries. Discover the advantages of ViLT over traditional vision language pre-trained models, best practices for modularizing the application, and steps to deploy this deep learning-based solution on Google Cloud Platform. Gain insights into how built-in Python libraries facilitate the implementation and deployment of complex models like ViLT. Access the open-source code and follow a walkthrough to build your own visual question answering application, complete with speech-to-text and text-to-speech capabilities for enhanced accessibility.
Syllabus
Talk - Padmaja Bhagwat/Manisha R: VigNET: An intelligent camera app that assists you...
Taught by
PyCon US
Related Courses
Intro to Python for Brand New ProgrammersPyCon US via YouTube Comprehending Comprehensions
PyCon US via YouTube Data Analysis with SQLite and Python
PyCon US via YouTube Build a Production Ready GraphQL API Using Python
PyCon US via YouTube Web Development With A Python-backed Frontend - Featuring HTMX and Tailwind
PyCon US via YouTube