YoVDO

VigNET - An Intelligent Camera App That Assists You

Offered By: PyCon US via YouTube

Tags

PyCon US Courses Deep Learning Courses Python Courses Cloud Deployment Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the development of an intelligent camera application designed to assist visually impaired individuals in understanding their surroundings through a 28-minute talk at PyCon US. Learn about the Visual Question Answering (VQA) app that utilizes deep learning and the Vision Language Transformer (ViLT) model to provide rapid responses to image-related queries. Discover the advantages of ViLT over traditional vision language pre-trained models, best practices for modularizing the application, and steps to deploy this deep learning-based solution on Google Cloud Platform. Gain insights into how built-in Python libraries facilitate the implementation and deployment of complex models like ViLT. Access the open-source code and follow a walkthrough to build your own visual question answering application, complete with speech-to-text and text-to-speech capabilities for enhanced accessibility.

Syllabus

Talk - Padmaja Bhagwat/Manisha R: VigNET: An intelligent camera app that assists you...


Taught by

PyCon US

Related Courses

Intro to Python for Brand New Programmers
PyCon US via YouTube
Comprehending Comprehensions
PyCon US via YouTube
Data Analysis with SQLite and Python
PyCon US via YouTube
Build a Production Ready GraphQL API Using Python
PyCon US via YouTube
Web Development With A Python-backed Frontend - Featuring HTMX and Tailwind
PyCon US via YouTube