YoVDO

DIY OpenAI Vision API App with Speech Recognition - Python, OpenAI, Google Speech Services

Offered By: Eli the Computer Guy via YouTube

Tags

Computer Vision Courses Python Courses Image Analysis Courses Speech Recognition Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to build an OpenAI Vision API application with speech recognition capabilities using Python, OpenAI, and Google Speech Services in this comprehensive 41-minute tutorial. Explore system architecture, automatic item identification, and full voice communication with a computer vision system. Gain practical insights into code implementation, including handling Pyaudio challenges. Follow along with detailed code explanations and demonstrations to create your own AI-powered vision and speech application.

Syllabus

Introduction
Demonstration
System Architecture
WARNING - Pyaudio is a pain
Automatic Item Identification Script - Code Explaination
Ask Computer About an Item - Code Explanation
Full Voice Communication with a Computer Vision System - Code Explanation
Final Thoughts


Taught by

Eli the Computer Guy

Related Courses

Writing II: Rhetorical Composing
Ohio State University via Coursera
Introducción a la visión por computador: desarrollo de aplicaciones con OpenCV.
Universidad Carlos iii de Madrid via edX
Earth Imagery at Work
Esri via Independent
Introduction to Artificial Intelligence (AI)
Microsoft via edX
Image Analysis Methods for Biologists
The University of Nottingham via FutureLearn