YoVDO

OpenAI Embeddings with Voice Cloning - Eleven Labs API, ChatGPT API, Whisper API

Offered By: Part Time Larry via YouTube

Tags

Natural Language Processing (NLP) Courses ChatGPT Courses Gradio Courses Voice Cloning Courses

Course Description

Overview

Learn how to build a sophisticated question-answering voice assistant with realistic voice responses in this comprehensive tutorial. Explore the integration of OpenAI Embeddings, ChatGPT API, Whisper API, and Eleven Labs API to create a powerful AI-driven assistant. Discover techniques for voice cloning, natural language processing, and user interface development using Gradio. Follow along as the instructor demonstrates how to construct a Q&A corpus, implement vector embeddings, and utilize cosine similarity for accurate answer retrieval. Gain insights into incorporating AI-generated avatars, handling microphone input, and optimizing voice synthesis settings. By the end of this tutorial, you'll have the knowledge to create your own advanced voice assistant with customizable voices and intelligent responses.

Syllabus

Project Description: Q&A + Voice Cloning
The movie “Her” and the Idea of Smarter Assistants
Voice Sampling
Demo Voice #1 Samantha Voice
Demo Voice #2 Jay-Z Voice, Rhyming Responses
Hip Hop Music and Sampling Analogy
Hip Hop Production, Rick Rubin, Taste and Technical Ability Clip
Recap of OpenAI For Finance Series So Far, Prerequisites
Building a Q&A Corpus, Vector Embeddings, Cosine Similarity Review
Building a User Interface with Gradio, Starter Code from Video #9
Voice Cloning with Eleven Labs API
Python Code Walkthrough - config.py constants, voice ID, custom prompts
Eleven Labs API - Example Request and Response Payloads
Avatars and AI Art Generation with Midjourney, Nvidia Stock Win
Python Code Walkthrough - advisor.py, requirements.txt
Gradio User Interface Development, Microphone Input, Avatar Display
UI Launch, Debugging Mode, Sharing Your App, Mobile Devices
Transcribe Function, OpenAI Whisper API
Incorporating Word Embeddings, Question Vector, Cosine Similarity, Answers
ChatGPT API, Conversation History, Stuffing the Prompt with Context
Eleven Labs API Request with Python, Text to Speech, Voice Synthesis Settings
Outputting Binary Response / MP3 to Audio Output
Final Words of Advice from Jay-Z


Taught by

Part Time Larry

Related Courses

Natural Language Processing
Columbia University via Coursera
Natural Language Processing
Stanford University via Coursera
Introduction to Natural Language Processing
University of Michigan via Coursera
moocTLH: Nuevos retos en las tecnologías del lenguaje humano
Universidad de Alicante via Miríadax
Natural Language Processing
Indian Institute of Technology, Kharagpur via Swayam