OpenAI Embeddings with Voice Cloning - Eleven Labs API, ChatGPT API, Whisper API
Offered By: Part Time Larry via YouTube
Course Description
Overview
Syllabus
Project Description: Q&A + Voice Cloning
The movie “Her” and the Idea of Smarter Assistants
Voice Sampling
Demo Voice #1 Samantha Voice
Demo Voice #2 Jay-Z Voice, Rhyming Responses
Hip Hop Music and Sampling Analogy
Hip Hop Production, Rick Rubin, Taste and Technical Ability Clip
Recap of OpenAI For Finance Series So Far, Prerequisites
Building a Q&A Corpus, Vector Embeddings, Cosine Similarity Review
Building a User Interface with Gradio, Starter Code from Video #9
Voice Cloning with Eleven Labs API
Python Code Walkthrough - config.py constants, voice ID, custom prompts
Eleven Labs API - Example Request and Response Payloads
Avatars and AI Art Generation with Midjourney, Nvidia Stock Win
Python Code Walkthrough - advisor.py, requirements.txt
Gradio User Interface Development, Microphone Input, Avatar Display
UI Launch, Debugging Mode, Sharing Your App, Mobile Devices
Transcribe Function, OpenAI Whisper API
Incorporating Word Embeddings, Question Vector, Cosine Similarity, Answers
ChatGPT API, Conversation History, Stuffing the Prompt with Context
Eleven Labs API Request with Python, Text to Speech, Voice Synthesis Settings
Outputting Binary Response / MP3 to Audio Output
Final Words of Advice from Jay-Z
Taught by
Part Time Larry
Related Courses
Natural Language ProcessingColumbia University via Coursera Natural Language Processing
Stanford University via Coursera Introduction to Natural Language Processing
University of Michigan via Coursera moocTLH: Nuevos retos en las tecnologías del lenguaje humano
Universidad de Alicante via Miríadax Natural Language Processing
Indian Institute of Technology, Kharagpur via Swayam