Creating Voiceovers with OpenAI's Text-to-Speech and Vision Models
Offered By: Ian Wootten via YouTube
Course Description
Overview
Explore the latest advancements in OpenAI's text-to-speech (TTS) and GPT-4V models in this 15-minute video tutorial. Discover innovative applications of these technologies, including generating image descriptions and creating audio content. Learn how to produce voiceovers for images and videos using a combination of TTS and GPT-4V. Follow along as the presenter demonstrates practical examples and showcases novel ways developers have been utilizing these powerful tools. Gain insights into the potential of AI-driven content creation and enhance your understanding of cutting-edge language and vision models.
Syllabus
Intro
Using TTS to create audio
Using GPT4V to describe images
Using TTS & GPT4V for Video voiceovers
Conclusion
Taught by
Ian Wootten
Related Courses
Building Document Intelligence Applications with Azure Applied AI and Azure Cognitive ServicesMicrosoft via YouTube Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube AI Show - Ignite Recap: Arc-Enabled ML, Language Services, and OpenAI
Microsoft via YouTube Building Intelligent Applications with World-Class AI
Microsoft via YouTube Build an AI Image Generator with OpenAI & Node.js
Traversy Media via YouTube