Running Speech-to-Speech Models on Mac or GPU
Offered By: Trelis Research via YouTube
Course Description
Overview
Learn how to run speech-to-speech AI models on Mac or GPU in this comprehensive 37-minute tutorial. Explore the process of building models like GPT-4o, dive into the Llama 3 Speech-to-Speech Model, and utilize HuggingFace's Speech-to-Speech repository. Follow step-by-step instructions for running these models on your Mac and remote GPU (CUDA) environments. Discover techniques to reduce latency using UDP ports instead of TCP. Access valuable resources, including GitHub repositories, slides, and research papers, to further enhance your understanding of speech-to-speech AI technology.
Syllabus
Introduction to Speech to Speech AI Models like GPT-4o
Video Overview
How to build speech-to-speech models like GPT-4o
Llama 3 Speech-to-Speech Model
HuggingFace Speech-to-Speech
Running speech to speech on your Mac
Running speech-to-speech on a remote GPU CUDA
Reducing latency with UDP ports instead of TCP
Video Resources
Taught by
Trelis Research
Related Courses
High Performance ComputingGeorgia Institute of Technology via Udacity Fundamentals of Accelerated Computing with CUDA C/C++
Nvidia via Independent High Performance Computing for Scientists and Engineers
Indian Institute of Technology, Kharagpur via Swayam CUDA programming Masterclass with C++
Udemy Neural Network Programming - Deep Learning with PyTorch
YouTube