Running Speech-to-Speech Models on Mac or GPU

Offered By: Trelis Research via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Learn how to run speech-to-speech AI models on Mac or GPU in this comprehensive 37-minute tutorial. Explore the process of building models like GPT-4o, dive into the Llama 3 Speech-to-Speech Model, and utilize HuggingFace's Speech-to-Speech repository. Follow step-by-step instructions for running these models on your Mac and remote GPU (CUDA) environments. Discover techniques to reduce latency using UDP ports instead of TCP. Access valuable resources, including GitHub repositories, slides, and research papers, to further enhance your understanding of speech-to-speech AI technology.

Syllabus

Introduction to Speech to Speech AI Models like GPT-4o
Video Overview
How to build speech-to-speech models like GPT-4o
Llama 3 Speech-to-Speech Model
HuggingFace Speech-to-Speech
Running speech to speech on your Mac
Running speech-to-speech on a remote GPU CUDA
Reducing latency with UDP ports instead of TCP
Video Resources

Taught by

Trelis Research

Running Speech-to-Speech Models on Mac or GPU

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Running Speech-to-Speech Models on Mac or GPU

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue