Real-Time Live Speech-to-Text - Streaming ASR Gradio App with Hugging Face Tutorial
Offered By: 1littlecoder via YouTube
Course Description
Overview
Build a real-time automatic speech recognition system using Facebook's Wav2Vec2 deep learning model in this applied NLP tutorial. Learn to implement Hugging Face Transformers Pipeline for audio-to-text conversion and create a Python web app with Gradio for live audio transcription. Explore pipeline setup, UI interface components, and state management. Access the provided Colab notebook for hands-on practice and discover related resources, including a guide on deploying Gradio ML apps on Hugging Face Spaces and a detailed blog post on real-time speech recognition. Enhance your NLP skills with additional tutorials, such as YouTube video transcript summarization using Hugging Face Transformers.
Syllabus
Introduction
Pipeline
UI
Interface Components
State
Taught by
1littlecoder
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX