YoVDO

Whisper Paper Explained - Robust Speech Recognition via Large-Scale Weak Supervision

Offered By: Aladdin Persson via YouTube

Tags

Natural Language Processing (NLP) Courses Machine Learning Courses Deep Learning Courses Speech Recognition Courses

Course Description

Overview

Explore a comprehensive video walkthrough of the Whisper paper, detailing robust speech recognition via large-scale weak supervision. Delve into the groundbreaking research that achieved state-of-the-art results in speech recognition with open-source code and weights. Learn about the dataset collection process, model approach, experiments, and evaluation methods. Gain insights into long-form transcription challenges and the impact of model and dataset scaling on performance. Follow along as the presenter breaks down complex concepts, providing timestamps for easy navigation through key topics such as the abstract, introduction, model architecture, and experimental results.

Syllabus

- Introduction
- Abstract
- Introduction
- Dataset collection and processing
- Model approach
- Figure of model
- Experiments and Evaluation
- Long form transcription, messy :/
- Model and Dataset scaling
- Long form transcription cont, messy :/
- Ending


Taught by

Aladdin Persson

Related Courses

Natural Language Processing
Columbia University via Coursera
Natural Language Processing
Stanford University via Coursera
Introduction to Natural Language Processing
University of Michigan via Coursera
moocTLH: Nuevos retos en las tecnologĂ­as del lenguaje humano
Universidad de Alicante via MirĂ­adax
Natural Language Processing
Indian Institute of Technology, Kharagpur via Swayam