RoBERTa - A Robustly Optimized BERT Pretraining Approach

Offered By: Yannic Kilcher via YouTube

Course Description

Overview

Explore a comprehensive analysis of the RoBERTa model in this informative video lecture. Delve into the replication study of BERT pretraining that challenges recent improvements in language model pretraining. Learn how careful hyperparameter tuning and increased training data size can significantly impact model performance. Discover how the original BERT model, when trained correctly, can outperform subsequent improvements. Examine the state-of-the-art results achieved on GLUE, RACE, and SQuAD benchmarks. Gain insights into the importance of previously overlooked design choices and question the source of recently reported advancements in natural language processing. Understand the implications of this research for future model development and evaluation in the field of language model pretraining.

Syllabus

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Taught by

Yannic Kilcher

Related Courses

Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
DeepLearning.AI via Coursera How to Win a Data Science Competition: Learn from Top Kagglers
Higher School of Economics via Coursera Predictive Modeling and Machine Learning with MATLAB
MathWorks via Coursera Machine Learning Rapid Prototyping with IBM Watson Studio
IBM via Coursera Hyperparameter Tuning with Neural Network Intelligence
Coursera Project Network via Coursera