Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
Offered By: Toronto Machine Learning Series (TMLS) via YouTube
Course Description
Overview
Explore the innovative CheckList methodology for testing NLP models in this 35-minute talk by Marco TĂșlio Ribeiro, Senior Researcher at Microsoft Research. Discover a task-agnostic approach inspired by software engineering principles that goes beyond traditional accuracy metrics. Learn about intriguing bugs uncovered in both commercial and research models, including those from tech giants like Microsoft, Amazon, and Google, as well as popular models like BERT and RoBERTA. Gain insights into the effectiveness of CheckList through case studies and user feedback from researchers and engineers. Understand how this powerful tool can enhance the testing and debugging process for NLP models, benefiting both practitioners and researchers in the field.
Syllabus
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
Taught by
Toronto Machine Learning Series (TMLS)
Related Courses
Multi-Label Classification on Unhealthy Comments - Finetuning RoBERTa with PyTorch - Coding Tutorialrupert ai via YouTube Hugging Face Transformers - The Basics - Practical Coding Guides - NLP Models (BERT/RoBERTa)
rupert ai via YouTube Programming Language of the Future: AI in Your Native Language
Linux Foundation via YouTube Pre-training and Pre-trained Models in Advanced NLP - Lecture 5
Graham Neubig via YouTube Fine-tuning LLMs Without Maxing Out Your GPU - LoRA for Parameter-Efficient Training
Data Centric via YouTube