Evaluation Techniques for Large Language Models
Offered By: MLOps World: Machine Learning in Production via YouTube
Course Description
Overview
Explore practical tools and best practices for evaluating and choosing Large Language Models (LLMs) in this comprehensive tutorial presented by Rajiv Shah, Machine Learning Engineer at Hugging Face. Gain insights into the capabilities of LLMs compared to traditional ML models and learn various evaluation techniques, including evaluation suites, head-to-head competition approaches, and using LLMs to evaluate other LLMs. Delve into the subtle factors affecting evaluation, such as the role of prompts, tokenization, and requirements for factual accuracy. Examine model bias and ethical considerations through working examples. Acquire an in-depth understanding of LLM evaluation tradeoffs and methods, with reusable code provided in Jupyter Notebooks for each technique discussed.
Syllabus
Evaluation Techniques for Large Language Models
Taught by
MLOps World: Machine Learning in Production
Related Courses
The Laws of Digital Data, Content and Artificial Intelligence (AI)University of Law via FutureLearn Artificial Intelligence Privacy and Convenience
LearnQuest via Coursera AI Accountability Essential Training
LinkedIn Learning Artificial Intelligence: an Overview
Politecnico di Milano via Coursera Building Responsible AI - Best Practices Across the Product Development Lifecycle
Data Science Dojo via YouTube