Fine-tuning Llama 3 on Wikipedia Datasets for Low-Resource Languages
Offered By: Trelis Research via YouTube
Course Description
Overview
Explore the process of fine-tuning Llama 3 for low-resource languages using Wikipedia datasets in this comprehensive 44-minute tutorial. Learn how to create a HuggingFace dataset using WikiExtractor, set up Llama 3 fine-tuning with LoRA, and implement dataset blending to prevent catastrophic forgetting. Dive into trainer setup, parameter selection, and loss inspection. Gain insights on learning rates, annealing, and additional tips for improving your fine-tuning results. Access provided resources including slides, dataset links, and code repositories to enhance your learning experience.
Syllabus
Fine-tuning Llama 3 for a low resource language
Overview of Wikipedia Dataset and Loss Curves
Video overview
HuggingFace Dataset creation with WikiExtractor
Llama 3 fine-tuning setup, incl. LoRA
Dataset blending to avoid catastrophic forgetting
Trainer setup and parameter selection
Inspection of losses and results
Learning Rates and Annealing
Further tips and improvements
Taught by
Trelis Research
Related Courses
Active Dendrites Avoid Catastrophic Forgetting - Interview With the AuthorsYannic Kilcher via YouTube Avoiding Catastrophe - Active Dendrites Enable Multi-Task Learning in Dynamic Environments
Yannic Kilcher via YouTube Supermasks in Superposition - Paper Explained
Yannic Kilcher via YouTube What Kind of AI Can Help Manufacturing Adapt to a Pandemic
Open Data Science via YouTube Rethinking Architecture Design for Data Heterogeneity in FL - Liangqiong Qu
Stanford University via YouTube