Fine-Tuning Llama 3 on a Custom Dataset for RAG Q&A - Training LLM on a Single GPU
Offered By: Venelin Valkov via YouTube
Course Description
Overview
Learn how to fine-tune Llama 3 on a custom dataset for a RAG Q&A use case using a single GPU in this comprehensive 33-minute tutorial. Explore the benefits of fine-tuning, understand the process overview, and dive into practical steps including dataset preparation, model loading, custom dataset creation, and LoRA setup. Follow along with Google Colab setup, establish a baseline, train the model, and evaluate its performance against the base model. Gain insights into pushing the fine-tuned model to the HuggingFace hub and discover how even smaller models can outperform larger ones when properly fine-tuned for specific tasks.
Syllabus
- Why fine-tuning?
- Text tutorial on MLExpert.io
- Fine-tuning process overview
- Dataset
- Lllama 3 8B Instruct
- Google Colab Setup
- Loading model and tokenizer
- Create custom dataset
- Establish baseline
- Training on completions
- LoRA setup
- Training
- Load model and push to HuggingFace hub
- Evaluation comparing vs the base model
- Conclusion
Taught by
Venelin Valkov
Related Courses
How to Do Stable Diffusion LORA Training by Using Web UI on Different ModelsSoftware Engineering Courses - SE Courses via YouTube MicroPython & WiFi
Kevin McAleer via YouTube Building a Wireless Community Sensor Network with LoRa
Hackaday via YouTube ComfyUI - Node Based Stable Diffusion UI
Olivio Sarikas via YouTube AI Masterclass for Everyone - Stable Diffusion, ControlNet, Depth Map, LORA, and VR
Hugh Hou via YouTube