Mistral 7B: Architecture, Evaluation, and Advanced Techniques
Offered By: Trelis Research via YouTube
Course Description
Overview
Explore the capabilities and architecture of Mistral 7B in this 19-minute video tutorial. Dive into the model's design, setup process on Runpod, and comprehensive evaluation through various tests including random sequence reversal, code generation, passkey retrieval, and fine-tuning. Gain insights into the model's performance and understand advanced concepts like Grouped Query Attention and Sliding Window Attention. Access additional resources including a comparison notebook, Runpod setup guide, and a supervised fine-tuning tutorial to enhance your understanding of this powerful language model.
Syllabus
Intro
Video Overview
Mistral 7B architecture and design
Runpod setup
Mistral 7B Evaluation
Test 1: Random sequence reversal
Test 2: Code generation
Test 3: Passkey retrieval
Test 4: Fine-tuning
Evaluation Summary
EXTRA: Grouped Query Attention
EXTRA: Sliding Window Attention
Taught by
Trelis Research
Related Courses
TensorFlow: Working with NLPLinkedIn Learning Introduction to Video Editing - Video Editing Tutorials
Great Learning via YouTube HuggingFace Crash Course - Sentiment Analysis, Model Hub, Fine Tuning
Python Engineer via YouTube GPT3 and Finetuning the Core Objective Functions - A Deep Dive
David Shapiro ~ AI via YouTube How to Build a Q&A AI in Python - Open-Domain Question-Answering
James Briggs via YouTube