Mistral 7B: Architecture, Evaluation, and Advanced Techniques
Offered By: Trelis Research via YouTube
Course Description
Overview
Explore the capabilities and architecture of Mistral 7B in this 19-minute video tutorial. Dive into the model's design, setup process on Runpod, and comprehensive evaluation through various tests including random sequence reversal, code generation, passkey retrieval, and fine-tuning. Gain insights into the model's performance and understand advanced concepts like Grouped Query Attention and Sliding Window Attention. Access additional resources including a comparison notebook, Runpod setup guide, and a supervised fine-tuning tutorial to enhance your understanding of this powerful language model.
Syllabus
Intro
Video Overview
Mistral 7B architecture and design
Runpod setup
Mistral 7B Evaluation
Test 1: Random sequence reversal
Test 2: Code generation
Test 3: Passkey retrieval
Test 4: Fine-tuning
Evaluation Summary
EXTRA: Grouped Query Attention
EXTRA: Sliding Window Attention
Taught by
Trelis Research
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Artificial Intelligence for Robotics
Stanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent