YoVDO

Model Compression Courses

Distillation, Quantization, and Pruning in Advanced NLP - Lecture 11
Graham Neubig via YouTube
GUITAR: Gradient Pruning toward Fast Neural Ranking - Efficiency for Search
Association for Computing Machinery (ACM) via YouTube
Leaner, Greener and Faster PyTorch Inference with Quantization
MLOps World: Machine Learning in Production via YouTube
A Gentle Introduction to Sparsity with a Concrete Example
MLOps World: Machine Learning in Production via YouTube
TinyEngine and Parallel Processing - EfficientML.ai Lecture 11
MIT HAN Lab via YouTube
MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube
MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube
Self-Improving Teacher Cultivates Better Student: Distillation Calibration for Multimodal Large Language Models - Lecture 3.3
Association for Computing Machinery (ACM) via YouTube
< Prev Page 11