Model Compression Courses

Distillation, Quantization, and Pruning in Advanced NLP - Lecture 11
Graham Neubig via YouTube GUITAR: Gradient Pruning toward Fast Neural Ranking - Efficiency for Search
Association for Computing Machinery (ACM) via YouTube Leaner, Greener and Faster PyTorch Inference with Quantization
MLOps World: Machine Learning in Production via YouTube A Gentle Introduction to Sparsity with a Concrete Example
MLOps World: Machine Learning in Production via YouTube TinyEngine and Parallel Processing - EfficientML.ai Lecture 11
MIT HAN Lab via YouTube MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube Self-Improving Teacher Cultivates Better Student: Distillation Calibration for Multimodal Large Language Models - Lecture 3.3
Association for Computing Machinery (ACM) via YouTube

< Prev Page 11