Model Compression Courses
Graham Neubig via YouTube GUITAR: Gradient Pruning toward Fast Neural Ranking - Efficiency for Search
Association for Computing Machinery (ACM) via YouTube Leaner, Greener and Faster PyTorch Inference with Quantization
MLOps World: Machine Learning in Production via YouTube A Gentle Introduction to Sparsity with a Concrete Example
MLOps World: Machine Learning in Production via YouTube TinyEngine and Parallel Processing - EfficientML.ai Lecture 11
MIT HAN Lab via YouTube MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube Self-Improving Teacher Cultivates Better Student: Distillation Calibration for Multimodal Large Language Models - Lecture 3.3
Association for Computing Machinery (ACM) via YouTube