Quantization Courses
Graham Neubig via YouTube LLMOps: OpenVino Toolkit Quantization 4int LLama 3.2 3B and Inference on CPU
The Machine Learning Engineer via YouTube LLMOps: OpenVino Toolkit para Quantizar LLama 3.2 3B a 4int e Inferencia en CPU
The Machine Learning Engineer via YouTube Leaner, Greener and Faster PyTorch Inference with Quantization
MLOps World: Machine Learning in Production via YouTube Quantized Optimal Transport Reward-based Reinforcement Learning Approach to Detoxify Query Auto-Completion - Lecture 1
Association for Computing Machinery (ACM) via YouTube MCUNet and TinyML - Lecture 10
MIT HAN Lab via YouTube Databricks' vLLM Optimization for Cost-Effective LLM Inference - Ray Summit 2024
Anyscale via YouTube