Quantization Courses
Institut des Hautes Etudes Scientifiques (IHES) via YouTube How to Pick a GPU and Inference Engine for Large Language Models
Trelis Research via YouTube IDEFICS 2 API Endpoint, vLLM vs TGI, and General Fine-tuning Tips
Trelis Research via YouTube Multi-GPU Fine-tuning with DDP and FSDP
Trelis Research via YouTube Pushing Models and Adapters to HuggingFace
Trelis Research via YouTube The Best Tiny Language Models - Performance, Fine-tuning, and Function-calling
Trelis Research via YouTube Mixtral Fine-Tuning and Inference - Advanced Guide
Trelis Research via YouTube Serve a Custom LLM for Over 100 Customers - GPU Selection, Quantization, and API Setup
Trelis Research via YouTube How to Quantize a Large Language Model with GGUF or AWQ
Trelis Research via YouTube Double Inference Speed with AWQ Quantization
Trelis Research via YouTube