LLaMA (Large Language Model Meta AI) Courses
USENIX via YouTube Quant-LLM: Accelerating Large Language Model Serving via FP6-Centric Algorithm-System Co-Design
USENIX via YouTube LongWriter: Generating Extended Text with Large Language Models
Sam Witteveen via YouTube RAG with Llama 3.1 for Google Trends Data Scraping and Summarization - Streamlit Web App
Machine Learning With Hamza via YouTube WordLlama: Fast Lightweight NLP Toolkit Based on LLama Embeddings
1littlecoder via YouTube Deploying LLM Workloads on Kubernetes Using WasmEdge and Kuasar
CNCF [Cloud Native Computing Foundation] via YouTube Llama - Scaling Up LLMs in an Open Ecosystem
Anyscale via YouTube How to Pick a GPU and Inference Engine for Large Language Models
Trelis Research via YouTube Function Calling in Language Models - Everything You Need to Know
Trelis Research via YouTube LLMOps: OpenVino Toolkit Quantize to 4int LLama 3.1 8B Inference on CPU
The Machine Learning Engineer via YouTube