Qwen 2 LLM - Overview and Development with Junyang Lin
Offered By: Aleksa Gordić - The AI Epiphany via YouTube
Course Description
Overview
Dive into an in-depth video interview with Junyang Lin, author of the recently announced Qwen 2 LLM. Explore the development process, key features, and innovations behind this large language model. Learn about the Qwen team at Alibaba, the model's architecture, tokenizer, training methodology, and pretraining data. Gain insights into the extended context length capabilities, post-training techniques, benchmarks, safety considerations, and future directions for Qwen 2. Discover how this LLM compares to others in the field and understand its potential impact on natural language processing applications.
Syllabus
00:00:00 - Intro
00:00:32 - Hyperstack GPUs sponsored
00:02:13 - Junyang & Qwen team at Alibaba
00:07:05 - Qwen2
00:13:05 - Tokenizer
00:15:55 - Training Qwen2
00:23:30 - Pretraining data
00:30:50 - Model Architecture changes?
00:37:35 - Context length
00:50:00 - Post-training
00:57:50 - Benchmarks/safety
00:59:20 - Future work
Taught by
Aleksa Gordić - The AI Epiphany
Related Courses
TensorFlow: Working with NLPLinkedIn Learning Introduction to Video Editing - Video Editing Tutorials
Great Learning via YouTube HuggingFace Crash Course - Sentiment Analysis, Model Hub, Fine Tuning
Python Engineer via YouTube GPT3 and Finetuning the Core Objective Functions - A Deep Dive
David Shapiro ~ AI via YouTube How to Build a Q&A AI in Python - Open-Domain Question-Answering
James Briggs via YouTube