Qwen 2 LLM - Overview and Development with Junyang Lin
Offered By: Aleksa Gordić - The AI Epiphany via YouTube
Course Description
Overview
Dive into an in-depth video interview with Junyang Lin, author of the recently announced Qwen 2 LLM. Explore the development process, key features, and innovations behind this large language model. Learn about the Qwen team at Alibaba, the model's architecture, tokenizer, training methodology, and pretraining data. Gain insights into the extended context length capabilities, post-training techniques, benchmarks, safety considerations, and future directions for Qwen 2. Discover how this LLM compares to others in the field and understand its potential impact on natural language processing applications.
Syllabus
00:00:00 - Intro
00:00:32 - Hyperstack GPUs sponsored
00:02:13 - Junyang & Qwen team at Alibaba
00:07:05 - Qwen2
00:13:05 - Tokenizer
00:15:55 - Training Qwen2
00:23:30 - Pretraining data
00:30:50 - Model Architecture changes?
00:37:35 - Context length
00:50:00 - Post-training
00:57:50 - Benchmarks/safety
00:59:20 - Future work
Taught by
Aleksa Gordić - The AI Epiphany
Related Courses
Investment Strategies and Portfolio AnalysisRice University via Coursera Advanced R Programming
Johns Hopkins University via Coursera Supply Chain Analytics
Rutgers University via Coursera Технологическое предпринимательство
Moscow Institute of Physics and Technology via Coursera Learn How To Code: Google's Go (golang) Programming Language
Udemy