Qwen 2 LLM - Overview and Development with Junyang Lin
Offered By: Aleksa Gordić - The AI Epiphany via YouTube
Course Description
Overview
Dive into an in-depth video interview with Junyang Lin, author of the recently announced Qwen 2 LLM. Explore the development process, key features, and innovations behind this large language model. Learn about the Qwen team at Alibaba, the model's architecture, tokenizer, training methodology, and pretraining data. Gain insights into the extended context length capabilities, post-training techniques, benchmarks, safety considerations, and future directions for Qwen 2. Discover how this LLM compares to others in the field and understand its potential impact on natural language processing applications.
Syllabus
00:00:00 - Intro
00:00:32 - Hyperstack GPUs sponsored
00:02:13 - Junyang & Qwen team at Alibaba
00:07:05 - Qwen2
00:13:05 - Tokenizer
00:15:55 - Training Qwen2
00:23:30 - Pretraining data
00:30:50 - Model Architecture changes?
00:37:35 - Context length
00:50:00 - Post-training
00:57:50 - Benchmarks/safety
00:59:20 - Future work
Taught by
Aleksa Gordić - The AI Epiphany
Related Courses
Alibaba - How To Succeed At Importing ProductsUdemy eCommerce Website: Shopify, Dropshipping, Amazon and more.
Udemy Complete Alibaba Dropshipping Business: From Zero To Hero
Udemy Alibaba - The Complete Guide to the Import Business
Udemy Alibaba The Complete Guide to Import from Alibaba to Amazon
Udemy