CS480-680 Lecture 19 - Attention and Transformer Networks
Offered By: Pascal Poupart via YouTube
Course Description
Overview
Explore the fundamental concepts of attention mechanisms and transformer networks in this comprehensive lecture. Delve into topics such as attention neural networks, kernel similarity, and machine translation. Gain insights into the architecture of transformer networks, including multihead attention and mask multihead attention. Examine the role of recurrence and normalization in these advanced deep learning models. Enhance your understanding of cutting-edge natural language processing techniques and their applications in various domains.
Syllabus
Intro
Attention
Attention Neural Networks
Kernel Similarity
Machine Translation
Transformer Networks
Multihead Attention
Mask Multihead Attention
Recurrence
Normalization
Taught by
Pascal Poupart
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera Good Brain, Bad Brain: Basics
University of Birmingham via FutureLearn Statistical Learning with R
Stanford University via edX Machine Learning 1—Supervised Learning
Brown University via Udacity Fundamentals of Neuroscience, Part 2: Neurons and Networks
Harvard University via edX