Tutorial on Vision Transformers - Tutorial 3
Offered By: MICDE University of Michigan via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of Vision Transformers in this comprehensive 49-minute tutorial presented by Bharath Ramsundar and Amal Sebastian at MICDE University of Michigan. Delve into the cutting-edge application of transformer architectures in computer vision tasks, gaining insights into their structure, functionality, and advantages over traditional convolutional neural networks. Learn about the key components of Vision Transformers, including self-attention mechanisms and positional encodings, and understand how these elements contribute to their remarkable performance in image recognition and classification tasks. Discover practical implementation techniques, best practices, and potential challenges when working with Vision Transformers, equipping yourself with valuable knowledge to leverage this powerful technology in your own computer vision projects.
Syllabus
Bharath Ramsundar & Amal Sebastian: Tutorial on Vision Transformers (Tutorial 3)
Taught by
MICDE University of Michigan
Related Courses
Advanced PyTorch Techniques and ApplicationsPackt via Coursera Preprocessing Unstructured Data for LLM Applications
DeepLearning.AI via Coursera Automatic Image Captioning with Vision Transformer and GPT-2
Eran Feit via YouTube Image Captioning Python App with ViT and GPT2 Using Hugging Face Models - Applied Deep Learning
1littlecoder via YouTube Rethinking and Improving Relative Position Encoding for Vision Transformer - Lecture 23
University of Central Florida via YouTube