ConvNeXt- A ConvNet for the 2020s - Paper Explained
Offered By: Aleksa Gordić - The AI Epiphany via YouTube
Course Description
Overview
Explore a comprehensive analysis of the "A ConvNet for the 2020s" paper in this 40-minute video lecture. Delve into the convergence of transformers and CNNs, understand the main diagram and its corrections, and recap the Swin transformer. Learn about modernizing ResNets, dive deeper into stage ratios and miscellaneous topics like inverted bottlenecks and depthwise convolutions. Examine the results in classification, object detection, and segmentation tasks. Gain insights into how ConvNets outperform vision transformers in big data regimes without attention layers, demonstrating the enduring relevance of convolutional priors in computer vision.
Syllabus
Intro - convergence of transformers and CNNs
Main diagram explained
Main diagram corrections
Swin transformer recap
Modernizing ResNets
Diving deeper: stage ratio
Diving deeper: misc inverted bottleneck, depthwise conv...
Results classification, object detection, segmentation
RIP DanNet
Summary and outro
Taught by
Aleksa Gordić - The AI Epiphany
Related Courses
Computer Vision: The FundamentalsUniversity of California, Berkeley via Coursera Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital
Duke University via Coursera Fundamentals of Digital Image and Video Processing
Northwestern University via Coursera 医学图像处理技术 Medical Image Analysis
Shanghai Jiao Tong University via Coursera Image Processing and Analysis for Life Scientists
École Polytechnique Fédérale de Lausanne via edX