Pytorch Image Captioning Tutorial
Offered By: Aladdin Persson via YouTube
Course Description
Overview
Learn how to build an image captioning system from scratch in this 36-minute tutorial. Explore the Flickr8k dataset and implement a model using PyTorch. Gain insights into combining convolutional neural networks (CNNs) and recurrent neural networks (RNNs) for image captioning tasks. Follow along with code implementation, training setup, and error fixing. Discover potential improvements like using larger models, extended training, and incorporating attention mechanisms. Conclude with a brief evaluation of the implemented system.
Syllabus
- Introduction
- Explanation of Image Captioning
- Overview of the code
- Implementation of CNN and RNN
- Setting up the training
- Fixing errors
- Small evaluation and ending
Taught by
Aladdin Persson
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Computational Photography
Georgia Institute of Technology via Coursera Einführung in Computer Vision
Technische Universität München (Technical University of Munich) via Coursera Introduction to Computer Vision
Georgia Institute of Technology via Udacity