Introduction to Attention-Based Neural Networks

Offered By: LinkedIn Learning

Tags

Long short-term memory (LSTM) Courses Image Captioning Courses Language Models Courses Sequence to Sequence Models Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Learn what attention-based models are, how they work, and what they can do for recurrent neural networks.

Syllabus

Introduction

Prerequisites
What are attention-based models?
Attention in language generation and translation models

1. Recurrent Neural Networks to Learn Sequential Data

Feed forward networks and their limitations
Recurrent neural networks for sequential data
The need for long memory cells
LSTM and GRU cells
Types of RRNNS

2. Encoder-Decoder Networks for Language Models

Language generation models
Sequence to sequence models for language translation

3. Attention-Based Neural Networks

The role of attention in sequence to sequence models
Attention mechanism in sequence to sequence models
Alignment weights in attention models
Bahdanau attention
Attention models for image captioning
Encoder decoder structure for image captioning

4. Image Captioning Model without Attention

Setting up Colab and Google Drive
Loading in the Flickr8k dataset
Constructing the vocabulary
Setting up the dataset class
Implementing utility functions for training data
Building the encoder CNN
Building the decoder RNN
Setting up the sequence to sequence model
Training the image captioning model

5. Image Captioning Model Using Attention

Loading the dataset and setting up utility functions
The encoder CNN generating unrolled feature maps
Implementing Bahdanau attention
The decoder RNN using attention
Generating captions using attention
Training the attention-based image captioning model
Visualizing the model's attention

Conclusion

Summary and next steps

Taught by

Janani Ravi

Related Courses

Simple Recurrent Neural Network with Keras
Coursera Project Network via Coursera Deep Learning: Advanced Natural Language Processing and RNNs
Udemy Machine Translation
Great Learning via YouTube Tensorflow 2.0 | Recurrent Neural Networks, LSTMs, GRUs
Udemy Pytorch Transformers for Machine Translation
Aladdin Persson via YouTube