YoVDO

Neural Nets for NLP - Recurrent Networks for Sentence or Language Modeling

Offered By: Graham Neubig via YouTube

Tags

Neural Networks Courses Natural Language Processing (NLP) Courses Long short-term memory (LSTM) Courses

Course Description

Overview

Explore recurrent neural networks for sentence and language modeling in this comprehensive lecture from CMU's Neural Networks for NLP course. Dive into the structure and capabilities of RNNs, understand the vanishing gradient problem and how LSTMs address it, and examine the strengths and weaknesses of recurrence in sentence modeling. Learn about pre-training techniques for RNNs and gain insights into handling long sequences and long-distance dependencies in language processing. Discover practical applications like language modeling and sentence representation through detailed examples and explanations.

Syllabus

Intro
Why Model Sentence Pairs?
Siamese Network (Bromley et al. 1993)
Convolutional Matching Model (Hu et al. 2014) • Concatenate sentences into a 30 tensor and perform convolution
Convolutional Features + Matrix-based Pooling in and Schutze 2015
NLP and Sequential Data
Long-distance Dependencies in Language
Can be Complicated!
Recurrent Neural Networks (Elman 1990)
Unrolling in Time • What does processing a sequence look like?
What Can RNNs Do?
Representing Sentences
e.g. Language Modeling
RNNLM Example: Loss Calculation and State Update
Vanishing Gradient • Gradients decrease as they get pushed back
LSTM Structure
What can LSTMs Learn? (2) (Shi et al. 2016, Radford et al. 2017) Count length of sentence
Handling Long Sequences


Taught by

Graham Neubig

Related Courses

Reinforcement Learning for Trading Strategies
New York Institute of Finance via Coursera
Natural Language Processing with Sequence Models
DeepLearning.AI via Coursera
Fake News Detection with Machine Learning
Coursera Project Network via Coursera
English/French Translator: Long Short Term Memory Networks
Coursera Project Network via Coursera
Text Classification Using Word2Vec and LSTM on Keras
Coursera Project Network via Coursera