Advanced Computer Vision and Deep Learning
Offered By: Udacity
Course Description
Overview
Learn to apply deep learning architectures to computer vision tasks. Discover how to combine CNN and RNN networks to build an automatic image captioning application.
Syllabus
- Advanced CNN Architectures
- Learn about advances in CNN architectures and see how region-based CNN’s, like Faster R-CNN, have allowed for fast, localized object recognition in images.
- YOLO
- Learn about the YOLO (You Only Look Once) multi-object detection model and work with a YOLO implementation.
- RNN's
- Explore how memory can be incorporated into a deep learning model using recurrent neural networks (RNNs). Learn how RNNs can learn from and generate ordered sequences of data.
- Long Short-Term Memory Networks (LSTMs)
- Luis explains Long Short-Term Memory Networks (LSTM), and similar architectures which have the benefits of preserving long term memory.
- Hyperparameters
- Learn about a number of different hyperparameters that are used in defining and training deep learning models. We'll discuss starting values and intuitions for tuning each hyperparameter.
- Optional: Attention Mechanisms
- Attention is one of the most important recent innovations in deep learning. In this section, you'll learn how attention models work and go over a basic code implementation.
- Image Captioning
- Learn how to combine CNNs and RNNs to build a complex, automatic image captioning model.
- Project: Image Captioning
- Train a CNN-RNN model to predict captions for a given image. Your main task will be to implement an effective RNN decoder for a CNN encoder.
Taught by
Cezanne Camacho (nd891), Luis Serrano, Jay Alammar - nd892, Ortal Arel - nd101 and Kelvin Lwin
Related Courses
Reinforcement Learning for Trading StrategiesNew York Institute of Finance via Coursera Natural Language Processing with Sequence Models
DeepLearning.AI via Coursera Fake News Detection with Machine Learning
Coursera Project Network via Coursera English/French Translator: Long Short Term Memory Networks
Coursera Project Network via Coursera Text Classification Using Word2Vec and LSTM on Keras
Coursera Project Network via Coursera