Applied Deep Learning in Speech Recognition and Collaborative Filtering - Session 5
Offered By: Nvidia via YouTube
Course Description
Overview
Explore cutting-edge applications of deep learning in speech recognition and collaborative filtering systems in this session from the NVIDIA AI Tech Workshop at NeurIPS Expo 2018. Delve into the intricacies of building speech recognition models using synthetic speech and optimizing neural collaborative filtering systems. Gain insights into semantic map filling, problem definition, and architecture overview. Examine various modules including wear module, neural networks, and auto encoders. Discover techniques for handling mode collapse and processing real images. Learn about complex image understanding, graph-based image representation, and structure prediction. Investigate graph permutation invariant architectures and their applications in synthetic examples and Visual Genome datasets. Enhance your knowledge of applied deep learning techniques presented by NVIDIA Research experts in this comprehensive 39-minute session.
Syllabus
Intro
Semantic Map Filling
Problem Definition
Architecture Overview
Wear Module
Neural Network
Mode Collapse
Auto Encoder
Real Images
Animation
Neural Network Results
Copy and Paste
Understanding complex images
Images as a graph
Good enough graph
Structure prediction
Architecture
Graph permutation invariant
Synthetic example
Visual Genome
Visual Genome Example
Summary
Taught by
NVIDIA Developer
Tags
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Computational Photography
Georgia Institute of Technology via Coursera Digital Signal Processing
École Polytechnique Fédérale de Lausanne via Coursera Creative, Serious and Playful Science of Android Apps
University of Illinois at Urbana-Champaign via Coursera