Understanding, Interpreting and Designing Neural Network Models Through Tensor Representations
Offered By: Institute for Pure & Applied Mathematics (IPAM) via YouTube
Course Description
Overview
Explore a comprehensive lecture on leveraging tensor representations to understand, interpret, and design neural network models. Delve into the challenges of modern deep neural networks and learn how spectral methods using tensor decompositions can provide provable performance guarantees. Discover techniques for designing deep neural network architectures that ensure interpretability, expressive power, generalization, and robustness before training begins. Examine the use of spectral methods to create "desirable" deep model functions and guarantee optimal outcomes post-training. Investigate compression techniques, CP layers, and low-rankedness concepts. Analyze generalization error bounds and performance evaluations. Gain insights into interpreting transformers through tensor diagrams, exploring single and multi-hat self-attention mechanisms. Conclude with an overview of improved expressive power and tensor representation for robust learning, providing a comprehensive understanding of advanced neural network design and analysis techniques.
Syllabus
Introduction
Challenges in Neural Networks
Robustness of Neural Networks
Outline
Conceptual Challenge
Computational Challenge
Goal
Background Knowledge
Compression Techniques
Compression Methods
CP Layer
Low Rankedness
Reshaping
Generalization Error Bound
Performance
Evaluation
Interpreting transformers
Operations in tensor diagrams
Benefits of tensor diagrams
Single Hat SelfAttention
Multi Hat SelfAttention
Multi Hat Modes
Recap
Improved expressive power
Tensor representation for robust learning
Results
Summary
Taught by
Institute for Pure & Applied Mathematics (IPAM)
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX