SGD and Weight Decay Secretly Compress Your Neural Network

Offered By: MITCBMM via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore the intriguing concept of how Stochastic Gradient Descent (SGD) and weight decay techniques inadvertently compress neural networks in this insightful 55-minute conference talk by Tomer Galanti from MIT. Delve into the underlying mechanisms that contribute to this hidden compression effect, gaining a deeper understanding of how these widely-used optimization methods impact the efficiency and performance of deep learning models.

Syllabus

SGD and Weight Decay Secretly Compress Your Neural Network

Taught by

MITCBMM

Related Courses

Neural Networks for Machine Learning
University of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX

SGD and Weight Decay Secretly Compress Your Neural Network

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue