YoVDO

Hessian-Aware Stochastic Differential Equation Modeling of SGD

Offered By: Erwin Schrödinger International Institute for Mathematics and Physics (ESI) via YouTube

Tags

Stochastic Gradient Descent Courses Machine Learning Courses Stochastic Differential Equation Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 29-minute conference talk from the "One World Optimization Seminar in Vienna" workshop held at the Erwin Schrödinger International Institute for Mathematics and Physics (ESI) in June 2024. Delve into the development of a novel Stochastic Differential Equation (SDE) model for Stochastic Gradient Descent (SGD) that incorporates Hessian information. Learn about the Hessian-Aware Stochastic Modified Equation (HA-SME) and its advantages over existing SDE models in capturing SGD's escaping behaviors. Discover how HA-SME achieves improved approximation error guarantees and reduced dependence on smoothness parameters. Understand the significance of HA-SME as the first SDE model to exactly recover SGD dynamics for quadratic objectives under certain conditions.

Syllabus

Niao He - A Hessian-Aware Stochastic Differential Equation Modelling of SGD


Taught by

Erwin Schrödinger International Institute for Mathematics and Physics (ESI)

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent