Understanding Generalization from Pre-training Loss to Downstream Tasks

Offered By: Simons Institute via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore the mysteries behind pre-trained models and their generalization capabilities in this lecture by Tengyu Ma from Stanford University. Delve into the role of pre-training losses in extracting meaningful structural information from unlabeled data, with a focus on the infinite data regime. Examine how contrastive loss creates embeddings that capture manifold distance between raw data and graph distance of positive-pair graphs. Investigate the relationship between embedding space directions and cluster relationships in positive-pair graphs. Discover recent advancements that incorporate architectural inductive bias and demonstrate the implicit bias of optimizers in pre-training. Gain insights into the theoretical frameworks and empirical evidence supporting these concepts, shedding light on the behavior of practical pre-trained models in AI and machine learning.

Syllabus

Understanding Generalization from Pre-training Loss to Downstream Tasks

Taught by

Simons Institute

Related Courses

From Reinforcement Learning to Spin Glasses - The Many Surprises in Quantum State Preparation
APS Physics via YouTube Mathematical Frameworks for Signal and Image Analysis - Diffusion Methods in Manifold and Fibre Bundle Learning
Joint Mathematics Meetings via YouTube Quantifying the Topology of Coma
Institute for Pure & Applied Mathematics (IPAM) via YouTube Reconstructing Manifolds by Weighted L_1-Norm Minimization
Applied Algebraic Topology Network via YouTube Demystifying Latschev's Theorem for Manifold Reconstruction
Applied Algebraic Topology Network via YouTube

Understanding Generalization from Pre-training Loss to Downstream Tasks

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue