YoVDO

Big Data Is Low Rank - Madeleine Udell, Cornell University

Offered By: Alan Turing Institute via YouTube

Tags

Principal Components Analysis Courses Computer Science Courses Statistics & Probability Courses Machine Learning Courses Implicit Regularization Courses

Course Description

Overview

Explore the intersection of statistics and computer science in machine learning through this 46-minute lecture by Madeleine Udell from Cornell University. Delve into the concept of low-rank structures in big data, covering topics such as principal components analysis, generalized low-rank models, and their applications in various fields. Learn about fitting GLRMs using alternating minimization, imputing missing and heterogeneous data, and validating models. Discover how low-rank models can be applied to dimensionality reduction, autoML, and experiment design for timely model selection. Examine real-world examples, including hospitalizations and political data tables, to understand the practical implications of low-rank structures. Gain insights into latent variable models and their approximate low-rank nature, bridging the gap between statistical theory and computational efficiency in the era of Big Data.

Syllabus

Intro
Data table: politics
Principal components analysis
Generalized low rank model
Fitting GLRMs with alternating minimization
Impute missing data
Impute heterogeneous data
Example: Julia implementation
Validate model
Hospitalizations are low rank
Low rank models for dimensionality reduction
Low rank autoML
Low rank fit correctly identifies best algorithm type
Experiment design for timely model selection
Latent variable models examples
Approximate low rank
Rank of nice latent variable models?


Taught by

Alan Turing Institute

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Statistics One
Princeton University via Coursera
Intro to Statistics
Stanford University via Udacity
Passion Driven Statistics
Wesleyan University via Coursera