YoVDO

Applying Second-Order Pruning Algorithms for SOTA Model Compression

Offered By: Neural Magic via YouTube

Tags

Model Compression Courses Machine Learning Courses Neural Networks Courses Image Classification Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the cutting-edge world of second-order pruning algorithms for state-of-the-art model compression in this 42-minute video presentation by Eldar Kurtić, Research Consultant at Neural Magic. Dive into the research, production results, and intuition behind these powerful techniques that enable higher sparsity while maintaining accuracy. Learn how to achieve significant model size reduction, lower latency, and higher throughput by removing weights that least affect the loss function. Discover real-world examples, such as pruning a ResNet-50 image classification model by 95% while retaining 99% of its baseline accuracy, resulting in a dramatic file size reduction from 90.8MB to 9.3MB. Follow along as the speaker guides you through the Optimal Brain Damage Framework, OBS Framework, and Efficient Second-Order Approximation. Gain insights into applications in Classification, Natural Language Processing, and Deep Space Deployment. Understand the intuition behind weight updates and explore open-source implementations. Conclude with a Q&A session to address any questions about applying these advanced pruning algorithms to your machine learning projects.

Syllabus

Introduction
Agenda
What is Pruning
Classification
Observations
Timeline
Optimal Brain Damage Framework
OBS Framework
Efficient SecondOrder Approximation
Results
Natural Language Processing
Deep Space Deployment
Next Step Up Results
Intuition
Weight Update
Open Source
QA Session


Taught by

Neural Magic

Related Courses

TensorFlow Lite for Edge Devices - Tutorial
freeCodeCamp
Few-Shot Learning in Production
HuggingFace via YouTube
TinyML Talks Germany - Neural Network Framework Using Emerging Technologies for Screening Diabetic
tinyML via YouTube
TinyML for All: Full-stack Optimization for Diverse Edge AI Platforms
tinyML via YouTube
TinyML Talks - Software-Hardware Co-design for Tiny AI Systems
tinyML via YouTube