SWIS: Shared Weight Bit Sparsity for Efficient Neural Network Acceleration
Offered By: tinyML via YouTube
Course Description
Overview
Explore the SWIS – Shared Weight bIt Sparsity framework for efficient neural network acceleration in this 20-minute conference talk from the tinyML Research Symposium 2021. Delve into the quantization technique that improves performance and storage compression through offline weight decomposition and scheduling algorithms. Learn how SWIS achieves significant accuracy improvements when quantizing MobileNet-v2, and discover its potential for up to 6X speedup and 1.8X energy improvement over state-of-the-art bit-serial architectures. Follow the presentation as it covers the introduction, base sparsity, quantization error, base serial multiplier, SWIS architecture, computation animation, scheduling, retraining, and concludes with a Q&A session.
Syllabus
Introduction
Why we need SWIS
Base Sparsity
Quantization Error
Base Serial Multiplier
SWIS Architecture
SWIS Computation Animation
SWIS Scheduling
SWIS Retraining
Questions
Sponsors
Taught by
tinyML
Related Courses
Bayes Classifier on DataprocGoogle via Google Cloud Skills Boost Llama for Python Programmers
University of Michigan via Coursera Quantization Fundamentals with Hugging Face
DeepLearning.AI via Coursera Quantization in Depth
DeepLearning.AI via Coursera Working with Llama 3
DataCamp