AI Model Efficiency Toolkit (AIMET) - Lecture 25

Offered By: MIT HAN Lab via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore the world of AI model efficiency in this guest lecture from Qualcomm AI Research. Dive into the challenges of deploying neural networks on mobile and IoT devices, and discover cutting-edge solutions for efficient machine learning. Learn about Qualcomm's core technologies, adaptive rounding, autocon, and transformer quantization. Gain insights into image model zoos, conditional compute, and the Qualcomm Snapdragon 8 Gen 2. Witness a live demo showcasing the practical applications of these efficiency techniques. Perfect for those interested in TinyML, efficient deep learning computing, and the future of AI on resource-constrained devices.

Syllabus

Introduction
Challenges
Qualcomm AI Research
Qualcomm Core Technologies
Layers of Interest
Inference
Features
Adaptive Rounding
Autocon
Training
Image Model Zoo
Transformer Quantization
Snapdragon Gen 2
GitHub
Qualcomm Snapdragon 8
Conditional Compute
Demo

Taught by

MIT HAN Lab

Related Courses

Machine Learning Modeling Pipelines in Production
DeepLearning.AI via Coursera MLOps for Scaling TinyML
Harvard University via edX Parameter Prediction for Unseen Deep Architectures - With First Author Boris Knyazev
Yannic Kilcher via YouTube SpineNet - Learning Scale-Permuted Backbone for Recognition and Localization
Yannic Kilcher via YouTube Synthetic Petri Dish - A Novel Surrogate Model for Rapid Architecture Search
Yannic Kilcher via YouTube