Quantization in Neural Networks - Lecture 5

Offered By: MIT HAN Lab via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Dive into the world of neural network quantization in this comprehensive lecture from MIT's TinyML and Efficient Deep Learning Computing course. Explore numeric data types in modern computing systems and gain insights into K-means-based quantization and linear quantization techniques. Learn how to optimize deep learning models for resource-constrained devices, enabling powerful AI applications on mobile and IoT platforms. Discover strategies for efficient inference, including model compression, pruning, and neural architecture search. Gain hands-on experience implementing deep learning applications on microcontrollers, mobile phones, and quantum machines through an open-ended design project focused on mobile AI.

Syllabus

Lecture 05 - Quantization (Part I) | MIT 6.S965

Taught by

MIT HAN Lab

Related Courses

Digital Signal Processing
École Polytechnique Fédérale de Lausanne via Coursera Principles of Communication Systems - I
Indian Institute of Technology Kanpur via Swayam Digital Signal Processing 2: Filtering
École Polytechnique Fédérale de Lausanne via Coursera Digital Signal Processing 3: Analog vs Digital
École Polytechnique Fédérale de Lausanne via Coursera Digital Signal Processing 4: Applications
École Polytechnique Fédérale de Lausanne via Coursera