YoVDO

Introduction to Optimization Algorithms to Compress Neural Networks

Offered By: tinyML via YouTube

Tags

Deep Learning Courses Embedded Systems Courses Quantization Courses Optimization Algorithms Courses

Course Description

Overview

Explore optimization algorithms for compressing neural networks in this tinyML Talks webcast. Dive into the challenges of deploying advanced networks on resource-constrained systems and learn about various compression techniques. Discover the functionality of common compression algorithms, including pruning, quantization, and knowledge distillation. Examine the pros and cons of different pruning techniques, and explore concepts such as lowend factorization, fast convolutional networks, and selective attention networks. Gain insights into general use cases and the process of pruning whole channels. This comprehensive talk, presented by Marcus Rüb from the Hahn-Schickard Research Institute, provides a valuable introduction to making neural networks more efficient for embedded devices and mobile applications.

Syllabus

Introduction
Local Meetup
What is tinyML
Quantization
Knowledge distillation
Pruning
Pruning techniques
Pros and cons
Lowend factorization
Fast convolutional
Selective attention network
Summary
Questions
General use cases
Pruning whole channels
Conclusion


Taught by

tinyML

Related Courses

Digital Signal Processing
École Polytechnique Fédérale de Lausanne via Coursera
Principles of Communication Systems - I
Indian Institute of Technology Kanpur via Swayam
Digital Signal Processing 2: Filtering
École Polytechnique Fédérale de Lausanne via Coursera
Digital Signal Processing 3: Analog vs Digital
École Polytechnique Fédérale de Lausanne via Coursera
Digital Signal Processing 4: Applications
École Polytechnique Fédérale de Lausanne via Coursera