YoVDO

Delta Keyword Transformer: Bringing Transformers to the Edge Through Dynamically Pruned Multi-Head Self-Attention

Offered By: tinyML via YouTube

Tags

Transformers Courses Machine Learning Courses Edge Computing Courses Model Optimization Courses Matrix Multiplication Courses

Course Description

Overview

Explore the cutting-edge developments in bringing Transformers to edge devices through a 21-minute conference talk from the tinyML Research Symposium 2022. Delve into the innovative Delta Keyword Transformer, presented by Zuzana Jelčicoová, an Industrial PhD student at Oticon. Learn about dynamically pruned multi-head self-attention and its applications in edge computing. Gain insights into the Keyword Transformer (KWT) model analysis, the Delta algorithm, and its implementations in regular and delta matrix multiplication, as well as softmax operations. Discover the results and implications of this groundbreaking research, concluding with a glimpse into EDGE IMPULSE technology.

Syllabus

Intro
Overview Transformers
Previous work
Keyword Transformer (KWT)
KWT-Model analysis
Delta algorithm
Delta-regular matrix multiplication
Delta-delta matrix multiplication
Delta for softmax
Delta Keyword Transformer
Results 1
Conclusion
Premier: EDGE IMPULSE


Taught by

tinyML

Related Courses

Fog Networks and the Internet of Things
Princeton University via Coursera
AWS IoT: Developing and Deploying an Internet of Things
Amazon Web Services via edX
Business Considerations for 5G with Edge, IoT, and AI
Linux Foundation via edX
5G Strategy for Business Leaders
Linux Foundation via edX
Intel® Edge AI Fundamentals with OpenVINO™
Intel via Udacity