Delta Keyword Transformer: Bringing Transformers to the Edge Through Dynamically Pruned Multi-Head Self-Attention
Offered By: tinyML via YouTube
Course Description
Overview
Explore the cutting-edge developments in bringing Transformers to edge devices through a 21-minute conference talk from the tinyML Research Symposium 2022. Delve into the innovative Delta Keyword Transformer, presented by Zuzana Jelčicoová, an Industrial PhD student at Oticon. Learn about dynamically pruned multi-head self-attention and its applications in edge computing. Gain insights into the Keyword Transformer (KWT) model analysis, the Delta algorithm, and its implementations in regular and delta matrix multiplication, as well as softmax operations. Discover the results and implications of this groundbreaking research, concluding with a glimpse into EDGE IMPULSE technology.
Syllabus
Intro
Overview Transformers
Previous work
Keyword Transformer (KWT)
KWT-Model analysis
Delta algorithm
Delta-regular matrix multiplication
Delta-delta matrix multiplication
Delta for softmax
Delta Keyword Transformer
Results 1
Conclusion
Premier: EDGE IMPULSE
Taught by
tinyML
Related Courses
Fog Networks and the Internet of ThingsPrinceton University via Coursera AWS IoT: Developing and Deploying an Internet of Things
Amazon Web Services via edX Business Considerations for 5G with Edge, IoT, and AI
Linux Foundation via edX 5G Strategy for Business Leaders
Linux Foundation via edX Intel® Edge AI Fundamentals with OpenVINO™
Intel via Udacity