Deploying Transformers at Scale - Addressing Challenges and Increasing Performance
Offered By: Toronto Machine Learning Series (TMLS) via YouTube
Course Description
Overview
Explore the challenges and solutions for deploying Transformer networks at scale in this insightful talk from the Toronto Machine Learning Series. Delve into the world of Natural Language Processing (NLP) as Pieter Luitjens, CTO of Private AI, shares his expertise on optimizing Transformer networks for resource-constrained environments. Learn how to improve latency and throughput for processing large amounts of data, particularly in de-identification tasks. Discover strategies for addressing the unique challenges posed by the size of Transformer networks, including achieving acceptable latency and unit economics. Gain valuable insights on how to effectively process terabytes of data in a cost-effective manner, drawing from real-world experiences in deploying deep learning algorithms in multi-billion dollar industrial projects.
Syllabus
Deploying Transformers at Scale Addressing Challenges and Increasing Performance
Taught by
Toronto Machine Learning Series (TMLS)
Related Courses
Linear CircuitsGeorgia Institute of Technology via Coursera مقدمة في هندسة الطاقة والقوى
King Abdulaziz University via Rwaq (رواق) Magnetic Materials and Devices
Massachusetts Institute of Technology via edX Linear Circuits 2: AC Analysis
Georgia Institute of Technology via Coursera Transmisión de energía eléctrica
Tecnológico de Monterrey via edX