Deploying Transformers at Scale - Addressing Challenges and Increasing Performance
Offered By: Toronto Machine Learning Series (TMLS) via YouTube
Course Description
Overview
Explore the challenges and solutions for deploying Transformer networks at scale in this insightful talk from the Toronto Machine Learning Series. Delve into the world of Natural Language Processing (NLP) as Pieter Luitjens, CTO of Private AI, shares his expertise on optimizing Transformer networks for resource-constrained environments. Learn how to improve latency and throughput for processing large amounts of data, particularly in de-identification tasks. Discover strategies for addressing the unique challenges posed by the size of Transformer networks, including achieving acceptable latency and unit economics. Gain valuable insights on how to effectively process terabytes of data in a cost-effective manner, drawing from real-world experiences in deploying deep learning algorithms in multi-billion dollar industrial projects.
Syllabus
Deploying Transformers at Scale Addressing Challenges and Increasing Performance
Taught by
Toronto Machine Learning Series (TMLS)
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent