Deploying Transformers at Scale - Addressing Challenges and Increasing Performance
Offered By: Toronto Machine Learning Series (TMLS) via YouTube
Course Description
Overview
Explore the challenges and solutions for deploying Transformer networks at scale in this insightful talk from the Toronto Machine Learning Series. Delve into the world of Natural Language Processing (NLP) as Pieter Luitjens, CTO of Private AI, shares his expertise on optimizing Transformer networks for resource-constrained environments. Learn how to improve latency and throughput for processing large amounts of data, particularly in de-identification tasks. Discover strategies for addressing the unique challenges posed by the size of Transformer networks, including achieving acceptable latency and unit economics. Gain valuable insights on how to effectively process terabytes of data in a cost-effective manner, drawing from real-world experiences in deploying deep learning algorithms in multi-billion dollar industrial projects.
Syllabus
Deploying Transformers at Scale Addressing Challenges and Increasing Performance
Taught by
Toronto Machine Learning Series (TMLS)
Related Courses
Coding the Matrix: Linear Algebra through Computer Science ApplicationsBrown University via Coursera كيف تفكر الآلات - مقدمة في تقنيات الحوسبة
King Fahd University of Petroleum and Minerals via Rwaq (رواق) Datascience et Analyse situationnelle : dans les coulisses du Big Data
IONIS via IONIS Data Lakes for Big Data
EdCast 統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco