YoVDO

Accelerating Transformers with Hugging Face Optimum and Infinity

Offered By: MLOps World: Machine Learning in Production via YouTube

Tags

Hugging Face Courses Machine Learning Courses Transformers Courses Model Optimization Courses Containerization Courses Hardware Acceleration Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the acceleration of Transformer models in this comprehensive talk from MLOps World: Machine Learning in Production. Discover Hugging Face's efforts to enhance prediction speed for Transformer models, focusing on two key tools: Optimum and Infinity. Learn about Optimum, an open-source library designed to optimize Transformer training and deployment on specific hardware. Gain insights into Infinity, a containerized solution that achieves millisecond-scale latencies in production environments. Benefit from the expertise of Lewis Tunstall and Philipp Schmid, Machine Learning Engineers at Hugging Face, as they discuss strategies for balancing model accuracy with performance requirements in real-world NLP applications. Understand how these tools can help overcome challenges related to model size and speed, enabling more efficient implementation of state-of-the-art NLP models in various business contexts.

Syllabus

Accelerating Transformers with Hugging Face Optimum and Infinity


Taught by

MLOps World: Machine Learning in Production

Related Courses

3D-печать для всех и каждого
Tomsk State University via Coursera
Developing a Multidimensional Data Model
Microsoft via edX
Launching into Machine Learning 日本語版
Google Cloud via Coursera
Art and Science of Machine Learning 日本語版
Google Cloud via Coursera
Launching into Machine Learning auf Deutsch
Google Cloud via Coursera