Deploying Deep Learning Models for Inference at Production Scale
Offered By: Applied Singularity via YouTube
Course Description
Overview
Explore a comprehensive session from NVIDIA Discovery Bengaluru focused on deploying AI models at production scale. Learn about two key NVIDIA resources: TensorRT, a deep learning platform optimizing neural network models and accelerating inference across GPU platforms, and Triton Inference Server, an open-source software providing a standardized inference platform for various infrastructures. Access the accompanying PowerPoint presentation for detailed insights. Gain valuable knowledge on the latest advancements in AI, machine learning, deep learning, and generative AI by joining the Applied Singularity Meetup group and downloading their free mobile app available on iOS and Android.
Syllabus
Deploying Deep Learning Models for Inference at Production Scale - at NVIDIA
Taught by
Applied Singularity
Related Courses
Developing a Tabular Data ModelMicrosoft via edX Data Science in Action - Building a Predictive Churn Model
SAP Learning Serverless Machine Learning with Tensorflow on Google Cloud Platform 日本語版
Google Cloud via Coursera Intro to TensorFlow em Português Brasileiro
Google Cloud via Coursera Serverless Machine Learning con TensorFlow en GCP
Google Cloud via Coursera