YoVDO

Profiling Deep Learning Applications with NVIDIA Tools Extension (NVTX)

Offered By: Nvidia via YouTube

Tags

GPU Computing Courses Memory Management Courses

Course Description

Overview

Discover how to leverage the NVIDIA Tools Extension Library (NVTX) to enhance deep learning application profiling in this 24-minute webinar from Nvidia. Explore techniques for annotating code to enrich profiler timelines, gaining insights into network performance optimization. Learn about the Nsight product family, NVTX syntax, and best practices for memory pinning and asynchronous data copy. Dive into practical examples of backward propagation with and without Automatic Mixed Precision (AMP), and understand how to identify the right performance knobs to tune your deep learning applications effectively.

Syllabus

Intro
A TYPICAL SCENARIO
HOW TO SPEED-UP NETWORK
WHAT'S GOING ON?
NSIGHT PRODUCT FAMILY
NSIGHT SYSTEMS Overview
NVIDIA TOOLS EXTENSION (NVTX)
ADDING NVTX START AND END OPERATIONS
NSIGHT SYSTEMS PROFILE
SIMPLE TRAINING PROGRAM
NVTX SYNTAX Backward propagation with or without using AMP
2ND PROFILING SESSION
FINDING THE RIGHT KNOB
BEST PRACTICES (4)
MEMORY PINNING
ASYNCHRONOUS DATA COPY


Taught by

NVIDIA Developer

Tags

Related Courses

Heterogeneous Parallel Programming
University of Illinois at Urbana-Champaign via Coursera
Advanced Operating Systems
Georgia Institute of Technology via Udacity
計算機程式設計 (Computer Programming)
National Taiwan University via Coursera
Introduction to Operating Systems
Georgia Institute of Technology via Udacity
Android Performance
Google via Udacity