YoVDO

Profiling Deep Learning Applications with NVIDIA Tools Extension (NVTX)

Offered By: Nvidia via YouTube

Tags

GPU Computing Courses Memory Management Courses

Course Description

Overview

Discover how to leverage the NVIDIA Tools Extension Library (NVTX) to enhance deep learning application profiling in this 24-minute webinar from Nvidia. Explore techniques for annotating code to enrich profiler timelines, gaining insights into network performance optimization. Learn about the Nsight product family, NVTX syntax, and best practices for memory pinning and asynchronous data copy. Dive into practical examples of backward propagation with and without Automatic Mixed Precision (AMP), and understand how to identify the right performance knobs to tune your deep learning applications effectively.

Syllabus

Intro
A TYPICAL SCENARIO
HOW TO SPEED-UP NETWORK
WHAT'S GOING ON?
NSIGHT PRODUCT FAMILY
NSIGHT SYSTEMS Overview
NVIDIA TOOLS EXTENSION (NVTX)
ADDING NVTX START AND END OPERATIONS
NSIGHT SYSTEMS PROFILE
SIMPLE TRAINING PROGRAM
NVTX SYNTAX Backward propagation with or without using AMP
2ND PROFILING SESSION
FINDING THE RIGHT KNOB
BEST PRACTICES (4)
MEMORY PINNING
ASYNCHRONOUS DATA COPY


Taught by

NVIDIA Developer

Tags

Related Courses

Моделирование биологических молекул на GPU (Biomolecular modeling on GPU)
Moscow Institute of Physics and Technology via Coursera
Practical Deep Learning For Coders
fast.ai via Independent
GPU Architectures And Programming
Indian Institute of Technology, Kharagpur via Swayam
Perform Real-Time Object Detection with YOLOv3
Coursera Project Network via Coursera
Getting Started with PyTorch
Coursera Project Network via Coursera