CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning Computations
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore CUTLASS, an open-source CUDA C++ template library designed to accelerate deep learning computations on NVIDIA GPUs. Dive into the core concepts of GPU computing for machine learning and AI applications, focusing on optimizing linear algebra operations like matrix multiplication and convolutions. Learn how CUTLASS has been instrumental since 2017 in helping developers create high-performance CUDA kernels across various NVIDIA GPU architectures. Gain insights into Tensor Core programming and discover how to leverage CUTLASS's modular abstractions and building blocks to develop custom CUDA C++ kernels that maximize performance for deep learning tasks. Acquire actionable knowledge to push the limits of GPU performance in AI applications like ChatGPT and Github Copilot.
Syllabus
CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning... Aniket Shivam & Vijay Thakkar
Taught by
Linux Foundation
Tags
Related Courses
Computer GraphicsUniversity of California, San Diego via edX Intro to Parallel Programming
Nvidia via Udacity Initiation à la programmation (en C++)
École Polytechnique Fédérale de Lausanne via Coursera C++ For C Programmers, Part A
University of California, Santa Cruz via Coursera Introduction à la programmation orientée objet (en C++)
École Polytechnique Fédérale de Lausanne via Coursera