YoVDO

CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning Computations

Offered By: Linux Foundation via YouTube

Tags

CUDA Courses Deep Learning Courses C++ Courses Linear Algebra Courses GPU Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore CUTLASS, an open-source CUDA C++ template library designed to accelerate deep learning computations on NVIDIA GPUs. Dive into the core concepts of GPU computing for machine learning and AI applications, focusing on optimizing linear algebra operations like matrix multiplication and convolutions. Learn how CUTLASS has been instrumental since 2017 in helping developers create high-performance CUDA kernels across various NVIDIA GPU architectures. Gain insights into Tensor Core programming and discover how to leverage CUTLASS's modular abstractions and building blocks to develop custom CUDA C++ kernels that maximize performance for deep learning tasks. Acquire actionable knowledge to push the limits of GPU performance in AI applications like ChatGPT and Github Copilot.

Syllabus

CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning... Aniket Shivam & Vijay Thakkar


Taught by

Linux Foundation

Tags

Related Courses

Coding the Matrix: Linear Algebra through Computer Science Applications
Brown University via Coursera
Mathematical Methods for Quantitative Finance
University of Washington via Coursera
Introduction à la théorie de Galois
École normale supérieure via Coursera
Linear Algebra - Foundations to Frontiers
The University of Texas at Austin via edX
Massively Multivariable Open Online Calculus Course
Ohio State University via Coursera