Systems
GPU Programming Basics
Learn CUDA and Triton, and write efficient GPU kernels
Overview
Before diving into advanced optimizations like Flash Attention, we need the fundamentals of GPU programming. This module takes you from scratch to understanding how GPUs work and how to write efficient kernels with Triton.
This module is a prerequisite for the Systems track. We recommend completing it before Flash Attention.
CookLLM Docs