SystemsFlashAttention
Block Pointers and Multi-Dim Support
PremiumScale from single sequence to Batch/Head parallelism and simplify pointer math with block pointers.
Companion CodeLog in to continue reading
This is premium content. Please log in to access the full article.
CookLLM Docs