Together AI Blog→ original

ThunderKittens from Together AI: a new language for efficient GPU kernels

Together AI released ThunderKittens, a compact programming language for writing optimized GPU kernels. On the H100 chip, it runs noticeably faster than standard

ThunderKittens from Together AI: a new language for efficient GPU kernels
Source: Together AI Blog. Collage: Hamidun News.
◐ Listen to article

Together AI released ThunderKittens, a compact programming language for writing optimized GPU kernels. On the H100 chip, it runs noticeably faster than standard FlashAttention2. The interface resembles PyTorch, so ML engineers can pick it up quickly. The authors openly say it is an experimental project. The code is fully open source and already integrated with NanoGPT for developer training.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.
What do you think?
Loading comments…