Together AI Blog→ original

Together AI: how kernel optimizations close the gap between models and GPUs

Together AI’s team adapted CUDA kernels for the new Blackwell GPUs in one week — work NVIDIA had spent a year on. All thanks to FlashAttention (2022) and Thunde

Together AI: how kernel optimizations close the gap between models and GPUs
Source: Together AI Blog. Collage: Hamidun News.
◐ Listen to article

Together AI’s team adapted CUDA kernels for the new Blackwell GPUs in one week — work NVIDIA had spent a year on. All thanks to FlashAttention (2022) and ThunderKittens. This closes the gap between model mathematics and real hardware power.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.
What do you think?
Loading comments…