million-token-prefill

Tag

Cards List
#million-token-prefill

@rohanpaul_ai: New Alibaba + Nanjing Univ paper claims million-token prefill can be sped up 9.36X (compared against FlashAttention-2) …

X AI KOLs Timeline · 2026-05-24 Cached

A new paper from Alibaba and Nanjing University introduces RTPurbo, a method that speeds up million-token prefill by up to 9.36x compared to FlashAttention-2 by selectively applying full attention only where needed, without retraining the model.

0 favorites 0 likes
← Back to home

Submit Feedback