dirichlet-kernels

Tag

Cards List
#dirichlet-kernels

Blurry Window Attention

arXiv cs.LG · 2026-06-10 Cached

Introduces Blurry Window Attention (BLA), a novel attention method with bounded-memory control that reconstructs a blurry KV history via Dirichlet kernel interpolation, achieving 8x state efficiency over Sliding Window Attention on the Multi-Query Associate Recall task.

0 favorites 0 likes
← Back to home

Submit Feedback