memory-modeling

Tag

Cards List
#memory-modeling

Dynamic Linear Attention

arXiv cs.CL · 6d ago Cached

This paper proposes DLA, a dynamic memory modeling framework for multi-state linear attention that adaptively merges states based on token information variation and maintains a fixed-size state cache, enabling better long-context representation without the quadratic complexity of standard attention.

0 favorites 0 likes
#memory-modeling

Dynamic Linear Attention

Hugging Face Daily Papers · 2026-06-09 Cached

DLA introduces adaptive state merging and capacity-bounded memory modeling for multi-state linear attention, improving long-context LLM performance.

0 favorites 0 likes
← Back to home

Submit Feedback