gated-delta-net-2

Tag

Cards List
#gated-delta-net-2

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Hugging Face Daily Papers · 2026-05-21 Cached

Gated DeltaNet-2 introduces separate erase and write gates for linear attention, achieving superior performance in long-context language modeling and retrieval tasks.

0 favorites 0 likes
← Back to home

Submit Feedback