attention-entropy

Tag

Cards List
#attention-entropy

Neural Activation Patterns Across Language Model Architectures: A Comprehensive Analysis of Cognitive Task Performance

arXiv cs.CL · 2026-05-18 Cached

This paper analyzes neural activation patterns across six LLM architectures on cognitive tasks, revealing differences in attention entropy and sparsity between encoder and decoder models.

0 favorites 0 likes
#attention-entropy

High-Fidelity KV Cache Summarization Using Entropy and Low-Rank Reconstruction

Hacker News Top · 2026-04-19 Cached

Proposes an SRC pipeline that uses entropy-based selection and low-rank reconstruction to summarize KV cache instead of pruning tokens, reducing VRAM for million-token LLM contexts while avoiding catastrophic attention errors.

0 favorites 0 likes
← Back to home

Submit Feedback