From brute-force graph traversal to Cognitive Attention: an architectural redesign
Summary
The author redesigned the IONS protocol from brute-force graph traversal to a Cognitive Attention Architecture that progressively routes queries through relevant slices of the network, separating path confidence, relevance, and utility.
Similar Articles
The Routing and Filtering Structure of Attention
The paper decomposes the attention interaction matrix into routing (skew-symmetric) and filtering (symmetric) components, introducing S-D attention to disentangle them. It reveals a spectral cascade in routing that predicts where attention can be simplified, achieving significant parameter reduction with minimal perplexity loss.
Learning to Skip Blocks: Self-Discovered Ultrametric Routing for Hardware-Accelerated Sparse Attention
This paper introduces Dynamic Ultrametric Attention, a framework where Transformers learn per-head block-sparse routing topologies during training, which are then offloaded to a custom Triton block-sparse kernel at inference time, achieving up to 28x speedup and 98.4% memory reduction over dense attention.
Rethinking the Role of Efficient Attention in Hybrid Architectures
This paper systematically analyzes the role of efficient attention modules in hybrid language model architectures, finding that different designs converge in long-context performance under sufficient training, and that long-range retrieval is primarily carried by full attention while efficient attention shapes the optimization trajectory, revealing a 'Large-Window Laziness' phenomenon.
Functional Attention: From Pairwise Affinities to Functional Correspondences
Functional Attention is a novel attention mechanism that reinterprets attention as a functional correspondence between adaptive bases, replacing softmax affinities with structured linear operators inspired by geometric functional maps. The method achieves state-of-the-art performance on operator learning tasks including PDE solving and 3D segmentation while remaining resolution-invariant.
hia-gat: A Heterogeneous Interaction-Aware Graph Attention Network For Frame-Level Traffic Conflict Risk Prediction On Freeways
This paper proposes HIA-GAT, a dual-stream heterogeneous graph attention network that integrates longitudinal and lateral vehicle interactions with a conflict-type-aware gating mechanism for frame-level traffic conflict risk prediction on freeways. Experiments on NGSIM datasets show improved risk-ranking performance, particularly for lateral conflicts, and provide interpretable per-vehicle conflict attribution.