sequence-models

Tag

Cards List
#sequence-models

The Dark Regulome: Disentangling Predictability from Regulation in Genomic Foundation Models

arXiv cs.CL · 3d ago Cached

This paper introduces a residualization-and-permutation diagnostic to separate predictability-driven from regulation-driven variance in regulatory importance scores from genomic foundation models, applied to dark genome elements at glioma-relevant loci.

0 favorites 0 likes
#sequence-models

SHARP: Sleep-based Hierarchical Accelerated Replay for Long Range Non-Stationary Temporal Pattern Recognition

arXiv cs.AI · 2026-06-02 Cached

SHARP introduces a bio-inspired framework that separates memory accumulation from pattern recognition, using accelerated replay during offline sleep phases to learn long-range non-stationary temporal patterns in streaming settings. It improves context retention on text8 and PG-19 while maintaining computational efficiency.

0 favorites 0 likes
#sequence-models

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

arXiv cs.CL · 2026-05-27 Cached

This paper formalizes the sufficiency gap in next-token prediction, demonstrating that even ideal sequence models can become overconfident when textual prefixes are not sufficient statistics for latent circumstances. It proposes an external observer mechanism to reduce but not eliminate this gap.

0 favorites 0 likes
#sequence-models

Conditional Attribute Estimation with Autoregressive Sequence Models

arXiv cs.AI · 2026-05-15 Cached

This paper introduces Conditional Attribute Transformers, a method for jointly estimating next-token probability and attribute values conditionally, enabling credit assignment, counterfactual analysis, and steerable generation in a single forward pass.

0 favorites 0 likes
#sequence-models

Toeplitz MLP Mixers are Low Complexity, Information-Rich Sequence Models

arXiv cs.LG · 2026-05-11 Cached

This paper introduces Toeplitz MLP Mixers (TMM), a novel architecture that replaces attention with Toeplitz matrix multiplication to achieve lower computational complexity while maintaining high information retention and training efficiency.

0 favorites 0 likes
← Back to home

Submit Feedback