state-space-model

#state-space-model

DTVEM-RE: A Hierarchical Random-Effects Extension of the Differential Time-Varying Effect Model for Person-Specific Multi-Lag Estimation in Intensive Longitudinal Data

arXiv cs.LG ↗ · 13h ago Cached

This paper presents DTVEM-RE, a hierarchical random-effects extension of the Differential Time-Varying Effect Model that estimates person-specific multi-lag coefficients via Hamiltonian Monte Carlo in Stan, addressing a limitation of the original DTVEM which assumed a single group-level lag structure. Simulation and empirical results demonstrate recovery of between-person variance and improvements over hierarchical and non-hierarchical baselines.

0 favorites 0 likes

#state-space-model

Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM

arXiv cs.CL ↗ · 2026-06-04 Cached

This paper proposes a query-based cross-modal projector that compresses visual tokens via cross-attention to improve Mamba-based multimodal LLMs, boosting both performance and throughput on vision-language benchmarks while eliminating the need for manual 2D scan order design.

0 favorites 0 likes

#state-space-model

LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling

arXiv cs.CL ↗ · 2026-06-04 Cached

LDARNet is a 120M-parameter hierarchical genomic foundation model that introduces learnable adaptive tokenization (inspired by H-Net's dynamic chunking) for masked language modeling on DNA sequences. It achieves state-of-the-art results on 5 histone modification tasks and outperforms models up to 20× larger on several genomic benchmarks, with learned token boundaries aligning with biological features like promoter motifs and splice junctions.

0 favorites 0 likes

#state-space-model

EnergyMamba: An Uncertainty-Aware Graph-Enhanced Selective State Space Model for Energy Consumption Prediction

arXiv cs.AI ↗ · 2026-06-02 Cached

EnergyMamba proposes a novel spatiotemporal framework combining a graph-enhanced selective state space model and adaptive conformalized quantile regression for accurate and reliable energy consumption prediction with uncertainty estimates, achieving improvements on real-world datasets from Florida, New York, and California.

0 favorites 0 likes

#state-space-model

Language Models Need Sleep

Hugging Face Daily Papers ↗ · 2026-05-25 Cached

This paper proposes a sleep-like consolidation mechanism for transformer models that uses fast weights and recurrent passes to improve long-context processing while maintaining inference speed.

0 favorites 0 likes

#state-space-model

Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view Generation

Hugging Face Daily Papers ↗ · 2026-05-24 Cached

MVCHead is a novel method for generating 3D Gaussian head avatars from single 2D images without multi-view data, using hierarchical state space models and multi-view consistency enforcement.

0 favorites 0 likes

#state-space-model

PIMSM: Physics-Informed Multi-Scale Mamba for Stable Neural Representations under Distribution Shift

arXiv cs.LG ↗ · 2026-05-19 Cached

This paper proposes Physics-Informed Multi-Scale Mamba (PIMSM), a state-space architecture that aligns model memory with physical timescales to improve robustness under distribution shift in scientific time series, demonstrating improvements on fMRI and weather forecasting tasks.

0 favorites 0 likes

#state-space-model

@_albertgu: Introducing a new sequence model Raven which pushes the boundary of fixed-state-size sequence models! Raven bridges pop…

X AI KOLs Timeline ↗ · 2026-05-07

Researchers introduce Raven, a novel sequence model that merges state space model efficiency with a selective slot-updating mechanism inspired by sliding window attention to improve long-context retrieval. The approach offers a more principled alternative to existing linear-time models.

0 favorites 0 likes

state-space-model

Submit Feedback