spoken-language-models

#spoken-language-models

Overcoming State Inertia in Full-Duplex Spoken Language Models via Activation Steering

arXiv cs.CL ↗ · 2026-06-11 Cached

This paper identifies 'state inertia' in full-duplex spoken language models, where the model's internal predictive focus lags during user interruptions, and proposes a training-free activation steering method to improve interruption handling.

0 favorites 0 likes

#spoken-language-models

Thinking-while-speaking: A Controlled, Interleaved Reasoning Method for Real-Time Speech Generation

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper introduces InterRS, a method for real-time speech generation that interleaves reasoning steps during natural pauses in speech, achieving better performance on math and logic benchmarks while maintaining fluent and instant responses.

0 favorites 0 likes

spoken-language-models

Overcoming State Inertia in Full-Duplex Spoken Language Models via Activation Steering

Thinking-while-speaking: A Controlled, Interleaved Reasoning Method for Real-Time Speech Generation

Submit Feedback