spoken-language-models

标签

Cards List
#spoken-language-models

通过激活引导克服全双工语音语言模型中的状态惯性

arXiv cs.CL · 2026-06-11 缓存

本文识别了全双工语音语言模型中的"状态惯性",即在用户打断时,模型的内部预测焦点滞后,并提出了一种无需训练的激活引导方法来改善打断处理。

0 人收藏 0 人点赞
#spoken-language-models

Thinking-while-speaking: A Controlled, Interleaved Reasoning Method for Real-Time Speech Generation

arXiv cs.CL · 2026-05-21 缓存

This paper introduces InterRS, a method for real-time speech generation that interleaves reasoning steps during natural pauses in speech, achieving better performance on math and logic benchmarks while maintaining fluent and instant responses.

0 人收藏 0 人点赞
← 返回首页

提交意见反馈