speech-models

#speech-models

Perceptual compensation for tonal context in self-supervised speech models

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper investigates whether the wav2vec2.0 architecture exhibits perceptual compensation for tonal context in Mandarin Chinese, finding limited evidence in the self-supervised model compared to human listeners and suggesting that supervised fine-tuning may be necessary for such phonological abstraction.

0 favorites 0 likes

#speech-models

@kyutai_labs: New paper: Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models We use RL to post-train speech models (Mo…

X AI KOLs Following ↗ · 2026-06-10 Cached

Kyutai Labs released a new paper on using reinforcement learning to post-train speech models (Moshi and PersonaPlex) for more human-like interaction, including when to respond, wait, or give listening cues.

0 favorites 0 likes

speech-models

Perceptual compensation for tonal context in self-supervised speech models

@kyutai_labs: New paper: Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models We use RL to post-train speech models (Mo…

Submit Feedback