full-duplex-speech

Tag

Cards List
#full-duplex-speech

Liberating LLM Capabilities in Full-Duplex Speech Models

Hugging Face Daily Papers · 2026-05-04 Cached

Proposes Listen-Write-Speak (LWS), a text-first tri-channel paradigm that allows a single autoregressive LLM to continuously listen, write visible text, and speak in real-time, enabling full-duplex speech interaction without architectural modifications.

0 favorites 0 likes
#full-duplex-speech

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Papers with Code Trending · 2024-10-23 Cached

OmniFlatten is a novel GPT-based model that enables real-time, full-duplex spoken dialogue through a multi-stage post-training technique that integrates speech and text without altering the original architecture.

0 favorites 0 likes
← Back to home

Submit Feedback