dialogue

Tag

Cards List
#dialogue

Seeing Is Not Sharing: Some Vision-Language Models Overestimate Common Ground in Asymmetric Dialogue

arXiv cs.CL · 2d ago Cached

This paper investigates whether vision-language models can distinguish potential from established common ground in asymmetric dialogue. Experiments on MapTask data show that providing task-relevant map content (visual or textual) biases models toward over-predicting alignment, as they rely on static referential cues rather than tracking grounding through dialogue history.

0 favorites 0 likes
#dialogue

Why thinking out loud with someone beats thinking alone

Hacker News Top · 2026-06-17 Cached

An essay exploring why thinking out loud with another person produces better understanding and insight than solitary reflection, drawing on cognitive science and philosophy.

0 favorites 0 likes
#dialogue

Dialogue SWE-Bench: A Benchmark for Dialogue-Driven Coding Agents

arXiv cs.CL · 2026-06-15 Cached

Introduces Dialogue-SWE-Bench, a benchmark for evaluating coding agents' ability to resolve software engineering problems through dialogue with a user. Proposes a persona-grounded user simulator and a schema-guided agent that improves dialogue capabilities.

0 favorites 0 likes
#dialogue

ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models

arXiv cs.CL · 2026-06-10 Cached

ParaBridge is an on-policy self-distillation method that bridges the gap between paralinguistic perception and dialogue behavior in speech language models, significantly improving safety and empathy without external rewards.

0 favorites 0 likes
#dialogue

Expert-Level Crisis Detection in Mental Health Conversations

arXiv cs.CL · 2026-06-10 Cached

Introduces CRADLE-Dialogue, a clinician-annotated benchmark for turn-level crisis detection in mental health conversations, along with an Alert–Confirm evaluation protocol and a synthetic training corpus plus a 32B model that outperforms existing open-source and proprietary models.

0 favorites 0 likes
#dialogue

$\Psi$-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

arXiv cs.LG · 2026-06-03 Cached

Ψ-Bench is a benchmark for evaluating LLMs' ability to influence users through persuasive dialogues, incorporating user profiles for personalized persuasion. Experiments show that even state-of-the-art models have room for improvement, and access to client profiles significantly boosts performance.

0 favorites 0 likes
#dialogue

Accommodation Goes Both Ways: Studying Linguistic Convergence Between Humans and Language Models

arXiv cs.CL · 2026-05-29 Cached

This paper studies how humans and large language models linguistically accommodate each other during multi-turn conversations, finding that LLMs overconverge to user style while humans accommodate LLMs no differently than humans.

0 favorites 0 likes
#dialogue

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Hugging Face Daily Papers · 2026-05-29 Cached

SwanVoice is a zero-shot text-to-speech model designed for expressive long-form monologue and dialogue synthesis, combining VAE, flow-matching DiT, and diffusion post-training to achieve higher richness and hierarchy scores than existing baselines.

0 favorites 0 likes
#dialogue

Conv-to-Bench: Evaluating Language Models Via User-Assistant Dialogues In Code Tasks

arXiv cs.CL · 2026-05-27 Cached

Conv-to-Bench is a multi-stage framework that automatically transforms multi-turn user-assistant dialogues into structured, verifiable requirement checklists for evaluating large language models on code tasks, achieving near-perfect alignment with human-authored benchmarks at lower computational cost.

0 favorites 0 likes
#dialogue

Moltbook Moderation: Uncovering Hidden Intent Through Multi-Turn Dialogue

arXiv cs.AI · 2026-05-14 Cached

This paper introduces Bot-Mod, a moderation framework that identifies malicious intent in multi-agent systems through multi-turn dialogue and Gibbs-based sampling, and presents a dataset from Moltbook for evaluation.

0 favorites 0 likes
#dialogue

May 19, 2026AnnouncementsWidening the conversation on frontier AI

Anthropic News · 2026-05-20 Cached

Anthropic announces a series of dialogues with religious, philosophical, and cultural groups to broaden perspectives on building safe and beneficial AI. The conversations aim to inform the moral formation of AI systems like Claude.

0 favorites 0 likes
← Back to home

Submit Feedback