multi-turn-reasoning

Tag

Cards List
#multi-turn-reasoning

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

Hugging Face Daily Papers · 2026-05-29

SpatialAct is a new simulator-grounded benchmark that probes whether VLM agents can perform coherent spatial reasoning and translate it into actions in 3D environments across multi-turn feedback settings. Experiments reveal a significant reasoning-to-action gap, with current VLMs struggling to maintain spatial beliefs and produce reliable actions despite performing well on isolated reasoning tasks.

0 favorites 0 likes
#multi-turn-reasoning

Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning

arXiv cs.AI · 2026-05-26 Cached

This paper introduces satisfiable drift, a failure mode where multi-turn reasoning systems silently violate prior commitments while maintaining internal logical consistency, dominating contradictions. The authors present DRIFT-Bench, a benchmark of 816 problems, and find that after repair, 98-100% of residual errors are drift errors.

0 favorites 0 likes
#multi-turn-reasoning

MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

arXiv cs.CL · 2026-05-11 Cached

This paper introduces MedAction, a framework for training LLMs on active, multi-turn clinical diagnosis by simulating iterative test ordering and hypothesis updates. It presents a new dataset, MedAction-32K, and demonstrates state-of-the-art performance for open-source models on medical benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback