compositional-reasoning

Tag

Cards List
#compositional-reasoning

R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search

arXiv cs.AI · 3d ago Cached

R-APS (Reflective Adversarial Pareto Search) is a novel method for constrained design tasks that addresses three structural failures in LLM-based agentic systems—error propagation, robustness evaluation, and knowledge invalidation—through reasoning-mode decomposition across three timescales, requiring no fine-tuning. Evaluated on planar mechanism synthesis, it achieves 3.5x tighter robustness certificates, 46% faster iterations-to-first-admission, and 2.1x Chamfer-distance reduction over baselines.

0 favorites 0 likes
#compositional-reasoning

MAVEN: Improving Generalization in Agentic Tool Calling

arXiv cs.AI · 6d ago Cached

MAVEN is a lightweight symbolic reasoning scaffold that improves generalization in agentic tool calling by using modular verification and adaptive tool orchestration. It achieves significant accuracy gains on a new stress-test benchmark (MAVEN-Bench) and remains competitive with proprietary models at a fraction of the cost.

0 favorites 0 likes
#compositional-reasoning

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

arXiv cs.AI · 2026-05-27 Cached

This paper introduces 'composition collapse', a phenomenon where language models with stable factual knowledge still fail to compose that knowledge into correct multi-hop reasoning, and proposes a double-gate protocol to isolate composition failure from atomic knowledge instability.

0 favorites 0 likes
#compositional-reasoning

Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning

arXiv cs.LG · 2026-05-08 Cached

This research paper investigates how shortcut solutions learned by Transformer models, specifically BERT, impair their ability to perform continual compositional reasoning. It contrasts BERT with ALBERT, finding that ALBERT's recurrent nature offers better inductive bias for continual learning tasks.

0 favorites 0 likes
#compositional-reasoning

The Amazing Agent Race: Strong Tool Users, Weak Navigators

arXiv cs.CL · 2026-04-20 Cached

The Amazing Agent Race (AAR) introduces a new benchmark with 1,400 directed acyclic graph (DAG) puzzle instances to evaluate LLM agents on fork-merge tool chains and Wikipedia navigation. Evaluations reveal agents excel at tool-use (errors <17%) but struggle with navigation (27-52% of failures), exposing a critical gap invisible to existing linear benchmarks.

0 favorites 0 likes
#compositional-reasoning

Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding

Hugging Face Daily Papers · 2026-04-14 Cached

Proposes Slipform, a training framework that uses lexical concreteness to select harder negatives and a margin-based Cement loss, boosting compositional reasoning in vision-language models.

0 favorites 0 likes
← Back to home

Submit Feedback