constraint-optimization

Tag

Cards List
#constraint-optimization

Anchor: Mitigating Artifact Drift in Agent Benchmark Generation

arXiv cs.AI · 2026-05-27 Cached

Anchor is a task-generation pipeline that addresses artifact drift in AI agent benchmarks by jointly producing instructions, environments, solutions, and verifiers from a single constraint optimization specification, yielding consistent and auditable evaluation tasks for enterprise workflows. The paper introduces ERP-Bench, a benchmark of 300 long-horizon tasks in a production ERP system, showing that frontier models satisfy explicit constraints in 26.1% of trials but reach optimal solutions in only 17.4%.

0 favorites 0 likes
#constraint-optimization

Transforming Constraint Programs to Input for Local Search

arXiv cs.AI · 2026-05-20

This paper presents a method to automatically generate local search neighborhoods from constraint specifications using symmetry properties, evaluated on six optimization problems.

0 favorites 0 likes
← Back to home

Submit Feedback