commit-evaluation

Tag

Cards List
#commit-evaluation

PACE: Anytime-Valid Acceptance Tests for Self-Evolving Agents

arXiv cs.AI · yesterday Cached

PACE introduces an anytime-valid commit gate for self-evolving agents that replaces greedy acceptance with a sequential hypothesis test, controlling false-commit probability and reducing churn while matching performance with lower variance.

0 favorites 0 likes
← Back to home

Submit Feedback