Tag
This paper introduces AgingBench, a benchmark for measuring how deployed AI agents degrade over time due to memory state changes, interaction history, and lifecycle events. It categorizes aging into four mechanisms and provides diagnostic tools for targeted repairs.