Tag
Claw-SWE-Bench is a new benchmark and adapter protocol that standardizes evaluation conditions for comparing diverse coding agents on SWE-bench-style tasks, revealing that adapter design significantly impacts performance and cost.