deterministic-testing

Tag

Cards List
#deterministic-testing

@GergelyOrosz: I'm using Antithesis (@AntithesisHQ - the presenting sponsor of the podcast) more to better understand how they test de…

X AI KOLs Following · 5d ago Cached

Gergely Orosz shares his experience using Antithesis, a deterministic testing infrastructure that can run hours of testing in minutes.

0 favorites 0 likes
#deterministic-testing

Layer-Isolated Evaluation: Gating the Deterministic Scaffold of a Production LLM Agent with a No-LLM, Regression-Locked Test Harness

arXiv cs.CL · 6d ago Cached

This paper introduces layer-isolated evaluation for LLM agents, decomposing a production agent into architectural layers each tested with a deterministic, no-LLM harness. It demonstrates that per-slice baseline testing localizes regressions that aggregate metrics mask, validated by controlled regression injections across multiple tenants.

0 favorites 0 likes
← Back to home

Submit Feedback