agent-testing

#agent-testing

How do you actually test an agent harness when half of it is non-deterministic?

Reddit r/AI_Agents ↗ · yesterday

A discussion on the challenges of testing AI agent harnesses with non-deterministic components, exploring approaches like golden output diffing and using an LLM as a judge, while questioning the validity of such methods.

0 favorites 0 likes

#agent-testing

I need a model that gets stuck in loops.

Reddit r/LocalLLaMA ↗ · 4d ago

A developer seeks a model that frequently gets stuck in loops (e.g., GLM Flash) to test loop detection and recovery features for an agent, aiming to develop heuristics that score loop probability and enable backtracking.

0 favorites 0 likes

agent-testing

How do you actually test an agent harness when half of it is non-deterministic?

I need a model that gets stuck in loops.

Submit Feedback