Need brutal feedback: I built a recorder for AI agent runs

Reddit r/AI_Agents Tools

Summary

A developer built agentproof-recorder to record AI agent runs and detect rule violations, seeking feedback on whether this is a common pain point.

I have been using AI coding agents more seriously lately and one thing started annoying me. I needed something to control access of sensitive material. Being high on caffine I went to codex and built agentproof-recorder. It records an agent run and checks whether the agent violated the rules during execution. I am not sure yet if this is a real pain for others or just something I personally got annoyed by while working with agents. So I want brutal feedback: Are you facing this while using agents and how are you solving it? Would a tool like this be useful?
Original Article

Similar Articles

How do you actually debug your AI agents?

Reddit r/AI_Agents

Developer shares struggles debugging AI agents in production, highlighting issues with hallucinations, regression from prompt changes, and high API costs, asking the community for strategies.

AI agents still suck, so I built my own

Reddit r/AI_Agents

The author built a custom AI agent application wrapping Claude Code and upcoming Codex support, focusing on composable workflows and seeking community feedback.