I got tired of W&B and Langfuse for debugging agents, so I built my own tracer looking for feedback
Summary
Built a new tracer for debugging AI agents that auto-detects loops, logs sessions as readable timelines, and allows side-by-side comparison. Seeking feedback.
Similar Articles
I'm tired of manually debugging traces
A developer builds a debugging tool for AI agents that compares replays against reference runs to identify where behavior first drifted, expressing frustration with manual trace debugging.
@benhylak: we built the first sane way to debug your agent locally. you can see your traces. codex/claude code can too. this lets …
A new open source tool enables local debugging of AI agents by viewing traces, allowing automated eval writing and testing with tools like codex and Claude code.
Need brutal feedback: I built a recorder for AI agent runs
A developer built agentproof-recorder to record AI agent runs and detect rule violations, seeking feedback on whether this is a common pain point.
Agent workflow visualizer: feedback and corrections
A tool for visualizing AI agent workflows is introduced, supporting multiple agent frameworks including Langgraph, CrewAI, AutoGen, Google ADK, and OpenAI Agents SDK. The creator seeks community feedback and corrections.
Need some feedback on this OSS tool called agentshark
The author introduces agentshark, an open-source tool for introspecting agent interactions, debugging tool-calling flows, and sharing session traces with teammates.