Tag
The article argues that the primary failure point for AI agents in production is not the model itself, but the lack of infrastructure such as stop buttons, billing oversight, and traceability for tool calls.
An engineer describes how their AI sales agent confidently invoiced $0.00 because it misinterpreted a null discount field as 100% off, highlighting the difficulty of debugging agent workflows and the need for full execution tracing.
Introduces DeepEval's @observe decorator for component-level tracing and evaluation of LLM apps, enabling granular insight into retrievers, tools, and models.
Seven leading financial firms, including Jane Street, Goldman Sachs, and JP Morgan, have open-sourced key engineering tools on GitHub, such as magic-trace, gs-quant, and perspective, offering high-performance solutions for tracing, derivative pricing, and real-time market monitoring.
Motus Tracing is a fully open-source observability layer for AI agents that captures every model call, tool call, sandbox interaction, and error, providing a unified span model for local development and cloud deployment with zero setup cost.
A developer asks for recommendations for open-source alternatives to LangSmith for tracing, evaluations, and debugging agent workflows, citing restrictive paywalls.
Raindrop Workshop is an open-source local debugger for AI agents that provides live streaming traces of tokens, tool calls, and decisions, with integration for coding agents like Claude Code to enable self-healing eval loops and replay.
A new open source tool enables local debugging of AI agents by viewing traces, allowing automated eval writing and testing with tools like codex and Claude code.
LangSmith Engine transforms traces from passive records into a driver for recursive agent self-improvement, automatically identifying failures and suggesting code changes and evaluations.
Tracea is a new product offering Datadog-like observability for AI agents, providing features such as tracing, root cause analysis, and team memory.
VictoriaMetrics presented retroactive sampling at KubeCon EU 2026, a new method that significantly reduces traffic, CPU, and memory overhead compared to traditional tail sampling in OpenTelemetry pipelines.
magic-trace is a high-resolution tracing tool using Intel Processor Trace to capture full control flow snapshots of Linux processes with low overhead, useful for debugging and performance analysis.
Langfuse open-sources its LLM engineering platform to offer self-hosted tracing, analytics, and evaluation tools for production AI applications.