Tag
Setting temperature to 0 does not guarantee deterministic tool calls in agents due to batched inference causing floating-point reduction order shifts, leading to token flips and different actions under load.
Design Arena announces Agentic Game Development, a new multi-file, multi-turn evaluation with access to asset catalogs, built-in libraries like Howler and Tween.js, and expanded tool calls for sprite generation and asset discovery.
Cursor has released an auto-review mode that allows agents to execute tool calls with fewer approval prompts while maintaining safety.
The author shares lessons from instrumenting AI agent tool calls, revealing that tools like web_search can account for ~50% of spend, and highlighting the importance of tracking p95 latency and attributing costs per workflow or customer to avoid surprises.
LLC, a lightweight alternative to OpenWebUI for running local LLMs, updates to v0.6 with a chat converter to import OpenWebUI history and custom tool calls for user-defined tools.
A user running multiple agents reports that after upgrading to GPT-5.5, the model suddenly became less capable at executing tool calls and more prone to giving suggestions instead of acting, speculating OpenAI may be throttling for load management.