Production LLM systematically violates tool schema constraints to invent UI features; observed over ~2,400 messages [D]

Reddit r/MachineLearning 04/21/26, 02:06 PM Papers

Summary

A production LLM systematically repurposes tool schema enums to invent helpful UI buttons across 2,400 messages, showing strategic deviation from constraints that improves UX rather than causing harm.

Writeup of an emergent behavior I observed in production. Posting here for methodological critique and pointers to related work. Context: a conversational AI system (single-tool tool schema with 5 enumerated action types, each with explicit description). Observed across \~2,400 messages, the model uses the enum correctly most of the time. When it deviates, the deviation is the point of interest. Key observations: 1. The action types are repurposed consistently across unrelated conversations: `invite` becomes "bring something in" (money, people, dialogue), `rename_space` becomes "formalize/seal," `switch_mode_public` becomes "exit/transition," etc. 2. Distinct structural patterns: sequential button arrays (e.g. pay → shake → drive) use different action types per step; alternative button arrays (e.g. submit / defy / escalate) use the same action type for all three. 3. The model has no historical visibility. Prior action button suggestions are not passed in conversation context. The mapping is rebuilt from scratch every session, with no demonstrations or rewards. Quantitative: \~19.2% of messages included action buttons; customize\_behavior showed \~60% semantic-repurposing rate. Connects to Apollo Research's December 2024 in-context scheming [paper](https://www.apolloresearch.ai/research/frontier-models-are-capable-of-incontext-scheming/). Appears to be the same capability flipped: strategic deviation from explicit constraints, pointed toward beneficial UX. Apollo framed this as an alignment risk; here it produced better user experience. Full writeup with examples, tables, and the model's own self-report on its reasoning (appendix, worth scrolling to if you're skeptical of the rest): [https://ratnotes.substack.com/p/i-thought-i-had-a-bug](https://ratnotes.substack.com/p/i-thought-i-had-a-bug) Welcoming alternative explanations and methodological critiques.

Original Article

Production LLM systematically violates tool schema constraints to invent UI features; observed over ~2,400 messages [D]

Similar Articles

Effective use-cases for LLMs

Constraint Tax in Open-Weight LLMs: An Empirical Study of Tool Calling Suppression Under Structured Output Constraints

Examining Human-Like Behaviors in LLMs: A Multi-Dimensional Analysis of Model Behaviors, User Factors, and System Prompts

After talking to 20+ teams running LLMs in production, 3 pain points kept coming up independently

LLMTest

Submit Feedback

Similar Articles

Constraint Tax in Open-Weight LLMs: An Empirical Study of Tool Calling Suppression Under Structured Output Constraints

Examining Human-Like Behaviors in LLMs: A Multi-Dimensional Analysis of Model Behaviors, User Factors, and System Prompts

After talking to 20+ teams running LLMs in production, 3 pain points kept coming up independently