Production LLM systematically violates tool schema constraints to invent UI features; observed over ~2,400 messages [D]

Reddit r/MachineLearning Papers

Summary

A production LLM systematically repurposes tool schema enums to invent helpful UI buttons across 2,400 messages, showing strategic deviation from constraints that improves UX rather than causing harm.

Writeup of an emergent behavior I observed in production. Posting here for methodological critique and pointers to related work. Context: a conversational AI system (single-tool tool schema with 5 enumerated action types, each with explicit description). Observed across \~2,400 messages, the model uses the enum correctly most of the time. When it deviates, the deviation is the point of interest. Key observations: 1. The action types are repurposed consistently across unrelated conversations: `invite` becomes "bring something in" (money, people, dialogue), `rename_space` becomes "formalize/seal," `switch_mode_public` becomes "exit/transition," etc. 2. Distinct structural patterns: sequential button arrays (e.g. pay → shake → drive) use different action types per step; alternative button arrays (e.g. submit / defy / escalate) use the same action type for all three. 3. The model has no historical visibility. Prior action button suggestions are not passed in conversation context. The mapping is rebuilt from scratch every session, with no demonstrations or rewards. Quantitative: \~19.2% of messages included action buttons; customize\_behavior showed \~60% semantic-repurposing rate. Connects to Apollo Research's December 2024 in-context scheming [paper](https://www.apolloresearch.ai/research/frontier-models-are-capable-of-incontext-scheming/). Appears to be the same capability flipped: strategic deviation from explicit constraints, pointed toward beneficial UX. Apollo framed this as an alignment risk; here it produced better user experience. Full writeup with examples, tables, and the model's own self-report on its reasoning (appendix, worth scrolling to if you're skeptical of the rest): [https://ratnotes.substack.com/p/i-thought-i-had-a-bug](https://ratnotes.substack.com/p/i-thought-i-had-a-bug) Welcoming alternative explanations and methodological critiques.
Original Article

Similar Articles

Effective use-cases for LLMs

Lobsters Hottest

This article shares practical, real-world use cases for LLMs in software engineering, including searching through customer conversations via RAG, triaging API failures from logs, and shortening content. It emphasizes efficiency gains and reducing manual sifting.

LLMTest

Product Hunt

LLMTest is a tool to help developers use the right LLMs in their apps and set up fallbacks.