@a1zhang: Good harness designs can get around extreme token costs when information is structured. There's really no need to feed …

X AI KOLs Following 06/15/26, 09:10 PM News

token-cost harness-design structured-information rlm-agent efficiency ai-optimization context-management

Summary

A discussion on how harness designs can reduce token costs by structuring information instead of feeding everything into a language model's context, citing an example of an RLM agent processing many lines of logs with few active tokens.

Good harness designs can get around extreme token costs when information is structured. There's really no need to feed everything into a language model's context all the time. We've conflated naively throwing everything into context with bitter-lesson pilled scaling for too long. A good harness goes a long way!

Original Article

View Cached Full Text

Cached at: 06/15/26, 11:08 PM

Good harness designs can get around extreme token costs when information is structured. There’s really no need to feed everything into a language model’s context all the time.

We’ve conflated naively throwing everything into context with bitter-lesson pilled scaling for too long. A good harness goes a long way!

diego 🧞‍♂️ (@diblacksmith): My RLM agent can effortlessly process ~80k lines of service logs from CloudWatch

in a single go. that’s worth like 8 million tokens.

The cool part is, after 53 steps, it had spent only 32k “active” tokens* (not through the full 8MM yet atp, more like half).

That’s nothing for

Similar Articles

best of the best agentic harnesses do this…

Reddit r/AI_Agents

The author shares insights on building effective agent harnesses: the best ones minimize LLM reliance for trivial tasks and reserve LLMs for complex reasoning, distinguishing genuine harnesses from simple wrappers.

@rajistics: Token costs are climbing. How do you avoid being locked into a single vendor's harness? Built a demo showing how @OpenH…

X AI KOLs Following

A demo showing how OpenHands acts as a control plane across multiple agent harnesses like Claude Code, Gemini CLI, and OpenHands itself, enabling swapping models or vendors without rewriting orchestration.

@omarsar0: // Scaling Laws for Agent Harnesses // If you build agent harnesses, this one is worth your time. (bookmark it) Most ha…

X AI KOLs Following

New research on scaling laws for agent harnesses reveals that most token and tool call volume does not matter; the work introduces an effective approach.

@mfpiccolo: Kaffu's "rich man's toy" line is the one of the sharp thing I've read on harnesses this year. He's right about the symp…

X AI KOLs Timeline

The tweet discusses the problem of bloat in AI agent harnesses, agreeing with Kaffu's critique that harnesses become "rich man's toys," and advocates for a composable architecture of small, replaceable workers to reduce drift and keep systems cheap and debuggable.

@dair_ai: // State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If the…

X AI KOLs Following

Harness-1 introduces a state-externalizing harness that separates routine bookkeeping from policy decisions in search agents, enabling a 20B model to outperform larger frontier searchers across multiple benchmarks.

Similar Articles

best of the best agentic harnesses do this…

@rajistics: Token costs are climbing. How do you avoid being locked into a single vendor's harness? Built a demo showing how @OpenH…

@omarsar0: // Scaling Laws for Agent Harnesses // If you build agent harnesses, this one is worth your time. (bookmark it) Most ha…

@mfpiccolo: Kaffu's "rich man's toy" line is the one of the sharp thing I've read on harnesses this year. He's right about the symp…

@dair_ai: // State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If the…

Submit Feedback