@AdamRLucek: Do agents listen to you… or themselves? While evaling subagent behavior in deep agent systems, we noticed an interestin…

X AI KOLs Following 05/21/26, 03:08 PM Papers

agents subagents alignment system-prompts orchestration behavior-evaluation

Summary

A researcher shares an observation in evaluating subagent behavior within deep agent systems, noting an interesting quirk in how agents align with hand-written system prompts versus instructions from the orchestrator.

Do agents listen to you… or themselves? While evaling subagent behavior in deep agent systems, we noticed an interesting quirk in our agents' alignment with hand-written system prompts vs. the instructions given by the orchestrator 1/4 🧵

Original Article

View Cached Full Text

Cached at: 05/21/26, 07:37 PM

Do agents listen to you… or themselves? While evaling subagent behavior in deep agent systems, we noticed an interesting quirk in our agents’ alignment with hand-written system prompts vs. the instructions given by the orchestrator 1/4

On a ‘needle in a haystack’ style classification eval, where a main agent relies on multiple subagents to parse through many large (million+ token) datapoints and cluster them into related groups, we saw varying performance and behavior changes depending on the length and specificity of additional instructions sent to the subagent by the orchestrator 2/4

While our subagent system prompt was generally directional and open-ended, some models provided detailed rubrics and guidelines that resulted in wayyyy too strict behavior and limited the subagent’s creative execution, hurting end performance. These larger briefs from the agent often directionally overrode the looser behavior we wanted to encourage from our prompting 3/4

The takeaway? It’s important to consider and measure not just how you are prompting a subagent, but how your primary agent is prompting it too. The relationship an agent has with its subagent delegations can make or break the overall system’s success 4/4

On the money! Directional over exact

Maybe we’ve hit AGI already…

Any tips?

@AdamRLucek: Do agents listen to you… or themselves? While evaling subagent behavior in deep agent systems, we noticed an interestin…

Similar Articles

Quoting Andreas Påhlsson-Notini

been experimenting with custom agents, and the interesting part isn't task completion — it's what changes when they have memory

Anyone else feel like AI agents are amazing right up until things get complicated?

A right answer from your agent doesn't mean it did the right thing

@Vtrivedy10: my fave point from here: the earlier you think about your agent as a system that can be measured & improved, the faster…

Submit Feedback

Similar Articles

Quoting Andreas Påhlsson-Notini

been experimenting with custom agents, and the interesting part isn't task completion — it's what changes when they have memory

Anyone else feel like AI agents are amazing right up until things get complicated?

A right answer from your agent doesn't mean it did the right thing

@Vtrivedy10: my fave point from here: the earlier you think about your agent as a system that can be measured & improved, the faster…