FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse
Summary
FlowBank introduces a three-stage framework for optimizing agentic workflows in LLM multi-agent systems by precomputing a diverse set of reusable workflows and adaptively selecting the best one per query, achieving higher scores while maintaining cost competitiveness.
View Cached Full Text
Cached at: 06/11/26, 01:46 PM
# FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse Source: [https://arxiv.org/abs/2606.11290](https://arxiv.org/abs/2606.11290) [View PDF](https://arxiv.org/pdf/2606.11290) > Abstract:Large Language Model \(LLM\)\-based multi\-agent systems are increasingly powerful, but current agentic workflow optimization paradigms make an unsatisfying trade\-off\. Task\-level methods spend substantial offline compute yet deploy only a single workflow, leaving complementary candidates unused, while query\-level methods synthesize a new workflow per query at substantial inference cost\. Our motivating analysis shows these paradigms are more complementary than competing: workflows discovered during offline search often solve different subsets of queries, and many queries handled by expensive query\-level generation can already be solved by cheaper precomputed workflows\. This suggests a different objective: rather than searching for one universally best workflow or regenerating one per instance, we should build a compact bank of reusable, complementary workflows and select among them adaptively at inference time\. Doing so requires solving three coupled problems: generating complementary rather than redundant candidates, compressing them into a small deployable portfolio, and assigning each query to the right workflow under a performance\-cost trade\-off\. To this end, we present FlowBank, a three\-stage framework for portfolio\-based agentic workflow optimization\. Diversifying proposes DiverseFlow to steer search toward under\-covered queries and produce a high\-coverage candidate pool\. Curating proposes CuraFlow to compress this pool into a compact portfolio with minimal redundancy\. Matching casts deployment as edge\-value prediction on a query\-workflow bipartite graph and routes each incoming query to the portfolio member with the best predicted utility\. Across five benchmarks, FlowBank achieves the highest average score among the evaluated methods while remaining cost\-competitive, improving over the strongest automated and handcrafted baselines by 4\.26% and 14\.92% relative, respectively\. ## Submission history From: Lingzhi Yuan \[[view email](https://arxiv.org/show-email/543e7a2f/2606.11290)\] **\[v1\]**Tue, 9 Jun 2026 17:58:21 UTC \(545 KB\)
Similar Articles
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
This paper analyzes tradeoffs between latency, reliability, and cost in LLM-enabled agentic workflows, introducing performance models and deriving optimal resource allocation policies like water-filling token allocation.
AgentCo-op: Retrieval-Based Synthesis of Interoperable Multi-Agent Workflows
AgentCo-op is a retrieval-based synthesis framework for composing interoperable multi-agent workflows from reusable skills, tools, and external agents. It uses typed artifact handoffs and bounded self-guided local repair, achieving strong results on benchmarks and enabling collaborative discovery in open-world genomics tasks.
Tools as Continuous Flow for Evolving Agentic Reasoning
This paper introduces FlowAgent, a novel framework that reconceptualizes tool chaining as continuous trajectory generation using conditional flow matching to improve robustness in long-horizon agentic reasoning.
@AdamRLucek: I'm bullish on agent swarms (aka workflows). Agents are increasingly being used to analyze and collate massive amounts …
The author discusses the growing use of agent swarms/workflows for processing unstructured data at scale, noting that reliable execution drops significantly when deploying more than 30+ sub-agents in parallel, and teases a solution for combining intelligent decision-making with reliable task execution.
@hooeem: https://x.com/hooeem/status/2062266452921491934
A guide explaining how to make agentic workflows up to 462x cheaper by compiling fixed procedures into smaller fine-tuned models instead of repeatedly prompting frontier models.