policy-sharing

Tag

Cards List
#policy-sharing

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

arXiv cs.AI · 2026-05-26 Cached

This paper studies when end-to-end reinforcement learning training improves multi-agent LLM workflows, comparing shared-policy and isolated-policy training across different workflows, tasks, and model scales, revealing conditional tradeoffs.

0 favorites 0 likes
← Back to home

Submit Feedback