@rajistics: Token costs are climbing. How do you avoid being locked into a single vendor's harness? Built a demo showing how @OpenH…

X AI KOLs Following 05/14/26, 03:34 PM Tools

multi-agent orchestration open-source agent-harness control-plane vendor-lock-in code-demo

Summary

A demo showing how OpenHands acts as a control plane across multiple agent harnesses like Claude Code, Gemini CLI, and OpenHands itself, enabling swapping models or vendors without rewriting orchestration.

Token costs are climbing. How do you avoid being locked into a single vendor's harness? Built a demo showing how @OpenHands acts as a control plane across agent harnesses. Manage multiple harnesses from one place, swap models or vendors without rewriting your orchestration. https://github.com/rajshah4/openhands-multi-agent-demo…

Original Article

View Cached Full Text

Cached at: 05/15/26, 04:59 AM

Token costs are climbing. How do you avoid being locked into a single vendor’s harness? Built a demo showing how @OpenHands acts as a control plane across agent harnesses. Manage multiple harnesses from one place, swap models or vendors without rewriting your orchestration. https://github.com/rajshah4/openhands-multi-agent-demo…

rajshah4/openhands-multi-agent-demo

Source: https://github.com/rajshah4/openhands-multi-agent-demo

How OpenHands Orchestrates Multiple Agents

OpenHands lets you choose how agents share state, how they are isolated, and how the workflow is orchestrated.

OpenHands control plane for multiple agent harnesses

The diagram above shows the control-plane view: OpenHands coordinates the workflow while different harnesses and runtime models sit underneath it.

Why Multi-Agent Orchestration?

An agent harness wraps a model with tools, context, and execution — Claude Code, Gemini CLI, and OpenHands are all harnesses. Each has different strengths: Claude Code for implementation, Gemini CLI for fast test generation, OpenHands for code review with its own agent framework.

This repo treats OpenHands as the orchestration layer, or control plane, around those harnesses. The key idea is that the workflow is separate from the runtime: the same implement → test → review pipeline can run with different harnesses, with different state-sharing models, and with different isolation strategies.

The point is not that you must use three vendors. The point is that you can compose heterogeneous agent systems while keeping the workflow itself stable.

The Pipeline

Every demo in this repo runs the same three-phase pipeline:

Phase	Default Harness	What it does
Implement	Claude Code (Anthropic)	Writes the code from a spec
Test	Gemini CLI (Google)	Reads the code and adds pytest coverage
Review	OpenHands	Reviews everything, reports findings with severity

You can swap harnesses within the pipeline to use OpenHands for all phases, or move the same workflow between shared workspaces, isolated local clones, and managed cloud sandboxes.

Three Patterns for Multi-Agent Orchestration

This repo demonstrates three architectural patterns for running multiple agents. They produce the same output but differ in isolation, complexity, and infrastructure.

Three orchestration patterns for the same multi-agent workflow

📖 Read the full patterns guide → for detailed architecture explanations, decision trees, and migration paths.

Pattern Comparison

	Pattern 1: Easy	Pattern 2: Isolated Local	Pattern 3: Enterprise
Script	`shared_workspace.py`	`multi_server_isolation.py`	`cloud_conversations.py`
Sandboxes	1 shared	N isolated (manual)	N isolated (automatic)
Local runtime shape	1 shared workspace	N isolated clones	Enterprise-managed
Coordination	Filesystem	Git (you orchestrate)	Git (Enterprise orchestrates)
Code complexity	Low	High	Medium
Infrastructure	None	Manual server management	Automatic provisioning
Observability	Terminal logs	Terminal logs	Web UI per agent

When to Use Each Pattern

Pattern 1 (Easy) — Agents share a workspace, simple code

✅ Quick local development
✅ Agents collaborate on same files
✅ Minimal infrastructure
❌ No isolation between agents

Pattern 2 (Isolated Local) — Full isolation, manual orchestration

✅ Complete isolation without Cloud
✅ Air-gapped environments
✅ Real local verification with pytest
❌ You manage git coordination and retry logic
❌ More complex orchestration code

Pattern 3 (Enterprise) — Full isolation, automatic orchestration

✅ Isolation + simple code
✅ Automatic sandbox provisioning
✅ Web UI for each agent
❌ Requires internet and Enterprise API key

Pattern 1: Easy — Single Agent-Server (`shared_workspace.py`)

All agents run in a single shared workspace using the OpenHands SDK. Claude Code and Gemini CLI connect as subprocesses via ACP (Agent Client Protocol).

shared_workspace.py (your laptop)
│
└─► Single Agent-Server (one workspace)
     ├─ Agent 1 [Claude Code]  → writes shortener.py
     ├─ Agent 2 [Gemini CLI]   → writes test_shortener.py
     └─ Agent 3 [OpenHands]    → reviews all files
        
        All share /workspace/project ✅

Architecture: One sandbox, agents coordinate via shared filesystem.

Best for: Quick local development, tight collaboration, minimal infrastructure.

Setup and Run

git clone https://github.com/rajshah4/openhands-multi-agent-demo.git
cd openhands-multi-agent-demo

pip install openhands-sdk openhands-tools
export LLM_API_KEY="your-key"
export ANTHROPIC_API_KEY="your-key"
export GEMINI_API_KEY="your-key"

python shared_workspace.py               # ACP pipeline with all three harnesses
python shared_workspace.py --no-claude   # Pure OpenHands agent delegation
python shared_workspace.py --cloud       # Run on Cloud infrastructure (still single sandbox)

When run with --no-claude, the SDK uses DelegateTool to spawn OpenHands subagents — the LLM decides the flow rather than a hardcoded script.

Pattern 2: Isolated Local — Multiple Workspaces (`multi_server_isolation.py`)

Each phase runs in its own isolated git clone under a different temporary directory. The script uses the OpenHands SDK for every phase, and changes move between workspaces through git push/pull.

multi_server_isolation.py (your laptop)
│
├─► Agent 1 [OpenHands SDK + Anthropic LLM]  → /tmp/workspace_claude/
│     └─ Implements code → git push
│
├─► Agent 2 [OpenHands SDK + Gemini LLM]     → /tmp/workspace_gemini/
│     └─ git pull → writes tests → pytest → optional repair → git push
│
└─► Agent 3 [OpenHands SDK reviewer]         → /tmp/workspace_reviewer/
      └─ git pull → reviews code

Architecture: Multiple isolated workspaces, manual git coordination, and a local bare repo used as the shared origin. Each phase has its own clone and the orchestrator runs local pytest verification before review.

Best for: Air-gapped environments, custom orchestration, learning how to build multi-agent systems.

Trade-off: Full isolation, but the local orchestrator has to manage repo mirroring, branch handoff, verification, and repair retries.

Setup and Run

# Prerequisites: Same as Pattern 1 (ANTHROPIC_API_KEY, GEMINI_API_KEY)
pip install openhands-ai pytest

python multi_server_isolation.py                    # Run full pipeline
python multi_server_isolation.py --no-claude        # OpenHands only
python multi_server_isolation.py --task csv-tool    # Different task

Notes:

multi_server_isolation.py creates a temporary bare git origin from your local checkout, then clones isolated workspaces from that origin.
The implementation phase defaults to Anthropic Sonnet, the test phase defaults to Gemini, and the reviewer falls back across configured LLM keys.
The tester workspace is verified with local pytest; if it fails, the script does one repair pass and retries.

Pattern 3: Enterprise — Automatic Multi-Sandbox (`cloud_conversations.py`)

Each agent runs in its own sandbox on OpenHands Cloud or Enterprise (self-hosted). The platform automatically provisions sandboxes, handles git coordination, and provides web UI for each agent.

cloud_conversations.py (your laptop)
│
├─► ☁️ Conversation 1   [Claude Code / Anthropic]
│     └─ Platform provisions sandbox, implements, pushes to repo
│
├─► ☁️ Conversation 2   [Gemini CLI / Google]
│     └─ Platform provisions sandbox, pulls, tests, pushes
│
└─► ☁️ Conversation 3   [OpenHands]
      └─ Platform provisions sandbox, pulls, reviews

Architecture: Enterprise-managed sandboxes, automatic orchestration. You write high-level workflow, the platform handles infrastructure.

Best for: Production workflows, observability, auditability, team deployments.

Setup and Run

# Prerequisites: ANTHROPIC_API_KEY and GEMINI_API_KEY configured in platform
# Get an API key from https://app.all-hands.dev → Settings → API Keys (Cloud)
# Or from your self-hosted Enterprise instance

pip install requests
export OPENHANDS_CLOUD_API_KEY="your-cloud-api-key"

python cloud_conversations.py                          # default: url-shortener
python cloud_conversations.py --task csv-tool          # CSV-to-JSON converter
python cloud_conversations.py --task custom --custom-task "Build a rate limiter"
python cloud_conversations.py --repo youruser/yourrepo # your own repo
python cloud_conversations.py --no-claude              # OpenHands for all steps

You’ll see three conversation URLs — click each one to watch that agent work live in the Cloud UI.

Value: Same isolation goal as Pattern 2, but Cloud handles sandbox provisioning, cleanup, and observability for you.

Files

File	What it does
`cloud_conversations.py`	Pattern 3 — Enterprise conversations via API (automatic multi-sandbox)
`shared_workspace.py`	Pattern 1 — SDK with ACP (single shared workspace)
`multi_server_isolation.py`	Pattern 2 — Isolated workspaces with manual git orchestration
`shortener.py`	Sample output — URL shortener generated by the pipeline
`.agents/agents/code-reviewer.md`	File-based agent definition for the reviewer

Architecture Insights

Why Three Patterns?

Each pattern represents a different isolation vs. complexity trade-off:

Pattern 1 is the “Goldilocks” for local development:

✅ Simple (~10 lines)
✅ Fast (no network calls)
✅ All SDK features (DelegateTool, ACP, file-based agents)
❌ No isolation (agents share filesystem)

Pattern 2 provides local isolation with higher operational complexity:

✅ Full isolation (separate workspaces and git clones)
✅ Air-gapped capability
❌ Complex local orchestration
❌ Manual git handoff, verification, and retry management

Pattern 3 is the “Goldilocks” for production:

✅ Full isolation (Cloud provisions sandboxes)
✅ Thin local orchestration script
✅ Observability (Web UI per agent)
✅ Automatic orchestration
❌ Requires Cloud connectivity

The Key Insight

Cloud conversations (Pattern 3) = Isolation (Pattern 2) + Simplicity (Pattern 1)

You get the full sandbox isolation of Pattern 2 without the orchestration complexity. Cloud handles:

✅ Sandbox provisioning and cleanup
✅ Port management
✅ Git integration
✅ Observability (Web UI)
✅ Error recovery

This is why cloud_conversations.py stays relatively thin while multi_server_isolation.py carries the local orchestration burden directly.

Enterprise Value

Multi-vendor flexibility — Anthropic implements, Google tests, OpenHands reviews
Observable workflows — Each agent in its own conversation, fully auditable
Distributed architecture — Agents communicate through artifacts (git), not tight coupling
Vendor-agnostic — Swap any agent without changing the pipeline
Extensible — Add new harnesses by adding entries to HARNESS_INSTRUCTIONS
Pattern flexibility — Start local (Pattern 1), scale to Cloud (Pattern 3)

@rajistics: Token costs are climbing. How do you avoid being locked into a single vendor's harness? Built a demo showing how @OpenH…

rajshah4/openhands-multi-agent-demo

How OpenHands Orchestrates Multiple Agents

Why Multi-Agent Orchestration?

The Pipeline

Three Patterns for Multi-Agent Orchestration

Pattern Comparison

When to Use Each Pattern

Pattern 1: Easy — Single Agent-Server (`shared_workspace.py`)

Setup and Run

Pattern 2: Isolated Local — Multiple Workspaces (`multi_server_isolation.py`)

Setup and Run

Pattern 3: Enterprise — Automatic Multi-Sandbox (`cloud_conversations.py`)

Setup and Run

Files

Architecture Insights

Why Three Patterns?

The Key Insight

Enterprise Value

Links

Similar Articles

@omarsar0: // Scaling Laws for Agent Harnesses // If you build agent harnesses, this one is worth your time. (bookmark it) Most ha…

@ClementDelangue: Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular t…

@dair_ai: // State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If the…

@omarsar0: // Self-Harness: Harnesses That Improve Themselves // (bookmark this one) Most of the agent scaffolds we rely on today …

@mfpiccolo: https://x.com/mfpiccolo/status/2060069083878408689

Submit Feedback

Similar Articles

@omarsar0: // Scaling Laws for Agent Harnesses // If you build agent harnesses, this one is worth your time. (bookmark it) Most ha…

@ClementDelangue: Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular t…

@dair_ai: // State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If the…

@omarsar0: // Self-Harness: Harnesses That Improve Themselves // (bookmark this one) Most of the agent scaffolds we rely on today …

@mfpiccolo: https://x.com/mfpiccolo/status/2060069083878408689

rajshah4/openhands-multi-agent-demo

How OpenHands Orchestrates Multiple Agents

Why Multi-Agent Orchestration?

The Pipeline

Three Patterns for Multi-Agent Orchestration

Pattern Comparison

When to Use Each Pattern

Pattern 1: Easy — Single Agent-Server (shared_workspace.py)

Setup and Run

Pattern 2: Isolated Local — Multiple Workspaces (multi_server_isolation.py)

Setup and Run

Pattern 3: Enterprise — Automatic Multi-Sandbox (cloud_conversations.py)

Setup and Run

Files

Architecture Insights

Why Three Patterns?

The Key Insight

Enterprise Value

Links

Similar Articles

@omarsar0: // Scaling Laws for Agent Harnesses // If you build agent harnesses, this one is worth your time. (bookmark it) Most ha…

@ClementDelangue: Token costs are why there will be no saas apocalypse / good dev tools are cached intelligence for agents! The popular t…

@dair_ai: // State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If the…

@omarsar0: // Self-Harness: Harnesses That Improve Themselves // (bookmark this one) Most of the agent scaffolds we rely on today …

@mfpiccolo: https://x.com/mfpiccolo/status/2060069083878408689

Submit Feedback

Pattern 1: Easy — Single Agent-Server (`shared_workspace.py`)

Pattern 2: Isolated Local — Multiple Workspaces (`multi_server_isolation.py`)

Pattern 3: Enterprise — Automatic Multi-Sandbox (`cloud_conversations.py`)