Relay: A ledger-based middleware for reliable agent handoffs (Zero-dependency)

Reddit r/AI_Agents Tools

Summary

Relay is a ledger-based middleware for secure and auditable agent handoffs in multi-agent systems, featuring append-only context, snapshot recovery, and hard-cap budgeting to prevent context corruption and data leaks.

I’ve been seeing a lot of "Context Corruption" in multi-agent systems where agents slowly drift away from the facts or leak data they shouldn't. Things like context pollution and context exposure can leak major things like your API keys and credits. That's why you need something secure and auditable ..... You need **Relay** . **Key Architecture Decisions:** 1. **Append-only Ledger:** Context is never "overwritten." Every step creates a new signed envelope. 2. **Snapshot-First Recovery:** Instead of trying to prompt-engineer an agent back to sanity, Relay triggers a rollback to the last valid snapshot. 3. **Framework-Agnostic:** It works with LangChain, CrewAI, AutoGen, or just raw OpenAI/Ollama calls via adapters. 4. **Hard-Cap Budgeting:** It projects token costs *before* the call. If the agent is about to blow your budget, Relay kills the process. I’m looking for feedback on the Parallel Fork-Join model (v0.4). You can run 3 agents on the same context and join them via `UNION`, `VOTE`, or `FIRST_WINS`.
Original Article

Similar Articles

@dzhng: you should try /relay of duet agent:

X AI KOLs Following

duet-agent is a harness for persistent AI agent jobs that outlive a single chat session, enabling long-running, resumable tasks with cross-session memory and observable state through relay turns.

Relay

Product Hunt

Relay is a tool that allows users to interact with multiple AI models without repeating themselves, streamlining AI interactions.

Learned Relay Representations for Forward-Thinking Discrete Diffusion Models

arXiv cs.LG

This paper introduces Learned Relay Representations (Relay), a method that allows masked diffusion models to propagate latent information across denoising steps, overcoming the hard reset problem and improving performance-latency trade-offs. The method is shown to outperform standard supervised finetuning on coding tasks while reducing inference latency by up to 32%.