I built a Hermes Skill where 3 AI models argue with each other before giving you an answer - adversarial multi-model consensus with RRF + Borda Count ranking

Reddit r/AI_Agents 05/31/26, 10:06 AM Tools

adversarial multi-model consensus agentic open-source multi-agent orchestration

Summary

PolyGnosis is an adversarial multi-model consensus system built as a Hermes skill. It runs three AI models in parallel with different expert personas, then has a hostile critic phase, scoring via RRF and Borda Count, and a synthesis gate—all built agentically using DeepSeek V4-Pro.

I built an adversarial multi-model consensus system as a Hermes skill — 136 API calls, 16.8M tokens, entirely agent-built Most AI pipelines run one model, give you one answer, and call it a day. No second opinion. No pushback. I wanted something that actually argues with itself before giving me a result. So I built PolyGnosis. **What it does** Three frontier models solve the same problem independently in parallel. Each one gets a dynamically assigned expert persona - Security Auditor, Backend Architect, DBA Consultant - with different tool access per persona. Auditors can read. Builders can write. Not just different prompts, actual role-specific tooling. Then a hostile critic phase tears every solution apart. Bugs, hallucinations, edge cases, security flaws. Severe findings get logged to a Reflexion buffer so the system never repeats the same mistake. After that, two deterministic ranking algorithms - RRF (Reciprocal Rank Fusion) and Borda Count - score the solutions. When both algorithms agree on a winner, you know it's solid. Not vibes. Math. A synthesizer merges the strongest elements into one output, then a Constitutional Quality Gate checks whether the merge actually improved anything. If synthesis introduced errors or degraded quality, the gate rejects it and delivers the best individual solution instead. There's also an Early Resolution circuit - if all three solvers independently agree from the start, it skips critique and scoring entirely. No point burning compute on consensus that already exists. **How it was built** The whole thing was built through agentic prompt-driven development using DeepSeek V4-Pro on Hermes. 136 API calls. 16.8M tokens. 1h 54m. PolyGnosis is built on top of PolyBrain, my multi-model orchestration skill for Hermes. Open source. Feedback welcome. Interested to hear if anyone else is building multi-agent consensus systems on Hermes or other AI Agents.

Original Article

I built a Hermes Skill where 3 AI models argue with each other before giving you an answer - adversarial multi-model consensus with RRF + Borda Count ranking

Similar Articles

@PrajwalTomar_: Hermes Agent just shipped the most interesting feature I've seen in a while. It combines multiple AI models into one an…

Emergent Collaborative Deliberation in Multi-Model AI Systems: A BFT-Derived Protocol for Epistemic Synthesis

@akshay_pachaar: Hermes Mixture of Agents (MoA) explained. Every agent commits to a single model, and every model has blind spots the ot…

Built a platform where Claude, ChatGPT, and Gemini debate each other before giving you an answer

Watching AI models disagree with each other is surprisingly useful

Submit Feedback

Similar Articles

@PrajwalTomar_: Hermes Agent just shipped the most interesting feature I've seen in a while. It combines multiple AI models into one an…

Emergent Collaborative Deliberation in Multi-Model AI Systems: A BFT-Derived Protocol for Epistemic Synthesis

@akshay_pachaar: Hermes Mixture of Agents (MoA) explained. Every agent commits to a single model, and every model has blind spots the ot…

Built a platform where Claude, ChatGPT, and Gemini debate each other before giving you an answer

Watching AI models disagree with each other is surprisingly useful