If you use Hermes long enough you will hit the MEMORY md wall. Here is what we did about it.

Reddit r/AI_Agents Tools

Summary

AtomicMemory is a new memory layer for the Hermes agent that replaces the 6-turn flush cycle with per-turn classification and removes the 2.2KB memory cap by storing claims in Postgres, all running on a small local 3B model.

The MEMORY md consolidation lag is real. Hermes flushes every 6 turns which means if you update the same fact three times in one session, the agent processes multiple turns on the wrong version before the flush catches up. The 2.2KB cap is the other wall. Once you hit it you are manually deciding what to throw away and hoping nothing important gets pruned. We built AtomicMemory as the memory layer underneath Hermes to fix both of these. What changes: * Per-turn classification instead of 6-turn flush cycles. Every write gets decided on immediately, add, update, supersede, or no-op before it hits storage. * No size cap. Memory lives in Postgres so you can store thousands of claims with no pruning. * Every memory queryable directly from Postgres so you can see exactly what Hermes believes and why at any point. * Runs on a tiny 3B model locally so it does not eat into your main agent tokens. Setup is three steps. Run AtomicMemory in Docker, point it at Ollama, drop the plugin into Hermes with one address. Run hermes memory status to confirm. Done. This is an upgrade not a replacement. Hermes still handles everything it did. AtomicMemory just makes the memory layer underneath it honest
Original Article

Similar Articles