@antirez: Cool use case

X AI KOLs Following 06/25/26, 08:17 PM News

Summary

A user reports that running Hermes Agent as a game master using DeepSeek V4 Flash locally on M3 Ultra yields nearly identical quality to the online version.

Cool use case

Original Article

View Cached Full Text

Cached at: 06/27/26, 10:02 PM

Cool use case

Ivan Fioravanti ᯅ (@ivanfioravanti): Hermes Agent as Master in a GDR (Blades in the Dark here) running locally on M3 Ultra using DeepSeek V4 Flash q4-imatrix with ds4 by @antirez

Testing side by side with online version and apart from the speed, quality is nearly identical so far.

I’ll keep testing to see how

Similar Articles

@ivanfioravanti: For anyone wandering what does it mean to run ds4-agent locally on an M5 Max using DeepSeek V4 Flash q2-imatrix gguf mo…

X AI KOLs Timeline

A demo of running ds4-agent locally on an M5 Max with DeepSeek V4 Flash q2-imatrix gguf model, showing self-updating capabilities and integration with HF_HOME for gguf models.

@vmiss33: I installed Hermes Agent on Windows, and set it up with GPT 5.5. I gave it one of @above_spec's amazing twitter threads…

X AI KOLs Timeline

The user shares a report on successfully running the Qwen3.6 35B A3B model on Windows using Hermes Agent and an 8GB VRAM GPU.

@analogalok: I just got Gemma 4 26B A4B MoE model running fully locally with Hermes agent on an 8GB RTX 4060 and it's now backtestin…

X AI KOLs Following

A developer demonstrates running Gemma 4 26B MoE model locally on an 8GB RTX 4060 with Hermes agent to fully automate backtesting of trading strategies, highlighting the growing capability of local LLMs as autonomous agents.

@mr_r0b0t: If you have 24-128GB unified memory and use @NousResearch Hermes agents, this is for you! You now run FULLY LOCAL agent…

X AI KOLs Timeline

Announces the ability to run fully local agent teams using NousResearch Hermes agents on systems with 24-128GB unified memory. Each agent has its own Hermes session and works collaboratively via a local orchestrator on long-running tasks.

@shannholmberg: I've started experimenting with gBrain + Hermes Agent it's a shared memory layer that sits underneath my Hermes Agent c…

X AI KOLs Timeline

Shann Holmberg describes an experimental architecture using gBrain as a shared memory layer for a team of Hermes Agents, allowing specialists to read from a centralized brain before acting and write durable context back.

Similar Articles

@ivanfioravanti: For anyone wandering what does it mean to run ds4-agent locally on an M5 Max using DeepSeek V4 Flash q2-imatrix gguf mo…

@vmiss33: I installed Hermes Agent on Windows, and set it up with GPT 5.5. I gave it one of @above_spec's amazing twitter threads…

@analogalok: I just got Gemma 4 26B A4B MoE model running fully locally with Hermes agent on an 8GB RTX 4060 and it's now backtestin…

@mr_r0b0t: If you have 24-128GB unified memory and use @NousResearch Hermes agents, this is for you! You now run FULLY LOCAL agent…

@shannholmberg: I've started experimenting with gBrain + Hermes Agent it's a shared memory layer that sits underneath my Hermes Agent c…

Submit Feedback