MiroThinker-1.7, an open-weight deep research agent (Qwen3 MoE base) — mini is 30B/3B active, curious what tok/s people get on consumer hardware

Reddit r/LocalLLaMA 05/17/26, 03:26 PM Models

open-source open-weight deep-research agent qwen3 moe benchmark

Summary

MiroThinker-1.7 is an open-weight deep research agent built on Qwen3 MoE, with a mini version (30B total, 3B active) designed for consumer hardware; the team shares benchmarks and seeks feedback on local deployment.

As usual, disclosure first: I'm on the team that built this. Our MiroThinker-1.7-deepresearch and 1.7-mini-deepresearch API went live, mini is a deep research agent built on Qwen3 MoE (30B total, 3B active for mini). Weights on HuggingFace: [huggingface.co/miromind-ai/MiroThinker-1.7](https://huggingface.co/collections/miromind-ai/mirothinker-17) Posting here because the open-weight agent conversation mostly happens in this sub and I'd genuinely like feed because commenting in reddit and discussing did get me some feedback, but it was actually not enough. Tried to load a github APP on our DC server to get PR notified faster but realized there was actually not enough and one was a promo. Benchmarks (arxiv Table 1, cherry-picked to fit a table but full comparison in paper): |Model|BrowseComp|BrowseComp-ZH|HLE-Text|GAIA|xbench-DS|SEAL-0| |:-|:-|:-|:-|:-|:-|:-| |MiroThinker-1.7|74.0|75.3|42.9|82.7|62.0|53.0| |MiroThinker-1.7-mini (30B/3B active)|67.9|72.3|36.4|80.3|57.2|48.2| |Qwen3.5-397B|78.6|70.3|48.3|–|–|46.9| |DeepSeek-V3.2|67.6|65.0|40.8|–|–|49.5| |GPT-5 (closed, for context)|54.9|65.0|35.2|76.4|75.0|51.4| Two things I'd specifically want this sub to push back on: 1. The mini model is only 3B active params — anyone tried running it locally yet? Curious what tok/s people are getting on consumer hardware. 2. Our context management (sliding window K=5 + episode restarts) is opinionated. If you've run long-context agents locally you probably have opinions on this. Paper: arXiv:2603.15726 See y'all in the comments, will reply tomorrow\~ please don't downvote me, for a genuinely good open-source project we ARE not getting enough dev feedback and Reddit has been a good source so far.

Original Article

MiroThinker-1.7, an open-weight deep research agent (Qwen3 MoE base) — mini is 30B/3B active, curious what tok/s people get on consumer hardware

Similar Articles

Big Model Value Wars - DeepSeek V4 Pro vs MiMo-V2.5-Pro vs MiniMax M3

@cyrilXBT: Nemotron 3 Ultra versus DeepSeek V4 versus MiniMax M3 versus Qwen 3.7 Max. Same two prompts. Four frontier models. One …

@TeksEdge: With MiniMax M3 open source now out, here is what to expect on quants and sizes, including VRAM needed: MiniMax M3 (428…

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

Qwen3.7: The Agent Frontier (15 minute read)

Submit Feedback

Similar Articles

Big Model Value Wars - DeepSeek V4 Pro vs MiMo-V2.5-Pro vs MiniMax M3

@cyrilXBT: Nemotron 3 Ultra versus DeepSeek V4 versus MiniMax M3 versus Qwen 3.7 Max. Same two prompts. Four frontier models. One …
A comparison of four frontier AI models (Nemotron 3 Ultra, DeepSeek V4, MiniMax M3, Qwen 3.7 Max) on the same two prompts, with full results linked.

@TeksEdge: With MiniMax M3 open source now out, here is what to expect on quants and sizes, including VRAM needed: MiniMax M3 (428…

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

Qwen3.7: The Agent Frontier (15 minute read)