MiroThinker-1.7, an open-weight deep research agent (Qwen3 MoE base) — mini is 30B/3B active, curious what tok/s people get on consumer hardware

Reddit r/LocalLLaMA Models

Summary

MiroThinker-1.7 is an open-weight deep research agent built on Qwen3 MoE, with a mini version (30B total, 3B active) designed for consumer hardware; the team shares benchmarks and seeks feedback on local deployment.

As usual, disclosure first: I'm on the team that built this. Our MiroThinker-1.7-deepresearch and 1.7-mini-deepresearch API went live, mini is a deep research agent built on Qwen3 MoE (30B total, 3B active for mini). Weights on HuggingFace: [huggingface.co/miromind-ai/MiroThinker-1.7](https://huggingface.co/collections/miromind-ai/mirothinker-17) Posting here because the open-weight agent conversation mostly happens in this sub and I'd genuinely like feed because commenting in reddit and discussing did get me some feedback, but it was actually not enough. Tried to load a github APP on our DC server to get PR notified faster but realized there was actually not enough and one was a promo. Benchmarks (arxiv Table 1, cherry-picked to fit a table but full comparison in paper): |Model|BrowseComp|BrowseComp-ZH|HLE-Text|GAIA|xbench-DS|SEAL-0| |:-|:-|:-|:-|:-|:-|:-| |MiroThinker-1.7|74.0|75.3|42.9|82.7|62.0|53.0| |MiroThinker-1.7-mini (30B/3B active)|67.9|72.3|36.4|80.3|57.2|48.2| |Qwen3.5-397B|78.6|70.3|48.3|–|–|46.9| |DeepSeek-V3.2|67.6|65.0|40.8|–|–|49.5| |GPT-5 (closed, for context)|54.9|65.0|35.2|76.4|75.0|51.4| Two things I'd specifically want this sub to push back on: 1. The mini model is only 3B active params — anyone tried running it locally yet? Curious what tok/s people are getting on consumer hardware. 2. Our context management (sliding window K=5 + episode restarts) is opinionated. If you've run long-context agents locally you probably have opinions on this. Paper: arXiv:2603.15726 See y'all in the comments, will reply tomorrow\~ please don't downvote me, for a genuinely good open-source project we ARE not getting enough dev feedback and Reddit has been a good source so far.
Original Article

Similar Articles

Qwen3.7: The Agent Frontier (15 minute read)

TLDR AI

Alibaba's Qwen team has released Qwen3.7-Max, a proprietary agent-foundation model achieving top scores on multiple benchmarks including Terminal-Bench 2.0, SWE-Pro, and GPQA Diamond, with consistent performance across various code environments.

The Qwen 3.6 35B A3B hype is real!!!

Reddit r/LocalLLaMA

The author benchmarks small local LLMs, highlighting Qwen 3.6 35B A3B for its superior ability to map academic code to research papers compared to models like Gemma 4 and Nemotron 3 Nano.