@ModelScope2022: Qwen-AgentWorld just dropped two releases on ModelScope! An open 35B total / 3B active MoE world model with 256K contex…

X AI KOLs Timeline Models

Summary

Qwen-AgentWorld releases an open 35B total / 3B active MoE world model with 256K context, along with a 7-domain benchmark, achieving state-of-the-art performance on AgentWorldBench.

Qwen-AgentWorld just dropped two releases on ModelScope! An open 35B total / 3B active MoE world model with 256K context, plus a 7-domain benchmark grounded in real environment observations. https://modelscope.ai/collections/Qwen/Qwen-AgentWorld… Qwen-AgentWorld-35B-A3B One model for 7 agent environments: MCP, Search, Terminal, SWE, Web, OS, and Android 47.73 → 56.39 on AgentWorldBench, surpassing Claude Sonnet 4.6 at 56.04 Three-stage training: CPT injects environment knowledge, SFT activates next-state prediction reasoning, and RL sharpens simulation fidelity AgentWorldBench Covers 7 domains with 2,170 samples and 22.8 average turns Scores predictions on format, factuality, consistency, realism, and quality
Original Article
View Cached Full Text

Cached at: 06/24/26, 12:24 PM

Qwen-AgentWorld just dropped two releases on ModelScope! An open 35B total / 3B active MoE world model with 256K context, plus a 7-domain benchmark grounded in real environment observations. https://modelscope.ai/collections/Qwen/Qwen-AgentWorld…

Qwen-AgentWorld-35B-A3B One model for 7 agent environments: MCP, Search, Terminal, SWE, Web, OS, and Android 47.73 → 56.39 on AgentWorldBench, surpassing Claude Sonnet 4.6 at 56.04 Three-stage training: CPT injects environment knowledge, SFT activates next-state prediction reasoning, and RL sharpens simulation fidelity

AgentWorldBench Covers 7 domains with 2,170 samples and 22.8 average turns Scores predictions on format, factuality, consistency, realism, and quality

Similar Articles

Qwen-AgentWorld: Language World Models for General Agents

Hacker News Top

Qwen-AgentWorld introduces language world models for agentic environments, covering seven domains with long chain-of-thought reasoning. The work includes a new benchmark, AgentWorldBench, and shows that world modeling improves downstream agent performance.

Qwen-AgentWorld-397B-A17B

Reddit r/LocalLLaMA

Qwen released a new large language model, Qwen-AgentWorld-397B-A17B, as detailed on HuggingFace and the Qwen blog.

Qwen3.7-Max: The Agent Frontier

Hacker News Top

Qwen3.7-Max is a new AI model release focused on agent capabilities, pushing the boundaries of autonomous AI agents.