I can't get Qwen3.6 27B to outperform Qwen-Coder-Next and I'm not sure why

Reddit r/LocalLLaMA News

Summary

A user reports that Qwen-Coder-Next outperforms Qwen3.6 27B in both real-world tests and synthetic benchmarks, despite others praising 27B, and seeks advice on possible setup issues.

In my real-world usage (opencode) and in my synthetic benchmarks, Coder-Next (Q5) demolishes the whole Qwen3.6 family including the 27B Dense model (All Q8). Everybody else is hailing that 27B is superior and is an amazing model, but I haven't been able to replicate any of that. Coder-Next seems to overperform, and 27B seems to underperform. I am using the recommended settings on the model cards, and I have tried several 27B models including the MTP one Unsloth released. I'm using llama.cpp with a 96GB variant Strix Halo machine. I would think it's the speed that is causing it to trip up, but 35BA3B also performs poorly. Has anybody ran into this? Is 27B just being compared to other GPU sized models, or is something in my setup not optimal?
Original Article

Similar Articles

Qwen 3.6 35B A3B vs Qwen 3.5 122B A10B

Reddit r/LocalLLaMA

User reports Qwen 3.5 122B significantly outperforms Qwen 3.6 35B on multi-step tasks despite benchmark claims, questioning if quantization or setup issues are to blame.