I can't get Qwen3.6 27B to outperform Qwen-Coder-Next and I'm not sure why

Reddit r/LocalLLaMA 05/17/26, 06:15 PM News

model-comparison qwen coder performance benchmarking llama-cpp

Summary

A user reports that Qwen-Coder-Next outperforms Qwen3.6 27B in both real-world tests and synthetic benchmarks, despite others praising 27B, and seeks advice on possible setup issues.

In my real-world usage (opencode) and in my synthetic benchmarks, Coder-Next (Q5) demolishes the whole Qwen3.6 family including the 27B Dense model (All Q8). Everybody else is hailing that 27B is superior and is an amazing model, but I haven't been able to replicate any of that. Coder-Next seems to overperform, and 27B seems to underperform. I am using the recommended settings on the model cards, and I have tried several 27B models including the MTP one Unsloth released. I'm using llama.cpp with a 96GB variant Strix Halo machine. I would think it's the speed that is causing it to trip up, but 35BA3B also performs poorly. Has anybody ran into this? Is 27B just being compared to other GPU sized models, or is something in my setup not optimal?

Original Article

Similar Articles

Qwen 3.6 35B A3B vs Qwen 3.5 122B A10B

Reddit r/LocalLLaMA

User reports Qwen 3.5 122B significantly outperforms Qwen 3.6 35B on multi-step tasks despite benchmark claims, questioning if quantization or setup issues are to blame.

@populartourist: Having worked consistently with Qwen3.6 27B NVFP4 on repos - it's clear that this quant is not reliable, at least for c…

X AI KOLs Timeline

The user reports that the Qwen3.6 27B NVFP4 quantization is unreliable for coding, with inconsistent quality despite high throughput, and suggests that Q4_K_M may be more consistent.

Anyone use QwQ-32B? It's over a year old? Has Qwen 3.6 27b basically replaced it?

Reddit r/LocalLLaMA

A discussion on whether the older QwQ-32B model is still useful compared to newer alternatives like Qwen 3.6 27b and Gemma 4, particularly for coding tasks.

@KyleHessling1: Guys, I am absolutely astounded. The Qwen 3.6 27b is like a jump to Qwen 4 from Qwen 27B 3.5. I just did a full suite o…

X AI KOLs Following

Early user reports that Qwen 3.6 27B shows dramatic performance gains over 3.5, excelling in front-end design and agentic benchmarks.

Qwen3.6-35B-A3B and 9B are officially on the public Terminal-Bench 2.0 leaderboard!

Reddit r/LocalLLaMA

Qwen3.6-35B-A3B and Qwen3.5-9B models are officially on the Terminal-Bench 2.0 leaderboard, with little-coder achieving 24.6% on the 35B variant, surpassing Gemini 2.5 Pro and Qwen3-Coder-480B, while the 9B model shows that sub-10B local models can compete on hard agentic benchmarks.

Similar Articles

Qwen 3.6 35B A3B vs Qwen 3.5 122B A10B

@populartourist: Having worked consistently with Qwen3.6 27B NVFP4 on repos - it's clear that this quant is not reliable, at least for c…

Anyone use QwQ-32B? It's over a year old? Has Qwen 3.6 27b basically replaced it?

@KyleHessling1: Guys, I am absolutely astounded. The Qwen 3.6 27b is like a jump to Qwen 4 from Qwen 27B 3.5. I just did a full suite o…

Qwen3.6-35B-A3B and 9B are officially on the public Terminal-Bench 2.0 leaderboard!

Submit Feedback