Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ?

Reddit r/LocalLLaMA News

Summary

The author inquires about potential distilled 9B and 14B variants of the Qwen-3.6 model for local coding, citing specific tool-calling and file structure issues encountered with Qwen-3.5 9B on limited hardware.

As the title suggests. I'm already testing (with some success, and few challenges) usage of Qwen-3.5 9B with a new work laptop that I've received with RTX 1000 6GB VRAM (I know it seems like a joke in today's time and age). I am using it with \`pi\` as the terminal coding harness. The issue I am facing with Qwen-3.5 9B is that I've encountered some (relatively infrequent) issues around: 1. How it handles directories / folders - more than once, strangely I got a deeply nested folder structure for final code/test artefacts 2. Recognized test run to be failure, while it was actually a success Same prompts when used with gemini-2.5-flash and gemini-2.5-flash-lite don't see such issues, indicating the possibility that the issue is not with \`pi\`. I've read some reports of \`pi\` sometimes struggling with Qwen-3.5 tool-calling, and that is apparently fixed in Qwen-3.6. Thus wondering if anyone heard or Qwen-3.6-27B dense model distillations with 9B, 14B might also be released, enabling using in smaller GPUs.
Original Article

Similar Articles

Qwen/Qwen3.6-35B-A3B-FP8

Hugging Face Models Trending

Alibaba releases Qwen3.6-35B-A3B-FP8, an open-weight quantized variant of Qwen3.6 with 35B parameters and 3B activated via MoE, featuring improved agentic coding capabilities and thinking preservation for iterative development.

The Qwen 3.6 35B A3B hype is real!!!

Reddit r/LocalLLaMA

The author benchmarks small local LLMs, highlighting Qwen 3.6 35B A3B for its superior ability to map academic code to research papers compared to models like Gemma 4 and Nemotron 3 Nano.