Quoting Georgi Gerganov

Simon Willison's Blog 06/16/26, 04:04 PM News

local-ai coding-assistant qwen llama-cpp offline-llm georgi-gerganov

Summary

Georgi Gerganov attests that Qwen3.6-27B is a very capable local coding model, which he uses daily on his M2 Ultra or RTX 5090 with a lightweight harness.

No content available

Original Article

View Cached Full Text

Cached at: 06/16/26, 07:33 PM

# A quote from Georgi Gerganov Source: [https://simonwillison.net/2026/Jun/16/georgi-gerganov/](https://simonwillison.net/2026/Jun/16/georgi-gerganov/) 16th June 2026 > I can 100% attest to the fact that Qwen3\.6\-27B is a very capable local model for coding tasks\. Over the last month and a half I've been using it almost daily, either on my M2 Ultra or on my RTX 5090 box\. I use it for small[mundane tasks at ggml\-org](https://github.com/search?q=%22Assisted-by%22+user%3Aggml-org&type=commits&ref=advsearch)\- nothing really impressive, but definitely a helpful tool for a maintainer\. I think I would be using it much more, if I didn't have to spend a lot of my time on reviewing PRs\. Currently, I have a very lightweight harness \- the pi agent with everything stripped \(`pi \-nc \-\-offline`\) and[a short system prompt](https://github.com/ggml-org/llama.cpp/blob/master/.pi/gg/SYSTEM.md)to align it a bit with my style\. —[Georgi Gerganov](https://news.ycombinator.com/item?id=48555993#48557304),Hacker News comment on[Running local models is good now](https://vickiboykis.com/2026/06/15/running-local-models-is-good-now/)by Boykis

Quoting Georgi Gerganov

Similar Articles

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable

Qwen3.6-27B-GGUF is here!

The Qwen 3.6 35B A3B hype is real!!!

High VRAM local coding model — still Qwen 3.6 27B?

Qwen/Qwen3.6-27B

Submit Feedback

Similar Articles

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable

The Qwen 3.6 35B A3B hype is real!!!

High VRAM local coding model — still Qwen 3.6 27B?