Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

Reddit r/LocalLLaMA 05/30/26, 08:36 PM Models

qwen local-llm m1-max coding-assistant mixture-of-experts apple-silicon zoo-code

Summary

The article discusses running the Qwen 3.6 35b Mixture-of-Experts model locally on an Apple M1 Max Mac using Zoo Code, highlighting its capabilities as a battery-powered coding assistant.

No content available

Original Article

Similar Articles

Running Qwen3.6-35B-A3B Locally for Coding Agent: My Setup & Working Config

Reddit r/LocalLLaMA

A detailed guide for running the 35B-parameter Qwen3.6 model locally on Apple Silicon with llama.cpp to power the pi coding agent, including optimized configuration flags and sampling parameters.

Is anyone getting real coding work done with Qwen3.6-35B-A3B-UD-Q4_K_M on a 32GB Mac in opencode, claude code or similar?

Reddit r/LocalLLaMA

A user shares their experience running Qwen3-35B-A3B quantized model on an M2 MacBook Pro with 32GB RAM for coding tasks via opencode and llama.cpp, finding that the 32K context window limit causes critical memory loss during compaction, making complex coding tasks impractical. They conclude that meaningful agentic coding with this model likely requires at least 128K context, exceeding what their hardware can support.

@Daniel_Farinax: Qwen3.6-27B on MacBook Pro M5 128GB MLX with custom coding CLI optimized for it. Should also work on M1, M2, M3, M4 Mac…

X AI KOLs Timeline

Daniel Farinax announces a custom CLI for running Qwen3.6-27B on MacBooks via MLX, seeking beta testers and moving to TypeScript for faster iteration.

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

Reddit r/LocalLLaMA

A developer benchmarked 21 local LLMs on MacBook Air M5 using HumanEval+ and found Qwen 3.6 35B-A3B (MoE) leads at 89.6% with 16.9 tok/s, while Qwen 2.5 Coder 7B offers the best RAM-to-performance ratio at 84.2% in 4.5 GB. Notably, Gemma 4 models significantly underperformed expectations (31.1% for 31B), possibly due to Q4_K_M quantization effects.

The Qwen 3.6 35B A3B hype is real!!!

Reddit r/LocalLLaMA

The author benchmarks small local LLMs, highlighting Qwen 3.6 35B A3B for its superior ability to map academic code to research papers compared to models like Gemma 4 and Nemotron 3 Nano.

Similar Articles

Running Qwen3.6-35B-A3B Locally for Coding Agent: My Setup & Working Config

Is anyone getting real coding work done with Qwen3.6-35B-A3B-UD-Q4_K_M on a 32GB Mac in opencode, claude code or similar?

@Daniel_Farinax: Qwen3.6-27B on MacBook Pro M5 128GB MLX with custom coding CLI optimized for it. Should also work on M1, M2, M3, M4 Mac…

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

The Qwen 3.6 35B A3B hype is real!!!

Submit Feedback