@Daniel_Farinax: Qwen3.6-27B on MacBook Pro M5 128GB MLX with custom coding CLI optimized for it. Should also work on M1, M2, M3, M4 Mac…

X AI KOLs Timeline 05/13/26, 06:13 PM Tools

qwen mlx macbook cli inference typescript open-source

Summary

Daniel Farinax announces a custom CLI for running Qwen3.6-27B on MacBooks via MLX, seeking beta testers and moving to TypeScript for faster iteration.

Qwen3.6-27B on MacBook Pro M5 128GB MLX with custom coding CLI optimized for it. Should also work on M1, M2, M3, M4 Macs. Created this in 8 prompts. I need beta testers for the CLI. Pivoting to TypeScript for faster iteration. https://t.co/DHESMMbcp3

Original Article

View Cached Full Text

Cached at: 05/13/26, 10:25 PM

Qwen3.6-27B on MacBook Pro M5 128GB MLX with custom coding CLI optimized for it. Should also work on M1, M2, M3, M4 Macs.

Created this in 8 prompts. I need beta testers for the CLI. Pivoting to TypeScript for faster iteration. https://t.co/DHESMMbcp3

Similar Articles

@Daniel_Farinax: Qwen3.6-27B on MacBook Pro M5 128GB. Third version of the game. This time a low-poly GTA, built overnight using a custo…

X AI KOLs Timeline

Daniel Farinax demonstrates running Qwen3.6-27B on a MacBook Pro M5 128GB, using a custom Rust CLI (MPTLX) to build a low-poly GTA game overnight, claiming blazing fast performance comparable to Claude 4.6 running locally.

Qwen3.6-35B-A3B-Abliterated-Heretic-MLX-4bit

Reddit r/LocalLLaMA

The user reviews a quantized and fine-tuned version of the Qwen3.6-35B model optimized for Apple Silicon via MLX, praising its speed, intelligence, and lack of safety disclaimers.

Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

Reddit r/LocalLLaMA

The article discusses running the Qwen 3.6 35b Mixture-of-Experts model locally on an Apple M1 Max Mac using Zoo Code, highlighting its capabilities as a battery-powered coding assistant.

@rohanpaul_ai: Qwen 3.6 27B on a MacBook Pro M5 Max 64GB hitting 34tokens per sec, locally with atomic[.]chat 90% acceptance rate, i.e…

X AI KOLs Following

Qwen 3.6 27B achieves 34 tokens/sec on a MacBook Pro M5 Max 64GB locally with 90% draft acceptance, enabled by TurboQuant, GGUF, and llama.cpp, showcasing a major advancement in laptop-based AI inference.

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

Reddit r/LocalLLaMA

A developer benchmarked 21 local LLMs on MacBook Air M5 using HumanEval+ and found Qwen 3.6 35B-A3B (MoE) leads at 89.6% with 16.9 tok/s, while Qwen 2.5 Coder 7B offers the best RAM-to-performance ratio at 84.2% in 4.5 GB. Notably, Gemma 4 models significantly underperformed expectations (31.1% for 31B), possibly due to Q4_K_M quantization effects.

Similar Articles

@Daniel_Farinax: Qwen3.6-27B on MacBook Pro M5 128GB. Third version of the game. This time a low-poly GTA, built overnight using a custo…

Qwen3.6-35B-A3B-Abliterated-Heretic-MLX-4bit

Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

@rohanpaul_ai: Qwen 3.6 27B on a MacBook Pro M5 Max 64GB hitting 34tokens per sec, locally with atomic[.]chat 90% acceptance rate, i.e…

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

Submit Feedback