qwen3-5

#qwen3-5

Qwen3.6 35B-A3B on a Laptop: My Zero to One Moment

Reddit r/LocalLLaMA ↗ · 2026-06-07

The author shares their experience running Qwen3.6 35B-A3B locally on an ASUS Zenbook Pro 14, achieving 27 TPS at 32k context, marking a personal milestone towards fully local AI for privacy.

0 favorites 0 likes

#qwen3-5

@xenovacom: Opus 4.7 just wrote a custom WebGPU kernel that runs Qwen3.5 up to 13x faster using a fused LinearAttention op! Agentic…

X AI KOLs Following ↗ · 2026-04-23 Cached

Opus 4.7 auto-generated a custom WebGPU kernel that accelerates Qwen3.5 inference up to 13× via fused LinearAttention, now shipping in Transformers.js v4.2.0.

0 favorites 0 likes

qwen3-5

Qwen3.6 35B-A3B on a Laptop: My Zero to One Moment

@xenovacom: Opus 4.7 just wrote a custom WebGPU kernel that runs Qwen3.5 up to 13x faster using a fused LinearAttention op! Agentic…

Submit Feedback