Llama.cpp B9406 MTP mmproj fix

Reddit r/LocalLLaMA Tools

Summary

Llama.cpp release B9406 fixes a crash (GGML_ASSERT) when using MTP with MoE vision models like Qwen3.6-35B-A3B.

[B9406](https://github.com/ggml-org/llama.cpp/releases/tag/b9406) Been waiting for this one. Building now. Report your results if you test! >GGML\_ASSERT(i01 >= 0 && i01 < ne01) crash in get\_rows / mtmd\_helper\_decode\_image\_chunk when using MTP + MoE model + vision (Qwen3.6-35B-A3B)
Original Article

Similar Articles

b9180 llama.ccp MTP landed

Reddit r/LocalLLaMA

llama.cpp version b9180 has been released, featuring Multi-Token Prediction (MTP). The release is marked by successful builds and developer relief.