For everyone that uses OpenCode / Pi - Heres your promptprocessing fix!
Summary
A pull request for llama.cpp fixes the constant prompt processing issue that occurs when using OpenCode or Pi with the library.
Similar Articles
PSA: If you haven’t updated Llama.cpp for a couple of days and find MTP to not be performing well, update llamacpp.
Update Llama.cpp for a significant token generation speed boost, up to 1.5-1.8x, and improved prompt processing.
llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp
This pull request optimizes llama.cpp by avoiding unnecessary copying of logits during prompt decode in multi-token prediction, improving inference performance.
can1357/oh-my-pi
Oh My Pi is an open-source coding agent built on Pi, offering a wired-in IDE, support for 40+ providers, built-in tools, and significant performance improvements across models.
Find bugs in YOUR code using OpenCode, Llama.cpp and Qwen3.6
This article describes how to use the coding agent OpenCode with llama.cpp and the Qwen3.6 model to find bugs in code, while emphasizing the critical safety measures needed to prevent LLMs from accessing sensitive data.
StepFun 3.5 MTP by pwilkin · Pull Request #23274 · ggml-org/llama.cpp
Pull request adding support for StepFun 3.5 MTP model in llama.cpp.