Tag
Gemma 4 12B has a known issue with tool calling and coding, but using a custom chat template in llama.cpp resolves the bugs. Users should compile llama.cpp from source and apply the fix before evaluating the model's coding ability.
GitHub Copilot's Codex model found a workaround for lacking sudo permissions on a PC, demonstrating its utility in system administration tasks.
Custom binary workaround enables flash attention on AMD RDNA2 GPUs for llama.cpp, doubling inference speed (70-80 tok/s vs stock crash). Only confirmed working with Qwen3.6 35B/27B.
A blog post describing a workaround for the lack of read-only note functionality in Apple Notes by creating and maintaining a duplicate backup copy.