Tag
Mudler released APEX-MTP GGUF quantizations of the Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled model, bundling the multi-token prediction head for self-speculative decoding with llama.cpp.
A fine-tuned uncensored version of the Qwen model (Qwen3.6-35B-A3B) with MTP support and APEX quantization, tested stable at 200k context and recommended for use in LM Studio.