Who is your favourite quant publisher and why?

Reddit r/LocalLLaMA 05/13/26, 04:43 PM News

Summary

A user shares their preference for Unsloth quantized models due to fast releases and low perplexity, compares them with Apex MoE quants, and asks the community for their favorite quant publisher.

Hey everyone, I’ve been a big fan of **Unsloth** for several reasons: * They publish models ASAP after release. * They usually offer the lowest PPL. * Their website has tons of helpful tutorials and documentation. Recently, I stumbled upon this Reddit thread suggesting to try out an **Apex MoE quant** of *Mudler* instead: 👉 [https://www.reddit.com/r/LocalLLaMA/comments/1t3n6jo/apex\_moe\_quants\_update\_25\_new\_models\_since\_the/](https://www.reddit.com/r/LocalLLaMA/comments/1t3n6jo/apex_moe_quants_update_25_new_models_since_the/) So I decided to test it myself. I tried running **Qwen3.5 122B IQuality**, which is roughly the same size as Qwen3.5 122B Q4\_K\_XL. So far, I haven’t noticed a difference in real world tasks between these two models in terms of output quality so i decided to run one gsm8k benchmark and unsloth was slightly better. So im asking you now, who is your fav publisher and why?

Original Article

Similar Articles

MagicQuant (v2.0) - Hybrid Mixed GGUF Models + Unsloth Dynamic Learned Quant Configurations + Benchmark table with collapsed winners and more

Reddit r/LocalLLaMA

MagicQuant v2.0 is a pipeline for creating hybrid mixed GGUF quant models, learning from Unsloth and other methods to find optimal quant configurations based on KLD benchmarks, with a focus on nonlinear wins and anomaly detection.

@Italianclownz: Tested MTP, TriAttention, TurboQuant on @UnslothAI @Alibaba_Qwen Qwen 3.6 35B A3B MTP MXFP4_MoE on @huggingface @no_stp…

X AI KOLs Following

A user benchmarks MTP, TriAttention, and TurboQuant optimizations on Qwen 3.6 35B using Unsloth on consumer hardware, finding TurboQuant to be the most effective.

@_EldarKurtic: TurboQuant has drawn a lot of attention recently, but the accompanying evals didn't tell the full story. So we ran what…

X AI KOLs Following

Eldar Kurtic presents a comprehensive study on TurboQuant, revealing its real-world effects on accuracy, latency, and throughput beyond initial evaluations.

Need Info on quality benchmarks to run on DeepSeek V3.2 different quant levels [D]

Reddit r/MachineLearning

Developer seeks quality benchmarks to evaluate runtime quantization impact on DeepSeek V3.2 model performance.

Models and Quants quality test results - the chessboard svg (Qwen3.6 27B/35B-A3B/Zaya1)