rtx4090

#rtx4090

@davideciffa: If you have an Nvidia RTX 4090 --ddtree-budget 36 is the best configuration that buys you 2.5x speed up during decoding…

X AI KOLs Timeline ↗ · 2026-05-24 Cached

A tweet recommending --ddtree-budget 36 for Nvidia RTX 4090, claiming 2.5x speedup during decoding for Qwen3.6_27B.

0 favorites 0 likes

#rtx4090

Stop wasting electricity

Reddit r/LocalLLaMA ↗ · 2026-05-12

The author demonstrates how to reduce RTX 4090 power consumption by up to 40% while running quantized Qwen models via llama.cpp, without sacrificing inference speed. By capping GPU power limits through nvidia-smi and adjusting llama-server parameters, users can significantly lower heat, noise, and extend hardware lifespan.

0 favorites 0 likes

rtx4090

@davideciffa: If you have an Nvidia RTX 4090 --ddtree-budget 36 is the best configuration that buys you 2.5x speed up during decoding…

Stop wasting electricity

Submit Feedback