gpu-benchmarking

#gpu-benchmarking

[Benchmark] 5090RTX: Promt Parsing, Token Generation and Power Level

Reddit r/LocalLLaMA ↗ · 2026-05-14

A user benchmarks the Nvidia 5090 RTX GPU for LLM inference using llama.cpp, measuring prompt processing and token generation at various power levels, finding that prompt processing is more sensitive to power limits than token generation, and noting differences from the 4090 RTX.

0 favorites 0 likes

gpu-benchmarking

[Benchmark] 5090RTX: Promt Parsing, Token Generation and Power Level

Submit Feedback