threading

#threading

PSA: Test your "threads" argument in llama.cpp (+80% performance in my case)

Reddit r/LocalLLaMA ↗ · 4d ago

A user benchmarks thread count for hybrid CPU-GPU inference with Gemma 4 in llama.cpp, discovering a 80% performance uplift by using 16 threads instead of 6 on a hybrid core CPU, and shares the optimal command configuration.

0 favorites 0 likes

#threading

Why do you say that a COM STA thread must pump messages if I see sample code creating STA threads and not pumping messages?

The Old New Thing (Raymond Chen) ↗ · 2026-05-22 Cached

Raymond Chen explains that COM STA threads are required to pump messages only when idle; code that is always busy doesn't need an explicit message loop, but COM still creates a hidden window that requires pumping when the thread becomes idle to avoid jamming window broadcasts.

0 favorites 0 likes

threading

PSA: Test your "threads" argument in llama.cpp (+80% performance in my case)

Why do you say that a COM STA thread must pump messages if I see sample code creating STA threads and not pumping messages?

Submit Feedback