numa

#numa

NUMA: Cores, memory, and the distance between them

Hacker News Top ↗ · 2026-06-24 Cached

The article explains Non-Uniform Memory Access (NUMA), its historical context, and how it affects performance in multi-socket servers, while also introducing Edera's work on making Xen-based virtualization NUMA-aware end-to-end.

0 favorites 0 likes

#numa

I forked ik_llama.cpp and added a "--numa mirror" mode to maximize performance on multi-socket CPU systems. Just sharing and looking for testers!

Reddit r/LocalLLaMA ↗ · 2026-06-21

A developer forked ik_llama.cpp and added a '--numa mirror' mode that duplicates model weights and KV cache across NUMA nodes to maximize multi-socket CPU inference performance, sharing benchmarks and seeking testers.

0 favorites 0 likes

numa

NUMA: Cores, memory, and the distance between them

I forked ik_llama.cpp and added a "--numa mirror" mode to maximize performance on multi-socket CPU systems. Just sharing and looking for testers!

Submit Feedback