ram

Tag

Cards List
#ram

Maybe KV cache offload to RAM isn't bad

Reddit r/LocalLLaMA · yesterday

A user shares their experience offloading the KV cache to RAM in llama.cpp, achieving comparable speeds while freeing VRAM for larger models and context windows, suggesting this trade-off is often worthwhile.

0 favorites 0 likes
#ram

Are the rich RAM /poor GPU people wrong here?

Reddit r/LocalLLaMA · 2026-05-15

Discusses the trade-off between dense and Mixture-of-Experts (MoE) models for local AI, noting that high-RAM users have limited MoE options beyond Qwen 3.5 122B, and questioning if large GPU is the only viable path.

0 favorites 0 likes
#ram

Why do Windows client editions on 32-bit x86 systems artificially limit RAM to 4 GB?

The Old New Thing (Raymond Chen) · 2026-05-12 Cached

Explains the historical reason why 32-bit Windows client editions artificially limit RAM to 4 GB: driver compatibility issues with Physical Address Extensions (PAE) and Data Execution Prevention (DEP), as opposed to any nefarious motive.

0 favorites 0 likes
← Back to home

Submit Feedback