gpu-memory

Tag

Cards List
#gpu-memory

Are the rich RAM /poor GPU people wrong here?

Reddit r/LocalLLaMA · 3d ago

Discusses the trade-off between dense and Mixture-of-Experts (MoE) models for local AI, noting that high-RAM users have limited MoE options beyond Qwen 3.5 122B, and questioning if large GPU is the only viable path.

0 favorites 0 likes
#gpu-memory

@KL_Div: LLMs require more GPU memory as they generate longer responses. Can we make GPU memory constant without significantly s…

X AI KOLs Timeline · 2026-04-23 Cached

IceCache introduces Dynamic Continuous Indexing to keep GPU memory usage constant during long LLM generations with minimal accuracy loss.

0 favorites 0 likes
← Back to home

Submit Feedback