dual-gpu

#dual-gpu

Can't believe I got it working! Dual GPU - 48gb VRAM llama-cpp server - R7900 + 7800XT

Reddit r/LocalLLaMA ↗ · 2026-05-22

A user successfully set up a dual-GPU llama-cpp server with 48GB VRAM using an AMD Radeon PRO and 7800 XT via Vulkan in Docker on Kubuntu 24.04.

0 favorites 0 likes

#dual-gpu

Dual GPU llama.cpp speedup

Reddit r/LocalLLaMA ↗ · 2026-05-17

A fork of llama.cpp fixes the --split-mode tensor issue with quantized KV caches, achieving up to 40% speed improvement on dual GPU setups without quality loss.

0 favorites 0 likes

dual-gpu

Can't believe I got it working! Dual GPU - 48gb VRAM llama-cpp server - R7900 + 7800XT

Dual GPU llama.cpp speedup

Submit Feedback