Intel Arc Pro B70 llama.cpp benchmarks posted

Reddit r/LocalLLaMA News

Summary

Benchmark results for Intel Arc Pro B70 GPU running llama.cpp with SYCL on Qwen models show 63 tokens per second performance.

[https://www.reddit.com/r/LocalLLM/comments/1tuf6l1/intel\_arc\_pro\_b70\_llamacpp\_sycl\_63\_ts\_on\_qwen/](https://www.reddit.com/r/LocalLLM/comments/1tuf6l1/intel_arc_pro_b70_llamacpp_sycl_63_ts_on_qwen/)
Original Article

Similar Articles

Nvidia RTX 3090 vs Intel Arc Pro B70 llama.cpp Benchmarks

Reddit r/LocalLLaMA

Community benchmark shows Intel Arc Pro B70 averages ~71% slower prompt processing and ~54% slower token generation than RTX 3090 under llama.cpp, with SYCL backend sometimes beating Vulkan on the same card.