Tag
llama.cpp build b9095 introduces NCCL-free tensor parallelism for dual Blackwell PCIe GPUs, enabling efficient multi-GPU inference without relying on NCCL.
A user proposes building a heterogeneous AI cluster using Blackwell GPUs and high-memory servers connected via RDMA, seeking collaboration on Tinygrad driver development.
At Nvidia GTC 2026, CEO Jensen Huang introduced the next-gen Vera Rubin system while Supermicro unveiled a full-stack AI Factory portfolio built on Nvidia Blackwell GPUs for turnkey enterprise AI deployment.