People are making single-slot, half height pcie v100 with nvlink in China

Reddit r/LocalLLaMA News

Summary

Chinese GPU enthusiasts have created a custom single-slot, half-height PCIe V100 GPU with NVLINK, retaining full performance and aiming to sell for around $220.

https://preview.redd.it/iysrmwmjr96h1.png?width=869&format=png&auto=webp&s=c44258020dc9bbde13dbf6ff3522a778c57baf05 https://preview.redd.it/yu3jolmnm96h1.png?width=899&format=png&auto=webp&s=4a938ad5675015cc6b5e4bb9d1f1bdf7e23d1792 https://preview.redd.it/1xoi8uw4n96h1.png?width=850&format=png&auto=webp&s=434fb60d256413af75fa435044ed24bf1f2986ab https://preview.redd.it/skl63bego96h1.jpg?width=857&format=pjpg&auto=webp&s=1de6255928505ce6697c1bd476720f62cb9181f3 The video was released on Bilibili two days ago, and the actual product is not out (for purchase) yet. But it seems real. Not an adapter, but actually soldered core on a custom PCB. Designed for passive cooling, so the default version comes with just PCIe power and capped at 75W, do have alternative version with the powerport enabled and support up to 300W though. 16cm length, 7.5cm height. Fully functional, and retains the full performance of the core. Benchmarks are included in the video. According to the video, a 32GB version is also coming. They expect to sell it (16GB version) around/below ¥1500, which is around $220 US dollars. One of my friends has already pre-ordered two... The creator of this called is called “显卡仙人”, which translates to "GPU god" or "The cultivator of GPU". If this is real, I guess you can really call them that.
Original Article

Similar Articles

I Put a Datacenter GPU in My Gaming PC for £200

Lobsters Hottest

A blogger describes how they acquired a Tesla V100 SXM2 datacenter GPU for £150 and used a custom adapter to install it in their gaming PC alongside an RTX 4080, achieving 32GB of total VRAM and enabling local inference of 27B parameter models at 32 tokens per second.

Cheap V100 32gb

Reddit r/LocalLLaMA

A deal for a used V100 32GB GPU on Aliexpress at approximately $526, including coupon codes.

Buying AI accelerators/GPUs in China...

Reddit r/LocalLLaMA

A user asks about buying Chinese AI accelerators/GPUs for inference, specifically looking for Huawei alternatives to Nvidia, with support for vLLM or Llama.cpp.