Tag
A hardware enthusiast built an immersion-cooled supercluster using 8192 RISC-V microcontrollers (CH570) and shared their challenges and solutions regarding clock, SPI signals, and production capacity.
A setup guide for using a custom Docker/Podman toolbox with ROCm/RCCL RDMA support to cluster two AMD Strix Halo nodes, enabling vLLM with tensor parallelism across 256GB unified memory.
SkyPilot Sandboxes allows AI teams to run sandboxes on their own clusters, offering 4-10x cost savings compared to Modal with sub-second launches and warm pools.
A blog post guides readers through setting up a Raspberry Pi cluster for distributed training and inference, part of a series aimed at making distributed AI accessible using affordable hardware.