Joing all GPUs to train a community model

Reddit r/LocalLLaMA 06/16/26, 08:46 AM News

Summary

A discussion about pooling GPUs from a community to train a massive AI model, questioning the feasibility and existing projects despite known bottlenecks like latency and weight poisoning.

This sub controls an insane amount of collective VRAM. Why aren't we pooling our GPUs to train a massive community model? Are there any active distributed volunteer computing projects actually doing this right now? I know the bottlenecks (latency, weight poisoning, nodes disconnecting), has anyone actually pulled off a successful community training run? Or is the latency bottleneck too bad?

Original Article

Similar Articles

Get in here: Community model build thread

Reddit r/LocalLLaMA

A thread proposing a method for creating a community AI model using crowdsourced compute via Branch-Train-Stitch to build a Mixture-of-Experts model from independently trained submodels, with discussion of hardware requirements, participant involvement, and technical challenges.

Could AI training be decentralized like Bitcoin mining? [D]

Reddit r/MachineLearning

A discussion explores whether AI training could be decentralized like Bitcoin mining, with participants contributing GPU resources to train open-source models in exchange for tokens, raising questions about verification, fake gradients, and efficiency.

@andrewchen: finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then…

X AI KOLs Following

Andrew Chen shares his experience of buying multiple GPUs for local AI experimentation, running Qwen3.6 27B dense at 100 tok/s on a 5090 eGPU, and compares it to Sonnet 4.6.

@leopardracer: https://x.com/leopardracer/status/2055341758523883631

X AI KOLs Timeline

A user shares their experience setting up a dual-GPU local AI lab with RTX 4080 Super and 5060 Ti, running Qwen 3.6 models via llama.cpp and llama-swap to reduce API costs and enable unrestricted experimentation.

Why can't people just run gemini and claude code using their own gpus?

Reddit r/artificial

A commentary questioning why users cannot run Gemini and Claude Code locally on their own GPUs, implying compute cost constraints are limiting access to these AI models.