Joing all GPUs to train a community model
Summary
A discussion about pooling GPUs from a community to train a massive AI model, questioning the feasibility and existing projects despite known bottlenecks like latency and weight poisoning.
Similar Articles
Get in here: Community model build thread
A thread proposing a method for creating a community AI model using crowdsourced compute via Branch-Train-Stitch to build a Mixture-of-Experts model from independently trained submodels, with discussion of hardware requirements, participant involvement, and technical challenges.
Could AI training be decentralized like Bitcoin mining? [D]
A discussion explores whether AI training could be decentralized like Bitcoin mining, with participants contributing GPU resources to train open-source models in exchange for tokens, raising questions about verification, fake gradients, and efficiency.
@andrewchen: finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then…
Andrew Chen shares his experience of buying multiple GPUs for local AI experimentation, running Qwen3.6 27B dense at 100 tok/s on a 5090 eGPU, and compares it to Sonnet 4.6.
@leopardracer: https://x.com/leopardracer/status/2055341758523883631
A user shares their experience setting up a dual-GPU local AI lab with RTX 4080 Super and 5060 Ti, running Qwen 3.6 models via llama.cpp and llama-swap to reduce API costs and enable unrestricted experimentation.
Why can't people just run gemini and claude code using their own gpus?
A commentary questioning why users cannot run Gemini and Claude Code locally on their own GPUs, implying compute cost constraints are limiting access to these AI models.