Joing all GPUs to train a community model

Reddit r/LocalLLaMA News

Summary

A discussion about pooling GPUs from a community to train a massive AI model, questioning the feasibility and existing projects despite known bottlenecks like latency and weight poisoning.

This sub controls an insane amount of collective VRAM. Why aren't we pooling our GPUs to train a massive community model? Are there any active distributed volunteer computing projects actually doing this right now? I know the bottlenecks (latency, weight poisoning, nodes disconnecting), has anyone actually pulled off a successful community training run? Or is the latency bottleneck too bad?
Original Article

Similar Articles

Get in here: Community model build thread

Reddit r/LocalLLaMA

A thread proposing a method for creating a community AI model using crowdsourced compute via Branch-Train-Stitch to build a Mixture-of-Experts model from independently trained submodels, with discussion of hardware requirements, participant involvement, and technical challenges.

Could AI training be decentralized like Bitcoin mining? [D]

Reddit r/MachineLearning

A discussion explores whether AI training could be decentralized like Bitcoin mining, with participants contributing GPU resources to train open-source models in exchange for tokens, raising questions about verification, fake gradients, and efficiency.