@charles_irl: why use many bytes when few do trick?

X AI KOLs Following News

Summary

Nan Jiang of Modal announces their work on open-source RL frameworks to support frontier open-weights models, highlighting delta compression and remaining challenges in weight sync and cross-cluster training.

why use many bytes when few do trick?
Original Article
View Cached Full Text

Cached at: 06/01/26, 03:07 AM

why use many bytes when few do trick?

Nan Jiang (@nanjiangwill): At @modal, we’re working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models.

Delta compression is key, but the job’s not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training.

Similar Articles

Open weights aren't catching up to closed models by copying them, but they're winning because of how the whole AI stack is quietly modularising

Reddit r/singularity

The article argues that open-weight AI models are catching up to closed ones not via distillation but due to the modularisation of the AI stack—stable interfaces (Transformer architecture, OpenAI-compatible APIs, agentic harnesses) allow innovations to diffuse rapidly across the ecosystem, shrinking the capability gap while keeping a massive price advantage, potentially leading to a commoditisation of frontier AI.