@charles_irl: why use many bytes when few do trick?

X AI KOLs Following 05/30/26, 07:59 PM News

Summary

Nan Jiang of Modal announces their work on open-source RL frameworks to support frontier open-weights models, highlighting delta compression and remaining challenges in weight sync and cross-cluster training.

why use many bytes when few do trick?

Original Article

View Cached Full Text

Cached at: 06/01/26, 03:07 AM

why use many bytes when few do trick?

Nan Jiang (@nanjiangwill): At @modal, we’re working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models.

Delta compression is key, but the job’s not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training.

Similar Articles

I built an open-source memory governance layer for AI assistants - looking for technical feedback [P]

Reddit r/MachineLearning

MemoryOps AI is an open-source memory governance layer for AI assistants that handles memory lifecycle with policies, expiration, auditing, and deletion guarantees. The author seeks technical feedback from developers building AI agents and RAG systems.

May in Servo: user scripts, mp4 compat, blackboxing in DevTools, and more

Lobsters Hottest

Servo 0.3.0 released with 391 commits, adding new font features, mp4 support without fast start, new DOM APIs, and DevTools blackboxing, among other improvements.

@rasbt: After 18 months of writing, coding, and experimenting, Build a Reasoning Model (From Scratch) is finally out! My first …

X AI KOLs Timeline

Sebastian Raschka announces the release of his book 'Build a Reasoning Model (From Scratch)' after 18 months of work, covering inference scaling, reinforcement learning, and distillation from scratch.

Huawei open-sources OpenPangu-2.0-Flash - 92B total,6B active

Reddit r/LocalLLaMA

Huawei open-sources OpenPangu-2.0-Flash, a 92B-parameter MoE model with 6B active parameters and 512K context, along with inference code and training operations.

Open weights aren't catching up to closed models by copying them, but they're winning because of how the whole AI stack is quietly modularising

Reddit r/singularity

The article argues that open-weight AI models are catching up to closed ones not via distillation but due to the modularisation of the AI stack—stable interfaces (Transformer architecture, OpenAI-compatible APIs, agentic harnesses) allow innovations to diffuse rapidly across the ecosystem, shrinking the capability gap while keeping a massive price advantage, potentially leading to a commoditisation of frontier AI.