Open weights are not enough: we need open training frameworks for research and better algorithms [P]
Summary
A call for open training frameworks in AI research, introducing FeynRL, a modular and explicit framework for RL post-training of LLMs, VLMs, and agents, designed to make training processes visible and modifiable.
Similar Articles
@charles_irl: Proper post-training RL, deployed broadly, is a key step towards a future where software systems quietly improve themse…
Modal announces an open-source library for reinforcement learning on its platform, addressing infrastructure challenges in post-training RL with scalable deployment.
Open weights are quietly closing up - and that's a problem
The article argues that the trend of open-weights AI models becoming more restrictive poses a threat to market competition, as these models currently provide essential price discipline and privacy options against frontier closed-model providers.
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
Hugging Face publishes a comprehensive analysis of 16 open-source reinforcement learning libraries, examining architectural patterns for asynchronous RL training and presenting design lessons for TRL's async trainer to address generation bottlenecks and weight synchronization challenges.
@TheTuringPost: 10 open-source tools for the Agent RL stack ↓ OpenPipe ART verl-agent Agent Lightning Unsloth OpenRLHF SkyRL NVIDIA’s P…
A curated roundup of 10 open-source tools for training AI agents using reinforcement learning, covering frameworks like OpenPipe ART, verl-agent, Agent Lightning, and Unsloth, with details on their use cases and strengths.
@nanjiangwill: At @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weight…
Modal is enhancing OSS RL frameworks with delta compression and other techniques for training frontier open-weight models. The slime framework brings lossless delta sync to disaggregated training setups.