@Dorialexander: Well since I keep up with the RL env market: Anthropic really did tons of Slack RL
Summary
A tweet highlights that Anthropic conducted large-scale reinforcement learning using Slack conversations, with Andrej Karpathy emphasizing that it is not a trivial Slack bot feature as commonly misinterpreted.
View Cached Full Text
Cached at: 06/25/26, 12:08 AM
Well since I keep up with the RL env market: Anthropic really did tons of Slack RL
Andrej Karpathy (@karpathy): This is correct, I think a number of people on the tl didn’t read past the title and made inferences and comparisons that are just wrong and then use it as an opportunity to take cheap shots. This is not a “feature” like some crappy Slack bot and it’s certainly not a Claw, though
Similar Articles
@_djdumpling: very exciting work and thrilled to be working on RL this summer at @modal!
A user expresses excitement about working on reinforcement learning at Modal, referencing Modal's announcement of an open-source library and lessons learned for scaling RL training.
@didier_lopes: Incredible how Z. ai literally has their RL infrastructure open source. The entire OPD post-training of GLM-5.2 took on…
Z. ai has open-sourced its RL infrastructure, the slime framework, which enabled efficient OPD post-training of GLM-5.2 in about two days. slime is an LLM post-training framework for RL scaling that integrates Megatron and SGLang, and has been battle-tested by frontier models like GLM, Qwen, DeepSeek, and Llama.
@adithya_s_k: https://x.com/adithya_s_k/status/2054961319179420035
An analysis of why RL for coding tasks is gaining traction due to verifiable rewards, and why the emerging framework Harbor addresses the bottleneck of environment complexity in RL training.
@slime_framework: Modal put it clearly: frontier RL is no longer just about algorithms — it is an infrastructure problem. Happy to see sl…
A tweet highlights that frontier reinforcement learning is now an infrastructure problem, noting the use of the open-source slime library in Modal's RL stack and upstream contributions.
@charles_irl: Proper post-training RL, deployed broadly, is a key step towards a future where software systems quietly improve themse…
Modal announces an open-source library for reinforcement learning on its platform, addressing infrastructure challenges in post-training RL with scalable deployment.