@Dorialexander: Well since I keep up with the RL env market: Anthropic really did tons of Slack RL

X AI KOLs Following 06/24/26, 07:01 PM News

Summary

A tweet highlights that Anthropic conducted large-scale reinforcement learning using Slack conversations, with Andrej Karpathy emphasizing that it is not a trivial Slack bot feature as commonly misinterpreted.

Well since I keep up with the RL env market: Anthropic really did tons of Slack RL

Original Article

View Cached Full Text

Cached at: 06/25/26, 12:08 AM

Well since I keep up with the RL env market: Anthropic really did tons of Slack RL

Andrej Karpathy (@karpathy): This is correct, I think a number of people on the tl didn’t read past the title and made inferences and comparisons that are just wrong and then use it as an opportunity to take cheap shots. This is not a “feature” like some crappy Slack bot and it’s certainly not a Claw, though

Similar Articles

@_djdumpling: very exciting work and thrilled to be working on RL this summer at @modal!

X AI KOLs Timeline

A user expresses excitement about working on reinforcement learning at Modal, referencing Modal's announcement of an open-source library and lessons learned for scaling RL training.

@didier_lopes: Incredible how Z. ai literally has their RL infrastructure open source. The entire OPD post-training of GLM-5.2 took on…

X AI KOLs Following

Z. ai has open-sourced its RL infrastructure, the slime framework, which enabled efficient OPD post-training of GLM-5.2 in about two days. slime is an LLM post-training framework for RL scaling that integrates Megatron and SGLang, and has been battle-tested by frontier models like GLM, Qwen, DeepSeek, and Llama.

@adithya_s_k: https://x.com/adithya_s_k/status/2054961319179420035

X AI KOLs Timeline

An analysis of why RL for coding tasks is gaining traction due to verifiable rewards, and why the emerging framework Harbor addresses the bottleneck of environment complexity in RL training.

@slime_framework: Modal put it clearly: frontier RL is no longer just about algorithms — it is an infrastructure problem. Happy to see sl…

X AI KOLs Following

A tweet highlights that frontier reinforcement learning is now an infrastructure problem, noting the use of the open-source slime library in Modal's RL stack and upstream contributions.

@charles_irl: Proper post-training RL, deployed broadly, is a key step towards a future where software systems quietly improve themse…

X AI KOLs Following

Modal announces an open-source library for reinforcement learning on its platform, addressing infrastructure challenges in post-training RL with scalable deployment.

Similar Articles

@_djdumpling: very exciting work and thrilled to be working on RL this summer at @modal!

@didier_lopes: Incredible how Z. ai literally has their RL infrastructure open source. The entire OPD post-training of GLM-5.2 took on…

@adithya_s_k: https://x.com/adithya_s_k/status/2054961319179420035

@slime_framework: Modal put it clearly: frontier RL is no longer just about algorithms — it is an infrastructure problem. Happy to see sl…

@charles_irl: Proper post-training RL, deployed broadly, is a key step towards a future where software systems quietly improve themse…

Submit Feedback