llm-development

#llm-development

prompt caching, but for rl training - 7.5x speedup on long-prompt/short-response workloads

Reddit r/LocalLLaMA ↗ · 2026-05-11

A new optimization technique for open-source RL training engines introduces prompt caching during training, achieving up to 7.5x speedup on long-prompt, short-response workloads by reducing redundant compute.

0 favorites 0 likes

#llm-development

@bibryam: Claude Cookbook is worth bookmarking. 81 practical guides across 15 categories, covering agents, tools, RAG, evals, mul…

X AI KOLs Timeline ↗ · 2026-05-11 Cached

Anthropic has published the Claude Cookbook, a curated collection of 81 practical developer guides spanning AI agents, RAG, evaluations, multimodal apps, and production workflows. The resource offers actionable code examples and best practices for building and deploying applications with Claude.

0 favorites 0 likes

#llm-development

@0xLogicrw: Shunyu Yao, former Anthropic research scientist and current Google DeepMind research scientist, first revealed the internal R&D process of Claude 3.7 on @zhang_benita's podcast "Language is World". He joined Anthro…

X AI KOLs Timeline ↗ · 2026-05-11

Former Anthropic scientist Shunyu Yao revealed details on the R&D of Claude 3.7 in a podcast, along with Anthropic's strategic shift to heavily bet on coding capabilities, and compared the differences in decision-making structures between Anthropic and OpenAI.

0 favorites 0 likes

#llm-development

Notes from inside China's AI labs (18 minute read)

TLDR AI ↗ · 2026-05-08 Cached

The author reflects on a visit to China's AI labs, comparing cultural differences between Chinese and American labs in building LLMs. Chinese labs benefit from a culture of collective work and student involvement, while American labs face challenges from individual ego and career ambitions.

0 favorites 0 likes

llm-development

prompt caching, but for rl training - 7.5x speedup on long-prompt/short-response workloads

@bibryam: Claude Cookbook is worth bookmarking. 81 practical guides across 15 categories, covering agents, tools, RAG, evals, mul…

@0xLogicrw: Shunyu Yao, former Anthropic research scientist and current Google DeepMind research scientist, first revealed the internal R&D process of Claude 3.7 on @zhang_benita's podcast "Language is World". He joined Anthro…

Notes from inside China's AI labs (18 minute read)

Submit Feedback