rlm

Tag

Cards List
#rlm

@diblacksmith: [OSS RELEASE] This is my story of how I've been using RLMs at work. Since its launch (Jan26), I started using it for da…

X AI KOLs Following · 2026-06-27 Cached

The author shares his experience using RLMs for daily tasks like coding, processing multi-million-token logs, and browser automation, and releases it as an open-source Python package installable via pip.

0 favorites 0 likes
#rlm

Show HN: RLM-based local debugger for AI agent traces

Hacker News Top · 2026-06-23 Cached

HALO is an open-source desktop app that uses reinforcement learning from model-based (RLM) techniques to debug and optimize AI agent traces locally, providing analysis and actionable recommendations.

0 favorites 0 likes
#rlm

@dosco: use perplexity, parallel, google, x search whatever and build this in 5 minutes using DSPy+RLM (ax-agent) http://axllm.…

X AI KOLs Timeline · 2026-06-02 Cached

Ax is an open-source TypeScript library that implements DSPy-style typed signatures and agent frameworks for building reliable AI applications with minimal prompting. It supports multiple LLM providers and includes features like agents, flows, RAG, and self-improving pipelines.

0 favorites 0 likes
#rlm

@neural_avb: RLMs can now access MCP servers with `fast-rlm` - Connect any MCP via stdio or http - RLM accesses all MCP tools, resou…

X AI KOLs Timeline · 2026-06-01 Cached

fast-rlm enables reinforcement learning models to access MCP servers via stdio or HTTP, allowing tool use and resource fetching with results saved as Python variables in the REPL to save input tokens.

0 favorites 0 likes
#rlm

@tech_optimist: Absolutely amazing work combining RLMs and GEPA. Looking forward to part 2!

X AI KOLs Following · 2026-05-30

A tweet praising the combination of RLMs and GEPA, expressing anticipation for a follow-up.

0 favorites 0 likes
#rlm

@neural_avb: New `fast-rlm` update Check this demo where RLM web searches (exa), reviews Goodreads with tools, and recommends books!…

X AI KOLs Timeline · 2026-05-21

New `fast-rlm` update introduces REPL Tool Calling, allowing agents to invoke Python functions via REPL with outputs stored in variables. Demo shows web search and Goodreads review integration.

0 favorites 0 likes
#rlm

Reinforcing Recursive Language Models (18 minute read)

TLDR AI · 2026-05-13 Cached

The article explores reinforcement learning fine-tuning of small (4B) recursive language models (RLMs) to perform evidence selection from scientific documents, showing that RL-trained 4B models match Claude Sonnet 4.6 performance at a fraction of the size and cost.

0 favorites 0 likes
#rlm

@a1zhang: RLM arXiv paper update: depth>1 results, more comparisons, more training, and more error analysis! We add depth=2/3 exp…

X AI KOLs Following · 2026-05-12

This update to the RLM arXiv paper adds depth>1 experiments with recursive RLM calls, showing significant performance gains on OOLONG-Pairs and other benchmarks, along with new comparisons to OpenCode and Claude Code, additional training results on MRCRv2, and an expanded error analysis.

0 favorites 0 likes
#rlm

@isaac_flath: RLM means notebooks are gonna be back (I hope). Agent driving a REPL with interleaved prose. The exact backend the nb i…

X AI KOLs Following · 2026-04-21 Cached

Isaac Flath predicts RLM will revive notebooks by enabling agents to drive REPLs with interleaved prose.

0 favorites 0 likes
#rlm

@dosco: very cool writeup on applying RLM and DSPy to multi-modal data. this bit really got me thinking...

X AI KOLs Following · 2026-04-20 Cached

A social media post highlighting a writeup on applying RLM and DSPy to multi-modal data.

0 favorites 0 likes
#rlm

@sumeetrm: LongCoT is adding two new leaderboards! Due to the interest in agents (particularly RLMs), we’re adding a “Restricted H…

X AI KOLs Following · 2026-04-19 Cached

LongCoT introduces two new agent leaderboards (Restricted & Open Harness), with GPT 5.2 RLM topping the Open Harness at 25.12%.

0 favorites 0 likes
#rlm

@ekzhu: I read the RLM paper and it’s like, this is the simplest way to solve a general problem, seriously it’s just this simple.

X AI KOLs Timeline · 2026-04-19 Cached

A researcher comments on the simplicity and elegance of the RLM paper, comparing it to the influential ReAct paper and expressing appreciation for its straightforward approach to solving general problems.

0 favorites 0 likes
#rlm

@samhogan: RLMs pretty much solved context btw You can shove tens of millions of tokens into a good RLM harness and it just works.…

X AI KOLs Following · 2026-04-18 Cached

A developer shares their experience with Recurrent Language Models (RLMs), claiming they effectively handle extremely long context windows with tens of millions of tokens, representing a significant advancement in context handling capabilities.

0 favorites 0 likes
← Back to home

Submit Feedback