@samhogan: RLMs pretty much solved context btw You can shove tens of millions of tokens into a good RLM harness and it just works.…

X AI KOLs Following 04/18/26, 03:27 PM News

Summary

A developer shares their experience with Recurrent Language Models (RLMs), claiming they effectively handle extremely long context windows with tens of millions of tokens, representing a significant advancement in context handling capabilities.

RLMs pretty much solved context btw You can shove tens of millions of tokens into a good RLM harness and it just works. I’m spending all my free time here.

Original Article

View Cached Full Text

Cached at: 04/20/26, 09:39 AM

RLMs pretty much solved context btw You can shove tens of millions of tokens into a good RLM harness and it just works. I’m spending all my free time here.

Similar Articles

alexzhang13/rlm

GitHub Trending (daily)

Recursive Language Models (RLMs) introduce a task-agnostic inference paradigm enabling language models to handle near-infinite contexts by recursively calling themselves over input, with an accompanying open-source inference engine and training environment.

@ickma2311: Efficient AI Lecture 15: Long-Context LLM Long context is not just a bigger prompt window. The key question is: which p…

X AI KOLs Timeline

This post summarizes Efficient AI Lecture 15 on long-context LLMs, covering RoPE position interpolation for context extension, the needle-in-haystack evaluation, and StreamingLLM's attention sink phenomenon and KV cache eviction strategy.

@Pavel_Izmailov: New paper: Latent Context Language Models (LCLMs)! Idea: encode 16 tokens as 1 latent token, and have the LLM work on t…

X AI KOLs Timeline

Introduces Latent Context Language Models (LCLMs), which encode 16 tokens as 1 latent token to improve performance, speed, and memory usage.

Can Language Models Actually Retrieve In-Context? Drowning in Documents at Million Token Scale

arXiv cs.CL

This paper systematically studies in-context retrieval at million-token scale, introducing BlockSearch, a 0.6B LM retriever, and analyzing attention dilution. The model matches or outperforms dense retrieval on benchmarks like MS MARCO and NQ, and significantly outperforms on tasks requiring different similarity notions, highlighting the potential of in-context retrieval while emphasizing attention control under extreme context growth.

@TDataScience: Follow along @neural_avb's all-in-one deep dive to learn "what recursive language models (RLMs) are, why they are winni…

X AI KOLs Following

An educational deep dive into recursive language models (RLMs), explaining what they are, why they are winning long-context benchmarks, and how they differ from existing agentic harness designs like ReAct or CodeAct, using a simple case study.