@neural_avb: New `fast-rlm` update Check this demo where RLM web searches (exa), reviews Goodreads with tools, and recommends books!…
Summary
New `fast-rlm` update introduces REPL Tool Calling, allowing agents to invoke Python functions via REPL with outputs stored in variables. Demo shows web search and Goodreads review integration.
Similar Articles
@micka_design: New component drop: Chat-style message composer with an auto-resizing textarea, flexible left/right action slots. Clean…
Fluid Functionalism released an open-source chat-style message composer component with an auto-resizing textarea and flexible left/right action slots.
@victormustar: Before the week ends, let's acknowledge one of the most INSANE week ever for open AI, with 25+ notable open-weight drop…
A recap of an extraordinary week in open AI, featuring over 25 open-weight model releases across LLMs, image generation, audio/speech, vision, and video/3D, with notable contributions from NVIDIA, Google, and others.
Memory retrieval is broken under the hood.
PrecisionMemBench is an open-source benchmark that tests retrieval precision as a strict unit test, revealing that popular memory frameworks like Mem0, Zep, and Hindsight have very low precision (0.05-0.09) and rely on LLMs to compensate. The article argues for zero-tolerance hard fail on precision for production memory infrastructure.
OpenLumara - A different kind of AI agent, written from scratch, not vibecoded. Extremely token-efficient, super small system prompt, made for local models. Everything is modular.
OpenLumara is a new open-source AI agent framework designed for local models, emphasizing token efficiency, modularity, and security. It is written from scratch with a small system prompt and full control over AI capabilities.
LLM prefomance in Estonian
The Institute of the Estonian Language has released an open benchmark to evaluate LLM performance in Estonian, covering language proficiency, reasoning, factual accuracy, and resistance to propaganda, revealing that models strong on English benchmarks may perform differently in smaller language environments.