@maximelabonne: LFM2.5-ColBERT-350M is a surprisingly reliable smart tool selector. We gave it 151 tools, and it consistently surfaces …
Summary
LFM2.5-ColBERT-350M is a model that reliably selects the most relevant tools from a set of 151, saving tokens and improving accuracy, ideal for agentic edge models.
View Cached Full Text
Cached at: 06/18/26, 04:08 PM
LFM2.5-ColBERT-350M is a surprisingly reliable smart tool selector.
We gave it 151 tools, and it consistently surfaces the 5 most relevant ones based on the user prompt.
This saves tokens and improves accuracy. Ideal for hmmmm agentic edge models? 👀 https://t.co/IPyizctesU
Similar Articles
@liquidai: Storing too many tools in your context window increases latency and can lead to wrong tool selection. In this demo, we …
Liquid AI demonstrates using LFM2.5-ColBERT-350M as a filter to select only the five most relevant tools from 151 options, reducing latency and improving tool selection accuracy.
LiquidAI/LFM2.5-ColBERT-350M
LiquidAI releases LFM2.5-ColBERT-350M, a late-interaction multilingual retrieval model, along with a dense bi-encoder variant, both built on LFM2.5-350M-Base, supporting 11 languages and designed as drop-in replacements for RAG pipelines.
@liquidai: Introducing LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: two multilingual retrieval models built for ultra-fast and a…
Liquid AI introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M, two multilingual retrieval models optimized for fast and accurate search across 11 languages, with latency as low as 1.5ms.
@antoine_chaffin: Reason-ModernColBERT nearly solved BrowseComp-Plus, smashing SOTA and outperforming models models 54× bigger Not bad fo…
Reason-ModernColBERT achieves near-perfect results on BrowseComp-Plus, surpassing SOTA and models 54× larger, then Agent-ModernColBERT further improves with minimal training.
@LightOnIO: Reason-ModernColBERT topped BrowseComp-Plus with just 149M parameters. Now, Agent-ModernColBERT adds ~10% on top. Reach…
LightOn released Agent-ModernColBERT, a 149M parameter open-source retrieval model that achieves performance comparable to GPT-5 combined with Qwen3-Embed-8B by integrating agent reasoning traces into queries.