@raphaelsrty: At 140 million parameters, our LateOn model yield strong results Unrelated to LateOn, I'm really excited by what's happ…
Summary
The LateOn model with 140M parameters achieves strong results, and the community is excited about advances in multi-vector models including new CPU indexes and multilingual support.
View Cached Full Text
Cached at: 05/30/26, 10:27 PM
At 140 million parameters, our LateOn model yield strong results 😉
Unrelated to LateOn, I’m really excited by what’s happenning with multi-vector models right now
- New kind of indexes running on cpu
- New multilingual models
- Anisotropie being solved
- Sparse multi-vector
Omar Khattab (@lateinteraction): 20M downloads / month is a new record for colbertv2
but people should probably migrate from this ancient October 2021 model to the LateOn colbert model from @raphaelsrty @antoine_chaffin et al (@LightOnIO)
Similar Articles
@raphaelsrty: We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vecto…
Raphael released two open-source retrieval models, LateOn (ColBERT multi-vector) and DenseOn (single-vector), each 149M parameters and outperforming 4× larger models on BEIR.
@antoine_chaffin: The new generation of open state-of-the-art single and multi-vector retrieval models is here It's time, DenseOn with th…
LightOn releases DenseOn and LateOn, a new generation of open state-of-the-art single and multi-vector retrieval models that outperform existing ones.
@antoine_chaffin: It’s only BEIR but there are almost 10 points gap between v2 and LateOn We also have good evidence that the model gener…
LateOn, a new generation ColBERT model, achieves a nearly 10-point improvement over v2 on BEIR and generalizes well outside BEIR, with the same usage in PyLate.
@KrzakalaF: LightOn getting GPT-5-level Deep Research retrieval performance with a 150M-parameter late-interaction model is honestl…
LightOn achieves GPT-5-level deep research retrieval performance using a 150M-parameter late-interaction model, a remarkable feat.
@SilvioMartinico: The late-interaction multivector retrieval ecosystem is exploding right now. To help separate the signal from the noise…
A curated list of top models, engines, libraries, and datasets for late-interaction multivector retrieval, organized in an 'Awesome Multivector Retrieval' resource.