@raphaelsrty: We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vecto…

X AI KOLs Following Models

Summary

Raphael released two open-source retrieval models, LateOn (ColBERT multi-vector) and DenseOn (single-vector), each 149M parameters and outperforming 4× larger models on BEIR.

We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vector): 57.22 NDCG@10 on BEIR. DenseOn (dense, single-vector): 56.20. Both beat models up to 4× larger We're open-sourcing the weights under Apache 2.0
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/21/26, 05:13 PM

We’re releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vector): 57.22 NDCG@10 on BEIR. DenseOn (dense, single-vector): 56.20. Both beat models up to 4× larger We’re open-sourcing the weights under Apache 2.0

Similar Articles

@lateinteraction: The keynote recording is now on YouTube, for everyone who asked us to host it outside X. https://youtube.com/watch?v=Z2…

X AI KOLs Timeline

A keynote recording argues that late interaction retrieval (e.g., ColBERT-style) is the most promising direction in AI-scale information retrieval research, contending that single-vector dense retrieval is fundamentally flawed and that the IR community must raise its ambitions significantly. The talk introduces the LIMIT benchmark as evidence of dense retrieval's generalization failures and calls for a paradigm shift by 2030.

New embedding models and API updates

OpenAI Blog

OpenAI released two new embedding models: text-embedding-3-small (5x cheaper than ada-002 with 40%+ MIRACL improvement) and text-embedding-3-large (best performance with up to 3072 dimensions). Both models show significant performance gains on standard benchmarks while reducing costs.