New and improved embedding model
Summary
OpenAI released text-embedding-ada-002, a unified embedding model that consolidates five previous models into one with superior performance, 4x longer context (8192 tokens), smaller dimensionality (1536), and 99.8% lower pricing than previous Davinci embeddings.
View Cached Full Text
Cached at: 04/20/26, 02:46 PM
Similar Articles
New embedding models and API updates
OpenAI released two new embedding models: text-embedding-3-small (5x cheaper than ada-002 with 40%+ MIRACL improvement) and text-embedding-3-large (best performance with up to 3072 dimensions). Both models show significant performance gains on standard benchmarks while reducing costs.
Introducing text and code embeddings
OpenAI introduces a new embeddings API endpoint that converts text and code into numerical vector representations for semantic search, clustering, and classification tasks. The models achieve state-of-the-art results on standard benchmarks including a 20% relative improvement in code search performance.
@JinaAI_: jina-embeddings-v5-omni is here! Our first universal embedding model for text, images, audio, and video. Available in t…
Jina AI has released jina-embeddings-v5-omni, a universal embedding model supporting text, images, audio, and video with back-compatible indexing capabilities.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
IBM releases Granite Embedding Multilingual R2, a family of open-source multilingual embedding models under Apache 2.0, featuring a compact 97M model that achieves best-in-class sub-100M retrieval quality and a 311M model with Matryoshka embeddings, both supporting 32K context and 200+ languages.
@_philschmid: Gemini Embedding 2 now GA! One embedding model that understand text, images, video, audio, and PDFs! 5 modalities in a …
Google releases Gemini Embedding 2 for general availability, offering a single model that embeds text, images, video, audio, and PDFs into one unified space across 100+ languages without needing audio transcription.