New and improved embedding model

OpenAI Blog 12/15/22, 08:00 AM Models

Summary

OpenAI released text-embedding-ada-002, a unified embedding model that consolidates five previous models into one with superior performance, 4x longer context (8192 tokens), smaller dimensionality (1536), and 99.8% lower pricing than previous Davinci embeddings.

We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.

Original Article

View Cached Full Text

Cached at: 04/20/26, 02:46 PM

# New and improved embedding model Source: [https://openai.com/index/new-and-improved-embedding-model/](https://openai.com/index/new-and-improved-embedding-model/) The new model,`text\-embedding\-ada\-002`, replaces five separate models for text search, text similarity, and code search, and outperforms our previous most capable model, Davinci, at most tasks, while being priced 99\.8% lower\. **Unification of capabilities**\. We have significantly simplified the interface of the[/embeddings⁠\(opens in a new window\)](https://beta.openai.com/docs/api-reference/embeddings)endpoint by merging the five separate models shown above \(`text\-similarity`,`text\-search\-query`,`text\-search\-doc`,`code\-search\-text`and`code\-search\-code`\) into a single new model\. This single representation performs better than our previous embedding models across a diverse set of text search, sentence similarity, and code search benchmarks\. **Longer context\.**The context length of the new model is increased by a factor of four, from 2048 to 8192, making it more convenient to work with long documents\. **Smaller embedding size\.**The new embeddings have only 1536 dimensions, one\-eighth the size of`davinci\-001`embeddings, making the new embeddings more cost effective in working with vector databases\. **Reduced price\.**We have reduced the price of new embedding models by 90% compared to old models of the same size\. The new model achieves better or similar performance as the old Davinci models at a 99\.8% lower price\. Overall, the new embedding model is a much more powerful tool for natural language processing and code tasks\. We are excited to see how our customers will use it to create even more capable applications in their respective fields\.

New and improved embedding model

Similar Articles

New embedding models and API updates

Introducing text and code embeddings

@JinaAI_: jina-embeddings-v5-omni is here! Our first universal embedding model for text, images, audio, and video. Available in t…

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

@_philschmid: Gemini Embedding 2 now GA! One embedding model that understand text, images, video, audio, and PDFs! 5 modalities in a …

Submit Feedback

Similar Articles

New embedding models and API updates

Introducing text and code embeddings

@JinaAI_: jina-embeddings-v5-omni is here! Our first universal embedding model for text, images, audio, and video. Available in t…

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

@_philschmid: Gemini Embedding 2 now GA! One embedding model that understand text, images, video, audio, and PDFs! 5 modalities in a …