New embedding models and API updates

OpenAI Blog 01/25/24, 08:00 AM Models

embedding-models api-update text-embedding-3 performance-improvement pricing-reduction openai

Summary

OpenAI released two new embedding models: text-embedding-3-small (5x cheaper than ada-002 with 40%+ MIRACL improvement) and text-embedding-3-large (best performance with up to 3072 dimensions). Both models show significant performance gains on standard benchmarks while reducing costs.

No content available

Original Article Export to Word Export to PDF

View Cached Full Text

Cached at: 04/20/26, 02:54 PM

# New embedding models and API updates Source: [https://openai.com/index/new-embedding-models-and-api-updates/](https://openai.com/index/new-embedding-models-and-api-updates/) `text\-embedding\-3\-small`is our new highly efficient embedding model and provides a significant upgrade over its predecessor, the`text\-embedding\-ada\-002`model released in[December 2022⁠](https://openai.com/index/new-and-improved-embedding-model/)\. **Stronger performance\.**Comparing`text\-embedding\-ada\-002`to`text\-embedding\-3\-small`, the average score on a commonly used benchmark for multi\-language retrieval $[MIRACL⁠\(opens in a new window$](https://github.com/project-miracl/miracl)\) has increased from 31\.4% to 44\.0%, while the average score on a commonly used benchmark for English tasks $[MTEB⁠\(opens in a new window$](https://github.com/embeddings-benchmark/mteb)\) has increased from 61\.0% to 62\.3%\. **Reduced price\.**`text\-embedding\-3\-small`is also substantially more efficient than our previous generation`text\-embedding\-ada\-002`model\. Pricing for`text\-embedding\-3\-small`has therefore been reduced by 5X compared to`text\-embedding\-ada\-002`, from a price per 1k tokens of $0\.0001 to $0\.00002\. We are not deprecating`text\-embedding\-ada\-002`, so while we recommend the newer model, customers are welcome to continue using the previous generation model\. A new large text embedding model:`text\-embedding\-3\-large` `text\-embedding\-3\-large`is our new next generation larger embedding model and creates embeddings with up to 3072 dimensions\. **Stronger performance\.**`text\-embedding\-3\-large`is our new best performing model\. Comparing`text\-embedding\-ada\-002`to`text\-embedding\-3\-large`: on MIRACL, the average score has increased from 31\.4% to 54\.9%, while on MTEB, the average score has increased from 61\.0% to 64\.6%\.

New embedding models and API updates

Similar Articles

New and improved embedding model

Introducing text and code embeddings

@raphaelsrty: We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vecto…

OpenAI cooked with the new Images 2 Model, the characters can stay extremely consistent, while text is clear and stays the same

Introducing next-generation audio models in the API

Submit Feedback

Similar Articles

New and improved embedding model

Introducing text and code embeddings

@raphaelsrty: We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vecto…

OpenAI cooked with the new Images 2 Model, the characters can stay extremely consistent, while text is clear and stays the same

Introducing next-generation audio models in the API