New embedding models and API updates

OpenAI Blog Models

Summary

OpenAI released two new embedding models: text-embedding-3-small (5x cheaper than ada-002 with 40%+ MIRACL improvement) and text-embedding-3-large (best performance with up to 3072 dimensions). Both models show significant performance gains on standard benchmarks while reducing costs.

No content available
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/20/26, 02:54 PM

# New embedding models and API updates Source: [https://openai.com/index/new-embedding-models-and-api-updates/](https://openai.com/index/new-embedding-models-and-api-updates/) `text\-embedding\-3\-small`is our new highly efficient embedding model and provides a significant upgrade over its predecessor, the`text\-embedding\-ada\-002`model released in[December 2022⁠](https://openai.com/index/new-and-improved-embedding-model/)\. **Stronger performance\.**Comparing`text\-embedding\-ada\-002`to`text\-embedding\-3\-small`, the average score on a commonly used benchmark for multi\-language retrieval \([MIRACL⁠\(opens in a new window\)](https://github.com/project-miracl/miracl)\) has increased from 31\.4% to 44\.0%, while the average score on a commonly used benchmark for English tasks \([MTEB⁠\(opens in a new window\)](https://github.com/embeddings-benchmark/mteb)\) has increased from 61\.0% to 62\.3%\. **Reduced price\.**`text\-embedding\-3\-small`is also substantially more efficient than our previous generation`text\-embedding\-ada\-002`model\. Pricing for`text\-embedding\-3\-small`has therefore been reduced by 5X compared to`text\-embedding\-ada\-002`, from a price per 1k tokens of $0\.0001 to $0\.00002\. We are not deprecating`text\-embedding\-ada\-002`, so while we recommend the newer model, customers are welcome to continue using the previous generation model\. A new large text embedding model:`text\-embedding\-3\-large` `text\-embedding\-3\-large`is our new next generation larger embedding model and creates embeddings with up to 3072 dimensions\. **Stronger performance\.**`text\-embedding\-3\-large`is our new best performing model\. Comparing`text\-embedding\-ada\-002`to`text\-embedding\-3\-large`: on MIRACL, the average score has increased from 31\.4% to 54\.9%, while on MTEB, the average score has increased from 61\.0% to 64\.6%\.

Similar Articles

New and improved embedding model

OpenAI Blog

OpenAI released text-embedding-ada-002, a unified embedding model that consolidates five previous models into one with superior performance, 4x longer context (8192 tokens), smaller dimensionality (1536), and 99.8% lower pricing than previous Davinci embeddings.

Introducing text and code embeddings

OpenAI Blog

OpenAI introduces a new embeddings API endpoint that converts text and code into numerical vector representations for semantic search, clustering, and classification tasks. The models achieve state-of-the-art results on standard benchmarks including a 20% relative improvement in code search performance.

Introducing next-generation audio models in the API

OpenAI Blog

OpenAI introduced next-generation audio models for the API, including improved speech-to-text (gpt-4o-transcribe, gpt-4o-mini-transcribe) and customizable text-to-speech models that enable developers to build more intelligent and expressive voice agents with enhanced accuracy across challenging scenarios.