DeepSeek makes the V4 Pro price discount permanent

Hacker News Top Products

Summary

DeepSeek has made the 75% discount on V4 Pro API pricing permanent, reducing input/output token costs significantly.

&gt; (3) The deepseek-v4-pro model API pricing will be officially adjusted to 1&#x2F;4 of the original price after the 75% discount promotion ends on 2026&#x2F;05&#x2F;31 15:59 UTC.<p><a href="https:&#x2F;&#x2F;x.com&#x2F;deepseek_ai&#x2F;status&#x2F;2057854261699195173" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;deepseek_ai&#x2F;status&#x2F;2057854261699195173</a>
Original Article
View Cached Full Text

Cached at: 05/22/26, 06:25 PM

# Models & Pricing | DeepSeek API Docs Source: [https://api-docs.deepseek.com/quick_start/pricing](https://api-docs.deepseek.com/quick_start/pricing) The prices listed below are in units of per 1M tokens\. A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark\. We will bill based on the total number of input and output tokens by the model\. --- ## Model Details[​](https://api-docs.deepseek.com/quick_start/pricing#model-details) **MODELdeepseek\-v4\-flash\(1\)deepseek\-v4\-proBASE URL \(OpenAI Format\)[https://api\.deepseek\.com](https://api.deepseek.com/)BASE URL \(Anthropic Format\)[https://api\.deepseek\.com/anthropic](https://api.deepseek.com/anthropic)MODEL VERSIONDeepSeek\-V4\-FlashDeepSeek\-V4\-ProTHINKING MODESupports both non\-thinking and thinking \(default\) modes See[Thinking Mode](https://api-docs.deepseek.com/guides/thinking_mode)for how to switchCONTEXT LENGTH1MMAX OUTPUTMAXIMUM: 384KFEATURES[Json Output](https://api-docs.deepseek.com/guides/json_mode)✓✓[Tool Calls](https://api-docs.deepseek.com/guides/tool_calls)✓✓[Chat Prefix Completion(Beta)](https://api-docs.deepseek.com/guides/chat_prefix_completion)✓✓[FIM Completion(Beta)](https://api-docs.deepseek.com/guides/fim_completion)Non\-thinking mode onlyNon\-thinking mode onlyPRICING1M INPUT TOKENS \(CACHE HIT\)\(2\)$0\.0028$0\.003625 \(75% off\(3\)\)$0\.01451M INPUT TOKENS \(CACHE MISS\)$0\.14$0\.435 \(75% off\(3\)\)$1\.741M OUTPUT TOKENS$0\.28$0\.87 \(75% off\(3\)\)$3\.48Concurrency Limit\(4\)2500500** \(1\) The model names`deepseek\-chat`and`deepseek\-reasoner`will be deprecated in the future\. For compatibility, they correspond to the non\-thinking mode and thinking mode of`deepseek\-v4\-flash`, respectively\. \(2\) For all models, the input cache hit price has been reduced to 1/10 of the launch price\. This price adjustment takes effect from 2026/4/26 12:15 UTC\. \(3\) The deepseek\-v4\-pro model API pricing will be officially adjusted to 1/4 of the original price after the 75% discount promotion ends on 2026/05/31 15:59 UTC\. \(4\) For more details on concurrency limits, please refer to[Rate Limit & Isolation](https://api-docs.deepseek.com/quick_start/rate_limit) --- ## Deduction Rules[​](https://api-docs.deepseek.com/quick_start/pricing#deduction-rules) The expense = number of tokens × price\. The corresponding fees will be directly deducted from your topped\-up balance or granted balance, with a preference for using the granted balance first when both balances are available\. Product prices may vary and DeepSeek reserves the right to adjust them\. We recommend topping up based on your actual usage and regularly checking this page for the most recent pricing information\.

Similar Articles

)

TLDR AI

DeepSeek permanently reduced V4 Pro prices by 75%, undercutting leading AI models from OpenAI, Anthropic, and Google, escalating the AI price war.