@scaling01: DeepSeek just made their inference ~5x cheaper at 50 TPS
Summary
DeepSeek has reduced inference costs by approximately 5x while maintaining 50 tokens per second throughput.
View Cached Full Text
Cached at: 06/29/26, 02:26 AM
DeepSeek just made their inference ~5x cheaper at 50 TPS https://t.co/9lYUGsshdb
Lisan al Gaib (@scaling01): how can you not like deepseek
thank you lord wenfeng for continuing to make intelligence too cheap to meter
Similar Articles
@rohanpaul_ai: NVIDIA's newly published report says its Blackwell inference stack cut DeepSeek V4 token costs by up to 5x in one month.
NVIDIA reported that its Blackwell inference stack reduced DeepSeek V4 token costs by up to 5x in one month.
DeepSeek Announces Permanent Price Cut of 75% after Promotion Period
DeepSeek has announced a permanent 75% price reduction following a promotional period, making its AI services significantly cheaper for users.
DeepSeek makes the V4 Pro price discount permanent
DeepSeek has made the 75% discount on V4 Pro API pricing permanent, reducing input/output token costs significantly.
@seclink: Chinese startup DeepSeek announced on Friday that its 75% discount on the DeepSeek-V4-Pro API will become permanent, with prices as low as $0.003625 per million cached input tokens and $0.87 per million output tokens—approximately 34 times cheaper than OpenAI's GPT-5.5. The model has 1.6 trillion...
DeepSeek has made its V4-Pro API price cut of 75% permanent, with per-million cached input tokens at just $0.003625 and output tokens at $0.87, about 34 times cheaper than OpenAI's GPT-5.5. The model has 1.6 trillion parameters but requires only 49 billion active parameters, supports a 1-million-token context, and leads in coding and reasoning benchmark tests.
)
DeepSeek permanently reduced V4 Pro prices by 75%, undercutting leading AI models from OpenAI, Anthropic, and Google, escalating the AI price war.