Tag
DeepSeek v4 PRO, a 1.6 trillion parameter model, is running via SSD streaming on a 128GB MacBook m5 max, demonstrating local inference of a massive model.
DeepSeek has made its V4-Pro API price cut of 75% permanent, with per-million cached input tokens at just $0.003625 and output tokens at $0.87, about 34 times cheaper than OpenAI's GPT-5.5. The model has 1.6 trillion parameters but requires only 49 billion active parameters, supports a 1-million-token context, and leads in coding and reasoning benchmark tests.
DeepSeek makes the discount for DeepSeek-V4-Pro permanent, extending it until May 31, 2026.
DeepSeek has made the 75% discount on V4 Pro API pricing permanent, reducing input/output token costs significantly.