Tag
Update on running a non-quantized DeepSeek-v4-Flash model at 11 tok/s on a single DGX Spark using sglang inference and a custom mega-kernel, progressing towards GLM-5.2.
Fenng shares a self-media comparison between the fourth-generation WeLM-80B (80B total params, 3B activated, 3.75% activation rate) and DeepSeek-V4-Flash (284B total, 13B activated, 4.6% activation rate), with a humorous comment.
DeepSeek cache hit up to 95%, Maka desktop AI workstation performs excellently in long-context tasks, supports multiple models and tools, open source and local-first.
While running the dim-agent benchmark, the author noticed that DSv4's scores have been consistently improving, hinting at significant progress in model development.
Baidu has open-sourced the Unlimited OCR model, which uses the R-SWA attention mechanism to process hundreds of pages in a single pass without page splitting, with a constant KV Cache. The model innovatively mimics the attention pattern of humans copying books by hand and shares technical lineage with DeepSeek OCR, sparking discussions about talent mobility.
User @sheriyuo praises LatePost as the best AI media in the Chinese-language region, criticizes competitors like Xin Zhi Yuan, Quantum Bit, and Heart of Machine, and mentions the depth of reports on DeepSeek and others.
The open-source OCR model Unlimited OCR, based on DeepSeek OCR, achieves 93.23 on OmniDocBench v1.5 with only 3B parameters, outperforming DeepSeek OCR, Gemini 2.5, and others.
Codex can self-configure to integrate third-party models like DeepSeek and Ollama by reading and modifying its config file automatically.
Following the recruitment link shared by Tianyi Cui, DeepSeek is hiring for roles in AI technology, infrastructure, and business, with mentions of compensation.
This article deeply explains the importance of the evaluation framework (Harness) in AI, analyzes the strategic significance of DeepSeek building its own Harness team, and compares the differences between the open-source lm-evaluation-harness and an in-house system.
This tutorial explains how to install Codex++ and configure a DeepSeek API key to unlock the full features of Codex AI tool in China, bypassing the need for a ChatGPT account or subscription.
MCP web search service based on DeepSeek API, providing web search capabilities for MCP-compatible clients (such as Claude Code, OpenCode), avoiding reliance on third-party search services. Only one DeepSeek API Key is needed to use it.
Mentions of AI models Deepseek and Kimi, possibly discussing recent updates or comparisons.
WeChat launches AI Agent product 'Xiao Wei', main model uses WeLM, some answers fallback to DeepSeek, grayscale testing has begun.
DeepSeek researcher Deli Chen open-sourced Deli AutoResearch SKILL, a SKILL.md protocol file that defines the operating rules for AI's long-term autonomous research, including state persistence, stagnation detection, heartbeat mechanism, etc., aiming to decompose autonomous scientific research from a vision into a sustainable engineering closed loop.
DeepSeek reportedly requires investors to promise not to poach its talent as part of its $7.4 billion fundraising round, highlighting the intense competition for AI engineers in China.
Testing DeepSeek v4 Flash on the AMD Ryzen AI Max+ 395 with 128GB RAM achieves ~15 TPS for a 284B MoE model (13B active) locally, costing $3,000 versus $25,000+ for a datacenter setup, highlighting the feasibility of running large models on consumer hardware.
Codex skills optimized for DeepSeek V4 Pro, saves 60-80% tokens by freezing skill files and minimal output, with cross-conversation persistent memory capability.
DeepSeek announces a new vision capability, likely a vision-language model, expanding its AI offerings.
A detailed configuration guide that teaches users how to connect OpenAI Codex to third-party models like DeepSeek through the open-source proxy tool CC Switch, solving protocol incompatibility issues.