Tag
An analysis of political leanings in six major AI models, showing that 4 out of 6 lean left of center on the economic axis, with some models being unaware of their own bias.
This article evaluates the performance of running GLM 5.2 (IQ2M quantized version) on Dual Strix Halo (256GB VRAM). The generation speed is only about 7 tokens/s, and coding tasks take twice as long as DeepSeek V4 Flash. Its cost-performance ratio is far inferior to other models, so it is not recommended for use with this hardware configuration.
DeepSeek Flash is a new AI model that dramatically reduces the cost of building AI agents by 100x, potentially revolutionizing the agent market.
The article examines the dramatic cost difference between open-weight models like DeepSeek V4 and closed models from Anthropic and OpenAI, arguing that the latter sustain high prices through artificial scarcity and branding rather than technical superiority.
Achieved DeepSeek-V4-Flash MTP speculative decoding on 2× RTX PRO 6000 with a 38% throughput increase by fixing a mis-routed quantization format issue.
Update on running a non-quantized DeepSeek-v4-Flash model at 11 tok/s on a single DGX Spark using sglang inference and a custom mega-kernel, progressing towards GLM-5.2.
Fenng shares a self-media comparison between the fourth-generation WeLM-80B (80B total params, 3B activated, 3.75% activation rate) and DeepSeek-V4-Flash (284B total, 13B activated, 4.6% activation rate), with a humorous comment.
DeepSeek cache hit up to 95%, Maka desktop AI workstation performs excellently in long-context tasks, supports multiple models and tools, open source and local-first.
While running the dim-agent benchmark, the author noticed that DSv4's scores have been consistently improving, hinting at significant progress in model development.
Baidu has open-sourced the Unlimited OCR model, which uses the R-SWA attention mechanism to process hundreds of pages in a single pass without page splitting, with a constant KV Cache. The model innovatively mimics the attention pattern of humans copying books by hand and shares technical lineage with DeepSeek OCR, sparking discussions about talent mobility.
User @sheriyuo praises LatePost as the best AI media in the Chinese-language region, criticizes competitors like Xin Zhi Yuan, Quantum Bit, and Heart of Machine, and mentions the depth of reports on DeepSeek and others.
DeepSeek-V4-Fable is a distilled variant of Claude-5-Fable built on DeepSeek-V4-Flash, designed for autonomous offensive security research, CTF problem solving, and controlled environment exploitation planning, with strict authorization requirements.
The open-source OCR model Unlimited OCR, based on DeepSeek OCR, achieves 93.23 on OmniDocBench v1.5 with only 3B parameters, outperforming DeepSeek OCR, Gemini 2.5, and others.
Codex can self-configure to integrate third-party models like DeepSeek and Ollama by reading and modifying its config file automatically.
Following the recruitment link shared by Tianyi Cui, DeepSeek is hiring for roles in AI technology, infrastructure, and business, with mentions of compensation.
This article deeply explains the importance of the evaluation framework (Harness) in AI, analyzes the strategic significance of DeepSeek building its own Harness team, and compares the differences between the open-source lm-evaluation-harness and an in-house system.
This tutorial explains how to install Codex++ and configure a DeepSeek API key to unlock the full features of Codex AI tool in China, bypassing the need for a ChatGPT account or subscription.
MCP web search service based on DeepSeek API, providing web search capabilities for MCP-compatible clients (such as Claude Code, OpenCode), avoiding reliance on third-party search services. Only one DeepSeek API Key is needed to use it.
Mentions of AI models Deepseek and Kimi, possibly discussing recent updates or comparisons.
WeChat launches AI Agent product 'Xiao Wei', main model uses WeLM, some answers fallback to DeepSeek, grayscale testing has begun.