deepseek

Tag

Cards List
#deepseek

Political bias in AI: Where the AI models stand

Hacker News Top · 7h ago Cached

An analysis of political leanings in six major AI models, showing that 4 out of 6 lean left of center on the economic axis, with some models being unaware of their own bias.

0 favorites 0 likes
#deepseek

GLM 5.2 on Dual Strix Halo (256GB): Worth it?

Reddit r/LocalLLaMA · 13h ago Cached

This article evaluates the performance of running GLM 5.2 (IQ2M quantized version) on Dual Strix Halo (256GB VRAM). The generation speed is only about 7 tokens/s, and coding tasks take twice as long as DeepSeek V4 Flash. Its cost-performance ratio is far inferior to other models, so it is not recommended for use with this hardware configuration.

0 favorites 0 likes
#deepseek

DeepSeek Flash just revolutionized the agent market: 100x cheaper agents

Reddit r/AI_Agents · 16h ago

DeepSeek Flash is a new AI model that dramatically reduces the cost of building AI agents by 100x, potentially revolutionizing the agent market.

0 favorites 0 likes
#deepseek

The Unbearable Cheapness of Open Weight Models

Hacker News Top · 17h ago Cached

The article examines the dramatic cost difference between open-weight models like DeepSeek V4 and closed models from Anthropic and OpenAI, arguing that the latter sustain high prices through artificial scarcity and branding rather than technical superiority.

0 favorites 0 likes
#deepseek

@Hikari_07_jp: I got DeepSeek-V4-Flash MTP speculative decoding actually working on 2× RTX PRO 6000 +38% single-stream throughput. It …

X AI KOLs Timeline · 23h ago Cached

Achieved DeepSeek-V4-Flash MTP speculative decoding on 2× RTX PRO 6000 with a 38% throughput increase by fixing a mis-routed quantization format issue.

0 favorites 0 likes
#deepseek

@Ex0byt: Update: the road to GLM-5.2: we're getting there, folks! non-quantized, non-pruned DeepSeek-v4-Flash. 11tok/s on a sing…

X AI KOLs Timeline · yesterday Cached

Update on running a non-quantized DeepSeek-v4-Flash model at 11 tok/s on a single DGX Spark using sglang inference and a custom mega-kernel, progressing towards GLM-5.2.

0 favorites 0 likes
#deepseek

@Fenng: Saw this piece written by a self-media account — 'The latest fourth-generation WeLM-80B now has only 80 billion total parameters, with 3 billion activated, an activation rate of just 3.75%. For comparison — DeepSeek-V4-Flash, the domestic representative of extreme cost-performance, has 284 billion total parameters, 13 billion activated, activation rate of 4.6%...'

X AI KOLs Timeline · 2d ago Cached

Fenng shares a self-media comparison between the fourth-generation WeLM-80B (80B total params, 3B activated, 3.75% activation rate) and DeepSeek-V4-Flash (284B total, 13B activated, 4.6% activation rate), with a humorous comment.

0 favorites 0 likes
#deepseek

@jakevin7: DeepSeek cache hit rate 95%, feels great. Maka's performance under the latest round of long-context tasks with the Deepseek model is outstanding. Total runtime close to 18 hours, nearly 400 million tokens, cost 33 bucks. The Make builders are amazing…

X AI KOLs Timeline · 2d ago Cached

DeepSeek cache hit up to 95%, Maka desktop AI workstation performs excellently in long-context tasks, supports multiple models and tools, open source and local-first.

0 favorites 0 likes
#deepseek

@ashfold: Revealing the answer. While running the dim-agent benchmark, we noticed that DSv4's scores have been consistently improving. The whales are cooking!

X AI KOLs Timeline · 2d ago Cached

While running the dim-agent benchmark, the author noticed that DSv4's scores have been consistently improving, hinting at significant progress in model development.

0 favorites 0 likes
#deepseek

@berryxia: Wow, this move directly poached DeepSeek's talent! Last night I saw this interesting OCR open-source model on HuggingFace and the fascinating story behind it. This OCR model is completely different from traditional ones! Its speed and accuracy are absolutely unbeatable~~ Let me start with some background, for those who are familiar…

X AI KOLs Timeline · 2d ago Cached

Baidu has open-sourced the Unlimited OCR model, which uses the R-SWA attention mechanism to process hundreds of pages in a single pass without page splitting, with a constant KV Cache. The model innovatively mimics the attention pattern of humans copying books by hand and shares technical lineage with DeepSeek OCR, sparking discussions about talent mobility.

0 favorites 0 likes
#deepseek

@sheriyuo: I think @latepostnews's reporting is indisputably the best in the Chinese-language region, whether it's the earlier DeepSeek exclusive, Top Seed, or this latest big-package narrative. They really dare to write—Xin Zhi Yuan / Quantum Bit and the like can't even come close, and as for Heart of Machine, it's just distilling a few tweets on Twitter these days (...

X AI KOLs Timeline · 3d ago Cached

User @sheriyuo praises LatePost as the best AI media in the Chinese-language region, criticizes competitors like Xin Zhi Yuan, Quantum Bit, and Heart of Machine, and mentions the depth of reports on DeepSeek and others.

0 favorites 0 likes
#deepseek

Chunjiang-Intelligence/DeepSeek-v4-Fable

Hugging Face Models Trending · 3d ago Cached

DeepSeek-V4-Fable is a distilled variant of Claude-5-Fable built on DeepSeek-V4-Flash, designed for autonomous offensive security research, CTF problem solving, and controlled environment exploitation planning, with strict authorization requirements.

0 favorites 0 likes
#deepseek

@manateelazycat: Did a big shot come from Baidu's AI Whampoa Military Academy? The open-source Unlimited OCR, based on DeepSeek OCR, immediately drops a killer move. According to its published data, it scored 93.23 on OmniDocBench v1.5, surpassing DeepSeek OCR and...

X AI KOLs Timeline · 3d ago Cached

The open-source OCR model Unlimited OCR, based on DeepSeek OCR, achieves 93.23 on OmniDocBench v1.5 with only 3B parameters, outperforming DeepSeek OCR, Gemini 2.5, and others.

0 favorites 0 likes
#deepseek

@VincentLogic: Codex integrates third-party models. The easiest way: let Codex configure itself. Just tell it "Read ~/.codex/config.toml, add a custom model for DeepSeek, don't overwrite existing config, read API Key from environment variable". It will modify the config and…

X AI KOLs Timeline · 3d ago Cached

Codex can self-configure to integrate third-party models like DeepSeek and Ollama by reading and modifying its config file automatically.

0 favorites 0 likes
#deepseek

@Fenng: Following the recruitment link shared by Tianyi Cui ( @tianyi ), I checked out DeepSeek's current open positions: https://app.mokahr.com/social-recruitment/high-flyer/140576#/j…

X AI KOLs Following · 4d ago Cached

Following the recruitment link shared by Tianyi Cui, DeepSeek is hiring for roles in AI technology, infrastructure, and business, with mentions of compensation.

0 favorites 0 likes
#deepseek

@AntCaveClub: What exactly is Harness? Harness = Evaluation Harness. In AI, "harness" is industry jargon – a set of tools to "harness" a model and run standardized evaluations. The industry standard is EleutherAI's lm-e…

X AI KOLs Timeline · 4d ago Cached

This article deeply explains the importance of the evaluation framework (Harness) in AI, analyzes the strategic significance of DeepSeek building its own Harness team, and compares the differences between the open-source lm-evaluation-harness and an in-house system.

0 favorites 0 likes
#deepseek

@touxnplayai: https://x.com/touxnplayai/status/2068596799888388373

X AI KOLs Timeline · 4d ago Cached

This tutorial explains how to install Codex++ and configure a DeepSeek API key to unlock the full features of Codex AI tool in China, bypassing the need for a ChatGPT account or subscription.

0 favorites 0 likes
#deepseek

@QingQ77: MCP web search service based on DeepSeek API https://github.com/chengx-coding/forever-saint-liang-websearch... Provides web search capabilities for MCP-compatible clients (Claude Code, Op...

X AI KOLs Timeline · 5d ago Cached

MCP web search service based on DeepSeek API, providing web search capabilities for MCP-compatible clients (such as Claude Code, OpenCode), avoiding reliance on third-party search services. Only one DeepSeek API Key is needed to use it.

0 favorites 0 likes
#deepseek

Deepseek, kimi etc..

Reddit r/AI_Agents · 5d ago

Mentions of AI models Deepseek and Kimi, possibly discussing recent updates or comparisons.

0 favorites 0 likes
#deepseek

@Fenng: WeChat's AI Agent has arrived, product name 'Xiao Wei', main model uses WeLM, partial answers fallback to DeepSeek, grayscale testing has begun.

X AI KOLs Following · 5d ago

WeChat launches AI Agent product 'Xiao Wei', main model uses WeLM, some answers fallback to DeepSeek, grayscale testing has begun.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback