deepseek

#deepseek

@bookwormengr: Wonderful coverage on CANN (Huawei's CUDA) and DeepSeek V4 inference on Huawei chips.... "CANN (Compute Architecture fo…

X AI KOLs Timeline ↗ · 2026-06-09 Cached

Huawei has open-sourced its CANN software toolkit to compete with Nvidia's CUDA, and DeepSeek V4 shows significant inference performance improvements on Huawei Ascend chips.

0 favorites 0 likes

#deepseek

Here are some tips on hitting nearly 200 tok/s for DeepSeek v4 Flash on Hopper

Reddit r/LocalLLaMA ↗ · 2026-06-08 Cached

This blog post provides tips and benchmarks for achieving nearly 200 tokens per second inference on DeepSeek V4 Flash using vLLM on a dual GH200 workstation, highlighting the use of a quantized checkpoint from Canada-Quant and tensor parallelism optimizations.

0 favorites 0 likes

#deepseek

MiniMax is digging its own grave

Reddit r/AI_Agents ↗ · 2026-06-08

MiniMax's price increases and model limitations are driving users away to competitors like DeepSeek and premium options like Claude or ChatGPT, reversing its earlier reputation as a cheap, usable daily driver.

0 favorites 0 likes

#deepseek

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

Hacker News Top ↗ · 2026-06-08

DeepSeek V4 Pro reportedly outperforms GPT-5.5 Pro on precision, suggesting a significant advancement in model accuracy.

0 favorites 0 likes

#deepseek

@GoSailGlobal: Practical data on multi-agent AI collaboration: Use Opus 4.8 for planning, Deepseek/Gemma for execution — 10x cost reduction, 2x speed improvement. The secret is not using the most expensive model, but having cheap models do the heavy lifting and expensive models only make decisions. This is the same as company management: the CEO shouldn't write code, and interns shouldn't set strategy. A…

X AI KOLs Timeline ↗ · 2026-06-08 Cached

A practical sharing on multi-agent AI collaboration, proposing a hierarchical strategy using Opus 4.8 for planning and Deepseek/Gemma for execution, achieving a 10x cost reduction and 2x speed improvement, with open-source implementation.

0 favorites 0 likes

#deepseek

@jakevin7: DeepSeek V4's "Think Max" mode essentially just adds "You must think through every step clearly, no shortcuts" at the start of the prompt. So is reasoning ability emergent, or... is it scolded into existence?

X AI KOLs Following ↗ · 2026-06-06 Cached

DeepSeek V4's "Think Max" mode essentially just adds a prompt prefix requiring step-by-step reasoning, sparking debate on the origin of reasoning ability.

0 favorites 0 likes

#deepseek

@cyrilXBT: Nemotron 3 Ultra versus DeepSeek V4 versus MiniMax M3 versus Qwen 3.7 Max. Same two prompts. Four frontier models. One …

X AI KOLs Following ↗ · 2026-06-06 Cached

A comparison of four frontier AI models (Nemotron 3 Ultra, DeepSeek V4, MiniMax M3, Qwen 3.7 Max) on the same two prompts, with full results linked.

0 favorites 0 likes

#deepseek

@antirez: DeepSeek v4 PRO running via SSD streaming on my 128GB MacBook m5 max. 1.6 trillion parameters.

X AI KOLs Timeline ↗ · 2026-06-04 Cached

DeepSeek v4 PRO, a 1.6 trillion parameter model, is running via SSD streaming on a 128GB MacBook m5 max, demonstrating local inference of a massive model.

0 favorites 0 likes

#deepseek

@queen_nunaa: Someone set up a repo on GitHub that lets you use Claude Code for free, forever. It works by routing Claude Code requests to 10 free providers like DeepSeek, Kimi, etc. Setup takes about five minutes, and already...

X AI KOLs Timeline ↗ · 2026-06-04 Cached

Someone created a repository on GitHub that forwards Claude Code requests to 10 free providers such as DeepSeek and Kimi, allowing users to use Claude Code for free and permanently. Setup takes only five minutes, and over 20,000 developers are already using it.

0 favorites 0 likes

#deepseek

Big Model Value Wars - DeepSeek V4 Pro vs MiMo-V2.5-Pro vs MiniMax M3

Reddit r/LocalLLaMA ↗ · 2026-06-03

A discussion comparing DeepSeek V4 Pro, MiMo-V2.5-Pro, and MiniMax M3 for best value in local or openrouter use, with a focus on agentic and coding tasks, and mentions of Hermes Agent and Qwen 3.6 variants.

0 favorites 0 likes

#deepseek

@TheAhmadOsman: S-Tier Chinese Labs: Moonshot and DeepSeek These 2 are levels above everyone else

X AI KOLs Following ↗ · 2026-06-03 Cached

A brief opinion stating that Moonshot and DeepSeek are the top-tier Chinese AI labs, far ahead of others.

0 favorites 0 likes

#deepseek

Why Chinese AI Models Are Reshaping the Economics of AI

Reddit r/AI_Agents ↗ · 2026-06-03

Chinese AI models like DeepSeek and Qwen deliver competitive performance at 5x–20x lower cost than Western counterparts, reshaping the economics of AI and driving multi-model deployment strategies.

0 favorites 0 likes

#deepseek

@NeoResearchAI: We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab. Today we're publishing …

X AI KOLs Following ↗ · 2026-06-02 Cached

Neo Research (新衡), Asia's first independent frontier AI safety evaluation lab, announces its first report: a safety evaluation of DeepSeek v4 Pro.

0 favorites 0 likes

#deepseek

Observation: the best agent harness for each model will be from the model developer themselves

Reddit r/AI_Agents ↗ · 2026-06-01

A discussion on how AI models perform best with harnesses developed by their own creators, as third-party harnesses may cause underperformance despite strong benchmarks, citing examples like Claude Code for Claude and Codex for GPT.

0 favorites 0 likes

#deepseek

@danveloper: I can't believe this works, but I got DeepSeek-V4-Flash (284B params) running on a Raspberry Pi 5 (8GB edition) at >1to…

X AI KOLs Timeline ↗ · 2026-06-01 Cached

A developer successfully ran the 284B-parameter DeepSeek-V4-Flash model on a Raspberry Pi 5 at over 1 tok/s, using an untouched GGUF file from antirez after extensive experimentation.

0 favorites 0 likes

#deepseek

Deepseek V4 flash performance on DGX Spark

Reddit r/LocalLLaMA ↗ · 2026-06-01

A Reddit user shares their experience running DeepSeek V4 Flash on a dual-ASUS GX10 DGX Spark setup, detailing performance metrics, configuration, and power consumption, with throughput benchmarks across various context lengths.

0 favorites 0 likes

#deepseek

Building and managing AI agents in SAFi

Reddit r/AI_Agents ↗ · 2026-05-31

The author introduces SAFi, an open-source runtime governance engine for AI agents, detailing its memory system (ethical, conversational, profile, project) and practical use cases like a work assistant powered by DeepSeek V4.

0 favorites 0 likes

#deepseek

Confusing “run out of credits” error with OpenRouter DeepSeek V4 Pro model

Reddit r/openclaw ↗ · 2026-05-31

A user reports that the DeepSeek V4 Pro model via OpenRouter returned a misleading 'run out of credits' error, which turned out to be a model-specific issue, causing hours of wasted debugging.

0 favorites 0 likes

#deepseek

@royxy: You've all heard that you should use Codex for planning and Deepseek for implementation. But over the past couple of days, while pushing forward discussions on a highly complex project that has probably never been done before, I feel that Deepseek is more creative than Codex, while Codex's logical and engineering...

X AI KOLs Timeline ↗ · 2026-05-31

User shares experience using Deepseek and Codex for complex project planning and implementation, finding Deepseek more creative while Codex stronger in logic and engineering abilities.

0 favorites 0 likes

#deepseek

DeepSWE benchmarks indicate that DeepSeek v4 Pro only passes 8% of tasks

Reddit r/LocalLLaMA ↗ · 2026-05-31

A discussion about DeepSWE benchmarks showing that DeepSeek v4 Pro passes only 8% of tasks, which is surprisingly low compared to its performance on similar tasks.

0 favorites 0 likes

deepseek

Submit Feedback