deepseek

Tag

Cards List
#deepseek

@Ex0byt: Update: the road to GLM-5.2: we're getting there, folks! non-quantized, non-pruned DeepSeek-v4-Flash. 11tok/s on a sing…

X AI KOLs Timeline · 19h ago Cached

Update on running a non-quantized DeepSeek-v4-Flash model at 11 tok/s on a single DGX Spark using sglang inference and a custom mega-kernel, progressing towards GLM-5.2.

0 favorites 0 likes
#deepseek

@Fenng: Saw this piece written by a self-media account — 'The latest fourth-generation WeLM-80B now has only 80 billion total parameters, with 3 billion activated, an activation rate of just 3.75%. For comparison — DeepSeek-V4-Flash, the domestic representative of extreme cost-performance, has 284 billion total parameters, 13 billion activated, activation rate of 4.6%...'

X AI KOLs Timeline · yesterday Cached

Fenng shares a self-media comparison between the fourth-generation WeLM-80B (80B total params, 3B activated, 3.75% activation rate) and DeepSeek-V4-Flash (284B total, 13B activated, 4.6% activation rate), with a humorous comment.

0 favorites 0 likes
#deepseek

@jakevin7: DeepSeek cache hit rate 95%, feels great. Maka's performance under the latest round of long-context tasks with the Deepseek model is outstanding. Total runtime close to 18 hours, nearly 400 million tokens, cost 33 bucks. The Make builders are amazing…

X AI KOLs Timeline · yesterday Cached

DeepSeek cache hit up to 95%, Maka desktop AI workstation performs excellently in long-context tasks, supports multiple models and tools, open source and local-first.

0 favorites 0 likes
#deepseek

@ashfold: Revealing the answer. While running the dim-agent benchmark, we noticed that DSv4's scores have been consistently improving. The whales are cooking!

X AI KOLs Timeline · yesterday Cached

While running the dim-agent benchmark, the author noticed that DSv4's scores have been consistently improving, hinting at significant progress in model development.

0 favorites 0 likes
#deepseek

@berryxia: Wow, this move directly poached DeepSeek's talent! Last night I saw this interesting OCR open-source model on HuggingFace and the fascinating story behind it. This OCR model is completely different from traditional ones! Its speed and accuracy are absolutely unbeatable~~ Let me start with some background, for those who are familiar…

X AI KOLs Timeline · yesterday Cached

Baidu has open-sourced the Unlimited OCR model, which uses the R-SWA attention mechanism to process hundreds of pages in a single pass without page splitting, with a constant KV Cache. The model innovatively mimics the attention pattern of humans copying books by hand and shares technical lineage with DeepSeek OCR, sparking discussions about talent mobility.

0 favorites 0 likes
#deepseek

@sheriyuo: I think @latepostnews's reporting is indisputably the best in the Chinese-language region, whether it's the earlier DeepSeek exclusive, Top Seed, or this latest big-package narrative. They really dare to write—Xin Zhi Yuan / Quantum Bit and the like can't even come close, and as for Heart of Machine, it's just distilling a few tweets on Twitter these days (...

X AI KOLs Timeline · 2d ago Cached

User @sheriyuo praises LatePost as the best AI media in the Chinese-language region, criticizes competitors like Xin Zhi Yuan, Quantum Bit, and Heart of Machine, and mentions the depth of reports on DeepSeek and others.

0 favorites 0 likes
#deepseek

@manateelazycat: Did a big shot come from Baidu's AI Whampoa Military Academy? The open-source Unlimited OCR, based on DeepSeek OCR, immediately drops a killer move. According to its published data, it scored 93.23 on OmniDocBench v1.5, surpassing DeepSeek OCR and...

X AI KOLs Timeline · 2d ago Cached

The open-source OCR model Unlimited OCR, based on DeepSeek OCR, achieves 93.23 on OmniDocBench v1.5 with only 3B parameters, outperforming DeepSeek OCR, Gemini 2.5, and others.

0 favorites 0 likes
#deepseek

@VincentLogic: Codex integrates third-party models. The easiest way: let Codex configure itself. Just tell it "Read ~/.codex/config.toml, add a custom model for DeepSeek, don't overwrite existing config, read API Key from environment variable". It will modify the config and…

X AI KOLs Timeline · 2d ago Cached

Codex can self-configure to integrate third-party models like DeepSeek and Ollama by reading and modifying its config file automatically.

0 favorites 0 likes
#deepseek

@Fenng: Following the recruitment link shared by Tianyi Cui ( @tianyi ), I checked out DeepSeek's current open positions: https://app.mokahr.com/social-recruitment/high-flyer/140576#/j…

X AI KOLs Following · 3d ago Cached

Following the recruitment link shared by Tianyi Cui, DeepSeek is hiring for roles in AI technology, infrastructure, and business, with mentions of compensation.

0 favorites 0 likes
#deepseek

@AntCaveClub: What exactly is Harness? Harness = Evaluation Harness. In AI, "harness" is industry jargon – a set of tools to "harness" a model and run standardized evaluations. The industry standard is EleutherAI's lm-e…

X AI KOLs Timeline · 3d ago Cached

This article deeply explains the importance of the evaluation framework (Harness) in AI, analyzes the strategic significance of DeepSeek building its own Harness team, and compares the differences between the open-source lm-evaluation-harness and an in-house system.

0 favorites 0 likes
#deepseek

@touxnplayai: https://x.com/touxnplayai/status/2068596799888388373

X AI KOLs Timeline · 3d ago Cached

This tutorial explains how to install Codex++ and configure a DeepSeek API key to unlock the full features of Codex AI tool in China, bypassing the need for a ChatGPT account or subscription.

0 favorites 0 likes
#deepseek

@QingQ77: MCP web search service based on DeepSeek API https://github.com/chengx-coding/forever-saint-liang-websearch... Provides web search capabilities for MCP-compatible clients (Claude Code, Op...

X AI KOLs Timeline · 4d ago Cached

MCP web search service based on DeepSeek API, providing web search capabilities for MCP-compatible clients (such as Claude Code, OpenCode), avoiding reliance on third-party search services. Only one DeepSeek API Key is needed to use it.

0 favorites 0 likes
#deepseek

Deepseek, kimi etc..

Reddit r/AI_Agents · 4d ago

Mentions of AI models Deepseek and Kimi, possibly discussing recent updates or comparisons.

0 favorites 0 likes
#deepseek

@Fenng: WeChat's AI Agent has arrived, product name 'Xiao Wei', main model uses WeLM, partial answers fallback to DeepSeek, grayscale testing has begun.

X AI KOLs Following · 4d ago

WeChat launches AI Agent product 'Xiao Wei', main model uses WeLM, some answers fallback to DeepSeek, grayscale testing has begun.

0 favorites 0 likes
#deepseek

@Xudong07452910: When AI starts researching AI autonomously, what's truly open-sourced may not be the code, but a research protocol. DeepSeek researcher Deli Chen's open-source Deli AutoResearch SKILL is worth a look — it's a set of rules for AI to conduct long-term research. It's not a complex codebase, but a...

X AI KOLs Timeline · 4d ago Cached

DeepSeek researcher Deli Chen open-sourced Deli AutoResearch SKILL, a SKILL.md protocol file that defines the operating rules for AI's long-term autonomous research, including state persistence, stagnation detection, heartbeat mechanism, etc., aiming to decompose autonomous scientific research from a vision into a sustainable engineering closed loop.

0 favorites 0 likes
#deepseek

‘No poaching' our people, China's AI behemoth DeepSeek reportedly tells investors (3 minute read)

TLDR AI · 5d ago Cached

DeepSeek reportedly requires investors to promise not to poach its talent as part of its $7.4 billion fundraising round, highlighting the intense competition for AI engineers in China.

0 favorites 0 likes
#deepseek

@ciruai: Testing DeepSeek v4 Flash on the AMD Ryzen AI Max+ 395 Strix Halo with 128GB RAM. Getting ~15 TPS over a decently long …

X AI KOLs Timeline · 6d ago Cached

Testing DeepSeek v4 Flash on the AMD Ryzen AI Max+ 395 with 128GB RAM achieves ~15 TPS for a 284B MoE model (13B active) locally, costing $3,000 versus $25,000+ for a datacenter setup, highlighting the feasibility of running large models on consumer hardware.

0 favorites 0 likes
#deepseek

@NFTCPS: Guys, using DeepSeek V4 Pro to run Codex, the tokens burning a hole in your pocket? You gotta know these two skills. token-saver: after modifying code, just returns a path + done, no extra words. Tests show it saves 60-80% tokens memory…

X AI KOLs Timeline · 6d ago Cached

Codex skills optimized for DeepSeek V4 Pro, saves 60-80% tokens by freezing skill files and minimal output, with cross-conversation persistent memory capability.

0 favorites 0 likes
#deepseek

DeepSeek Introduces Vision

Hacker News Top · 6d ago

DeepSeek announces a new vision capability, likely a vision-language model, expanding its AI offerings.

0 favorites 0 likes
#deepseek

@anxue201: https://x.com/anxue201/status/2067477109816050119

X AI KOLs Timeline · 6d ago Cached

A detailed configuration guide that teaches users how to connect OpenAI Codex to third-party models like DeepSeek through the open-source proxy tool CC Switch, solving protocol incompatibility issues.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback