deepseek

Tag

Cards List
#deepseek

‘No poaching' our people, China's AI behemoth DeepSeek reportedly tells investors (3 minute read)

TLDR AI · 6d ago Cached

DeepSeek reportedly requires investors to promise not to poach its talent as part of its $7.4 billion fundraising round, highlighting the intense competition for AI engineers in China.

0 favorites 0 likes
#deepseek

@ciruai: Testing DeepSeek v4 Flash on the AMD Ryzen AI Max+ 395 Strix Halo with 128GB RAM. Getting ~15 TPS over a decently long …

X AI KOLs Timeline · 2026-06-18 Cached

Testing DeepSeek v4 Flash on the AMD Ryzen AI Max+ 395 with 128GB RAM achieves ~15 TPS for a 284B MoE model (13B active) locally, costing $3,000 versus $25,000+ for a datacenter setup, highlighting the feasibility of running large models on consumer hardware.

0 favorites 0 likes
#deepseek

@NFTCPS: Guys, using DeepSeek V4 Pro to run Codex, the tokens burning a hole in your pocket? You gotta know these two skills. token-saver: after modifying code, just returns a path + done, no extra words. Tests show it saves 60-80% tokens memory…

X AI KOLs Timeline · 2026-06-18 Cached

Codex skills optimized for DeepSeek V4 Pro, saves 60-80% tokens by freezing skill files and minimal output, with cross-conversation persistent memory capability.

0 favorites 0 likes
#deepseek

DeepSeek Introduces Vision

Hacker News Top · 2026-06-18

DeepSeek announces a new vision capability, likely a vision-language model, expanding its AI offerings.

0 favorites 0 likes
#deepseek

@anxue201: https://x.com/anxue201/status/2067477109816050119

X AI KOLs Timeline · 2026-06-18 Cached

A detailed configuration guide that teaches users how to connect OpenAI Codex to third-party models like DeepSeek through the open-source proxy tool CC Switch, solving protocol incompatibility issues.

0 favorites 0 likes
#deepseek

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

arXiv cs.LG · 2026-06-18 Cached

Proposes a structural pruning framework for MoE models that maximizes channel-score coverage via attribution-based approximation, achieving 50% or 25% pruning with 4-bit quantization and reducing memory footprint by 5.27x on Qwen3-30B-A3B.

0 favorites 0 likes
#deepseek

@VukRosic99: A DeepSeek researcher just open-sourced his AutoResearch personal project. For the first time, the AutoResearch Agent a…

X AI KOLs Timeline · 2026-06-18 Cached

A DeepSeek researcher open-sourced AutoResearch, an autonomous framework that can plan, execute, and debug RL experiments on the DeepSeek 285B model without human intervention, accompanied by a self-play survey paper.

0 favorites 0 likes
#deepseek

@jakevin7: Deepseek has recently been at the center of attention again over financing (did I use the phrase right?). Closely tied to financing is actually the most important thing: the team. Deepseek has been wildly popular for a while now, but as far as I’ve observed, the only core team members who have left are Guo Daya & Wang Bin...

X AI KOLs Following · 2026-06-17 Cached

Discussing DeepSeek's recent financing and the departure of core team members Guo Daya and Wang Binxuan, pointing out the extremely low turnover rate, which reflects a good team culture.

0 favorites 0 likes
#deepseek

@shaogefenhao: Recently set up E2E, AI automatically creates E2E test cases then completes development and debugging, passing acceptance in one go. Yesterday the team worked on a requirement, AI completed it end-to-end, passed acceptance in one go, everyone was amazed. And it's only using the cheap model DeepSeek V4 Flash.

X AI KOLs Timeline · 2026-06-17 Cached

Team members shared their experience of using AI (DeepSeek V4 Flash) to automatically create E2E test cases and complete development and debugging, passing acceptance in one go, demonstrating the potential of AI-assisted development.

0 favorites 0 likes
#deepseek

@victor207755822: Deli AutoResearch SKILL is now officially open source! https://victorchen96.github.io/auto_research/framework.html… Alo…

X AI KOLs Timeline · 2026-06-17 Cached

Deli AutoResearch SKILL is open-sourced, an autonomous framework that automates GPU experiments and RL pipelines, with a companion survey paper on Self-play.

0 favorites 0 likes
#deepseek

US holds off blacklisting DeepSeek, more than 100 firms deemed security risks

Hacker News Top · 2026-06-17

The US government has paused blacklisting DeepSeek but has designated over 100 other firms as security risks, impacting tech and AI companies.

0 favorites 0 likes
#deepseek

Update: DeepSeek AI and the Great Talent Competition

Reddit r/artificial · 2026-06-16 Cached

This analysis updates the study of DeepSeek's research team, revealing that their talent pool has grown to 356 researchers with increasing citation impact and that over half have only Chinese affiliations, highlighting challenges for U.S. talent retention and independence.

1 favorites 1 likes
#deepseek

@sheriyuo: The DeepSeek Harness team is really short-staffed right now, so anyone wanting to join DeepSeek should seize the opportunity. It's totally unlike DeepSeek's usual hiring style—they've split recruiting into Harness and non-Harness tracks.

X AI KOLs Timeline · 2026-06-16 Cached

The DeepSeek Harness team is in urgent need of talent; the hiring policy has been changed to separate Harness and non-Harness tracks.

0 favorites 0 likes
#deepseek

@PolymarketMoney: JUST IN: $MSFT weighs DeepSeek for Copilot Cowork.

X AI KOLs Following · 2026-06-16 Cached

Microsoft is reportedly considering integrating DeepSeek into its Copilot Cowork product.

0 favorites 0 likes
#deepseek

@Gorden_Sun: https://x.com/Gorden_Sun/status/2066919099016630286

X AI KOLs Following · 2026-06-16 Cached

A long-term study involving 26,000 Chinese middle and high school students found that after students independently used AI, homework performance improved by 18%, but closed-book exam scores dropped by 20% within six months. Zhongkao and Gaokao scores dropped by 24% and 18% respectively, and 81% of students used AI to complete their homework.

0 favorites 0 likes
#deepseek

@natolambert: New podcast with @finbarrtimbers! We survey the latest post-training recipes, from GLM 5.1, Kimi K2.6, DeepSeek V4, Xia…

X AI KOLs Timeline · 2026-06-16 Cached

Nathan Lambert and Finbarr Timbers discuss the latest post-training recipes for large language models, including DeepSeek V4, GLM 5.1, Kimi K2.6, and the industry shift to multi-teacher on-policy distillation.

0 favorites 0 likes
#deepseek

@huangjinbo: Reasonix is truly excellent. Don't be misled by its project name (DeepSeek-Reasonix). As long as the relay supports OpenAI-compatible, it can be supported... Recommending again. Mainly its skills, memory, Hooks, MCP and other features are all very useful... It was used to…

X AI KOLs Timeline · 2026-06-16 Cached

Reasonix (formerly named DeepSeek-Reasonix) is an AI coding agent CLI tool developed in Go, supporting features like skills, memory, Hooks, MCP, etc., and can replace OpenCode.

0 favorites 0 likes
#deepseek

Stop When Further Reasoning Won't Help: Attention-State Adaptive Generation in Reasoning Models

arXiv cs.CL · 2026-06-16 Cached

This paper proposes ASAG, a training-free method that adaptively stops reasoning in large reasoning models based on attention distributions, reducing token usage by ~40% while improving accuracy by 3.2% on benchmarks using DeepSeek-R1-Distill and Qwen3 models.

0 favorites 0 likes
#deepseek

@ziv_ravid: 1/I read the Nemotron 3 Ultra report and it's interesting to compare their post-training to DeepSeek V4's. Both now do …

X AI KOLs Timeline · 2026-06-15 Cached

The tweet compares the post-training methods of Nemotron 3 Ultra and DeepSeek V4, noting both use multiple specialist teachers and on-policy distillation into a single student, but differ in support overlap.

0 favorites 0 likes
#deepseek

How did China develop AI so quickly recently if most work was done in USA ?

Reddit r/ArtificialInteligence · 2026-06-14

This article discusses how China has rapidly advanced in AI despite being a latecomer, questioning the sources of datasets, computing power, and algorithms that enabled companies like DeepSeek to catch up with US leaders like OpenAI and Google.

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback