deepseek

Tag

Cards List
#deepseek

@PolymarketMoney: JUST IN: $MSFT weighs DeepSeek for Copilot Cowork.

X AI KOLs Following · 2026-06-16 Cached

Microsoft is reportedly considering integrating DeepSeek into its Copilot Cowork product.

0 favorites 0 likes
#deepseek

@Gorden_Sun: https://x.com/Gorden_Sun/status/2066919099016630286

X AI KOLs Following · 2026-06-16 Cached

A long-term study involving 26,000 Chinese middle and high school students found that after students independently used AI, homework performance improved by 18%, but closed-book exam scores dropped by 20% within six months. Zhongkao and Gaokao scores dropped by 24% and 18% respectively, and 81% of students used AI to complete their homework.

0 favorites 0 likes
#deepseek

@natolambert: New podcast with @finbarrtimbers! We survey the latest post-training recipes, from GLM 5.1, Kimi K2.6, DeepSeek V4, Xia…

X AI KOLs Timeline · 2026-06-16 Cached

Nathan Lambert and Finbarr Timbers discuss the latest post-training recipes for large language models, including DeepSeek V4, GLM 5.1, Kimi K2.6, and the industry shift to multi-teacher on-policy distillation.

0 favorites 0 likes
#deepseek

@huangjinbo: Reasonix is truly excellent. Don't be misled by its project name (DeepSeek-Reasonix). As long as the relay supports OpenAI-compatible, it can be supported... Recommending again. Mainly its skills, memory, Hooks, MCP and other features are all very useful... It was used to…

X AI KOLs Timeline · 2026-06-16 Cached

Reasonix (formerly named DeepSeek-Reasonix) is an AI coding agent CLI tool developed in Go, supporting features like skills, memory, Hooks, MCP, etc., and can replace OpenCode.

0 favorites 0 likes
#deepseek

Stop When Further Reasoning Won't Help: Attention-State Adaptive Generation in Reasoning Models

arXiv cs.CL · 2026-06-16 Cached

This paper proposes ASAG, a training-free method that adaptively stops reasoning in large reasoning models based on attention distributions, reducing token usage by ~40% while improving accuracy by 3.2% on benchmarks using DeepSeek-R1-Distill and Qwen3 models.

0 favorites 0 likes
#deepseek

@ziv_ravid: 1/I read the Nemotron 3 Ultra report and it's interesting to compare their post-training to DeepSeek V4's. Both now do …

X AI KOLs Timeline · 2026-06-15 Cached

The tweet compares the post-training methods of Nemotron 3 Ultra and DeepSeek V4, noting both use multiple specialist teachers and on-policy distillation into a single student, but differ in support overlap.

0 favorites 0 likes
#deepseek

How did China develop AI so quickly recently if most work was done in USA ?

Reddit r/ArtificialInteligence · 2026-06-14

This article discusses how China has rapidly advanced in AI despite being a latecomer, questioning the sources of datasets, computing power, and algorithms that enabled companies like DeepSeek to catch up with US leaders like OpenAI and Google.

0 favorites 0 likes
#deepseek

@vista8: Enter any app name, automatically fetch AppStore user reviews. Use DeepSeek for information mining, turning reviews into useful insights for product managers: 1. What are users actually praising or complaining about? 2. Which issues are related to version updates? 3. Which represent product opportunities? 4. Visual charts. Product expected to...

X AI KOLs Following · 2026-06-14 Cached

An AI tool that will soon be open-source, using DeepSeek to automatically fetch AppStore user reviews and perform information mining, helping product managers understand user feedback, version issues, and product opportunities.

0 favorites 0 likes
#deepseek

You can run Deepseek 4 flash on mac (M3 Max, 96gb)

Reddit r/LocalLLaMA · 2026-06-14

A guide on running DeepSeek 4 flash on a Mac M3 Max with 96GB RAM using Antirez's ds4 engine and SSD streaming, achieving ~12 tokens/second inference speed.

0 favorites 0 likes
#deepseek

Local models in mid-2026

Reddit r/LocalLLaMA · 2026-06-14 Cached

A technical overview of the state of local AI models in mid-2026, highlighting how open-weight models have narrowed the gap to frontier models through advances in mixture-of-experts and sparse attention, enabling efficient local inference.

0 favorites 0 likes
#deepseek

@rosstaylor90: A few words on the Sovereign AI debate, having built several LLMs in Meta while in the UK and now working as a UK based…

X AI KOLs Following · 2026-06-14 Cached

A former Meta AI researcher shares a 10-point thread on the UK's sovereign AI debate, arguing that smaller, well-scoped teams can validate new directions without billions, and that nurturing local talent and managing expectations are crucial for the UK's AI ecosystem.

0 favorites 0 likes
#deepseek

@LinearUncle: If you're after the ultimate coding agent freedom, only Pi can give you peace of mind. If you've been wanting to try Pi + DeepSeek but don't know how to set it up, I highly recommend beginners check out this foreign guy's blog: https://deepakness.com/blog…

X AI KOLs Timeline · 2026-06-12 Cached

This tweet recommends using the Pi coding agent with DeepSeek and links to a detailed setup guide blog.

0 favorites 0 likes
#deepseek

We captured the network traffic of ChatGPT, Gemini and DeepSeek to see how each defines a "source" — they're three completely different mechanisms

Reddit r/artificial · 2026-06-11

A technical investigation captured and compared the network traffic of ChatGPT, Gemini, and DeepSeek to understand how each system technically defines and attaches sources to responses, revealing three fundamentally different mechanisms and distinct citation preferences.

0 favorites 0 likes
#deepseek

How can Deepseek v4 top the coding leaderboards and still sit 8 months behind the frontier?

Reddit r/LocalLLaMA · 2026-06-11

Analysis of DeepSeek V4's top coding scores versus its reported 8-month gap behind the frontier, highlighting differences between narrow benchmark optimization and broader reasoning tests, plus the practical performance hit when running quantized local versions.

0 favorites 0 likes
#deepseek

Notes on DeepSeek

Hacker News Top · 2026-06-10 Cached

A visit to DeepSeek's headquarters reveals its modest origins, young team, and unique culture. The company, operated out of a hedge fund, focuses on staying small and remains unconcerned about AGI risks, instead prioritizing societal concerns like job loss.

0 favorites 0 likes
#deepseek

@akshay_pachaar: https://x.com/akshay_pachaar/status/2064700531600458093

X AI KOLs Following · 2026-06-10 Cached

This article explains how to use GRPO to fine-tune an LLM (Qwen3-8B) for reliable JSON structured output, improving schema accuracy from 62% to 82%, surpassing GPT-4.1's 58%.

0 favorites 0 likes
#deepseek

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

arXiv cs.AI · 2026-06-10 Cached

This paper investigates instruction finetuning of DeepSeek-R1-8B using LoRA and NEFTune for financial named-entity recognition, achieving a micro-F1 of 0.912 and outperforming several baseline models.

0 favorites 0 likes
#deepseek

DeepSeek enters the fight for token volume, Anthropic continues to dominate spend (12 minute read)

TLDR AI · 2026-06-10 Cached

AI Gateway's May 2026 data shows DeepSeek's token share surged to 17% with minimal spend, while Anthropic retained 65% of spend, indicating cost-conscious routing and growing overall usage.

0 favorites 0 likes
#deepseek

FlashMemory DeepSeek-V4 Retriever (GitHub Repo)

TLDR AI · 2026-06-10 Cached

Introduces FlashMemory DeepSeek-V4 Retriever, a lightweight model that sparsifies DeepSeek-V4's CSA KV-cache by predicting which chunks will be attended to next, keeping only ~10-15% on-device while matching full-attention performance.

0 favorites 0 likes
#deepseek

We are buying something that clones itself

Reddit r/ArtificialInteligence · 2026-06-09

The article argues that the AI startup wave is unsustainable because intelligence is an infinitely replicable commodity with zero marginal cost, and most AI companies will collapse by 2029, leaving only a few giants owning the physical layer like energy and chips.

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback