Top Stories

Articles with importance ≥ 6 from the past 48 hours

DeepSeek V4 paper full version is out, FP4 QAT details and stability tricks [D]
Reddit r/MachineLearning · 3h ago

DeepSeek released the full V4 paper detailing FP4 quantization-aware training, MoE training stability tricks (anticipatory routing and SwiGLU clamping), and a generative reward model for RLHF, achieving dramatic efficiency gains—V4-Flash uses only 10% of V3.2's FLOPs and 7% of its KV cache at 1M context length.

Chrome’s AI features may be hogging 4GB of your computer storage
Lobsters Hottest · 3h ago

Google Chrome is automatically downloading a 4GB Gemini Nano model weights file to users' devices to power on-device AI features like scam detection and writing assistance, often without clear notification about storage requirements. Users can disable the On-Device AI toggle in Chrome settings to remove the file and prevent re-downloads.

@akshay_pachaar: Naive RAG vs. Blockify! There's a new RAG approach that: - cuts corpus size by 40x. - reduces tokens per query by 3x. -…
X AI KOLs Following · 3h ago

Blockify is a new open-source RAG framework that replaces naive chunking with a patented 'IdeaBlocks' pipeline, claiming 40x corpus size reduction, 3x token efficiency, and 2.3x vector search accuracy improvements. It transforms enterprise documents into structured XML knowledge units for more coherent LLM retrieval.

@Prince_Canuma: mlx-audio v0.4.3 is here A massive release across models, server, and DX → 6 new TTS models: Higgs Audio v2 (voice clon…
X AI KOLs Timeline · 3h ago

mlx-audio v0.4.3 releases with 6 new TTS models including Higgs Audio v2 and OmniVoice (646+ languages), plus server improvements like concurrent requests and continuous batching, ~3x faster Voxtral Realtime on 4-bit, and slimmer dependencies for Apple Silicon.

@xiaochuan8688: ByteDance Quietly Shut Down 30% of Its AI Projects — Everything Outside Doubao Is Being Cut Back. Industry insider info: At ByteDance's internal AI strategy review meeting in April, the company axed 30% of its AI application projects, including "Maobox," "Xinghui," and parts of the overseas AI video tool Dreamina's product lines. On the surface…
X AI KOLs Timeline · 4h ago

At an internal AI strategy review meeting in April, ByteDance cut 30% of its AI application projects — including Maobox, Xinghui, and parts of Dreamina — as no product outside of Doubao met its target DAU goals. The company will now focus on Doubao, make a hardware bet, and scale back investment in standalone AI apps.

@tom_doerr: Fully open sources training data for 30B scale search agents https://github.com/PolarSeeker/OpenSeeker…
X AI KOLs Timeline · 4h ago

OpenSeeker fully open-sources training data and models for 30B-scale ReAct-based search agents, achieving state-of-the-art performance on multiple benchmarks including BrowseComp and Humanity's Last Exam. It is the first purely academic project to reach frontier search benchmark performance while releasing complete training data.

@garrytan: Downloading now... 1M token context window with supposedly usable coding agent capability all on a 128GB Macbook Pro is
X AI KOLs Following · 4h ago

Garry Tan highlights a model with a 1M token context window and coding agent capabilities running locally on a 128GB MacBook Pro, expressing excitement about the milestone.

@baispx: BREAKING: Michael Burry — the Big Short who predicted the 2008 crash — opens $1 billion short position betting on AI bubble collapse, with bets on $PLTR at $912M and $NVDA at $187M. Last time he went this big was the 2008 global financial crisis, and he was right. …
X AI KOLs Timeline · 5h ago

Famed short seller Michael Burry has reportedly established approximately $1 billion in short positions betting on an AI bubble collapse, targeting primarily Palantir ($912M) and NVIDIA ($187M). This is his largest short play since the 2008 financial crisis.

EU calls VPNs "a loophole that needs closing" in age verification push
Hacker News Top · 5h ago

The European Parliamentary Research Service (EPRS) has labeled VPNs 'a loophole that needs closing' in the context of online age-verification laws, raising concerns about children bypassing regional content restrictions. The push has sparked pushback from privacy advocates and VPN providers, highlighting tensions between child safety regulation and digital privacy rights.

killswitch: per-function short-circuit mitigation primitive
Lobsters Hottest · 5h ago

A new Linux kernel patch proposes a 'killswitch' primitive that allows admins to immediately disable vulnerable kernel functions (e.g., af_alg_sendmsg) by making them return -EPERM, providing a rapid temporary mitigation for security issues without requiring a reboot or kernel rebuild.

A Randomized Scheduler with Probabilistic Guarantees of Finding Bugs
Lobsters Hottest · 6h ago

This Microsoft Research paper introduces a randomized scheduling technique designed to provide probabilistic guarantees for uncovering bugs in software systems. Published for the ASPLOS conference, it focuses on systematic fault detection through algorithmic randomness.

@WY_mask: Currently #1 on GitHub Trending with 40k+ stars https://github.com/ruvnet/ruflo — An "AI Orchestration Hub" that can spin up dozens of Agents working in parallel, with multi-agent collaboration, RAG memory, distributed workflows, and even direct integration with Claude Co…
X AI KOLs Timeline · 7h ago

Ruflo (formerly Claude Flow) is a trending open-source GitHub project that supports orchestrating 100+ specialized AI Agents simultaneously, featuring RAG memory, distributed workflows, enterprise security, and direct integration with Claude Code and Codex. The project is currently ranked #1 on GitHub Trending with 40k+ stars.

@davis7: @0xSero helped me setup local models properly and I uh, had no idea these things had gotten this good Are they frontier…
X AI KOLs Following · 8h ago

The author highlights the impressive capabilities of the open-source Qwen 3.6-27B model running locally on an RTX 5090, noting its strong performance on programming tasks and comparing it favorably to commercial models, despite the complexity of local deployment.

A recent experience with ChatGPT 5.5 Pro
Hacker News Top · 8h ago

Mathematician Timothy Gowers recounts how ChatGPT 5.5 Pro produced PhD-level mathematical research in about an hour with minimal human input, solving open problems from a combinatorics/additive number theory paper and prompting him to significantly revise his assessment of LLMs' mathematical capabilities.

@cyrilXBT: CHINA JUST BUILT AN AI MODEL THAT IS COMPETING WITH OPENAI AND ANTHROPIC AT A FRACTION OF THE COST. And someone just dr…
X AI KOLs Timeline · 8h ago

DeepSeek, a Chinese AI model built by a quant hedge fund, is reportedly competing with GPT-4 level performance at roughly 5% of the training cost, causing significant market disruption including a $600B drop in NVIDIA's market cap. A free 1 hour 50 minute course has been released teaching users how to leverage DeepSeek V4 locally and via API.

@TechFlow99: BREAKING: Someone just built the exact tool Andrej Karpathy said someone should build. 48 hours after Karpathy posted h…
X AI KOLs Timeline · 8h ago

A new open-source tool called Graphify was built within 48 hours of Andrej Karpathy describing an LLM knowledge base workflow, enabling users to generate navigable knowledge graphs, Obsidian vaults, and wikis from any folder with 71.5x fewer tokens per query compared to reading raw files. It integrates with Claude Code and supports 13 programming languages, PDFs, images, and Markdown.

@Kangwook_Lee: https://x.com/Kangwook_Lee/status/2052925157606568217
X AI KOLs Timeline · 9h ago

The author argues that human-designed structural frameworks for AI agents should be replaced by AI-engineered ones, introducing a Three Regimes Framework to show how this shift unlocks mid-sized model capabilities. Citing projects like Meta Harness, they predict an imminent transition where AI will autonomously optimize its own system architecture.

@elliotchen100: Thariq from Anthropic’s viral HTML post hit 1.5M reads. On the surface, it’s about formatting aesthetics, but he’s actually outlining a brand-new workflow. Picking out the most technical points. First, HTML isn’t a document; it’s a throwaway editor. Take his example…
X AI KOLs Timeline · 10h ago

Analyzes a new AI development workflow shared by Anthropic employee Thariq, highlighting how replacing Markdown with HTML and SVG can dramatically improve multi-agent collaboration and interaction efficiency, offering a model better suited to human-AI synergy in the AI era.

METR evaluated an early version of Claude Mythos
Reddit r/singularity · 10h ago

METR evaluated an early version of Claude Mythos Preview in March 2026 using their time-horizons task suite, estimating a 50%-time-horizon of at least 16 hours, indicating the model is at the upper end of what current benchmarks can measure, with caveats about stability at longer time ranges.

@libapi_: Today, Hermes Agent secured the number one spot globally. This isn't just a ranking—it reflects the combined push from the open-source community, developers, contributors, and every real user. I'm also thrilled to see more AI Agent projects on @OpenRouter gaining visibility. CLI, Personal Agents, automated workflows, …
X AI KOLs Timeline · 10h ago

Hermes Agent tops the global rankings, highlighting the collaborative drive of the open-source community and developers, while signaling that the AI Agent ecosystem is rapidly scaling across platforms like OpenRouter.

Submit Feedback