Trending stories ranked by heat, importance and recency.
This paper investigates an alignment vulnerability in instruction-tuned LLMs, specifically Gemma-3-12B, by showing that pre-token hidden state shifts can act as an alignment policy traversal vector, potentially enabling bypass of safety measures.
Tesla Model 3 and Model Y have been ranked as the #1 and #2 most American-made vehicles for 2026, marking the sixth consecutive year Tesla leads the list.
A diffusion model that can transform any image into an interactive, playable hallucination, running locally on user hardware.
An analysis questioning whether OpenRouter's API pricing for open models like GLM-5.2 implies more aggressive quantization than assumed, given the economics of running large models on expensive hardware like 8xH200.
This article describes the loop engineering cycle for AI Product Managers, emphasizing building reusable systems that improve over time rather than one-off prompts.
Discussion on the need for local safety boundaries in AI coding agents to prevent unauthorized file access or command execution.
The article explores how AI is used in education and whether AI editing tools can replace human editors, seeking insights on their real benefits.
Mozilla and Cloudflare are collaborating with other browsers on a new initiative to combat bot abuse while preserving user privacy, proposing a rate-limiting approach with anonymous vouching instead of invasive verification methods like CAPTCHAs or Web Environment Integrity.
Cloudflare is working with major browsers to create a new privacy-first protocol for the global internet.
GLM 5.2 delivers major performance gains on Mac Studio with 512GB RAM, achieving prefill speeds above 100 t/s at high context lengths and enabling 4-bit quantization for contexts over 100k tokens, as detailed in a pull request by the oMLX creator.
Former White House AI advisor Dean Ball argues that China's efforts to achieve AI chip independence are largely performative and not substantive.
A discussion on the methodologies and challenges involved in evaluating AI features once they are deployed in production environments.
At least seven Chinese companies are shipping H100/H200-class AI accelerators, most having recently IPO'd, with several founded by former NVIDIA/AMD architects. Huawei's Ascend 950 targets H200-class performance, and China's domestic market share is rising as NVIDIA's declines.
Netflix is releasing an interactive horror game called Unhinged for its TV gaming platform, developed by Night School Studio, using a smartphone as controller. The game is designed to be simple and approachable, blurring the line between game and movie.
A benchmark of 8 LLMs for medical scribing found hallucinations rare but omissions a concern.
The article analyzes the unsustainable economics of AI platforms, revealing massive subsidies where companies like OpenAI and Anthropic lose billions by charging far below cost, leading to an affordability crisis.
KroWork turns AI chat interactions into reusable desktop applications that run locally without consuming tokens when restarted, allowing non-technical users to create deployable software via natural language.
Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.
Explains how prompt caching works in LLMs, using Claude as a case study, detailing the transformer's KV cache mechanism and the cost benefits of caching static prefixes in agentic workflows.
The article examines the cost of building a PC comparable to Valve's Steam Machine using off-the-shelf parts, concluding that a similar build is more expensive and larger, despite using comparable components.