Papers

The End of Code Review: Coding Agents Supersede Human Inspection

Hacker News Top ↗ · 1h ago Cached

This paper argues that LLM-based coding agents have reached a capability threshold making human code review redundant, and proposes replacing human inspection with agent-driven verification to reduce costs and latency.

0 favorites 0 likes

A Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B . Pre-token hidden state shift as an alignment policy traversal vector in instruction-tuned LLMs

Reddit r/AI_Agents ↗ · 1h ago

This paper investigates an alignment vulnerability in instruction-tuned LLMs, specifically Gemma-3-12B, by showing that pre-token hidden state shifts can act as an alignment policy traversal vector, potentially enabling bypass of safety measures.

0 favorites 0 likes

F3

Hacker News Top ↗ · 2h ago Cached

F3 is a next-generation open-source data file format that uses embedded WebAssembly decoders for interoperability and extensibility, addressing limitations of legacy formats like Parquet. It is currently a research prototype from a paper published in ACM.

0 favorites 0 likes

I mapped the KLD of KV cache quantization for Qwen3.6-35B-A3B and Gemma4-E2B QAT

Reddit r/LocalLLaMA ↗ · 4h ago

The author maps the Kullback-Leibler divergence of KV cache quantization for the Qwen3.6-35B-A3B and Gemma4-E2B QAT models.

0 favorites 0 likes

Agent Profiles Make AI Runs Safer, More Focused and Reusable

Reddit r/artificial ↗ · 5h ago

Agent Profiles is a new method that enhances AI safety, focus, and reusability by defining structured profiles for AI agents.

0 favorites 0 likes

Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

Hacker News Top ↗ · 5h ago Cached

Lift4D is a test-time optimization framework that reconstructs complete 4D geometry, appearance, and deformation of dynamic objects from a single monocular in-the-wild video, improving over prior methods on challenging sequences with occlusions and non-rigid motion.

0 favorites 0 likes

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069424192274252094

X AI KOLs Timeline ↗ · 5h ago Cached

Microsoft's NextLat introduces a training objective that rewards belief-state representations instead of relying solely on next-token prediction, pushing models toward compact world models for better generalization.

0 favorites 0 likes

@Gracker_Gao: AI Papers: Strong AI Doesn't Write Code by Writing Code Two recent arXiv papers reveal a counterintuitive finding: when encountering an unfamiliar programming language, GPT-5.4 and Claude Opus 4.6 don't directly write code in the target language—instead, they write a Python program to generate the target code, then debug it locally. This "meta-…

X AI KOLs Timeline ↗ · 10h ago Cached

Two recent arXiv papers found that GPT-5.4 and Claude Opus 4.6 employ a metaprogramming strategy when handling unfamiliar programming languages — generating target code with Python and debugging locally — rather than writing the target language code directly. This strategy is key to distinguishing top-tier agents from average ones, and strategy sophistication matters more than model parameter scale.

0 favorites 0 likes

Show HN: Neural Particle Automata

Hacker News Top ↗ · 11h ago Cached

Introduces Neural Particle Automata, a method for learning self-organizing particle dynamics using smooth particle hydrodynamics perception, enabling particles to have local perception vectors for an update rule, analogous to Neural Cellular Automata but on continuous particle positions.

0 favorites 0 likes

AI Built a Nuke and Still Lost

Hacker News Top ↗ · 11h ago Cached

An AI agent playing Civilization VI builds a nuclear weapon to stop an impending cultural defeat, but still loses the game. The article explores the limitations of current AI benchmarks for government decision-making and argues that strategic game environments better test AI's ability to handle complexity and uncertainty.

0 favorites 0 likes

What a model reads beforehand changes how it answers later - and you can see it in the hidden states

Reddit r/artificial ↗ · 13h ago

This post reports an observation that reading a long, structured text before answering alters a model's later responses, with behavioral evidence from Claude and mechanistic analysis on open-weight Gemma models showing separable hidden states and sharper probability distributions in instruction-tuned variants.

0 favorites 0 likes

What you read before a question changes how a language model answers it — even when the question has nothing to do with what you read. Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B

Reddit r/ArtificialInteligence ↗ · 13h ago

The article reports a potential alignment vulnerability in LLMs where processing a structured passage before an unrelated question can alter the model's response, with mechanistic evidence from Gemma-3-12B showing hidden-state separation.

0 favorites 0 likes

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

Hacker News Top ↗ · 17h ago Cached

This technical report introduces VibeThinker-3B, a 3B parameter dense model that achieves frontier-level reasoning performance on benchmarks like AIME26 and LiveCodeBench, matching or exceeding much larger models such as DeepSeek V3.2 and GLM-5 through a combination of curriculum-based SFT, multi-domain RL, and offline self-distillation.

0 favorites 0 likes

Thermodynamic Measure Of Intelligence

Reddit r/singularity ↗ · 18h ago Cached

This paper proposes a thermodynamic measure of intelligence defined as 'rare-valid lift' and argues that recursive self-simulation is necessary and nearly sufficient for high thermodynamic intelligence, making intelligence measurable on a universal scale.

0 favorites 0 likes

Prompt Injection as Role Confusion

Simon Willison's Blog ↗ · 19h ago Cached

Research paper shows that LLMs suffer from 'role confusion', where they prioritize the style of text over its actual role tags, enabling prompt injection attacks. Destyling text reduces attack success from 61% to 10%, indicating a fundamental challenge for LLM security.

0 favorites 0 likes

A Source of Mysterious Repeating Radio Signals From Space Has Been Identified

Wired ↗ · 21h ago Cached

An international research team identified the source of a mysterious repeating radio signal as a white dwarf pulling material from a companion red dwarf, solving a long-standing astronomical puzzle.

0 favorites 0 likes

@Ankur_Samanta_: New work on credit assignment in multi-step reasoning RL post-training Introducing Self-Reset Policy Optimization (SRPO…

X AI KOLs Timeline ↗ · yesterday Cached

Self-Reset Policy Optimization (SRPO) addresses credit assignment in multi-step reasoning RL post-training by localizing the first wrong reasoning step and learning from counterfactual continuations without external supervision.

0 favorites 0 likes

Prompt Injection as Role Confusion

Hacker News Top ↗ · yesterday Cached

This paper presents a theory that prompt injection attacks on LLMs stem from a fundamental flaw in how models perceive roles, treating roles as a type system for language. It explains existing attacks, predicts new ones, and proposes a research agenda for a science of roles.

0 favorites 0 likes

Attention Is All You Need

Reddit r/ArtificialInteligence ↗ · yesterday

A reflection on the landmark 'Attention Is All You Need' paper, highlighting how removing recurrence and relying solely on attention mechanisms revolutionized AI and led to modern LLMs like GPT and Claude.

0 favorites 0 likes

Revised: Estimated share of newly written code exposed to AI generation and review

Reddit r/singularity ↗ · yesterday

This paper revises the estimated proportion of newly written code that is generated or reviewed by AI, analyzing its impact on software development.

0 favorites 0 likes

Papers

Submit Feedback