Papers

@rohanpaul_ai: This paper argues that intelligence is the ability to make rare but valid futures more likely. So an intelligent system…

X AI KOLs Following ↗ · 1h ago Cached

This paper proposes a thermodynamic measure of intelligence, defining intelligence as the ability to make rare but valid futures more likely. It introduces a metric called 'rare-valid lift' that quantifies how much more often a system produces unlikely but acceptable outcomes compared to a passive baseline.

0 favorites 0 likes

@rohanpaul_ai: LLMs often cannot tell when an attack made them say something unsafe. Asking an LLM whether its own previous answer was…

X AI KOLs Timeline ↗ · 2h ago Cached

This paper investigates whether LLMs can reliably self-report when their outputs have been compromised by adversarial prefills, finding that models often cannot distinguish between compromised and intentional outputs, and their limited recognition stems from normal refusal behavior rather than true self-awareness.

0 favorites 0 likes

@yoheinakajima: ActiveGraph: 1 month in: Paper #1: The Log is the Agent 3 LongMemEval Experiments Paper #2: Regimes, self-improvement l…

X AI KOLs Following ↗ · 4h ago Cached

ActiveGraph announces two new papers on agent memory (LongMemEval) and self-improvement regimes, along with reference agents, pack templates, and upcoming meetups in Seattle and San Francisco.

0 favorites 0 likes

Plants appear to detect the patter of falling rain

MIT Technology Review ↗ · 6h ago Cached

MIT engineers discovered that rice seeds germinate 30-40% faster when exposed to the sound vibrations of falling rain, providing the first direct evidence that plant seeds can sense sound as a cue for optimal growth depth.

0 favorites 0 likes

Engineered “mini livers” could be injected as an alternative to transplantation

MIT Technology Review ↗ · 6h ago Cached

MIT researchers developed injectable hydrogel microspheres that, combined with hepatocytes, form stable mini livers in mice, potentially offering a non-surgical alternative to liver transplantation.

0 favorites 0 likes

Ultrasound imaging turns a robot hand into a skillful mimic

MIT Technology Review ↗ · 6h ago Cached

MIT researchers developed a wristband with ultrasound stickers that images muscles and tendons, using AI to translate those images into hand movements to wirelessly control a robotic hand with high dexterity.

0 favorites 0 likes

Super Mario is mathier than you think

MIT Technology Review ↗ · 6h ago Cached

Research from the MIT Hardness Group proves that Super Mario levels can be undecidable, meaning no computer program can always determine if Mario can reach the castle, placing Super Mario in the hardest complexity class.

0 favorites 0 likes

I Figured Out What Causes 'Super Weights'

Reddit r/ArtificialInteligence ↗ · 6h ago

Explains that super weights in large language models arise from the SoftMax-Attention interaction creating a 'Nothing Dump' token that serves as a stable reference point; removing these weights cripples performance.

0 favorites 0 likes

OpenMythos benchmarks

Reddit r/LocalLLaMA ↗ · 8h ago

OpenMythos introduces a new open-source benchmark for evaluating AI models on mythological knowledge.

0 favorites 0 likes

The End of Code Review: Coding Agents Supersede Human Inspection

Hacker News Top ↗ · 8h ago Cached

This paper argues that LLM-based coding agents have reached a capability threshold making human code review redundant, and proposes replacing human inspection with agent-driven verification to reduce costs and latency.

0 favorites 0 likes

Certainty Is All You Need

Reddit r/artificial ↗ · 8h ago

This paper introduces a new approach leveraging certainty in transformer models, building on the 'Attention Is All You Need' paradigm.

0 favorites 0 likes

@_akhaliq: paper:

X AI KOLs Following ↗ · 8h ago Cached

This technical report presents Ling-2.6 and Ring-2.6, a family of trillion-parameter models designed for efficient and instant agentic intelligence, featuring architectural upgrades like hybrid linear attention and specialized training methods including KPop reinforcement learning. All checkpoints are open-sourced.

0 favorites 0 likes

Brain-inspired AI architecture could computing faster and far less power-hungry

Reddit r/singularity ↗ · 9h ago

A brain-inspired AI architecture promises to deliver faster computing while consuming far less power, potentially advancing energy-efficient AI hardware.

0 favorites 0 likes

A Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B . Pre-token hidden state shift as an alignment policy traversal vector in instruction-tuned LLMs

Reddit r/AI_Agents ↗ · 9h ago

This paper investigates an alignment vulnerability in instruction-tuned LLMs, specifically Gemma-3-12B, by showing that pre-token hidden state shifts can act as an alignment policy traversal vector, potentially enabling bypass of safety measures.

0 favorites 0 likes

F3

Hacker News Top ↗ · 10h ago Cached

F3 is a next-generation open-source data file format that uses embedded WebAssembly decoders for interoperability and extensibility, addressing limitations of legacy formats like Parquet. It is currently a research prototype from a paper published in ACM.

0 favorites 0 likes

I mapped the KLD of KV cache quantization for Qwen3.6-35B-A3B and Gemma4-E2B QAT

Reddit r/LocalLLaMA ↗ · 11h ago

The author maps the Kullback-Leibler divergence of KV cache quantization for the Qwen3.6-35B-A3B and Gemma4-E2B QAT models.

0 favorites 0 likes

Agent Profiles Make AI Runs Safer, More Focused and Reusable

Reddit r/artificial ↗ · 12h ago

Agent Profiles is a new method that enhances AI safety, focus, and reusability by defining structured profiles for AI agents.

0 favorites 0 likes

Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

Hacker News Top ↗ · 12h ago Cached

Lift4D is a test-time optimization framework that reconstructs complete 4D geometry, appearance, and deformation of dynamic objects from a single monocular in-the-wild video, improving over prior methods on challenging sequences with occlusions and non-rigid motion.

0 favorites 0 likes

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069424192274252094

X AI KOLs Timeline ↗ · 12h ago Cached

Microsoft's NextLat introduces a training objective that rewards belief-state representations instead of relying solely on next-token prediction, pushing models toward compact world models for better generalization.

0 favorites 0 likes

@nablabio: Today, we expand zero-shot drug design beyond binding to the design of multifunctional medicines, the intracellular pro…

X AI KOLs Following ↗ · 14h ago Cached

Nabla Bio unveils JAM-2, a model for zero-shot drug design achieving atomic-precision, computationally designed multispecific antibodies and dual-variant KRAS multispecifics with high potency and selectivity, validated with Cryo-EM and wet-lab experiments.

0 favorites 0 likes

Papers

Submit Feedback