WorldKernel: A World Model is the Coupling Kernel of Admissible Possible Worlds
Summary
The paper identifies a failure mode where predictors collapse to a point on unidentified counterfactual couplings and proposes a framework using a positive semidefinite coupling kernel to bound counterfactuals, showing that prediction cannot represent uncertainty over cross-world couplings and that enforcing kernel constraints yields tractable bounds.
View Cached Full Text
Cached at: 06/10/26, 06:18 AM
# A World Model is the Coupling Kernel of Admissible Possible Worlds Admissibility on the Diagonal, Counterfactual Coupling on the Off-Diagonal, and Ontology-Structured Bounds up to a Sly–Sun Counting Barrier
Source: [https://arxiv.org/html/2606.10934](https://arxiv.org/html/2606.10934)
\(June 2026\)
###### Abstract
A common assumption holds that enough observational and interventional data, given to a strong enough predictor, is effectively enough\. We report a recurring failure mode that contradicts it: across hundreds of structural causal models, on quantities the data identify a strong predictor and a Bayesian baseline succeed, but on*unidentified*quantities, the couplings between counterfactual worlds, the predictor collapses to a point, on28%28\\%of models to one no valid model can produce, while the truth is an admissible interval more data never narrows\. The gap is structural: prediction cannot represent uncertainty over counterfactual couplings\. We cast a world model as a single positive semidefinite*coupling kernel*KE\(T,T′\)K\_\{E\}\(T,T^\{\\prime\}\)over admissible worlds, whose diagonal is the ordinary posterior \(what a predictor recovers\) and whose*off\-diagonal*is the cross\-world coupling it cannot, and which every counterfactual reads\. The paper is the theory of that off\-diagonal, organised as five questions about one object\.*Is it real?*Two states with identical posteriors differ on a cross\-world query, and the off\-diagonal coincides with the coupling that fixes counterfactuals\.*Can we bound it?*Its positive semidefiniteness is partial\-identifying information the marginals lack; enforcing it \(a semidefinite relaxation\) bounds counterfactuals in polynomial time where the exact response\-type program is intractable, returning the exact bound at scales that program cannot reach\.*Can logical structure sharpen it?*Ontology axioms enter as further constraints and tighten the bound by up to a third, propagating through positive semidefiniteness to couplings they never touch, structure flat causal\-bounds tools cannot use\.*Can we acquire it?*Targeted “scars”, constraints learned from encountered infeasibilities, close the gap up to4×4\\timesfaster than untargeted ones \(the margin narrowing as both saturate\), with a spectral\-gap\-restoration guarantee\.*When is it reachable?*Accessing it is approximate counting of the admissible worlds, tractable below the Sly–Sun threshold and inapproximable above; we do not claim to beat the worst case, and we measure exactly where the phase\-extremal recourse works \(few\-phase\) and provably stalls \(the glass\)\. A world model is the coupling kernel of what can consistently be; its off\-diagonal is the part prediction will never supply\.
## 1Introduction
We began not from a definition but from a failure\. A widespread working assumption is that observational and interventional data, in sufficient quantity and given to a sufficiently strong predictor, suffice to reason about a domain\. Section[4\.4](https://arxiv.org/html/2606.10934#S4.SS4)reports a battery of three hundred structural causal models on which this assumption breaks in a specific, repeatable way\. Each model is queried for a counterfactual\. When the query is identified by observational and interventional data, a strong language model, a Bayesian structural\-causal baseline, and the representation of this paper all answer it\. When the query is*not*identified, the correct answer is properly an interval of admissible values, and the systems diverge: the Bayesian baseline emits one feasible point that is frequently far from truth and whose sign is unstable, while the language model emits a point that on28%28\\%of models is not even feasible, a value no structural causal model can produce\. More data does not help, because the unidentified quantity, the coupling between counterfactual worlds, is not a function of any observational or interventional distribution\. The failure is therefore not statistical but representational:*prediction cannot represent uncertainty over counterfactual couplings\.*
This paper develops the minimal object that can represent that uncertainty\. A world model is a single positive semidefinite*coupling kernel*
KE\(T,T′\)=⟨T∣ρE∣T′⟩K\_\{E\}\(T,T^\{\\prime\}\)=\\langle T\\mid\\rho\_\{E\}\\mid T^\{\\prime\}\\rangleover complete admissible worldsT,T′T,T^\{\\prime\}, conditioned on evidenceEE\. The diagonalKE\(T,T\)=μE\(T\)K\_\{E\}\(T,T\)=\\mu\_\{E\}\(T\)is the classical posterior over admissible worlds, exactly what observational and interventional data, hence any predictor, can recover\. The off\-diagonalKE\(T,T′\)K\_\{E\}\(T,T^\{\\prime\}\)links alternative worlds, and it is exactly the cross\-world coupling the predictor cannot represent\. Time links stages within a world; this kernel links worlds\. The posterior is only the diagonal shadow of the kernel, and the failure mode above is precisely the absence of the off\-diagonal\.
#### One object\.
This paper is about a single object: the*off\-diagonal*of the world kernel, the cross\-world couplingKE\(T,T′\)K\_\{E\}\(T,T^\{\\prime\}\)that links alternative worlds\. A predictor recovers only the diagonal, the posterior over what is the case; the off\-diagonal is the one thing it cannot represent, and it is exactly what a counterfactual reads\. Everything else in the paper is the theory of that object, organised as the questions any account of it must answer\.
1. 1\.*Is it real, or just prediction in disguise?*Real and separate\. Worlds can be predictively identical yet admissibility\-distinct \(Section[2](https://arxiv.org/html/2606.10934#S2)\), and two states with identical posteriors can differ on a cross\-world query \(Section[3](https://arxiv.org/html/2606.10934#S3)\); empirically the off\-diagonal coincides with the coupling that fixes counterfactuals, which strong predictors collapse \(Section[4](https://arxiv.org/html/2606.10934#S4)\)\.
2. 2\.*Can we bound it from data?*Yes, and the kernel’s own structure does the work\. Positive semidefiniteness is partial\-identifying information the marginals discard, tightening counterfactual bounds in polynomial time where the exact program is intractable \(Section[4\.5](https://arxiv.org/html/2606.10934#S4.SS5)\)\.
3. 3\.*Can logical structure sharpen it?*Yes\. An ontology’s axioms enter as further constraints and tighten the bound by up to a third, propagating to couplings they never touch, structure flat causal\-bounds tools cannot use \(Section[4\.6](https://arxiv.org/html/2606.10934#S4.SS6)\)\.
4. 4\.*Can we acquire it from experience?*Yes\. Targeted “scars”, constraints learned from encountered infeasibilities, close the identifiability gap up to4×4\\timesfaster than untargeted ones \(narrowing as both saturate\), with a Cheeger guarantee \(Section[5\.2](https://arxiv.org/html/2606.10934#S5.SS2)\)\.
5. 5\.*When is it reachable?*Sharply bounded\. Accessing it is approximate counting of the admissible worlds, tractable below the Sly–Sun threshold and inapproximable above it, the order parameter\(d−1\)η\(d\-1\)\\etacrossing11between constraint\-graph degree55and66\(Section[6](https://arxiv.org/html/2606.10934#S6)\)\.
One object, five questions, not five ideas\. Section[7](https://arxiv.org/html/2606.10934#S7)states what the object buys: intelligence as closure\-preserving counterfactual competence\.
#### What is new and what is borrowed\.
We do not claim a new sampling\-to\-counting theorem, a new shielding method, or a new partial\-identification algorithm\. The contribution is the synthesis: the identification of a single object \(the positive semidefinite kernel\), the proof that its off\-diagonal is a distinct and counterfactually meaningful direction, the demonstration that its semidefinite structure tightens counterfactual bounds beyond the marginals on instances where the exact response\-type program is intractable, the placement of a known approximate\-counting barrier as the boundary for full off\-diagonal access, and the empirical demonstration that the off\-diagonal is load\-bearing for causal reasoning where strong predictors collapse\.
## 2Possible worlds, queries, and the insufficiency of prediction
###### Definition 1\(Worlds and query classes\)\.
LetΩ\\Omegabe a set of complete admissible worlds \(or histories\)\. A query classQQis a set of functionsq:Ω→Yqq:\\Omega\\to Y\_\{q\}\. We distinguish predictive queriesQpredQ\_\{\\mathrm\{pred\}\}\(e\.g\.p\(ot\+1∣o≤t,at\)p\(o\_\{t\+1\}\\mid o\_\{\\leq t\},a\_\{t\}\)\), entailment queriesQentQ\_\{\\mathrm\{ent\}\}\(𝒪⊧φ\\mathcal\{O\}\\models\\varphi\), admissibility queriesQadmQ\_\{\\mathrm\{adm\}\}\(a∈Γ𝒪\(s\)a\\in\\Gamma\_\{\\mathcal\{O\}\}\(s\)\), intervention queriesQcausQ\_\{\\mathrm\{caus\}\}\(P\(Y∣do\(a\),E\)P\(Y\\mid do\(a\),E\)\), and kernel queriesQkerQ\_\{\\mathrm\{ker\}\}\(KE\(ω,ω′\)K\_\{E\}\(\\omega,\\omega^\{\\prime\}\)\)\. The world query class isQworld=Qpred∪Qent∪Qadm∪Qcaus∪QkerQ\_\{\\mathrm\{world\}\}=Q\_\{\\mathrm\{pred\}\}\\cup Q\_\{\\mathrm\{ent\}\}\\cup Q\_\{\\mathrm\{adm\}\}\\cup Q\_\{\\mathrm\{caus\}\}\\cup Q\_\{\\mathrm\{ker\}\}\.
###### Definition 2\(Query equivalence and completeness\)\.
For a classQQ, setω∼Qω′\\omega\\sim\_\{Q\}\\omega^\{\\prime\}iffq\(ω\)=q\(ω′\)q\(\\omega\)=q\(\\omega^\{\\prime\}\)for allq∈Qq\\in Q\. A representationϕ:Ω→Z\\phi:\\Omega\\to ZisQQ\-complete if for eachq∈Qq\\in Qthere is a decoderfqf\_\{q\}withq=fq∘ϕq=f\_\{q\}\\circ\\phi\.
###### Theorem 1\(Query quotient\)\.
The quotient mapπQ:Ω→Ω/∼Q\\pi\_\{Q\}:\\Omega\\to\\Omega/\\\!\\sim\_\{Q\}isQQ\-complete, and anyQQ\-completeϕ\\phifactors through it:ϕ\(ω\)=ϕ\(ω′\)⇒ω∼Qω′\\phi\(\\omega\)=\\phi\(\\omega^\{\\prime\}\)\\Rightarrow\\omega\\sim\_\{Q\}\\omega^\{\\prime\}\. HenceΩ/∼Q\\Omega/\\\!\\sim\_\{Q\}is the coarsest representation sufficient forQQ\.
###### Proof\.
Definefq\(\[ω\]\)=q\(ω\)f\_\{q\}\(\[\\omega\]\)=q\(\\omega\), well defined since\[ω\]=\[ω′\]\[\\omega\]=\[\\omega^\{\\prime\}\]impliesq\(ω\)=q\(ω′\)q\(\\omega\)=q\(\\omega^\{\\prime\}\); thenq=fq∘πQq=f\_\{q\}\\circ\\pi\_\{Q\}\. Conversely ifϕ\\phiis complete andϕ\(ω\)=ϕ\(ω′\)\\phi\(\\omega\)=\\phi\(\\omega^\{\\prime\}\)thenq\(ω\)=fq\(ϕ\(ω\)\)=fq\(ϕ\(ω′\)\)=q\(ω′\)q\(\\omega\)=f\_\{q\}\(\\phi\(\\omega\)\)=f\_\{q\}\(\\phi\(\\omega^\{\\prime\}\)\)=q\(\\omega^\{\\prime\}\)for allqq, soω∼Qω′\\omega\\sim\_\{Q\}\\omega^\{\\prime\}\. ∎
###### Corollary 1\(Formal intelligence criterion\)\.
IfQpred⊊QworldQ\_\{\\mathrm\{pred\}\}\\subsetneq Q\_\{\\mathrm\{world\}\}strictly on a domain, a predictive representation can be complete for prediction yet incomplete for world understanding\. World competence requires a representation at least as fine asΩ/∼Qworld\\Omega/\\\!\\sim\_\{Q\_\{\\mathrm\{world\}\}\}\.
###### Theorem 2\(Predictive insufficiency\)\.
There exist worldsW0,W1W\_\{0\},W\_\{1\}that are predictively equivalent but not world\-equivalent\. No representation trained only to be prediction\-complete is guaranteed complete for world understanding\.
###### Proof\.
Take a deterministic arena with statesSS, actionsAA, transitionT:S×A→ST:S\\times A\\to S, and a regionL⊂SL\\subset Sthat is physically traversable\. LetW0=\(S,A,T,O,𝒪0\)W\_\{0\}=\(S,A,T,O,\\mathcal\{O\}\_\{0\}\)andW1=\(S,A,T,O,𝒪1\)W\_\{1\}=\(S,A,T,O,\\mathcal\{O\}\_\{1\}\)share the same dynamics and observation map, with𝒪0\\mathcal\{O\}\_\{0\}entailing no restriction onLLand𝒪1\\mathcal\{O\}\_\{1\}entailing that transitions intoLLare forbidden\. All predictive conditionals over observations coincide, soW0,W1W\_\{0\},W\_\{1\}are predictively equivalent\. Chooses,as,awithT\(s,a\)∈LT\(s,a\)\\in L\. Thena∈Γ𝒪0\(s\)a\\in\\Gamma\_\{\\mathcal\{O\}\_\{0\}\}\(s\)buta∉Γ𝒪1\(s\)a\\notin\\Gamma\_\{\\mathcal\{O\}\_\{1\}\}\(s\), so the admissibility query separates them\. By Theorem[1](https://arxiv.org/html/2606.10934#Thmtheorem1)any admissibility\-complete representation must separate them, and a prediction\-only representation has no information to do so\. ∎
#### The diagonal is certifiable\.
The diagonal of the kernel is admissibility, and it can be maintained with a guarantee independent of the learned model\. LetΦ𝒪⊆S\\Phi\_\{\\mathcal\{O\}\}\\subseteq Sbe the entailed forbidden set andV𝒪V\_\{\\mathcal\{O\}\}the entailed transition\-violation predicate\.
###### Theorem 3\(Closure\-preserving horizon soundness\)\.
Let a projectorg𝒪g\_\{\\mathcal\{O\}\}be single\-step sound: fors∉Φ𝒪s\\notin\\Phi\_\{\\mathcal\{O\}\}and any proposed actionaa,g𝒪\(s,a\)∉Φ𝒪g\_\{\\mathcal\{O\}\}\(s,a\)\\notin\\Phi\_\{\\mathcal\{O\}\}and¬V𝒪\(s,g𝒪\(s,a\)\)\\neg V\_\{\\mathcal\{O\}\}\(s,g\_\{\\mathcal\{O\}\}\(s,a\)\)\. Ifs0∉Φ𝒪s\_\{0\}\\notin\\Phi\_\{\\mathcal\{O\}\}andst\+1=g𝒪\(st,at\)s\_\{t\+1\}=g\_\{\\mathcal\{O\}\}\(s\_\{t\},a\_\{t\}\)for an arbitrary, possibly arbitrarily inaccurate action sequence, thenst∉Φ𝒪s\_\{t\}\\notin\\Phi\_\{\\mathcal\{O\}\}and¬V𝒪\(st,st\+1\)\\neg V\_\{\\mathcal\{O\}\}\(s\_\{t\},s\_\{t\+1\}\)for alltt\.
###### Proof\.
Induction\. Bases0∉Φ𝒪s\_\{0\}\\notin\\Phi\_\{\\mathcal\{O\}\}\. Ifst∉Φ𝒪s\_\{t\}\\notin\\Phi\_\{\\mathcal\{O\}\}, single\-step soundness givesg𝒪\(st,at\)∉Φ𝒪g\_\{\\mathcal\{O\}\}\(s\_\{t\},a\_\{t\}\)\\notin\\Phi\_\{\\mathcal\{O\}\}and¬V𝒪\(st,g𝒪\(st,at\)\)\\neg V\_\{\\mathcal\{O\}\}\(s\_\{t\},g\_\{\\mathcal\{O\}\}\(s\_\{t\},a\_\{t\}\)\), andst\+1=g𝒪\(st,at\)s\_\{t\+1\}=g\_\{\\mathcal\{O\}\}\(s\_\{t\},a\_\{t\}\)\. ∎
The point is that this competence is independent of the learned modelFθF\_\{\\theta\}: an arbitrarily inaccurate predictor still cannot leave the admissible set\. Empirically, in a permissive\-lava arena that we use as the admissibility witness, an unshielded predictor reaches goals by entering a forbidden region on36\.4%36\.4\\%of episodes \(91/25091/250\), while the diagonal projector, using the same wall\-blind learned model, commits zero forbidden transitions over250250episodes \(Figure[1](https://arxiv.org/html/2606.10934#S2.F1)\)\. The same OWL\-derived transition grammar improves competence and not only safety: transplanted onto an unmodified learned planner it raises cross\-room success across a range of wall topologies without retraining\. The entailed constraint is not a safety patch bolted on, it is information the predictor lacked\.
Figure 1:Permissive\-lava arena\. The unshielded planner reaches the goal by crossing the forbidden band; the diagonal projector, on the same learned model, threads the entailed door and never enters the band\.
#### The diagonal curves space\.
Admissibility is not only logical, it is geometric, which is the cleanest way to see that the entailed structure is a property of the action space and not a post\-hoc filter\.
###### Proposition 1\(Entailed geometry\)\.
LetΦ𝒪⊂X⊂ℝ2\\Phi\_\{\\mathcal\{O\}\}\\subset X\\subset\\mathbb\{R\}^\{2\}be the entailed forbidden set andgM=e2M𝟏Φ𝒪δg\_\{M\}=e^\{2M\\mathbf\{1\}\_\{\\Phi\_\{\\mathcal\{O\}\}\}\}\\deltathe conformal metric with barrier heightMM\. Any limit asM→∞M\\to\\inftyof minimizinggMg\_\{M\}\-paths fromaatobbis a shortest path among rectifiable paths avoidingΦ𝒪\\Phi\_\{\\mathcal\{O\}\}, and the Gaussian curvatureK=−e−2ϕΔϕK=\-e^\{\-2\\phi\}\\Delta\\phiconcentrates on∂Φ𝒪\\partial\\Phi\_\{\\mathcal\{O\}\}\.
###### Proof sketch\.
A path avoidingΦ𝒪\\Phi\_\{\\mathcal\{O\}\}hasgMg\_\{M\}\-length equal to its Euclidean length, independent ofMM\. A path spending Euclidean lengthℓM\\ell\_\{M\}insideΦ𝒪\\Phi\_\{\\mathcal\{O\}\}hasgMg\_\{M\}\-length≥eMℓM\\geq e^\{M\\ell\_\{M\}\}, so minimality forcesℓM→0\\ell\_\{M\}\\to 0; on the complement ofΦ𝒪\\Phi\_\{\\mathcal\{O\}\}the metric is Euclidean, so the limit minimizes Euclidean length among avoiding paths\. ∎
Forbidden regions become curvature ridges and entailed openings become flat passes: knowledge has become metric\.
## 3The kernel and its off\-diagonal
LetΩ=𝒯E\\Omega=\\mathcal\{T\}\_\{E\}be the finite set of complete admissible worlds under evidenceEE, withℋ=ℓ2\(𝒯E\)\\mathcal\{H\}=\\ell^\{2\}\(\\mathcal\{T\}\_\{E\}\)and basis\{\|T⟩\}\\\{\\,\|T\\rangle\\,\\\}\. A weighted generator assignswE\(T\)≥0w\_\{E\}\(T\)\\geq 0with normalizerZE=∑TwE\(T\)Z\_\{E\}=\\sum\_\{T\}w\_\{E\}\(T\)and posteriorμE\(T\)=wE\(T\)/ZE\\mu\_\{E\}\(T\)=w\_\{E\}\(T\)/Z\_\{E\}\.
###### Definition 3\(World kernel\)\.
A world kernel is a density operatorρE\\rho\_\{E\}with matrix elementsKE\(T,T′\)=⟨T∣ρE∣T′⟩K\_\{E\}\(T,T^\{\\prime\}\)=\\langle T\\mid\\rho\_\{E\}\\mid T^\{\\prime\}\\ranglewhose diagonal is the classical posterior,KE\(T,T\)=μE\(T\)K\_\{E\}\(T,T\)=\\mu\_\{E\}\(T\)\. The coherent posterior state is\|ΠE⟩=∑TμE\(T\)\|T⟩\|\\Pi\_\{E\}\\rangle=\\sum\_\{T\}\\sqrt\{\\mu\_\{E\}\(T\)\}\\,\|T\\rangle, with pure kernelKEpure\(T,T′\)=μE\(T\)μE\(T′\)K\_\{E\}^\{\\mathrm\{pure\}\}\(T,T^\{\\prime\}\)=\\sqrt\{\\mu\_\{E\}\(T\)\\mu\_\{E\}\(T^\{\\prime\}\)\}\.
The diagonal is ordinary Bayesian uncertainty\. The off\-diagonal says two worlds are not merely alternatives: they can interfere, be coherently transformed, and be compared under non\-classical operations\. We first confirm the kernel is conservative, then that the off\-diagonal is not\.
###### Lemma 1\(Classical shadow\)\.
For any classical eventA⊆𝒯EA\\subseteq\\mathcal\{T\}\_\{E\}withPA=∑T∈A\|T⟩⟨T\|P\_\{A\}=\\sum\_\{T\\in A\}\|T\\rangle\\langle T\|,Tr\(ρEPA\)=∑T∈AKE\(T,T\)\\mathrm\{Tr\}\(\\rho\_\{E\}P\_\{A\}\)=\\sum\_\{T\\in A\}K\_\{E\}\(T,T\)\. If evidenceBBis observed andρB=PBρEPB/Tr\(ρEPB\)\\rho\_\{B\}=P\_\{B\}\\rho\_\{E\}P\_\{B\}/\\mathrm\{Tr\}\(\\rho\_\{E\}P\_\{B\}\), then on the diagonalKB\(T,T\)=KE\(T,T\)/∑T′∈BKE\(T′,T′\)K\_\{B\}\(T,T\)=K\_\{E\}\(T,T\)/\\sum\_\{T^\{\\prime\}\\in B\}K\_\{E\}\(T^\{\\prime\},T^\{\\prime\}\)forT∈BT\\in Band0otherwise\.
###### Proof\.
PAP\_\{A\}is diagonal in the world basis, soTr\(ρEPA\)=∑T∈A⟨T\|ρE\|T⟩\\mathrm\{Tr\}\(\\rho\_\{E\}P\_\{A\}\)=\\sum\_\{T\\in A\}\\langle T\|\\rho\_\{E\}\|T\\rangle\. For the update,⟨T\|PBρEPB\|T⟩=𝟏\[T∈B\]KE\(T,T\)\\langle T\|P\_\{B\}\\rho\_\{E\}P\_\{B\}\|T\\rangle=\\mathbf\{1\}\[T\\in B\]K\_\{E\}\(T,T\); divide byTr\(ρEPB\)=∑T′∈BKE\(T′,T′\)\\mathrm\{Tr\}\(\\rho\_\{E\}P\_\{B\}\)=\\sum\_\{T^\{\\prime\}\\in B\}K\_\{E\}\(T^\{\\prime\},T^\{\\prime\}\)\. ∎
So measuring the diagonal reproduces Bayesian conditioning: the kernel extends, it does not replace, probabilistic ontology\.
###### Example 1\(Conservativity is exact\)\.
Let a generator propose three complete taxonomy worlds with evidence weightsw\(T0\)=6,w\(T1\)=3,w\(T2\)=1w\(T\_\{0\}\)=6,\\,w\(T\_\{1\}\)=3,\\,w\(T\_\{2\}\)=1, soZ=10Z=10andμ=\(0\.6,0\.3,0\.1\)\\mu=\(0\.6,0\.3,0\.1\)\. Suppose an intervention certifier rules outT2T\_\{2\}, accepting the eventB=\{T0,T1\}B=\\\{T\_\{0\},T\_\{1\}\\\}\. Classical conditioning givesμ\(T0∣B\)=6/9=2/3\\mu\(T\_\{0\}\\mid B\)=6/9=2/3andμ\(T1∣B\)=1/3\\mu\(T\_\{1\}\\mid B\)=1/3\. The kernel projectionρB=PBρPB/Tr\(ρPB\)\\rho\_\{B\}=P\_\{B\}\\rho P\_\{B\}/\\mathrm\{Tr\}\(\\rho P\_\{B\}\)has diagonal\(23,13,0\)\(\\tfrac\{2\}\{3\},\\tfrac\{1\}\{3\},0\), identical\. The off\-diagonal changes nothing classical marginal observers can see; it adds structure only off the diagonal\.
###### Proposition 2\(Same diagonal, different kernel\)\.
There exist two world kernels with the same posterior diagonal but different answers to a coherent linkage query\.
###### Proof\.
Let𝒯=\{T0,T1\}\\mathcal\{T\}=\\\{T\_\{0\},T\_\{1\}\\\}\. Takeρdiag=12\|T0⟩⟨T0\|\+12\|T1⟩⟨T1\|\\rho\_\{\\mathrm\{diag\}\}=\\tfrac\{1\}\{2\}\|T\_\{0\}\\rangle\\langle T\_\{0\}\|\+\\tfrac\{1\}\{2\}\|T\_\{1\}\\rangle\\langle T\_\{1\}\|andρcoh=\|Π⟩⟨Π\|\\rho\_\{\\mathrm\{coh\}\}=\|\\Pi\\rangle\\langle\\Pi\|with\|Π⟩=\(\|T0⟩\+\|T1⟩\)/2\|\\Pi\\rangle=\(\|T\_\{0\}\\rangle\+\|T\_\{1\}\\rangle\)/\\sqrt\{2\}\. Both have diagonal\(12,12\)\(\\tfrac\{1\}\{2\},\\tfrac\{1\}\{2\}\)\. For the projectorP\+=\|Π⟩⟨Π\|P\_\{\+\}=\|\\Pi\\rangle\\langle\\Pi\|,Tr\(ρcohP\+\)=1\\mathrm\{Tr\}\(\\rho\_\{\\mathrm\{coh\}\}P\_\{\+\}\)=1whileTr\(ρdiagP\+\)=12\\mathrm\{Tr\}\(\\rho\_\{\\mathrm\{diag\}\}P\_\{\+\}\)=\\tfrac\{1\}\{2\}\. Identical posteriors, different kernels\. ∎
This is the conceptual pivot: “the posterior is the whole story” is incomplete, because the posterior is only the diagonal\. The full object is the kernel, and Section[4](https://arxiv.org/html/2606.10934#S4)shows the off\-diagonal is precisely the cross\-world coupling that fixes counterfactuals, the quantity a posterior cannot carry\.
As a density matrix the kernel stays computable: the Shannon entropy of its diagonalH\(μE\)H\(\\mu\_\{E\}\)is a remaining\-uncertainty scalar that falls to zero as evidence pins a unique world; conditioning is projection\-and\-renormalize at costO\(\|ℳ\(𝒯\)\|2\)O\(\|\\mathcal\{M\}\(\\mathcal\{T\}\)\|^\{2\}\); and the world history is realized concretely as a deterministic event\-graph substrate\[[21](https://arxiv.org/html/2606.10934#bib.bib21)\], with Open Ontologies\[[19](https://arxiv.org/html/2606.10934#bib.bib19)\]supplying the grammar and CIVeX\[[20](https://arxiv.org/html/2606.10934#bib.bib20)\]certifying admissible interventions\. Its positive semidefiniteness is itself identifying information the marginals lack, which Section[4\.5](https://arxiv.org/html/2606.10934#S4.SS5)turns into counterfactual bounds tighter than the marginals allow, in polynomial time, where the exact program is intractable\.
## 4Empirical validation: the off\-diagonal is load\-bearing
We now show, on real computations and against a strong language model used as the predictor baseline, that the off\-diagonal carries information the diagonal cannot, and that it is exactly the counterfactual\. First we make the kernel\-counterfactual correspondence a theorem rather than a metaphor\.
###### Theorem 4\(Kernel–counterfactual isomorphism\)\.
Fix a binary treatment with potential outcomes\(Y0,Y1\)\(Y\_\{0\},Y\_\{1\}\)andv=\(Y0,Y1\)⊤v=\(Y\_\{0\},Y\_\{1\}\)^\{\\top\}\. For a counterfactual lawPPon\{0,1\}2\\\{0,1\\\}^\{2\}letM\(P\)=𝔼P\[vv⊤\]M\(P\)=\\mathbb\{E\}\_\{P\}\[vv^\{\\top\}\], a2×22\\times 2PSD matrix withMii=P\(Yi=1\)M\_\{ii\}=P\(Y\_\{i\}\{=\}1\)andM01=P\(Y0=1,Y1=1\)M\_\{01\}=P\(Y\_\{0\}\{=\}1,Y\_\{1\}\{=\}1\)\. ThenP↦M\(P\)P\\mapsto M\(P\)is a bijection between counterfactual equivalence classes \(laws agreeing on every counterfactual query\) and the set
𝒦=\{M⪰0:Mii∈\[0,1\],M01∈\[max\(0,M00\+M11−1\),min\(M00,M11\)\]\}\.\\mathcal\{K\}=\\Big\\\{\\,M\\succeq 0:\\ M\_\{ii\}\\in\[0,1\],\\ M\_\{01\}\\in\\big\[\\max\(0,M\_\{00\}\{\+\}M\_\{11\}\{\-\}1\),\\ \\min\(M\_\{00\},M\_\{11\}\)\\big\]\\Big\\\}\.Under this isomorphism: \(i\) the rung\-1/2 \(observational and interventional\) identifiable data are exactlydiagM\\mathrm\{diag\}M; \(ii\) every counterfactual query is a function ofMM, for instancePN=\(M11−M01\)/M11\\mathrm\{PN\}=\(M\_\{11\}\-M\_\{01\}\)/M\_\{11\}; \(iii\) two laws with equaldiagM\\mathrm\{diag\}Mbut different off\-diagonalM01M\_\{01\}are observationally and interventionally indistinguishable yet counterfactually distinct\.
###### Proof\.
The four cells recover fromMMbyp11=M01p\_\{11\}=M\_\{01\},p10=M00−M01p\_\{10\}=M\_\{00\}\-M\_\{01\},p01=M11−M01p\_\{01\}=M\_\{11\}\-M\_\{01\},p00=1−M00−M11\+M01p\_\{00\}=1\-M\_\{00\}\-M\_\{11\}\+M\_\{01\}, soP↦\(M00,M11,M01\)P\\mapsto\(M\_\{00\},M\_\{11\},M\_\{01\}\)is affine and invertible on the simplex; nonnegativity of the four cells is exactly the Fréchet–Hoeffding range onM01M\_\{01\}, and for\{0,1\}\\\{0,1\\\}variablesM=𝔼\[vv⊤\]⪰0M=\\mathbb\{E\}\[vv^\{\\top\}\]\\succeq 0holds automatically, so the image is precisely𝒦\\mathcal\{K\}\. Counterfactual queries are functions ofPP, hence ofMM; the necessity formula isPN=P\(Y0=0∣Y1=1\)=p01/M11\\mathrm\{PN\}=P\(Y\_\{0\}\{=\}0\\mid Y\_\{1\}\{=\}1\)=p\_\{01\}/M\_\{11\}\. Under randomizedXXthe rungs11and22identify the marginalsM00,M11=diagMM\_\{00\},M\_\{11\}=\\mathrm\{diag\}Mand place no constraint onM01M\_\{01\}beyond𝒦\\mathcal\{K\}, giving \(i\) and \(iii\)\. ∎
Algorithm[1](https://arxiv.org/html/2606.10934#alg1)is the constructive content of Theorem[4](https://arxiv.org/html/2606.10934#Thmtheorem4)and Proposition[2](https://arxiv.org/html/2606.10934#Thmproposition2)at once, and is exactly the witness computed in Section[4\.1](https://arxiv.org/html/2606.10934#S4.SS1): it fixes the diagonal, varies only the off\-diagonal across its Fréchet range, and reads off two distinct counterfactuals that no diagonal \(predictor\) answer can match\.
Algorithm 1Off\-diagonal witness and probability of necessity \(proves Thm\.[4](https://arxiv.org/html/2606.10934#Thmtheorem4), Prop\.[2](https://arxiv.org/html/2606.10934#Thmproposition2)\)1:marginals
r0=P\(Y0=1\)r\_\{0\}=P\(Y\_\{0\}\{=\}1\),
r1=P\(Y1=1\)r\_\{1\}=P\(Y\_\{1\}\{=\}1\)⊳\\trianglerightthe rung\-1/2 identifiable diagonal
2:
cmin←max\(0,r0\+r1−1\)c\_\{\\min\}\\leftarrow\\max\(0,r\_\{0\}\{\+\}r\_\{1\}\{\-\}1\),
cmax←min\(r0,r1\)c\_\{\\max\}\\leftarrow\\min\(r\_\{0\},r\_\{1\}\)⊳\\trianglerightFréchet–Hoeffding range ofM01M\_\{01\}
3:forcoupling
c∈\{cmax\(monotonic\),r0r1\(independent\)\}c\\in\\\{\\,c\_\{\\max\}\\ \(\\text\{monotonic\}\),\\ r\_\{0\}r\_\{1\}\\ \(\\text\{independent\}\)\\,\\\}do
4:
M←\(r0ccr1\)M\\leftarrow\\begin\{pmatrix\}r\_\{0\}&c\\\\ c&r\_\{1\}\\end\{pmatrix\}⊳\\trianglerightsame diagonal\(r0,r1\)\(r\_\{0\},r\_\{1\}\)for everycc
5:assert
M⪰0M\\succeq 0and all four cells
pij≥0p\_\{ij\}\\geq 0⊳\\trianglerightM∈𝒦M\\in\\mathcal\{K\}
6:
PN\(c\)←\(M11−M01\)/M11=\(r1−c\)/r1\\mathrm\{PN\}\(c\)\\leftarrow\(M\_\{11\}\-M\_\{01\}\)/M\_\{11\}=\(r\_\{1\}\-c\)/r\_\{1\}
7:endfor
8:predictor:given only
diagM=\(r0,r1\)\\mathrm\{diag\}M=\(r\_\{0\},r\_\{1\}\), returns a single point
p^\\hat\{p\}
9:return
PN\(cmax\)≠PN\(r0r1\)\\mathrm\{PN\}\(c\_\{\\max\}\)\\neq\\mathrm\{PN\}\(r\_\{0\}r\_\{1\}\), yet
p^\\hat\{p\}cannot equal both⊳\\trianglerightdiagonal underdetermines the counterfactual
### 4\.1A counterfactual is an off\-diagonal query
Consider a binary treatmentXXand outcomeYY\. The two potential\-outcome worlds areT0T\_\{0\}\(untreated,Y0Y\_\{0\}\) andT1T\_\{1\}\(treated,Y1Y\_\{1\}\)\. The diagonal of the kernel is the pair of marginalsP\(Y0\),P\(Y1\)P\(Y\_\{0\}\),P\(Y\_\{1\}\), which are exactly what observational and interventional data reveal\. The off\-diagonalK\(T0,T1\)K\(T\_\{0\},T\_\{1\}\)is the cross\-world jointP\(Y0,Y1\)P\(Y\_\{0\},Y\_\{1\}\), the coupling between the two worlds, which no single intervention probes\. The probability of necessityPN=P\(Y0=0∣X=1,Y=1\)\\mathrm\{PN\}=P\(Y\_\{0\}=0\\mid X=1,Y=1\)reads this off\-diagonal\.
We construct two models with identical marginalsP\(Y0=1\)=0\.5P\(Y\_\{0\}\{=\}1\)=0\.5,P\(Y1=1\)=0\.7P\(Y\_\{1\}\{=\}1\)=0\.7\(hence identical observational distribution under randomizedXXand identical interventionalP\(Y∣do\(X\)\)P\(Y\\mid do\(X\)\), average causal effect0\.20\.2in both\), but different cross\-world coupling: modelAAmaximally positively coupled \(monotone, treatment never hurts\), modelBBwith independent potential outcomes\. They givePNA=0\.286\\mathrm\{PN\}\_\{A\}=0\.286andPNB=0\.500\\mathrm\{PN\}\_\{B\}=0\.500\. We posed the question to a strong language model \(Claude, headless, given the full observational and interventional tables\) five times; it returned0\.400\.40on every call, the cross\-world\-independence point estimate\. A single number cannot equal both0\.2860\.286and0\.5000\.500: the predictor collapses two worlds that the off\-diagonal separates \(Figure[2](https://arxiv.org/html/2606.10934#S4.F2)\)\. The LLM is not wrong so much as committed to one coupling without warrant; the separating information is the off\-diagonal it does not carry\.
Figure 2:The off\-diagonal witness\. Both models share rungs11and22\(the same two worldsT0,T1T\_\{0\},T\_\{1\}, the same marginals andACE=0\.20\\mathrm\{ACE\}=0\.20, left box\), then fork on the cross\-world coupling thread alone: monotonic \(Model A\) versus independent \(Model B\)\. That single difference moves the counterfactual fromPN=0\.29\\mathrm\{PN\}=0\.29to0\.500\.50\(blue, the off\-diagonal/kernel values, right\)\. A strong predictor given all identifiable data returns one number,0\.400\.40\(red\), matching neither\. This is Proposition[2](https://arxiv.org/html/2606.10934#Thmproposition2)on Pearl’s ladder: the off\-diagonal is the rung\-33information\.
### 4\.2Scaling to mediation: the sign of an effect is unidentified
We scale to the canonical multivariable case, mediationX→M→YX\\to M\\to Y, with the natural direct effectNDE=P\(Y1,M0=1\)−P\(Y0,M0=1\)\\mathrm\{NDE\}=P\(Y\_\{1,M\_\{0\}\}=1\)\-P\(Y\_\{0,M\_\{0\}\}=1\), a nested cross\-world counterfactual that holds the mediator at its untreated value while setting the outcome’s treatment on\. Using a response\-function representation over6464types, we fix every quantity an experiment can measure \(the per\-arm distribution ofMM, the in\-world joint of\(M,Y\)\(M,Y\)under each arm, and all controlledP\(Y∣do\(X\),do\(M\)\)P\(Y\\mid do\(X\),do\(M\)\)\) and compute the identified interval ofNDE\\mathrm\{NDE\}by linear programming over the response\-type polytope\. The interval is\[−0\.381,\+0\.187\]\[\-0\.381,\+0\.187\], width0\.5680\.568, and it*spans zero*: the same complete experiment is consistent with the direct effect being harmful or helpful, and only the off\-diagonal coupling decides the sign \(Figure[3](https://arxiv.org/html/2606.10934#S4.F3)\)\. The two endpoint models reproduce byte\-identical rungs11and22\. The language model returned−0\.09\-0\.09on every call, one point inside the interval\. The honest statement is not that the model is wrong, it is that the quantity is unidentified and the model hides this; computing the exact0\.5680\.568\-wide interval requires the polytope program, which is the kernel’s job, not the predictor’s\.
Figure 3:MediationX→M→YX\\to M\\to Y\. Fixing everything an experiment measures leaves the natural direct effect unidentified over an interval that spans zero\. The endpoint models \(blue\) share identical rungs11and22; the predictor \(red\) commits to one coupling\.
### 4\.3The barrier meets the measurement on one axis
Consider an aggregate counterfactual over a population of units in which units sharing a cause cannot both be deemed necessary, a mutual\-exclusion constraint graphGGof degreedd\. The admissible joint necessity\-attributions are exactly the independent sets ofGG, and the aggregate quantity \(for instance the expected number of units for whom treatment was necessary\) is a sum of hard\-core occupation marginals\. Belief propagation computes those marginals correctly below the Sly–Sun threshold \(correlation decay\) and fails above it\. Sweeping the degreedd\(at fugacityλ=1\\lambda=1, the odds form of a per\-unit probability of necessity0\.50\.5, which keeps the threshold at the clean value below\), the relative error of the belief\-propagation estimate of the aggregate counterfactual against exact enumeration tracks the cavity order parameter\(d−1\)η\(d\-1\)\\eta, both turning atdc≈5\.14d\_\{c\}\\approx 5\.14\(the Sly–Sun implementation of Figure[11](https://arxiv.org/html/2606.10934#S6.F11)\)\. This is the single figure where the encoding \(independent\-set count\), the barrier \(Sly–Sunη\\eta\), and the measurement \(the counterfactual aggregate\) appear on one axis: what the off\-diagonal decides, and when it is reachable\. We are deliberate about the claim: at finitennthis is a monotonic correspondence, not a measured cliff, and it illustrates the proved asymptotic transition of Section[6](https://arxiv.org/html/2606.10934#S6)rather than independently proving it\.
### 4\.4A structural\-causal\-model battery
To move past single instances we draw300300random structural causal models per query type \(direct effect, queryPN\\mathrm\{PN\}; mediation, queryNDE\\mathrm\{NDE\}\), each a full counterfactual law with computable ground truth\. The principled comparison is between a*diagonal\-only*Bayesian SCM baseline \(assuming cross\-world independence, the standard naive default, which uses only the marginals\) and the*full kernel*\(reporting the identified interval, and, given the coupling, the exact value\); this isolates exactly the off\-diagonal\. As a secondary probe, not a partial\-identification method, we also query a frontier*language model*on the rung\-1/2 data, only to show that a strong predictor inherits the same collapse rather than as a baseline any practitioner would use\. The task is the one causal\-inference researchers care about: recover the admissible counterfactual*class*, not a point\.
Table 1:SCM battery,300300random models per query\. “Infeasible” means the answer lies outside the mathematically possible interval \(no valid SCM produces it\)\. “Covers” is the fraction of models whose true counterfactual lies in the reported interval\. All entries were locked by passing hard assertions \(T1–T5 in the text\)\.The pattern is uniform and was locked by hard assertions, all passing\. The full\-kernel interval covers the truth on100%100\\%of models and, given the coupling, is exact to machine precision: the off\-diagonal is the sufficient statistic \(T1, T2\)\. The diagonal\-only baseline is always feasible but commits to a point far from truth \(meanPN\\mathrm\{PN\}error0\.180\.18\) and gets the*sign*of the mediation effect wrong on27%27\\%of models \(T4\): interventionally identical data, opposite conclusions\. The predictor not only errs \(meanPN\\mathrm\{PN\}error0\.230\.23\) but returns*infeasible*answers on28%28\\%of models \(T3\), point estimates no valid SCM can produce\. It does not guess imprecisely, it hallucinates a point where the truth is a set \(Figure[4](https://arxiv.org/html/2606.10934#S4.F4)\)\.
Figure 4:SCM battery\.*Left:*distribution of the diagonal\-only error on the natural direct effect over300300models, against the full\-kernel oracle \(exact\)\.*Right:*the full kernel covers the truth on every model; the diagonal\-only baseline is sign\-unstable; the predictor returns infeasible answers on more than a quarter of models\.
### 4\.5The kernel’s PSD structure bounds counterfactuals where the exact program cannot
*The second question: can we bound the off\-diagonal from data?*The battery shows the off\-diagonal is necessary; we now show the kernel’s defining property earns its keep computationally\. Writev=\(Y0,…,Yk−1\)v=\(Y\_\{0\},\\dots,Y\_\{k\-1\}\)forkkbinary potential outcomes \(e\.g\.kktreatment arms\) andM=𝔼\[vv⊤\]M=\\mathbb\{E\}\[vv^\{\\top\}\]for the second\-moment matrix: this is the kernel restricted to second order, with diagonalMii=P\(Yi=1\)M\_\{ii\}=P\(Y\_\{i\}\{=\}1\)fixed by akk\-arm trial and off\-diagonalsMij=P\(Yi=1,Yj=1\)M\_\{ij\}=P\(Y\_\{i\}\{=\}1,Y\_\{j\}\{=\}1\)the unidentified cross\-world couplings\. A counterfactual aggregate such asQ=∑i<jP\(Yi=1,Yj=1\)Q=\\sum\_\{i<j\}P\(Y\_\{i\}\{=\}1,Y\_\{j\}\{=\}1\)is a linear functional ofMM\.
###### Proposition 3\(The PSD kernel is a valid, strictly tighter, poly\-time bound\)\.
For any counterfactual query linear inMM, optimizing over\{M⪰0,Mii=di,Mij∈\[max\(0,di\+dj−1\),min\(di,dj\)\]\}\\\{M\\succeq 0,\\ M\_\{ii\}=d\_\{i\},\\ M\_\{ij\}\\in\[\\max\(0,d\_\{i\}\{\+\}d\_\{j\}\{\-\}1\),\\min\(d\_\{i\},d\_\{j\}\)\]\\\}gives an interval that*\(i\)*contains the exact identified set \(the projection of the response\-type polytope\),*\(ii\)*is contained in the marginal/Fréchet bound, and*\(iii\)*is a semidefinite program of sizeO\(k2\)O\(k^\{2\}\), whereas the exact polytope has2k2^\{k\}vertices\.
Positive semidefiniteness ofMMis the constraint the marginals miss: with the diagonal fixed, the off\-diagonals cannot be chosen independently within their Fréchet boxes, because the resultingMMmust remain a genuine second\-moment matrix\. We verify all three claims numerically \(Figure[5](https://arxiv.org/html/2606.10934#S4.F5)\)\. The PSD bound contains the exact identified set on every instance tested \(40/4040/40at eachk≤14k\\leq 14, where the2k2^\{k\}LP is still solvable\)\. It is strictly tighter than the marginal bound, and the gap*grows*with the number of worlds: negligible fork≤6k\\leq 6, then2\.4%2\.4\\%atk=14k=14,5\.1%5\.1\\%atk=20k=20,8\.2%8\.2\\%atk=40k=40\(per\-instance up to11%11\\%\)\. Crucially it is computed by anO\(k2\)O\(k^\{2\}\)semidefinite program throughout: atk=40k=40the exact response\-type LP has240≈1\.1×10122^\{40\}\\approx 1\.1\\times 10^\{12\}variables and is dead, while the kernel SDP has820820entries\. The PSD relaxation is the standard second\-moment instance of the moment/polynomial\-programming approach to causal bounds\[[1](https://arxiv.org/html/2606.10934#bib.bib1)\]; our point is the reading, that this relaxation is nothing but the world kernel’s own positive\-semidefiniteness, so the density\-matrix structure is itself partial\-identifying information, delivering non\-trivial bounds exactly where the exact program is intractable\.
Figure 5:The kernel’s PSD structure as partial\-identifying information, on the counterfactual aggregateQ=∑i<jP\(Yi=1,Yj=1\)Q=\\sum\_\{i<j\}P\(Y\_\{i\}\{=\}1,Y\_\{j\}\{=\}1\)overkk\-arm potential outcomes\.*Left:*the PSD kernel bound \(green\) sits strictly between the trivial marginal/Fréchet bound \(grey\) and the exact response\-type LP \(blue, only while the2k2^\{k\}program is solvable\)\.*Right:*the tightening over the marginal bound grows withkk\(to∼8%\\sim 8\\%atk=40k=40\), while the PSD problem stays polynomial \(∼k2/2\\sim k^\{2\}/2entries\) where the exact LP variable count2k2^\{k\}diverges\.The companion is the query whereMMdoes*not*suffice \(Figure[6](https://arxiv.org/html/2606.10934#S4.F6)\)\. On the third\-order queryP\(Y0=Y1=Y2=1\)P\(Y\_\{0\}\{=\}Y\_\{1\}\{=\}Y\_\{2\}\{=\}1\), recovery under all three arms, the second\-moment bound is strictly looser than the exact response\-type program, and the slack is not noise but a closed\-form law: working out which cell\-nonnegativity theMM\-only bound omits \(only the all\-zero cell\), the slack is exactlymax\(0,min\(minijsij,minidi\)−P^0\)\\max\\\!\\big\(0,\\ \\min\(\\min\_\{ij\}s\_\{ij\},\\min\_\{i\}d\_\{i\}\)\-\\hat\{P\}\_\{0\}\\big\), whereP^0=1−∑idi\+∑ijsij\\hat\{P\}\_\{0\}=1\-\\sum\_\{i\}d\_\{i\}\+\\sum\_\{ij\}s\_\{ij\}is the second\-order inclusion–exclusion estimate of “no arm succeeds”\. Every random instance lies on this hinge to machine precision, positive on24%24\\%:MMis loose by exactly the third\-order room the all\-zero cell leaves, which is the untrackedE\[YiYjYk\]E\[Y\_\{i\}Y\_\{j\}Y\_\{k\}\]\(Remark[2](https://arxiv.org/html/2606.10934#Thmremark2)\)\. One query whereMMis tight, one where its slack is a law, the same machinery\.
Figure 6:MMsuffices for second\-order queries but not third\.*Left:*onP\(Y0=Y1=Y2=1\)P\(Y\_\{0\}\{=\}Y\_\{1\}\{=\}Y\_\{2\}\{=\}1\)the exact response\-type interval \(green\) is strictly inside theMM\-only bound \(grey\); the slack \(red\) is the untracked third momentE\[Y0Y1Y2\]E\[Y\_\{0\}Y\_\{1\}Y\_\{2\}\]\.*Right:*the slack is a closed\-form law,max\(0,min\(minijsij,minidi\)−P^0\)\\max\(0,\\,\\min\(\\min\_\{ij\}s\_\{ij\},\\min\_\{i\}d\_\{i\}\)\-\\hat\{P\}\_\{0\}\)withP^0=1−∑idi\+∑ijsij\\hat\{P\}\_\{0\}=1\-\\sum\_\{i\}d\_\{i\}\+\\sum\_\{ij\}s\_\{ij\}, every random instance on the hinge to machine precision \(positive on24%24\\%\); any≤\\leqsecond\-order query has zero slack\. The companion to the40/4040/40match of Figure[5](https://arxiv.org/html/2606.10934#S4.F5)\.Algorithm 2Counterfactual bound from the kernel’s positive semidefiniteness \(Prop\.[3](https://arxiv.org/html/2606.10934#Thmproposition3)\)1:marginals
d1,…,dkd\_\{1\},\\dots,d\_\{k\}\(the identified diagonal\); linear query
Q\(M\)Q\(M\); sense
∈\{min,max\}\\in\\\{\\min,\\max\\\}
2:variable
M∈𝕊kM\\in\\mathbb\{S\}^\{k\}\(symmetric\)
3:constraints:
M⪰0M\\succeq 0;
Mii=diM\_\{ii\}=d\_\{i\};
Mij∈\[max\(0,di\+dj−1\),min\(di,dj\)\]M\_\{ij\}\\in\[\\max\(0,d\_\{i\}\{\+\}d\_\{j\}\{\-\}1\),\\,\\min\(d\_\{i\},d\_\{j\}\)\]for
i<ji<j
4:solve the SDP
optQ\(M\)s\.t\. the above\\;\\mathrm\{opt\}\\ Q\(M\)\\ \\text\{s\.t\. the above\}⊳\\trianglerightO\(k2\)O\(k^\{2\}\)variables; no2k2^\{k\}blow\-up
5:returnthe optimum⊳\\trianglerightvalid outer bound on the exact identified set, tighter than marginal/Fréchet
### 4\.6Ontology structure tightens the bound: logic the flat tools cannot use
*The third question: can logical structure sharpen the off\-diagonal?*The previous result used only the generic constraint that the kernel is positive semidefinite\. A world model, however, is not a flat list of binary outcomes: its worlds satisfy an ontology, and the ontology’s axioms are*additional*structure on the kernel\. We show this structure does measurable work, and it is structure that no flat causal\-bounds tool \(which sees the outcomes as unconstrained binaries\) can use\. Let the outcomes bemmontology\-governed attributes under two arms; the kernel is the2m×2m2m\\times 2msecond\-moment matrixMM\. Three families of axiom enter as moment constraints: subsumptionAi⊑AjA\_\{i\}\\sqsubseteq A\_\{j\}fixesM\(i,a\),\(j,a\)=di,aM\_\{\(i,a\),\(j,a\)\}=d\_\{i,a\}\(within an arm\); disjointnessAi⊓Aj⊑⊥A\_\{i\}\\sqcap A\_\{j\}\\sqsubseteq\\botfixesM\(i,a\),\(j,a\)=0M\_\{\(i,a\),\(j,a\)\}=0; and a cross\-world domain axiom such as monotone subsumption \(treatment never ejects a unit from a superclass,ap\(0\)⇒ap\(1\)a\_\{p\}^\{\(0\)\}\\\!\\Rightarrow a\_\{p\}^\{\(1\)\}\) fixes the*cross\-arm*entryM\(p,0\),\(p,1\)=dp,0M\_\{\(p,0\),\(p,1\)\}=d\_\{p,0\}\.
###### Proposition 4\(Ontology\-aware bounds dominate ontology\-blind bounds\)\.
Adding the ontology’s subsumption, disjointness, and cross\-world axioms as equality constraints onMMyields a counterfactual bound that is valid, never looser than the ontology\-blind PSD bound of Proposition[3](https://arxiv.org/html/2606.10934#Thmproposition3), and computed by the sameO\(m2\)O\(m^\{2\}\)semidefinite program\.
The effect is large and, crucially, non\-local \(Figure[7](https://arxiv.org/html/2606.10934#S4.F7)\)\. When an axiom touches the queried attribute, the counterfactual is point\-identified: a monotone\-subsumption axiom collapses the probability of necessity for that attribute from a wide interval to a single value \(100%100\\%of instances\)\. The deeper result is propagation: on a multi\-coupling counterfactual aggregate overm=6m=6attributes, axioms placed on*other*attributes tighten the whole bound by32%32\\%on average \(up to40%40\\%\) beyond the ontology\-blind SDP, strictly tighter on every instance, because positive semidefiniteness carries the pinned entries into the unconstrained ones\. The gap persists as the world space2m2^\{m\}grows, where the exact joint program \(with22m2^\{2m\}cells\) is intractable and the SDP is not\. This is the synthesis doing something the parts cannot: flat partial\-identification tools bound counterfactuals from marginals, but they have no slot for “AiA\_\{i\}is a subclass ofAjA\_\{j\}” or “treatment does not eject fromAA”\. The world kernel does, and the logic measurably sharpens the counterfactual\.
Figure 7:Ontology\-structured counterfactual bounds\.*Left:*as more ontology axioms are imposed \(subsumption, disjointness, cross\-world monotonicity\), the ontology\-aware kernel bound \(green\) pulls below the ontology\-blind PSD bound \(grey\) on a multi\-coupling query, a34%34\\%tightening at seven axioms; the constraints on some attributes propagate through positive semidefiniteness to others\.*Right:*the gap persists as the world space2m2^\{m\}grows, where the exact joint program \(22m2^\{2m\}cells\) is intractable while the SDP staysO\(m2\)O\(m^\{2\}\)\. No flat causal\-bounds tool ingests this logical structure\.
### 4\.7A strictly richer capability hierarchy
Collecting the lava result \(admissibility\) and the battery \(counterfactuals\) alongside prediction yields a capability scorecard whose every entry is measured, not asserted \(Table[2](https://arxiv.org/html/2606.10934#S4.T2)\)\. Prediction is shared\. Admissibility separates the predictor \(36\.4%36\.4\\%violations\) from the diagonal and kernel \(both0/2500/250\)\. Counterfactuals separate the diagonal \(sign\-unstable, cannot report the class\) from the kernel \(covers the truth class, exact with the coupling\)\. WorldKernel is therefore not a re\-description of the same capability; it is a strictly larger capability set\.
Table 2:Measured capability hierarchy\. Each ✓/✗ is backed by a number from the lava arena \(admissibility\) or the SCM battery \(counterfactual\)\. The hierarchy is strict: each column to the right answers everything to its left and strictly more\.
### 4\.8Joint\-only failure modes, and scope
The diagonal/off\-diagonal split recurs wherever a system is monitored by marginal statistics\. In agent\-execution logs with an injected*purely\-joint*failure \(every variable’s marginal AUC≤0\.54\\leq 0\.54, invisible\), a marginal\-blind rank\-interaction screen recovers the responsible pair with power1\.01\.0at0%0\\%false\-discovery, and the complexity mirrors the counting barrier: an order\-kkpurely\-joint mode is akk\-sparse parity, so certifying coverage is statistical\-query hard \(dΩ\(k\)d^\{\\Omega\(k\)\}\)\. A monitor that audits only marginals is blind to these, as a predictor that sees only the diagonal is blind to counterfactuals\. We are likewise explicit about scope: the scaling\-law break, that a small kernel\-augmented model beats a much larger predictor on counterfactual reasoning, we do not attempt at frontier scale, evaluating the strongest accessible predictor baselines and leaving large\-scale world\-model training as future work; as a finite\-budget proxy, a frontier model given unlimited identifiable data still returns infeasible counterfactuals on28%28\\%of models while the kernel, with no learning, is exact\.
## 5The scarred world model
The ontology\-constrained kernel of Section[4\.6](https://arxiv.org/html/2606.10934#S4.SS6)suggests a sharper picture\. The admissible worlds are a region carved out of the space of all configurations, defined by binding constraints the model carries a priori from its ontology and refines online from experience\. We call those binding constraints the model’s*scars*: the disjointness and subsumption axioms a priori, and the encountered infeasibilities online\. Two results follow, and a single phase\-coexistence bottleneck governs both\.
### 5\.1Exact bounds at scale, where the exact program dies
We benchmark against the exact response\-type program with the same ontology constraints\. For discrete structural causal models this program is exactly what automated partial\-identification tools \(autobounds, Duarte et al\.\[[1](https://arxiv.org/html/2606.10934#bib.bib1)\]\) construct and solve; for our queries it is a linear program, which we solve to optimality, so its optimum is the tightest achievable bound and the reference any such tool returns\. Two findings\. First, tightness: on40/4040/40random instances the kernel SDP returns the*exact*bound \(maximum gap0\.00\.0\), so the relaxation loses nothing here\. Second, scale: the exact program over ontology\-consistent world pairs has up to22m2^\{2m\}variables, while the kernel SDP isO\(m2\)O\(m^\{2\}\)\. Table[3](https://arxiv.org/html/2606.10934#S5.T3)runs both to the point where the exact program becomes unbuildable: atm=18m=18it has2\.22\.2billion variables \(intractable for any solver,autoboundsincluded\), while the kernel SDP returns the identical bound in0\.780\.78seconds\. The scoped capability is precise:*the same answer, in polynomial time, where the exact program is dead*\. The edge is scale, not tightness, the SDP cannot beat the exact program; and the ontology constraints are linear, so a tool that accepts them gets the same bound where it can run\. The contribution is matching that bound at scales the exact program cannot reach\.
Table 3:Exact ontology\-constrained counterfactual bound, exact response\-type program versus kernel SDP\. Both return the identical bound where both run \(40/4040/40, zero gap\); pastm=12m=12the exact program is intractable while the SDP stays sub\-second\.
### 5\.2Online scarring: localized hazard\-memory tightens the bound
*The fourth question: can we acquire the off\-diagonal from experience?*The ontology need not be given in full\. A world model can*acquire*constraints by getting scarred: a trajectory that crosses a forbidden boundary leaves a detectable trace, the trace localizes the violated constraint, and the model stores it as a persistent scar\. Accumulated scars shrink the feasible set, so the counterfactual bound tightens with experience\. The mechanism is more than constraint\-counting: scarring the constraint that the current worst\-case bound*relies on*\(the configuration its extremal solution puts mass on, the one that bleeds\) is a cutting plane\. Empirically, blood\-localized scars close the identifiability gap up to4×4\\timesfaster early \(Figure[8](https://arxiv.org/html/2606.10934#S5.F8), left:37%37\\%vs9%9\\%at five scars, a4×4\\timesmargin that narrows to1\.6×1\.6\\timesby eighteen,64%64\\%vs40%40\\%, as both approach saturation\); a no\-memory control that encounters hazards but does not retain them never moves\. The reason is spectral, and it is the same bottleneck that drives the Sly–Sun barrier of Section[6](https://arxiv.org/html/2606.10934#S6)\.
###### Theorem 5\(Targeted scarring restores the spectral gap\)\.
Let the admissible\-world reconfiguration graph carry a phase\-coexistence bottleneck of conductanceΦ=exp\(−Ω\(n\)\)\\Phi=\\exp\(\-\\Omega\(n\)\)splitting the worlds into a majority component and a minority phase of measureμmin\\mu\_\{\\min\}\. Then \(i\)λ2≤2Φ=exp\(−Ω\(n\)\)\\lambda\_\{2\}\\leq 2\\Phi=\\exp\(\-\\Omega\(n\)\)by Cheeger; \(ii\) scarring \(deleting\) the minority phase leaves a residual withλ2=Ω\(1\)\\lambda\_\{2\}=\\Omega\(1\); \(iii\) the gap is restored only once the scarred fraction reachesμmin\\mu\_\{\\min\}and only if scarring targets the bottleneck cut: a random scarred fraction leavesΦ\\Phi, henceλ2\\lambda\_\{2\}, exponentially small\.
###### Proof\.
\(i\) is Cheeger’s inequality\. For \(ii\)–\(iii\), the bottleneck is by definition the cut of conductanceΦ\\Phi, and in the hard\-core coexistence regime it is exactly the boundary between the majority and minority phases \(Sly’s construction\)\. Deleting the minority phase removes one side of that cut, so the residual is the majority component, whose internal conductance isΩ\(1\)\\Omega\(1\); henceλ2=Ω\(1\)\\lambda\_\{2\}=\\Omega\(1\)\. A deletion not aligned with the cut leaves both sides and the connecting boundary intact, so the conductance, and by Cheeger the gap, remainexp\(−Ω\(n\)\)\\exp\(\-\\Omega\(n\)\)\. The threshold fraction is the measureμmin\\mu\_\{\\min\}of the side removed\. ∎
This is why localized scarring works and random does not \(Figure[8](https://arxiv.org/html/2606.10934#S5.F8), right\): the Fiedler value of the residual graph stays exponentially small until the scarred fraction removes the bottleneck phase, then jumps toΩ\(1\)\\Omega\(1\), a threshold rather than a proportionality\. Scarring is the operation that converts the intractable \(bottlenecked\) regime into the tractable one, by spending observations on exactly the constraints that bind\. Algorithm[3](https://arxiv.org/html/2606.10934#alg3)is the cutting plane: it scars the constraint the current worst\-case bound leans on\.
Algorithm 3Online scar\-accumulation \(blood\-localized cutting plane; operationalizes Thm\.[5](https://arxiv.org/html/2606.10934#Thmtheorem5)\)1:marginals
dd; query
QQ; hazard oracle
ℋ\\mathcal\{H\}\(flags infeasible configurations\)
2:
scars←∅\\mathrm\{scars\}\\leftarrow\\varnothing
3:repeat
4:
J⋆←J^\{\\star\}\\leftarrowthe bound\-*maximizing*joint over configurations matching
ddwith
scars\\mathrm\{scars\}set to
0
5:
\(i,j\)←argmax\(i,j\)\\leftarrow\\arg\\maxmass
Jij⋆J^\{\\star\}\_\{ij\}over cells with
ℋ\(i,j\)=infeasible\\mathcal\{H\}\(i,j\)=\\text\{infeasible\}⊳\\trianglerightwhere the worst case bleeds
6:ifno such cellthenbreak⊳\\trianglerightbound is exact on current information
7:endif
8:
scars←scars∪\{\(i,j\)\}\\mathrm\{scars\}\\leftarrow\\mathrm\{scars\}\\cup\\\{\(i,j\)\\\}⊳\\trianglerightstore the localized constraint as a scar
9:untilbudget exhausted
10:returnthe tightened bound on
QQ⊳\\trianglerightup to4×4\\timesfewer scars than random, early
Figure 8:Online scarring\.*Left:*blood\-localized scars \(the constraints the worst\-case bound bleeds on\) close the counterfactual identifiability gap up to4×4\\timesfaster \(early\) than the same number of random constraints; a no\-memory control stays at0%0\\%\.*Right:*Theorem[5](https://arxiv.org/html/2606.10934#Thmtheorem5)in numbers: the residual Fiedler valueλ2\\lambda\_\{2\}is exponentially small until targeted scarring removes the bottleneck phase, then jumps toΩ\(1\)\\Omega\(1\)\(a threshold, not a proportionality\); random scarring never restores it\.
### 5\.3The generative direction
The same scar structure points to a generative use\. A generator whose transition operator block\-decomposes across the admissible/forbidden partition keeps the admissible set invariant, so it generates only admissible worlds by construction\. This is the generative counterpart of the diagonal projector of Section[2](https://arxiv.org/html/2606.10934#S2): instead of predict\-then\-shield, the dynamics cannot leave the ontology\. It is a route to the generative kernel the predictive world\-model program lacks, and we leave its learning to future work; for the bounds above it adds nothing beyond imposing the constraints\.
## 6When the off\-diagonal is reachable: the Sly–Sun barrier
*The fifth question: when is the off\-diagonal reachable?*A disjointness ontology maps its admissible worlds to the independent sets of a graphGG, and the kernel’s diagonal to the \(uniform or weighted\) measureμ\\muoverℳ\(G\)\\mathcal\{M\}\(G\)\. Accessing the off\-diagonal at scale means computing the kernel normalizerZG=∑𝐦∈ℳ\(G\)w\(𝐦\)Z\_\{G\}=\\sum\_\{\\mathbf\{m\}\\in\\mathcal\{M\}\(G\)\}w\(\\mathbf\{m\}\)and its restricted conditional occupation marginals: uniform model sampling, counting, and the conditionals that bound counterfactuals all reduce to these\. We state when they are tractable\.
###### Theorem 6\(Counting transition\)\.
At unit fugacity, on graphs of maximum degreeΔ\\Delta: \(i\) forΔ≤5\\Delta\\leq 5the normalizerZGZ\_\{G\}and every restricted conditional occupation marginal are approximable to additive1/poly\(n\)1/\\mathrm\{poly\}\(n\)in deterministicpoly\(n\)\\mathrm\{poly\}\(n\)time; \(ii\) forΔ≥6\\Delta\\geq 6nopoly\(n\)\\mathrm\{poly\}\(n\)\-time algorithm approximates them unlessNP=RP\\mathrm\{NP\}=\\mathrm\{RP\}\. The two regimes are separated by the Sly–Sun threshold, and the order parameter is the Bethe cavity fieldη\\eta, with\(d−1\)η\(d\-1\)\\etacrossing11at the transition\.
The mechanism is locality\. A counterfactual ratio such asPN\\mathrm\{PN\}needs no global normalizer \(it cancels between numerator and denominator\); it needs only restricted conditional occupation marginals, and below the threshold each of these is a*local*computation, Weitz’s self\-avoiding\-walk tree rooted at the query vertex, correct precisely because correlation decays so that distant worlds do not affect it, a nearest\-neighbour read of world space\. The threshold is exactly where this locality breaks: correlation stops decaying, the neighbourhood that determines the marginal grows to the whole graph, and an efficient approximation would then yield an FPRAS for a quantity Sly–Sun proves inapproximable\. A proof, the cavity recursion, and the unconditional spectral \(Fiedler\) witness of the upper regime are given in Appendix[A](https://arxiv.org/html/2606.10934#A1)\. The single fieldη\\etagoverns both the counting verdict \(the sign of\(d−1\)η−1\(d\-1\)\\eta\-1\) and the rate at which evidence collapses the diagonal\. Algorithm[4](https://arxiv.org/html/2606.10934#alg4)is the constructive content of both directions: a Weitz correlation\-decay estimator below threshold, and the counting reduction that forbids it above\.
Algorithm 4Bound the counterfactual conditionals, or witness the counting barrier \(proves Thm\.[6](https://arxiv.org/html/2606.10934#Thmtheorem6)\)1:constraint graph
GG, max degree
Δ\\Delta, unit fugacity
2:if
Δ≤5\\Delta\\leq 5then⊳\\trianglerightbelow the Sly–Sun threshold: uniqueness regime
3:fix a vertex order
v1,…,vnv\_\{1\},\\dots,v\_\{n\}
4:for
k=1k=1to
nndo
5:
qk←P\(vk∈S∣v1…vk−1fixed\)q\_\{k\}\\leftarrow P\(v\_\{k\}\\\!\\in\\\!S\\mid v\_\{1\}\\\!\\dots v\_\{k\-1\}\\text\{ fixed\}\)via Weitz self\-avoiding\-walk tree to
±1/poly\\pm 1/\\mathrm\{poly\}⊳\\trianglerightcorrelation decay
6:endfor
7:return
ZG=∏k\(conditional factors\)Z\_\{G\}=\\prod\_\{k\}\(\\text\{conditional factors\}\)and all restricted occupation marginals to
1/poly\(n\)1/\\mathrm\{poly\}\(n\)
8:else⊳\\trianglerightΔ≥6\\Delta\\geq 6: assume a poly\-time approximator and derive a collapse
9:an approximate sampler/counter for
ℳ\(G\)\\mathcal\{M\}\(G\)within
TV≤1/poly\\mathrm\{TV\}\\leq 1/\\mathrm\{poly\}follows from the conditionals
10:by downward self\-reducibility \(Jerrum–Valiant–Vazirani\), assemble an FPRAS for
I\(G\)I\(G\)
11:Sly–Sun: approximating
I\(G\)I\(G\)at
λ=1,Δ≥6\\lambda\{=\}1,\\Delta\\geq 6is NP\-hard
⇒\\RightarrowNP=RP\\mathrm\{NP\}=\\mathrm\{RP\}
12:return“not approximable unless
NP=RP\\mathrm\{NP\}=\\mathrm\{RP\}”
13:endif
###### Corollary 2\(Kernel reconstruction is at least as hard as counting\)\.
Any procedure that reconstructs enough of the kernel to recover the restricted diagonal conditionals for every adaptive restriction also approximates the normalizer\. Hence full restriction\-uniform off\-diagonal access inherits the approximate\-counting barrier\.
This is the honesty boundary the program needs\. Old ontology AI assumed it could store and traverse the world; the barrier says full kernel access is a counting\-limited primitive, free only on the tractable side of degree5/65/6\. We implement the transition directly \(Figure[11](https://arxiv.org/html/2606.10934#S6.F11)\): the cavity order parameter\(d−1\)η\(d\-1\)\\etacrosses11betweend=5d=5\(0\.9800\.980\) andd=6d=6\(1\.1101\.110\), and at exactly that crossing the belief\-propagation error in counting the admissible worlds jumps from2%2\\%to84%84\\%, while the Fiedler value of the world\-reconfiguration graph declines as its bottleneck tightens\. Below the threshold the kernel is reconstructible \(Weitz correlation decay computes the conditionals exactly\)\.
#### Above the threshold: bounded, and crossable by scarring\.
The barrier forbids*exact*full access forΔ≥6\\Delta\\geq 6, but it is not a dead end: two polynomial\-time recourses remain, and together they are the practical answer to the hard regime\. First, the kernel SDP of Section[5\.1](https://arxiv.org/html/2606.10934#S5.SS1)is degree\-independent, returning a valid outer bound on every counterfactual at anyΔ\\DeltainO\(m2\)O\(m^\{2\}\)time, with no counting at all\. Second, and more pointedly, the hardness above the threshold is a*phase\-coexistence*phenomenon, two competing clusters of worlds joined by an exponentially thin neck, and Theorem[5](https://arxiv.org/html/2606.10934#Thmtheorem5)shows that this neck is exactly what scarring removes\. Conditioning on \(scarring to\) one phase leaves a single component on which correlation decay is restored andλ2=Ω\(1\)\\lambda\_\{2\}=\\Omega\(1\), so Weitz and Glauber compute the within\-phase conditionals exactly in polynomial time; the only genuinely inaccessible content, the cross\-phase coupling, is precisely what the SDP brackets\. The decomposition is complete: within\-phase quantities exact by phase\-conditioned scarring, the cross\-phase coupling bounded by the PSD relaxation, both polynomial above the threshold\. The barrier bounds naive enumeration of the whole world space; it is crossed by paying for one bit, which phase, and the off\-diagonal answer survives as an exact\-within\-phase value plus a tight cross\-phase bracket\.
#### Sly–Sun calipers, and their measured limit\.
This recourse has a clean geometric form, and we verify both where it works and where it fails\. When the regime is few\-phase, the measure is a convex combinationμ=∑kαkμk\\mu=\\sum\_\{k\}\\alpha\_\{k\}\\mu\_\{k\}of extremal phases, and a counterfactual aggregate inherits it,Eμ\[f\]=∑kαkEμk\[f\]E\_\{\\mu\}\[f\]=\\sum\_\{k\}\\alpha\_\{k\}E\_\{\\mu\_\{k\}\}\[f\]\. Each phase sits in the correlation\-decay regime, soEμk\[f\]E\_\{\\mu\_\{k\}\}\[f\]is a local computation with no global count, and the identified value lies in the convex hull of the phase values\. With two phases this is a bracket whose jaws are the two phase\-expectations: the two phases*are*the calipers \(Figure[9](https://arxiv.org/html/2606.10934#S6.F9)\), and the only unidentified degree of freedom is the mixing weightα\\alpha\. We confirm it on bipartitedd\-regular graphs above the threshold \(d=6,7d=6,7,λ=1\\lambda=1\): the two phases, recovered as the two belief\-propagation fixed points, give jaws that bracket the true value on every instance \(for example\[0\.28,3\.27\]∋1\.96\[0\.28,3\.27\]\\ni 1\.96atn=16,d=6n=16,d=6\), each jaw a local computation\. We need not even assume the phases: a block power iteration with two probe vectors, spun by the single\-flip dynamics, makes them emerge as the top two modes, the second aligning with the true phase split at correlation1\.0001\.000while the spectral gap it exposes halves with system size \(0\.013→0\.0065→0\.00330\.013\\to 0\.0065\\to 0\.0033\), tracing the bottleneck rather than presupposing it\.
This works precisely when the top of the spectrum is sparse, and we measure where it stops\. In a glass the number of near\-degenerate phasesKKgrows and the top eigenvalues crowd together \(gap→0\\to 0\); a fixed two\-probe iteration then captures two ofKKmodes and underestimates the spread \(Figure[10](https://arxiv.org/html/2606.10934#S6.F10)\): on a tunableKK\-phase instance the two\-probe range is2\.02\.0against a true phase spread of7\.07\.0atK=8K=8, and the two\-probe iteration count grows from1919to860860as the modes crowd\. Recovering allKKphases needs a block of sizeKK, which iseΩ\(n\)e^\{\\Omega\(n\)\}in a genuine glass\. The calipers cross the barrier exactly in the few\-phase regime and provably stall in the glass: the spectral signature of the same Sly–Sun wall, measured rather than asserted\.
Figure 9:Sly–Sun calipers\.*Left:*the two coexisting phases are the caliper jaws, each a within\-phase local computation \(no global count\)\.*Right:*the counterfactual is bracketed between the two phase\-expectationsEA\[f\],EB\[f\]E\_\{A\}\[f\],E\_\{B\}\[f\]; the true value \(for the unknown phase weightα\\alpha\) lies inside, and the bracket width is that single unidentified scalar\.Figure 10:Where the calipers stop, measured\.*Left:*few\-phase regime, two modes near11that two probes resolve cleanly \(phase split recovered at correlation1\.01\.0\)\.*Right:*glass,KKnear\-degenerate modes crowd the top \(gap≈10−16\\approx 10^\{\-16\}\); two probes capture two ofKKand underestimate the spread \(2\.02\.0vs true7\.07\.0atK=8K=8\), and recovering allKKneeds a block of sizeK∼eΩ\(n\)K\\sim e^\{\\Omega\(n\)\}\. This is the Sly–Sun wall as a spectral phenomenon\.Figure 11:The counting transition, implemented\.*Left:*the cavity order parameter\(d−1\)η\(d\-1\)\\etacrosses11between degree55and66, and belief\-propagation error in counting independent sets explodes from2%2\\%to84%84\\%at the crossing\.*Right:*the Fiedler value of the independent\-set reconfiguration graph declines as the bottleneck tightens \(the slow\-mixing witness; theexp\(−Ω\(n\)\)\\exp\(\-\\Omega\(n\)\)collapse is asymptotic, so finitennshows a trend\)\. Above the threshold exact kernel access is intractable and the SDP bound is the only survivor\.The single fieldη=1−u\\eta=1\-ualso fixes, in closed form, the rate at which evidence collapses the diagonal, monotonically with ontology density \(Appendix[A](https://arxiv.org/html/2606.10934#A1)\)\.
## 7Intelligence as closure\-preserving counterfactual competence
###### Definition 4\(Closure\-preserving counterfactual competence\)\.
A model is closure\-preserving counterfactually competent for a domain and classQworldQ\_\{\\mathrm\{world\}\}if its latent state is sufficient for the quotientΩ/∼Qworld\\Omega/\\\!\\sim\_\{Q\_\{\\mathrm\{world\}\}\}and its executed transitions preserve the entailed admissible set under every intervention sequence\.
###### Theorem 7\(WorldKernel intelligence theorem\)\.
Suppose \(i\) the latent statezt=\(ht,et,Γt,κt\)z\_\{t\}=\(h\_\{t\},e\_\{t\},\\Gamma\_\{t\},\\kappa\_\{t\}\)is complete forQworldQ\_\{\\mathrm\{world\}\}on the domain, withhth\_\{t\}learned perception,ete\_\{t\}entailed closure,Γt\\Gamma\_\{t\}the admissible set, andκt\\kappa\_\{t\}a slice of the kernel; \(ii\) the diagonal projector is single\-step sound; \(iii\) kernel queries are either not required or lie in the tractable regime of Theorem[6](https://arxiv.org/html/2606.10934#Thmtheorem6)\. Then the model is closure\-preserving counterfactually competent, and on any domain with predictively equivalent but admissibility\-distinct worlds no prediction\-only latent satisfies the same criterion\.
###### Proof\.
Completeness follows from \(i\) and Theorem[1](https://arxiv.org/html/2606.10934#Thmtheorem1)\. Horizon admissibility under interventions follows from \(ii\) and Theorem[3](https://arxiv.org/html/2606.10934#Thmtheorem3)\. Kernel\-query coverage follows from \(iii\), with the diagonal/projected model sufficient otherwise\. Separation from prediction\-only latents is Theorem[2](https://arxiv.org/html/2606.10934#Thmtheorem2)\. ∎
This is a scoped, precise notion, not a mystical one: it names a competence \(closure\-preserving execution plus off\-diagonal counterfactual access\) that prediction\-only models provably cannot guarantee, and it states exactly when that competence is computationally available\.
## 8Related work, discussion, and limitations
#### Predictive, physical, and spatial world models\.
The modern world\-model program, from Ha and Schmidhuber’s recurrent world models\[[23](https://arxiv.org/html/2606.10934#bib.bib23)\]and the Dreamer line\[[24](https://arxiv.org/html/2606.10934#bib.bib24)\]to LeCun’s JEPA agenda\[[25](https://arxiv.org/html/2606.10934#bib.bib25)\], V\-JEPA 2\[[2](https://arxiv.org/html/2606.10934#bib.bib2)\], NVIDIA Cosmos\[[3](https://arxiv.org/html/2606.10934#bib.bib3)\], DeepMind Genie\[[26](https://arxiv.org/html/2606.10934#bib.bib26)\], and World Labs’ Marble\[[4](https://arxiv.org/html/2606.10934#bib.bib4)\], compresses observations, trajectories, or scenes into a predictive latent\. WorldKernel is orthogonal to all of these: it models the admissible\-possibility space \(semantic, causal, deontic, evidential\), and its central object is a kernel over worlds, not a frame, trajectory, or pixel field\. The closest relative in spirit is the event\-graph substrate\[[21](https://arxiv.org/html/2606.10934#bib.bib21)\], which answers counterfactual queries by forking an append\-only log under a structured intervention vocabulary, a concrete diagonal\-plus\-off\-diagonal realization without learned components\.
#### Prediction is not understanding\.
High predictive accuracy does not imply a correct world model\. Vafa et al\. exhibit generative sequence models with near\-perfect next\-token accuracy whose implicit world model is incoherent\[[32](https://arxiv.org/html/2606.10934#bib.bib32)\], and models that predict orbital trajectories yet fail to recover the underlying mechanics\[[33](https://arxiv.org/html/2606.10934#bib.bib33)\]; the stochastic\-parrots critique\[[34](https://arxiv.org/html/2606.10934#bib.bib34)\]is the conceptual ancestor\. We make this precise for one axis: the missing structure is the off\-diagonal, and it is not a function of any predictive distribution\.
#### LLMs and causal reasoning\.
Whether language models reason causally is contested\. Optimistic results report strong performance on causal benchmarks\[[27](https://arxiv.org/html/2606.10934#bib.bib27)\]; the rebuttal is that such models recite causal facts without causal structure\[[28](https://arxiv.org/html/2606.10934#bib.bib28)\]\. Rung\-stratified benchmarks sharpen this: CLadder tests all three rungs of Pearl’s hierarchy\[[29](https://arxiv.org/html/2606.10934#bib.bib29)\], Corr2Cause finds near\-random performance on pure causal inference\[[30](https://arxiv.org/html/2606.10934#bib.bib30)\], and CounterBench finds frontier models near chance on genuine counterfactuals\[[31](https://arxiv.org/html/2606.10934#bib.bib31)\]\. Our battery \(Section[4\.4](https://arxiv.org/html/2606.10934#S4.SS4)\) localizes the failure: the counterfactual coupling the model would need is absent from its inputs\.
#### Neuro\-symbolic and safe learning\.
Semantic\-loss and predicate\-learning systems inject symbolic constraints as losses or vocabularies; shielding\[[6](https://arxiv.org/html/2606.10934#bib.bib6)\]and control\-barrier\[[7](https://arxiv.org/html/2606.10934#bib.bib7)\]methods enforce safety\. Our admissibility construction compiles OWL closure into a per\-transition projector over a learned model \(the lava witness of Section[2](https://arxiv.org/html/2606.10934#S2)\)\. On the implementation side, the learned proposer may be a frozen predictor whose forward pass is steered toward admissible proposals by sparse signed log\-gate controllers\[[8](https://arxiv.org/html/2606.10934#bib.bib8)\], while the hard projector preserves the guarantee of Theorem[3](https://arxiv.org/html/2606.10934#Thmtheorem3)regardless of the controller, so the grammar can enter the forward pass without weakening certified soundness\. Our diagonal/admissibility result reuses the shielding mechanism and claims no novelty for it; the contribution is that admissibility is the diagonal of a kernel whose off\-diagonal is a separate, counterfactual dimension, realized physically and bounded computationally\.
#### Joint\-only failure modes\.
The failures we identify are*purely joint*: visible only in the coupling between worlds and in no marginal \(diagonal\) check\. Verification that audits only the marginal statistics of a predictor therefore cannot detect them, which is the diagonal/off\-diagonal distinction of this paper stated operationally\.
#### Probabilistic ontologies, weighted model counting, and its barriers\.
PR\-OWL\[[9](https://arxiv.org/html/2606.10934#bib.bib9)\], ProbLog\[[10](https://arxiv.org/html/2606.10934#bib.bib10)\], Markov logic networks\[[54](https://arxiv.org/html/2606.10934#bib.bib54)\], and probabilistic description logics\[[55](https://arxiv.org/html/2606.10934#bib.bib55)\]give distributions over logical worlds; these are diagonal objects, and their inference reduces to weighted model counting\[[56](https://arxiv.org/html/2606.10934#bib.bib56)\]over compiled circuits\[[57](https://arxiv.org/html/2606.10934#bib.bib57)\], as used by semantic loss\[[5](https://arxiv.org/html/2606.10934#bib.bib5)\], DeepProbLog\[[53](https://arxiv.org/html/2606.10934#bib.bib53)\], and neuro\-symbolic predicate learners\[[52](https://arxiv.org/html/2606.10934#bib.bib52)\]\. Our normalizer is exactly a weighted model count, and the off\-diagonal inherits the counting structure: \#P\-hardness of the permanent\[[58](https://arxiv.org/html/2606.10934#bib.bib58)\], the sampling\-counting equivalence\[[11](https://arxiv.org/html/2606.10934#bib.bib11)\], hard\-core tractability below the tree threshold\[[14](https://arxiv.org/html/2606.10934#bib.bib14)\], and matching inapproximability above it\[[15](https://arxiv.org/html/2606.10934#bib.bib15),[16](https://arxiv.org/html/2606.10934#bib.bib16),[59](https://arxiv.org/html/2606.10934#bib.bib59)\]\. Counterfactual partial\-identification bounds via moment and polynomial\-programming relaxations are developed by\[[1](https://arxiv.org/html/2606.10934#bib.bib1)\]; our semidefinite bound \(Section[4\.5](https://arxiv.org/html/2606.10934#S4.SS5)\) is the second\-moment instance, read as the kernel’s own positive\-semidefiniteness, and extended with ontology axioms \(Section[4\.6](https://arxiv.org/html/2606.10934#S4.SS6)\) and online cutting\-plane acquisition \(Section[5\.2](https://arxiv.org/html/2606.10934#S5.SS2)\)\. We contribute the off\-diagonal, its counting boundary, this PSD reading, the ontology\-structured tightening, and the scarring acquisition strategy\.
#### Causal hierarchy, partial identification, and the cross\-world coupling\.
Our empirical core is classical in causal inference, assembled here around the off\-diagonal\. Pearl’s structural semantics and the Causal Hierarchy Theorem establish that the rungs almost never collapse, so interventional data generically underdetermine counterfactuals\[[17](https://arxiv.org/html/2606.10934#bib.bib17),[35](https://arxiv.org/html/2606.10934#bib.bib35),[36](https://arxiv.org/html/2606.10934#bib.bib36)\], and neural SCMs inherit this as a learnability limit\[[37](https://arxiv.org/html/2606.10934#bib.bib37)\]\. The unidentified object is the cross\-world joint of potential outcomes: probabilities of causation are only bounded, not identified\[[18](https://arxiv.org/html/2606.10934#bib.bib18),[40](https://arxiv.org/html/2606.10934#bib.bib40)\]; treatment\-effect counterfactuals are partially identified\[[41](https://arxiv.org/html/2606.10934#bib.bib41)\], computed by linear programming over response\-function types\[[42](https://arxiv.org/html/2606.10934#bib.bib42),[39](https://arxiv.org/html/2606.10934#bib.bib39),[51](https://arxiv.org/html/2606.10934#bib.bib51)\]; nested counterfactuals are identifiable only under graphical conditions\[[38](https://arxiv.org/html/2606.10934#bib.bib38)\]\. The mediation literature is the same story: natural direct and indirect effects require a cross\-world independence assumption no experiment can test\[[43](https://arxiv.org/html/2606.10934#bib.bib43),[44](https://arxiv.org/html/2606.10934#bib.bib44),[46](https://arxiv.org/html/2606.10934#bib.bib46),[47](https://arxiv.org/html/2606.10934#bib.bib47)\], whose boundary is mapped by the recanting\-witness criterion\[[45](https://arxiv.org/html/2606.10934#bib.bib45)\], single\-world intervention graphs\[[48](https://arxiv.org/html/2606.10934#bib.bib48)\], and direct analyses of the assumption\[[49](https://arxiv.org/html/2606.10934#bib.bib49)\]; Dawid’s skepticism that the cross\-world joint is even meaningful\[[50](https://arxiv.org/html/2606.10934#bib.bib50)\]is the contrarian articulation of our thesis\. Our contribution is not a new identifiability result: it is the identification of this cross\-world coupling with the off\-diagonal of a single positive semidefinite world kernel, the demonstration that the kernel’s PSD structure tightens the bound in polynomial time beyond the marginals \(Section[4\.5](https://arxiv.org/html/2606.10934#S4.SS5)\), and its counting boundary, with the demonstration \(Section[4\.4](https://arxiv.org/html/2606.10934#S4.SS4)\) that predictors and Bayesian baselines collapse it while the kernel reports it\. Scale does not remove the gap, consistent with evidence that predictive scaling laws saturate\[[22](https://arxiv.org/html/2606.10934#bib.bib22)\]\.
#### Limitations\.
The off\-diagonal becomes accessible only on the tractable side of the Sly–Sun threshold; above it, the theory predicts its own intractability, which is a feature but bounds applicability\. The keystone experiment is a finite\-nnillustration of an asymptotic theorem, not an independent proof\. The language\-model baseline is a strong but single predictor; the claim is structural \(the off\-diagonal is not in rung\-1/21/2data\), not about any particular model’s competence\. The generative use of the kernel \(sampling admissible worlds rather than verifying them, Section[5\.3](https://arxiv.org/html/2606.10934#S5.SS3)\) is left to future work; symbolic\-discovery methods in this direction exhibit saturating returns\[[22](https://arxiv.org/html/2606.10934#bib.bib22)\], which suggests the generative gain is bounded and must be combined with the verification primitives developed here rather than replacing them\.
## 9Conclusion
A world model is not a predictor of futures\. It is the coupling kernel of admissible possible worlds\. The diagonal of that kernel is the posterior, and maintaining it certifiably is admissibility, the part prediction can approximate and a projector can guarantee\. The off\-diagonal is a genuinely separate direction: it is the cross\-world coupling that fixes counterfactuals, it is invisible to the predictor and to any marginal check, its positive\-semidefinite structure already tightens what can be inferred beyond the marginals in polynomial time, and its full reconstruction is reachable only below the Sly–Sun counting threshold\. Prediction asks what happens next\. Intelligence asks what can consistently be, what follows, what is permitted, what intervention is licensed, and which alternative worlds remain linked\. That last question is the kernel\.
## Appendix AThe counting transition
#### Order parameter\.
On the infinitedd\-regular tree \(the locally tree\-like limit of randomdd\-regular graphs\) at fugacityλ\\lambda, the hard\-core cavity vacancy probability satisfies
u=11\+λud−1,u=\\frac\{1\}\{1\+\\lambda u^\{d\-1\}\},whose root in\(0,1\)\(0,1\)exists and is unique for allλ\\lambda\(it is the unique zero of the increasingg\(u\)=u\+λud−1g\(u\)=u\+\\lambda u^\{d\}\-1atλ=1\\lambda=1\), though the naive iterationu↦1/\(1\+λud−1\)u\\mapsto 1/\(1\+\\lambda u^\{d\-1\}\)converges to it only belowλc\(d\)=\(d−1\)d−1/\(d−2\)d\\lambda\_\{c\}\(d\)=\(d\-1\)^\{d\-1\}/\(d\-2\)^\{d\}; above that one must solveg\(u\)=0g\(u\)=0directly \(bisection\)\. The occupation field isη=1−u\\eta=1\-u, and atλ=1\\lambda=1the fixed point obeysud=1−u=ηu^\{d\}=1\-u=\\eta\. The recursion’s contraction factor at the fixed point is\|T′\(u∗\)\|=\(d−1\)λud=\(d−1\)η\|T^\{\\prime\}\(u^\{\*\}\)\|=\(d\-1\)\\lambda u^\{d\}=\(d\-1\)\\eta, the counting order parameter\.
Two distinct criteria locate the same integer transition, and we keep them separate to avoid a common conflation\. The contraction crossing\(d−1\)η=1\(d\-1\)\\eta=1occurs at the continuous valued≈5\.14d\\approx 5\.14\(numerically\(d−1\)η=0\.980\(d\-1\)\\eta=0\.980atd=5d=5and1\.1101\.110atd=6d=6\)\. The fugacity\-threshold crossingλc\(d\)=1\\lambda\_\{c\}\(d\)=1occurs atd≈5\.66d\\approx 5\.66\(λc\(5\)=44/35=1\.053\>1\\lambda\_\{c\}\(5\)=4^\{4\}/3^\{5\}=1\.053\>1,λc\(6\)=55/46=0\.763<1\\lambda\_\{c\}\(6\)=5^\{5\}/4^\{6\}=0\.763<1\)\. Both place the integer transition strictly between degree55\(tractable\) and degree66\(hard\); our figures use the contraction valuedc≈5\.14d\_\{c\}\\approx 5\.14\.
#### Evidence\-collapse rate in closed form\.
The Bethe free energy gives the per\-site entropyh\(η\)=log2\(1\+λud\)−d2log2\(1−η2\)h\(\\eta\)=\\log\_\{2\}\(1\+\\lambda u^\{d\}\)\-\\tfrac\{d\}\{2\}\\log\_\{2\}\(1\-\\eta^\{2\}\), the rate at which evidence collapses the diagonal, withO\(1/n\)O\(1/n\)corrections from short cycles; exact enumeration ton≤22n\\leq 22matches the closed form to three decimals, monotone in ontology density through the single fieldη\\eta\.
#### Below threshold \(tractable\)\.
The normalizer telescopes into adaptive conditional occupation marginals,ZG=∏k\(conditional factors\)Z\_\{G\}=\\prod\_\{k\}\(\\text\{conditional factors\}\), so it suffices to compute each to additive1/poly1/\\mathrm\{poly\}error\. Conditioning deletes vertices, so every conditional marginal is an occupation marginal of a vertex\-deleted subgraph of maximum degree≤Δ\\leq\\Delta, still in the uniqueness regime forΔ≤5\\Delta\\leq 5\. Weitz’s self\-avoiding\-walk\-tree algorithm computes each to±δ\\pm\\deltainpoly\(n,1/δ\)\\mathrm\{poly\}\(n,1/\\delta\)via exponential correlation decay; the product of thennconditionals gives the claim, uniformly\.
#### Above threshold \(hard\)\.
The conditional marginals yield a sampler within total variation1/poly\(n\)1/\\mathrm\{poly\}\(n\)of the uniform measure over independent sets\. Downward self\-reducibility \(Jerrum–Valiant–Vazirani\) turns a uniform sampler for every vertex\-deleted subgraph into an FPRAS forI\(G\)I\(G\)\. ForΔ≥6\\Delta\\geq 6atλ=1\\lambda=1this approximate count is NP\-hard \(Sly, Sly–Sun, Galanis–Stefankovic–Vigoda\), so a poly\-time approximator would giveNP=RP\\mathrm\{NP\}=\\mathrm\{RP\}\.
#### Unconditional spectral witness\.
For any local \(single\-flip\) dynamics on the admissible worlds, the obstruction is unconditional: the Fiedler value of the single\-flip adjacency onℳ\(G\)\\mathcal\{M\}\(G\)collapses asλ2=exp\(−Ω\(n\)\)\\lambda\_\{2\}=\\exp\(\-\\Omega\(n\)\)above threshold \(Sly slow mixing\), so the world space fragments into exponentially weakly connected clusters and no local\-move sampler mixes in polynomial time\. This is the finite\-nntrend of Figure[11](https://arxiv.org/html/2606.10934#S6.F11)and the bottleneck of Theorem[5](https://arxiv.org/html/2606.10934#Thmtheorem5); covering all \(non\-local\) algorithms is what requires theNP=RP\\mathrm\{NP\}=\\mathrm\{RP\}assumption\.
## Appendix BCounting barrier for restriction\-uniform kernel access
Letwx\(y\)≥0w\_\{x\}\(y\)\\geq 0be explicit weights ony∈\{0,1\}ny\\in\\\{0,1\\\}^\{n\}with normalizerZxZ\_\{x\}and posteriorμx\\mu\_\{x\}\. Assume restrictions are efficiently constructible, terminal weights computable to relative1±O\(ε\)1\\pm O\(\\varepsilon\), and that an oracle reproduces every restricted conditionalμx\|a\\mu\_\{x\|a\}for an adaptively chosen prefixaato additive errorγ=ε/\(20n\)\\gamma=\\varepsilon/\(20n\)\. ThenZxZ\_\{x\}is approximable to1±O\(ε\)1\\pm O\(\\varepsilon\)in polynomial time\. The proof is a Bayes\-greedy telescoping: query the next\-bit conditional, follow the branch of probability≥1/3\\geq 1/3, and multiply,μx\(y\)=∏kpk\\mu\_\{x\}\(y\)=\\prod\_\{k\}p\_\{k\}, so thatZx=wx\(y\)/∏kpkZ\_\{x\}=w\_\{x\}\(y\)/\\prod\_\{k\}p\_\{k\}\. Any conditionally diagonal\-complete kernel\-reconstruction procedure supplies exactly these conditionals, hence inherits the counting barrier\.
## References
- \[1\]G\. Duarte, N\. Finkelstein, D\. Knox, J\. Mummolo, I\. Shpitser\. An Automated Approach to Causal Inference in Discrete Settings\. Journal of the American Statistical Association, 2024\.
- \[2\]M\. Assran et al\. V\-JEPA 2: Self\-Supervised Video Models Enable Understanding, Prediction and Planning\. arXiv:2506\.09985, 2025\.
- \[3\]NVIDIA\. Cosmos 3: The Open Physical AI Foundation Model\. Technical report, 2026\.
- \[4\]World Labs\. Marble: A Multimodal World Model\. 2025\.
- \[5\]J\. Xu, Z\. Zhang, T\. Friedman, Y\. Liang, G\. Van den Broeck\. A Semantic Loss Function for Deep Learning with Symbolic Knowledge\. ICML, 2018\.
- \[6\]M\. Alshiekh, R\. Bloem, R\. Ehlers, B\. Könighofer, S\. Niekum, U\. Topcu\. Safe Reinforcement Learning via Shielding\. AAAI, 2018\.
- \[7\]A\. D\. Ames et al\. Control Barrier Functions: Theory and Applications\. ECC, 2019\.
- \[8\]L\. Chlon\. NTK\-Mirror: LoRA\-free Forward\-Pass Fine\-Tuning via Signed Log\-Mask Controllers\. Software repository, 2026\.[https://github\.com/leochlon/ntkmirror](https://github.com/leochlon/ntkmirror)\.
- \[9\]P\. C\. G\. da Costa, K\. B\. Laskey, K\. J\. Laskey\. PR\-OWL: A Bayesian Ontology Language for the Semantic Web\. URSW, 2005\.
- \[10\]L\. De Raedt, A\. Kimmig, H\. Toivonen\. ProbLog: A Probabilistic Prolog\. IJCAI, 2007\.
- \[11\]M\. Jerrum, L\. Valiant, V\. Vazirani\. Random Generation of Combinatorial Structures from a Uniform Distribution\. TCS, 1986\.
- \[12\]D\. Aharonov, A\. Ta\-Shma\. Adiabatic Quantum State Generation and Statistical Zero Knowledge\. STOC, 2003\.
- \[13\]L\. Grover, T\. Rudolph\. Creating Superpositions that Correspond to Efficiently Integrable Probability Distributions\. arXiv:quant\-ph/0208112, 2002\.
- \[14\]D\. Weitz\. Counting Independent Sets up to the Tree Threshold\. STOC, 2006\.
- \[15\]A\. Sly\. Computational Transition at the Uniqueness Threshold\. FOCS, 2010\.
- \[16\]A\. Sly, N\. Sun\. Counting in Two\-Spin Models ondd\-Regular Graphs\. Annals of Probability, 2014\.
- \[17\]J\. Pearl\. Causality: Models, Reasoning, and Inference\. Cambridge University Press, 2009\.
- \[18\]J\. Tian, J\. Pearl\. Probabilities of Causation: Bounds and Identification\. Annals of Mathematics and AI, 2000\.
- \[19\]F\. Rovai\. Open Ontologies: Tool\-Augmented Ontology Engineering with Stable Matching Alignment\. arXiv:2605\.09184, 2026\.
- \[20\]F\. Rovai\. CIVeX: Causal Intervention Verification for Language Agents\. arXiv:2605\.09168, 2026\.
- \[21\]F\. Rovai\. Deterministic Event\-Graph Substrates as World Models for Counterfactual Reasoning\. arXiv:2605\.15967, 2026\.
- \[22\]F\. Rovai\. Saturating Scaling Laws for Equational Discovery\. arXiv:2605\.23983, 2026\.
- \[23\]D\. Ha, J\. Schmidhuber\. Recurrent World Models Facilitate Policy Evolution\. NeurIPS, 2018\.
- \[24\]D\. Hafner, J\. Pasukonis, J\. Ba, T\. Lillicrap\. Mastering Diverse Control Tasks through World Models \(DreamerV3\)\. Nature, 2025\.
- \[25\]Y\. LeCun\. A Path Towards Autonomous Machine Intelligence\. OpenReview, 2022\.
- \[26\]J\. Bruce et al\. Genie: Generative Interactive Environments\. ICML, 2024\.
- \[27\]E\. Kıcıman, R\. Ness, A\. Sharma, C\. Tan\. Causal Reasoning and Large Language Models: Opening a New Frontier for Causality\. arXiv:2305\.00050, 2023\.
- \[28\]M\. Zečević, M\. Willig, D\. S\. Dhami, K\. Kersting\. Causal Parrots: Large Language Models May Talk Causality But Are Not Causal\. TMLR, 2023\.
- \[29\]Z\. Jin et al\. CLadder: Assessing Causal Reasoning in Language Models\. NeurIPS, 2023\.
- \[30\]Z\. Jin et al\. Can Large Language Models Infer Causation from Correlation? ICLR, 2024\.
- \[31\]Y\. Chen, V\. K\. Singh, J\. Ma, R\. Tang\. CounterBench: Evaluating and Improving Counterfactual Reasoning in LLMs\. arXiv:2502\.11008, 2025\.
- \[32\]K\. Vafa, J\. Y\. Chen, A\. Rambachan, J\. Kleinberg, S\. Mullainathan\. Evaluating the World Model Implicit in a Generative Model\. NeurIPS, 2024\.
- \[33\]K\. Vafa, P\. G\. Chang, A\. Rambachan, S\. Mullainathan\. What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models\. ICML, 2025\.
- \[34\]E\. M\. Bender, T\. Gebru, A\. McMillan\-Major, S\. Shmitchell\. On the Dangers of Stochastic Parrots\. FAccT, 2021\.
- \[35\]E\. Bareinboim, J\. D\. Correa, D\. Ibeling, T\. Icard\. On Pearl’s Hierarchy and the Foundations of Causal Inference\. In Probabilistic and Causal Inference, ACM Books, 2022\.
- \[36\]D\. Ibeling, T\. Icard\. Probabilistic Reasoning across the Causal Hierarchy\. AAAI, 2020\.
- \[37\]K\. Xia, K\.\-Z\. Lee, Y\. Bengio, E\. Bareinboim\. The Causal\-Neural Connection: Expressiveness, Learnability, and Inference\. NeurIPS, 2021\.
- \[38\]J\. D\. Correa, S\. Lee, E\. Bareinboim\. Nested Counterfactual Identification from Arbitrary Surrogate Experiments\. NeurIPS, 2021\.
- \[39\]J\. Zhang, J\. Tian, E\. Bareinboim\. Partial Counterfactual Identification from Observational and Experimental Data\. ICML, 2022\.
- \[40\]A\. Li, J\. Pearl\. Probabilities of Causation: Role of Observational Data\. AISTATS, 2023\.
- \[41\]C\. F\. Manski\. Nonparametric Bounds on Treatment Effects\. American Economic Review, 80\(2\):319–323, 1990\.
- \[42\]A\. Balke, J\. Pearl\. Counterfactual Probabilities: Computational Methods, Bounds and Applications\. UAI, 1994\.
- \[43\]J\. M\. Robins, S\. Greenland\. Identifiability and Exchangeability for Direct and Indirect Effects\. Epidemiology, 3\(2\):143–155, 1992\.
- \[44\]J\. Pearl\. Direct and Indirect Effects\. UAI, 2001\.
- \[45\]C\. Avin, I\. Shpitser, J\. Pearl\. Identifiability of Path\-Specific Effects\. IJCAI, 2005\.
- \[46\]K\. Imai, L\. Keele, T\. Yamamoto\. Identification, Inference and Sensitivity Analysis for Causal Mediation Effects\. Statistical Science, 25\(1\):51–71, 2010\.
- \[47\]T\. J\. VanderWeele\. Explanation in Causal Inference: Methods for Mediation and Interaction\. Oxford University Press, 2015\.
- \[48\]J\. M\. Robins, T\. S\. Richardson\. Alternative Graphical Causal Models and the Identification of Direct Effects\. In Causality and Psychopathology, Oxford University Press, 2010\.
- \[49\]R\. M\. Andrews, V\. Didelez\. Insights into the Cross\-World Independence Assumption of Causal Mediation Analysis\. Epidemiology, 32\(2\):209–219, 2021\.
- \[50\]A\. P\. Dawid\. Causal Inference without Counterfactuals\. JASA, 95\(450\):407–424, 2000\.
- \[51\]D\. Heckerman, R\. Shachter\. Decision\-Theoretic Foundations for Causal Reasoning\. JAIR, 3:405–430, 1995\.
- \[52\]Y\. Liang et al\. VisualPredicator: Learning Abstract World Models with Neuro\-Symbolic Predicates for Robot Planning\. ICLR, 2025\.
- \[53\]R\. Manhaeve, S\. Dumančić, A\. Kimmig, T\. Demeester, L\. De Raedt\. DeepProbLog: Neural Probabilistic Logic Programming\. NeurIPS, 2018\.
- \[54\]M\. Richardson, P\. Domingos\. Markov Logic Networks\. Machine Learning, 62:107–136, 2006\.
- \[55\]F\. Riguzzi, E\. Bellodi, E\. Lamma, R\. Zese\. Probabilistic Description Logics under the Distribution Semantics\. Semantic Web, 6\(5\):477–501, 2015\.
- \[56\]M\. Chavira, A\. Darwiche\. On Probabilistic Inference by Weighted Model Counting\. Artificial Intelligence, 172\(6–7\):772–799, 2008\.
- \[57\]A\. Darwiche, P\. Marquis\. A Knowledge Compilation Map\. JAIR, 17:229–264, 2002\.
- \[58\]L\. G\. Valiant\. The Complexity of Computing the Permanent\. Theoretical Computer Science, 8\(2\):189–201, 1979\.
- \[59\]A\. Galanis, D\. Štefankovič, E\. Vigoda\. Inapproximability of the Partition Function for the Antiferromagnetic Ising and Hard\-Core Models\. Combinatorics, Probability and Computing, 25\(4\):500–559, 2016\.Similar Articles
@CSProfKGD: .@ylecun’s definition of what is a world model.
A tweet referencing Yann LeCun's definition of a world model.
PatchWorld: Gradient-Free Optimization of Executable World Models
PatchWorld introduces a gradient-free framework that transforms offline trajectories into executable Python world models via counterexample-guided code repair, enabling interpretable and inspectable belief-state programs for planning in partially observable environments.
stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation
Stable-Worldmodel (SWM) is a modular and standardized research framework for developing and evaluating world models, designed to improve reproducibility and support robustness and continual learning research.
One Lens, Many Worlds : A Capability-Typed Interface for World-Model Interpretability
This paper introduces WorldModelLens, an open-source substrate for interpretability of world models, using a capability-typed adapter interface that generalizes across diverse architectures like PlaNet, Dreamer, IRIS, and I-JEPA. The framework provides a unified hook-and-cache layer for activation analysis and adds only ~12% overhead when inactive.
World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning
This paper proposes Privileged-Future On-Policy Self-Distillation (PF-OPSD) for controlled concrete reasoning, combining world models' visual simulation with language models' abstract reasoning to improve prediction accuracy and robustness on two new benchmarks.