Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images
Summary
This paper introduces sparse autoencoders to resolve superposition in neural networks, improving interpretability and geometric fidelity of latent spaces, and presents GW-map for cross-modal alignment between image representations and single-cell RNA sequencing data.
View Cached Full Text
Cached at: 07/01/26, 05:35 AM
# Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images
Source: [https://arxiv.org/html/2606.31394](https://arxiv.org/html/2606.31394)
Jisung Park1,Seohyeon Kang1,Daeun Yoo1,Eunsu Lee2,Seoin Cho1,Wooyeop Choi1 Ian Choi1,James R\. Evan4,Daesoo Kim1,3,Sonia Gandhi4,Minee L\. Choi1,3 1KAIST,2Konyang University,3Chang Gung University, 4UCL Queen Square Institute of Neurology & The Francis Crick Institute
###### Abstract
Artificial intelligence is transforming our capability to solve biological challenges\. In dimensionality bottleneck regimes exacerbated by high\-dimensional biological data, Neural networks force distinct concepts into the lower dimensions known as superposition\. Although this superposition is widely known to hinder interpretability, its impact on corrupting the geometry of latent spaces remains critically overlooked\. Here, we utilized sparse autoencoders \(SAEs\) trained on over 100,000 multiplexed images of patient\-derived Parkinson’s disease and healthy neurons to resolve superposition\. This approach bypasses the mathematical non\-uniqueness of feature attribution by shifting to interpretable latent representation analysis\. We theoretically and empirically demonstrate that superposition contaminates representational metric spaces, and thereby SAEs successfully recover geometric fidelity\. By treating these geometrically purified representations as single\-cell state vectors, we adapted single\-cell RNA sequencing \(scRNA\-seq\) data analysis methodologies directly to the image domain\. Finally, we introduce GW\-map, utilizing Gromov\-Wasserstein optimal transport to align these image representations with authentic scRNA\-seq data*de novo*\. This coupling reconstructs hierarchical neuronal pathology pathways such as Calcium\-AIS scaffold, without reference spatial transcriptomics, establishing a scalable foundation for spatial biology\. Code is available at https://github\.com/jijihihi/Bio\_superposition
## 1Introduction and Related Work
### 1\.1Limitation of Feature Attribution XAI
Deep learning has become indispensable in biological image analysis, with pathology foundation models routinely extracting high\-dimensional representations from visual information\. To establish model reliability and generate biological hypotheses, post\-hoc feature attribution techniques such as Gradient\-weighted Class Activation Mapping \(Grad\-CAM\)Selvarajuet al\.\([2017](https://arxiv.org/html/2606.31394#bib.bib163)\)and Shapley Additive Explanations \(SHAP\)Lundberg and Lee \([2017](https://arxiv.org/html/2606.31394#bib.bib164)\)are widely employedAravindkumaret al\.\([2026](https://arxiv.org/html/2606.31394#bib.bib143)\); Zhouet al\.\([2023](https://arxiv.org/html/2606.31394#bib.bib136)\)\. However, these methods confront a fundamental tension rooted in geometry\. Under the manifold hypothesis, component\-wise credit assignment becomes mathematically non\-uniqueSundararajan and Najmi \([2020](https://arxiv.org/html/2606.31394#bib.bib123)\)\. This ambiguity is severely exacerbated in biology due to the dense interdependency of biological variables\. Conversely, bypassing this ambiguity by ignoring the data’s intrinsic geometry yields off\-manifold explanationsFryeet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib129)\); Anderset al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib124)\)\. Given this manifold\-attribution paradox, compounded by frequent inconsistencies across explainable AI \(XAI\) methodsKrishnaet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib126)\)and other limitationsBilodeauet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib133)\); Huang and Marques\-Silva \([2024](https://arxiv.org/html/2606.31394#bib.bib193)\), the application of XAI in biology urgently requires an intrinsic alternative\.
### 1\.2Mechanistic Interpretability and Representation Contamination Exacerbated in Biology
Mechanistic Interpretability \(MI\) has recently emerged as a powerful alternativeElhageet al\.\([2021](https://arxiv.org/html/2606.31394#bib.bib188)\)\. MI posits that interpretability challenges stem from concept superposition: the forced compression of multiple distinct concepts into fewer dimensions in a neural networkElhageet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib189)\)\. To resolve this, MI utilizes sparse Autoencoders \(SAEs\) to disentangle superposed concepts into monosemantic unitsBrickenet al\.\([2023](https://arxiv.org/html/2606.31394#bib.bib156)\); Templetonet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib157)\)\. This conceptual isolation renders the model’s internal representations directly interpretable, obviating manifold\-attribution paradox\. It achieves human\-centric interpretability without sacrificing predictive capacity, altering model architecturesChenet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib153)\)or predefined conceptsKohet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib154)\); Kimet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib155)\)\. While the impact of superposition on interpretability is increasingly recognizedSomvanshiet al\.\([2026](https://arxiv.org/html/2606.31394#bib.bib190)\), its role in corrupting representation geometry has been critically overlookedElhageet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib189)\); Brickenet al\.\([2023](https://arxiv.org/html/2606.31394#bib.bib156)\); Longonet al\.\([2026](https://arxiv.org/html/2606.31394#bib.bib178)\)\. Because neural networks densely pack concept vectors to optimize lossElhageet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib189)\); Parket al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib250)\), this packing contaminates the metric space and destroys the mathematical isometry of the latent space rendering local measurements \(e\.g\., cosine distances\) unreliable\.
Crucially, superposition becomes extraordinarily prominent in the biological domain\. The internal dimensions of neural networks are vastly narrower than the extreme diversity of biological states, creating severe dimensionality constraints\. Furthermore, because interference fundamentally arises from inner products between sparse concept vectors, data sparsity is key factor in superposition\. Given that extreme high\-dimensionality and zero\-inflated sparsity are inherent to biological dataJianget al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib222)\), this geometric corruption is profoundly pronounced in biological AI, fundamentally threatening the reliability of prevalent representation\-based downstream analysesChaiet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib182)\); Caicedoet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib194)\)\.
### 1\.3Our Proposed Framework
To address this, we employed SAEs to disentangle the superposition\. We validated this framework using patient induced pluripotent stem cell \(iPSC\)\-derived cortical neurons in a high\-content assay\. Through theoretical and empirical analysis, we demonstrate that superposition contaminates representation and thereby disentangling superposition and lifting the dimensionality via SAEs restores the corrupted metric space, empowering the representations to faithfully reflect intrinsic data semantics\. Leveraging these geometrically purified representations and structural similarity between SAE representation and single\-cell RNA sequencing \(scRNA\-seq\) data \(see section 5\), we successfully deployed graph\-based scRNA\-seq analytical methods tailored to the intrinsic noise and continuous biological properties of the data directly to the image domain\.
Furthermore, we developed the GW\-map framework utilizing Gromov\-Wasserstein \(GW\) optimal transportPeyré and Cuturi \([2019](https://arxiv.org/html/2606.31394#bib.bib161)\); Mémoli \([2011](https://arxiv.org/html/2606.31394#bib.bib162)\); Demetciet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib100)\)to*de novo*align image and single\-cell transcriptomic data without reference spatial transcriptomics \(ST\)\. By autonomously reconstructing the hierarchical pathway architecture of neuronal pathology, this approach overcomes the fundamental limitations of ST, including restricted cellular resolution, high costs, limited gene coverage, and chronic cross\-modal alignment challengesGuoet al\.\([2025](https://arxiv.org/html/2606.31394#bib.bib197)\); Khanet al\.\([2025](https://arxiv.org/html/2606.31394#bib.bib199)\)\.
The main contributions of this paper are summarized as follows:
- •We theoretically demonstrate that superposition mathematically contaminates the representation metric space\.
- •We empirically demonstrate this geometric contamination in biological imaging models and show that SAE\-based superposition disentanglement faithfully recovers the intrinsic geometric fidelity of the data
- •Leveraging geometrically purified SAE representations, we successfully adapted advanced scRNA\-seq analysis algorithms to the image domain, establishing a powerful framework for evaluating representations and uncovering biological hypotheses\.
- •We introduce GW\-map, which autonomously couples purified SAE representations with authentic scRNA\-seq data via Gromov\-Wasserstein optimal transport, suggesting a highly scalable new approach to spatial biology
## 2Method
In this section, we present the computational framework that transforms high\-content imaging data into interpretable, geometrically purified representations, adapts of scRNA\-seq methodologies to the image domain, and aligns with authentic scRNA\-seq data via GW coupling\. All experimental results are reported as the average over eight independent random seeds\. Detiled methods in Appendix E\.
Figure 1:\(a\)MoCo\-based contrastive representation learning framework\.\(b\)Integration of the SAE to CNN\. SAE GAP latent vectors were used for pseudo\-transcriptomic analysis and multimodal integration\.\(c\)Left: Comparison between entangled CNN attention and disentangled attention of the SAE\. Right: SAE more faithfully reflects the data topological relationships\.\(d\)Single\-cell methods were applied to SAE representations\.\(e\)Unsupervised cross\-modal alignment of SAE representations with authentic scRNA\-seq data\. Downstream analyses validated the biological plausiblity of this cross\-modal alignment\. Panels \(a\-e\) were created using BioRender; Choi, ML\. \(2026\) https://BioRender\.com/tk1hkpy### 2\.1Contrastive Learning of CNN on Cortical Neuronal Images
Cortical neurons were differentiated from hiPSCs of healthy controls \(3 lines\) and familial PD patients*\(SNCA×3\\times 3, LRRK2, GBA\)*\. High\-content live\-cell imaging was performed using four fluorescent markers: Hoechst 33342 \(nuclei\), TMRM \(mitochondria\), LysoTracker \(lysosomes\), and Sytox Green \(dead cells for apoptosis quantification\)D’Saet al\.\([2023](https://arxiv.org/html/2606.31394#bib.bib253)\)\. A custom CNN architecture was chosen over a Vision Transformer \(ViT\) due to its spatial localityLecunet al\.\([1998](https://arxiv.org/html/2606.31394#bib.bib90)\); Olahet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib111)\); Burgertet al\.\([2025](https://arxiv.org/html/2606.31394#bib.bib149)\); Brendel and Bethge \([2019](https://arxiv.org/html/2606.31394#bib.bib151)\); Dosovitskiyet al\.\([2021](https://arxiv.org/html/2606.31394#bib.bib147)\); Raghuet al\.\([2021](https://arxiv.org/html/2606.31394#bib.bib148)\), aiding mechanistic interpretation\. The network was trained on over 100,000 images using a three\-channel input \(excluding the dead\-cell marker\)\. Figure 1 \(a\) shows the supervised contrastive learning frameworkKhoslaet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib144)\)based on MoCoHeet al\.\([2020b](https://arxiv.org/html/2606.31394#bib.bib142)\)for four\-class classification \(control and three PD mutations\)\. To minimize technical artifacts and variations in cell density, instance normalization \(IN\) was applied to all input images\.
### 2\.2Sparse Autoencoder Integration and Representation Evaluation
To identify the optimal latent space, we evaluated feature vectors extracted via Global Average Pooling \(GAP\) from the final three CNN layers preceding the projection head\. Gated sparse autoencoders \(SAE\)Brickenet al\.\([2023](https://arxiv.org/html/2606.31394#bib.bib156)\); Templetonet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib157)\); Rajamanoharanet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib114)\)were trained on point\-wise vectors \(vij∈RCv\_\{ij\}\\in\\ R^\{C\}\) from the spatial locations of the optimal CNN feature map \(fstage5∈ℝH×W×Cf\_\{stage5\}\\in\\mathbb\{R\}^\{H\\times W\\times C\}\) \(Fig\. 1\(b\)\)\. Following standard practice, each point\-wise vector was batch\-centeredL2L\_\{2\}\-normalized prior to SAE training\. We evaluated superposition disentanglement using inner product distributions, spatial attention monosemanticity and effective dimensionality \(eRank\) across various expansion factors and sparsity penalties\. The SAE successfully decomposed complex spatial attention into human\-interpretable feature maps \(Fig\. 1\(c\), left\) and preserved biological semantics with significantly higher topological fidelity than the baseline CNN representations\. \(Fig\. 1\(c\), right\)\. A sparsity threshold of1×10−51\\times\{10\}^\{\-5\}was applied to the SAE activations for all subsequent analyses unless otherwise stated\.
### 2\.3Manifold Analysis via Adaptation of scRNA\-seq Methodologies to Image Domain
Leveraging the and semantically resolved features from the SAE, we treated these representations as transcriptomic\-like single\-cell profiles\. This conceptual bridge enabled the direct adaptation of scRNA\-seq analytical frameworks, which offer the tools for non\-linear manifold analysis\. Specifically, we utilized Diffusion Pseudotime \(DPT\)Haghverdiet al\.\([2016](https://arxiv.org/html/2606.31394#bib.bib18)\), which approximates geodesic distances across complex, non\-Euclidean data manifolds by modeling stochastic random walks and heat diffusion kernels\. We also applied the Potential of Heat\-diffusion for Affinity\-based Transition Embedding \(PHATE\)Moonet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib165)\)to visualize information\-theoretic distances, and Partition\-based Graph Abstraction \(PAGA\)Wolfet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib167)\)to rigorously quantify topological connectivity\.
### 2\.4Cross\-Modal Coupling via GW\-map
We coupled image and authentic scRNA\-seq via GW\-map\. Specifically, we batch\-corrected publicly available*SNCA×3\\times 3*3D organoid scRNA\-seq dataJinet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib84)\)using single\-cell variational inference \(scVI\)Lopezet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib257)\)and computationally filtered them to retain 2D\-representative populations\. The PCA\-reduced SAE feature vectors were then structurally aligned with the scVI latent vectors via GW coupling\. To robustly quantify gene\-specific predictability without variance\-induced inflation, we calculatedZZ\-scores by standardizing the predictiveR2R^\{2\}values against a null distribution generated via rigorous permutation testing\. Downstream analyses validated the biological plausibility of this cross\-modal alignment\.
## 3Biological Information Encoded in the CNN Representation
Our custom CNN achieved a linear probe accuracy of 90\.31 ± 0\.38% \(mean ± SD; Fig\. 2\(a\), \(b\)\), and Centered Kernel Alignment \(CKA\)Kornblithet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib109)\)confirmed robust representational similarity across independently trained models \(512 dimensions, 2,500 samples/classNguyenet al\.\([2021](https://arxiv.org/html/2606.31394#bib.bib263)\); Murphyet al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib260)\); Fig\. 2\(c\)\)\. To evaluate whether the network captures biological information beyond classification, we predicted cell death rates using representations from the final three CNN layers \(fstage5midf\_\{stage5\_\{m\}id\},fstage5outf\_\{stage5\_\{o\}ut\}, andfrefineoutf\_\{refine\_\{o\}ut\}\) via ridge regressionChen and Guestrin \([2016](https://arxiv.org/html/2606.31394#bib.bib200)\)\. Although the model was trained exclusively on organelle markers without cell death labels, all three layers significantly predicted cell death rates across mutation lines \(Fig\. 2 \(d\)\)\.
Notably,fstage5outf\_\{stage5\_\{o\}ut\}yielded the highest predictive performance \(cross\-validationR2R^\{2\}up to 0\.52 for*SNCA×3\\times 3*, permutationp<0\.01p<0\.01\), making it the optimal input layer for the subsequent SAE training\. In contrast, the finalfrefineoutf\_\{refine\_\{o\}ut\}layer exhibited dimensional collapsePapyanet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib63)\), marked by sharp declines in both predictiveR2R^\{2\}and eRankRoy and Vetterli \([2007](https://arxiv.org/html/2606.31394#bib.bib170)\)\(Fig\. 2\(d\), \(e\)\)\. UMAP projections offstage5outf\_\{stage5\_\{o\}ut\}revealed distinct class\-specific topologies, where the implicitly encoded biological information manifested as a continuous gradient of cell death \(Fig\. 2\(f\), \(h\)\. This robust prediction of*SNCA×3\\times 3*\-associated cell death aligns with establishedα\\alpha\-synuclein pathology in Parkinson’s diseaseSingletonet al\.\([2003](https://arxiv.org/html/2606.31394#bib.bib273)\); Chartier\-Harlinet al\.\([2004](https://arxiv.org/html/2606.31394#bib.bib274)\); Ludtmannet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib276)\)Finally, because CNN feature magnitudes are susceptible to spatial artifacts such as cell density variations, we appliedL2L\_\{2\}\-normalization to all feature vectors prior to subsequent analyses \(Justifications in Appendix A\)\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x2.png)
Figure 2:\(a, b\)Linear classification performance and confusion matrix of CNN\.\(c\)CKA representational similarity between independent models showing robust convergence\.\(d–e\)Ridge regression prediction of cell death rates and effective rank analysis using the final three CNN layer representations\.\(f–h\)UMAP projection of CNNfstage5\_outf\_\{\\text\{stage5\\\_out\}\}\. Panels\(f–g\)are color\-coded by class, showing mutation classes in\(g–h\), while\(h\)is colored by cell death rate\. Statistical significance in \(d\) was assessed using a two\-sided Wilcoxon signed\-rank test, with each cross\-validation fold treated as an independent observation\. \(\*p < 0\.05, \*\*p < 0\.01, \*\*\*p < 0\.001\)
## 4Disentanglement of Superposition: Interpretability and Geometric Fidelity
### 4\.1SAE Superposition Disentanglement
To investigate the resolution of concept superposition, we swept Gated SAE dictionary sizes \(dd\) and sparsity penalties \(λ\\lambda\) \(Table 1\)\. We observed a clear Pareto trade\-off between reconstruction fidelity \(FVU\), linear probe accuracy, andL0L\_\{0\}sparsity\. As a baseline proxy for the CNN, low\-dimensional, low\-sparsity SAE was trained\. Scalingddandλ\\lambdagenerally decreased pairwise cosine similarities \(inner products\) between concept vectors while increasing the number of active feature maps, confirming the unraveling of dense CNN entanglement\.
Table 1:Evaluation of Gated SAE scaling behavior across dictionary sizes \(dd\) and sparsity penalties \(λ\\lambda\)\. Alive Feat\. denotes the number of active feature maps\.ModelDimensionλ\\lambda𝑳𝟎L\_\{0\}Alive Feat\.FVUAccuracyInner ProductCNN512N/AN/A512N/A90\.31N/ACNN Proxy600504\.95870\.3489\.240\.160Gated SAE10248002\.710210\.7388\.480\.104Gated SAE20488004\.220350\.5389\.630\.089Gated SAE40968007\.639700\.4089\.520\.082Gated SAE81928009\.837330\.3089\.910\.104Gated SAE409620013\.238500\.2489\.520\.099Gated SAE40968007\.639700\.4089\.880\.082Gated SAE409632002\.938660\.7288\.230\.052![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x3.png)
Figure 3:The eRank consistently increases from the baseline CNN \(fstage5outf\_\{stage5\_\{o\}ut\}\) as the SAE dictionary size scales under a fixedλ\\lambdaregime\. This trend is robustly observed across raw latent vectors, PCA projections, and PCA on standardized features \(left to right\)\. SD denotes standard deviation\.
Furthermore, increasingddunder a fixedλ\\lambdaconsistently expanded the effective rank \(eRank\) relative to the CNN baseline across raw and PCA\-reduced latent spaces \(Fig\. 3\)\. Although an increase in inner product magnitude was observed atd=8192d=8192, increasingddunder a fixedλ\\lambdaconsistently expanded the eRank relative to the CNN baseline\. This eRank expansion quantitatively demonstrates that polysemantic concepts, previously packed into overlapping dimensions, are successfully disentangled into independent directions\.
### 4\.2Intrinsic Interpretability Bypassing the Manifold\-Attribution Paradox
We used an SAE withλ=3200\\lambda=3200for interpretability analyses\. While conventional CNN feature maps polysemantically co\-activate across distinct concepts \(e\.g\., simultaneously firing for mitochondria and lysosomes\), the SAE effectively isolates these into monosemantic activations representing specific, human\-interpretable biological structures \(e\.g\., nuclear envelopes\), confirming successful superposition disentanglement \(Fig\. 4\(a\)\)\. This result indicates that superposition fundamentally compromises the reliability of spatial aggregation metrics such as GAP\. Because polysemantic CNN feature maps collapse distinct biological structures into a single activation vector, high GAP scores cannot be reliably attributed to specific causal features, thereby undermining post\-hoc attribution methods like Grad\-CAM\. Conversely, the monosemanticity achieved by the SAE transforms GAP into a trustworthy quantifier of isolated concepts\. Consequently, the SAE yields significantly more class\-specific feature maps than the CNN baseline \(Fig\. 4\(b\)\), establishing a rigorous link between classes and specific visual characteristics\. This intrinsic interpretability obviates the need for post\-hoc feature attribution, and effectively bypasses the manifold\-attribution paradox\.
Theoretically, in the dimensionality bottleneck regime, networks pack disparate concepts into the same direction, which severely inflates the local Lipschitz constant\. Because many post\-hoc XAI methods assume local gradient stabilityLundberg and Lee \([2017](https://arxiv.org/html/2606.31394#bib.bib164)\); Anconaet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib216)\)\(requiring a small Lipschitz constant\), heavily superposed networks inherently violate these prerequisites across class boundaries\. By resolving feature entanglement, the SAE restores local perturbation additivity, justifying the approximation of neural network units as local additive linear models\. Thus, SAE disentanglement not only provides intrinsic interpretability but also satisfies the fundamental structural prerequisites required for traditional XAI methods to function reliably\. Consequently, integrating existing XAI methods to this disentangled SAE latent space offers a framework to further improve interpretability\.
The biological fidelity of these monosemantic concepts was further validated via Activation Maximization \(AM\), which consistently aligned with human\-interpretable structures such as the perinuclear clustering of mitochondria and lysosomes \(Fig\. 4\(c\); additional images in Appendix F\.3\)\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x4.png)
Figure 4:\(a\)Visual disentanglement of superposed concepts in SAE\. Representative original images and bilinear\-interpolated attention maps\. The SAE reconstruction loss was spatially weighted by theL2L\_\{2\}norm of point\-wise feature vectorsvij∈RCv\_\{ij\}\\in\\ R^\{C\}to preserve the original CNN spatial activation magnitudes indicative of local token importance\.\(b\)\. Quantification of mutation\-specific feature maps across three pairs of healthy to three PD mutation groups via differential activation \(\|log2FC\|\>0\.58\|\{log\}\_\{2\}FC\|\>0\.58with sparsity threshold5×10−55\\times 10^\{\-5\}\.\)\(c\)\. Visual validation of mutation\-specific SAE concepts, displaying original images, spatial attention maps, and overlays \(left\), alongside AM images and corresponding attention maps \(right\)
### 4\.3Superposition\-Induced Geometric Contamination and Recovery of Semantic Fidelity
#### Theoretical Analysis: Geometric Contamination due to Superposition
In this section, we analyze the representational impact caused by superposition \(Details in Appendix B\.\)\. According to the Linear Representation HypothesisElhageet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib189)\); Parket al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib250)\), a latent representationhi∈ℝdh\_\{i\}\\in\\mathbb\{R\}^\{d\}is formed through a linear combination of learned concept vectorsw1,…,wm⊂ℝd\{w\_\{1\},\\dots,w\_\{m\}\}\\subset\\mathbb\{R\}^\{d\}, wheremmis the number of encoded concepts \(m\>dm\>d\)\. That is,hi=∑k=1mvi,kwkh\_\{i\}=\\sum\_\{k=1\}^\{m\}v\_\{i,k\}w\_\{k\}, wherevi,kv\_\{i,k\}denotes the semantic activation of conceptkk\. This dimensional bottleneck forces neural networks to strategically tolerate superposition, compressing distributed features into overlapping directions to optimize the lossTropp \([2004](https://arxiv.org/html/2606.31394#bib.bib251)\); Livezeyet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib252)\)\. This systemic geometric packing stress remains a critical bottleneck\. In real\-world data, intrinsic concepts are fundamentally non\-independent, exhibiting dense semantic correlations\. To capture this context\-dependent geometry, we model the representations as residing on a non\-linear Riemannian manifoldℳ\\mathcal\{M\}\. For any given semantic contextp∈ℳp\\in\\mathcal\{M\}, the unconstrained semantic proximity of concepts is governed by the local target Gram matrix𝐆p∈ℝm×m\\mathbf\{G\}\_\{p\}\\in\\mathbb\{R\}^\{m\\times m\}induced by the Riemannian metric tensorgpg\_\{p\}\. Let𝐖=\[w1,…,wm\]∈ℝd×m\\mathbf\{W\}=\[w\_\{1\},\\dots,w\_\{m\}\]\\in\\mathbb\{R\}^\{d\\times m\}be the dictionary matrix corresponding to the network weights \(or SAE decoder\)\. Under this localized framework, embedding these correlated concepts into a strictly lowerdd\-dimensional space is limited by the generalized Welch boundWelch \([1974](https://arxiv.org/html/2606.31394#bib.bib204)\); Waldron \([2003](https://arxiv.org/html/2606.31394#bib.bib266)\), which imposes a strict algebraic lower bound on the overall metric distortion error tensor𝐄p\(d\)=𝐖T𝐖−𝐆p\\mathbf\{E\}\_\{p\}\(d\)=\\mathbf\{W\}^\{T\}\\mathbf\{W\}\-\\mathbf\{G\}\_\{p\}:
‖𝐄p\(d\)‖F2=‖𝐖T𝐖−𝐆p‖F2≥\(Tr\(𝐆p\)−d\)2m−d\\\|\\mathbf\{E\}\_\{p\}\(d\)\\\|\_\{F\}^\{2\}=\\\|\\mathbf\{W\}^\{T\}\\mathbf\{W\}\-\\mathbf\{G\}\_\{p\}\\\|\_\{F\}^\{2\}\\geq\\frac\{\(\\text\{Tr\}\(\\mathbf\{G\}\_\{p\}\)\-d\)^\{2\}\}\{m\-d\}\(1\)
This strictly positive lower bound mathematically guarantees that the empirical dictionary geometry𝐖T𝐖\\mathbf\{W\}^\{T\}\\mathbf\{W\}cannot be free from packing artifactsTropp \([2004](https://arxiv.org/html/2606.31394#bib.bib251)\)\. In our generalized non\-independent regime, this constraint forces𝐖\\mathbf\{W\}towards a deformed, generalized ETF\-like configurationPapyanet al\.\([2020](https://arxiv.org/html/2606.31394#bib.bib63)\); Jianget al\.\([2024](https://arxiv.org/html/2606.31394#bib.bib138)\)\. Consequently, the empirical inner product inherits this geometric deformation:wkTwl=\(𝐆p\)k,l\+\(𝐄p\)k,l\(d\)w\_\{k\}^\{T\}w\_\{l\}=\(\\mathbf\{G\}p\)\{k,l\}\+\(\\mathbf\{E\}p\)\{k,l\}\(d\)\.
This directly pollutes downstream representations\. The empirical inner product between any two latent vectorshih\_\{i\}andhjh\_\{j\}within the local tangent spaceTpℳT\_\{p\}\\mathcal\{M\}expands into:
hiThj=∑k=1mvi,kvj,k\(𝐆p\)k,k\+∑k≠lvi,kvj,l\(𝐆p\)k,l⏟Pure Local Semantic Component \(Sij\)\+∑k≠lvi,kvj,l\(𝐄p\)k,l\(d\)⏟Coupled Geometric Contamination \(Gij\)h\_\{i\}^\{T\}h\_\{j\}=\\underbrace\{\\sum\_\{k=1\}^\{m\}v\_\{i,k\}v\_\{j,k\}\(\\mathbf\{G\}\_\{p\}\)\_\{k,k\}\+\\sum\_\{k\\neq l\}v\_\{i,k\}v\_\{j,l\}\(\\mathbf\{G\}\_\{p\}\)\_\{k,l\}\}\_\{\\text\{Pure Local Semantic Component \($S\_\{ij\}$\)\}\}\+\\underbrace\{\\sum\_\{k\\neq l\}v\_\{i,k\}v\_\{j,l\}\(\\mathbf\{E\}\_\{p\}\)\_\{k,l\}\(d\)\}\_\{\\text\{Coupled Geometric Contamination \($G\_\{ij\}$\)\}\}\(2\)
Since‖𝐄p\(d\)‖F2\>0\\\|\\mathbf\{E\}\_\{p\}\(d\)\\\|F^\{2\}\>0strictly holds by Eq\. \(1\), the geometric contaminationGijG\_\{ij\}cannot be nullified in a bottlenecked model\. Because theL2L\_\{2\}metric is derived from the inner product, this directly corrupts the Euclidean distances\. WithinTpℳT\_\{p\}\\mathcal\{M\}, thisL2L\_\{2\}packing distortion propagates across all localLpL\_\{p\}spaces \(p≥1p\\geq 1\) via the equivalence of norms \(C1\|h\|2≤\|h\|p≤C2\|h\|2C\_\{1\}\|h\|\_\{2\}\\leq\|h\|\_\{p\}\\leq C\_\{2\}\|h\|\_\{2\}\)\. Consequently, latent Euclidean distances fail to purely reflect true semantic distances, fundamentally corrupted by the geometric packing stress of the overcomplete mapping\. Furthermore, assuming semantic data resides on a non\-linear manifold, dimensional bottlenecks structurally preclude exact isometric embeddings, extending this geometric contamination to arbitrary metric spaces\.
SAEs circumvent this structural limitation by projecting into a higher\-dimensional space to unpack superposed concepts, thereby resolving the rank deficiency and recovering a metric space that faithfully reflects the underlying semantic similarities\. This dimensional expansion, coupled with the extreme activation sparsity of SAEs, restores orthogonality by providing sufficient white space to mathematically decouple the superposed concepts \(driving‖𝐄p\(d\)‖F2→0\\\|\\mathbf\{E\}\_\{p\}\(d\)\\\|\_\{F\}^\{2\}\\rightarrow 0\)\. Consequently, from a theoretical standpoint, the disentangled metric space of SAEs is semantically far more reliable than the geometrically contaminated latent spaces of dimensionally bottlenecked models\.
#### Empirical Validation of Geometric Contamination in Bottlenecked Representations
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x5.png)
Figure 5:Empirical validation of geometric contamination and its recovery\.\(a\)Quantification of the local semantic dispersion metric across individual mutation lines to account for confounding effects\.\(b\)Moran’sIIare evaluated for each mutation group, exposing suppressed spatial consistency in the CNN baseline\.\(c\)Predictive performance \(cross\-validationR2R^\{2\}\) of cell death rates via ridge regression\(d\)UMAP projection of the SAE latent space, colored by the continuous cell death rate\.
To empirically demonstrate that bottlenecked representations suffer from geometric contamination , we leveraged continuous, image\-specific cell death rates, an entirely unseen biological state\. If a latent space is geometrically uncorrupted, semantic neighbors should naturally exhibit highly similar functional states\. By employing the SAE to systematically relieve dimensional packing stress, we reveal this underlying distortion\.
We quantified this geometric distortion using a local semantic dispersion metric: the ratio of the k\-nearest neighbors \(kNN\) cell death SD to each mutation\-specific SD Across allkkvalues, the baseline CNNfstage5outf\_\{stage5\_\{o\}ut\}exhibited significant semantic dispersion\. Conversely, relieving the dimensional bottleneck via SAE significantly reduced this ratio across all mutation lines \(Fig\. 5\(a\)\)\. Scaling the SAE dictionary size systematically mitigated this metric, confirming that the initial semantic distortion in the CNN was indeed an artifact of geometric packing\. Spatial autocorrelation analysis via Moran’sIIcorroborated this finding\. The CNN representations exhibited artificially suppressed spatial consistency, whereas SAE dimensional expansion monotonically restored Moran’sIIscores \(Fig\. 5b\)\.
Consequently, this geometric contamination directly hindered downstream functional utility\. Ridge regression to predict cell death rates revealed that the CNN representation’s predictive capacity was artificially constrained; the SAE \(d=8192,λ=800d=8192,\\lambda=800\) consistently unlocked higher performance across all seeds and mutations \(e\.g\.,*LRRK2*predictiveR2R^\{2\}improving from 0\.288 to 0\.345; Fig\. 5\(c\)\)\. UMAP projections visually confirmed this recovered structural integrity as a smooth functional gradient \(Fig\. 5d\)\.
Collectively, these empirical findings validate our theoretical predictions: conventional dimensional bottlenecks inherently inflict severe geometric contamination, and unpacking this superposition is strictly necessary to faithfully capture the intrinsic semantic geometry of the data\.
## 5Adaptation scRNA\-seq Methodologies to Evaluate Representations
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x6.png)
Figure 6:Adaptation of single\-cell manifold algorithms to SAE representations\. All panels display pairwise comparisons between healthy controls \(gray\) and PD mutations \(*SNCA×3\\times 3*×3\\times 3: pink, GBA: orange, LRRK2: blue\)\.\(a\)PHATE embeddings revealing continuous structure\.\(b\)PAGA graphs with pie chart colors indicating the ratio of control to mutation cells in each node\.\(c\)Scatter plots and GAM fits of cell death rates of individual images against geodesic Diffusion Pseudotime \(DPT\)\.\(d\)Average cell death rates across DPT\-binned tertiles\.\(e\)Robustness of the DPT\-cell death correlation across varying hyperparameters
By effectively resolving geometric contamination and mitigating cell density variation through IN as well asL2L\_\{2\}normalization, the SAE latent space functionally parallels the high\-resolution topology of single\-cell transcriptomic profiles\. Intriguingly, theL0L\_\{0\}/L1L\_\{1\}regularization imposed on SAEs mathematically mirrors the evolutionary constraints of energy\-efficient, sparse molecular expression, yielding similarly skewed, zero\-inflated activation distributions\. Leveraging this structural similarity, we directly adapted scRNA\-seq analytical frameworks to the image domain\.
Following standard scRNA\-seq preprocessing \(log\-normalization and standardization\), PHATE revealed continuous structures \(Fig\. 6\(a\)\), while PAGA mapped the global connectivity between control and mutation states \(Fig\. 6\(b\)\)\. Biologically, cells undergoing cell death converge toward a uniform, generic phenotypic stateO’Flanaganet al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib267)\); Kopylovaet al\.\([2026](https://arxiv.org/html/2606.31394#bib.bib272)\)\. Consequently, we hypothesized that highly toxic cells collapse toward the intersecting boundary of the control and mutation manifolds\. That is, mutation cells closer to the healthy medoid exhibit higher cell death rates, whereas more distant cells remain viable\. To quantify this trajectory, we calculated geodesic distances for mutation cells via diffusion pseudotime \(DPT\) rooted at the control medoid\.
As anticipated, Generalized Additive Model \(GAM\) fitting demonstrated that cell death rates significantly decreased as cells diverged further from the control medoid across all mutations \(p<0\.001p<0\.001; Fig\. 6\(c\), \(d\)\)\. This negative correlation was robustly maintained across diverse hyperparameters \(Fig\. 6\(e\)\)\. Notably,*SNCA×3\\times 3*neurons exhibited the steepest trajectory \(ρ=−0\.513\\rho=\-0\.513\), aligning with prior literature that establishesα\\alpha\-synuclein pathology as the primary driver of progressive cellular toxicityLudtmannet al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib276)\)\. \(Appendix F\.1 for trajectory reliability\)
Taken together, these findings suggest that SAE\-decontaminated image vectors can be reliably deployed as transcriptomic\-like profiles, successfully bridging non\-linear scRNA\-seq analytical frameworks with high\-content biological imaging\. Crucially, translating high\-content cellular images into single\-cell state biological vectors circumvents chronic scRNA\-seq artifacts—such as stochastic dropoutsStegleet al\.\([2015](https://arxiv.org/html/2606.31394#bib.bib217)\); Hickset al\.\([2018](https://arxiv.org/html/2606.31394#bib.bib218)\), doubletsWolocket al\.\([2019](https://arxiv.org/html/2606.31394#bib.bib219)\), and cell\-type biasesvan den Brinket al\.\([2017](https://arxiv.org/html/2606.31394#bib.bib220)\)while offering unprecedented cost and time efficiency\.
## 6Cross\-Modal Integration of Image and scRNA\-seq Data via GW\-map
Because cellular images and single\-cell transcriptomes are multimodal projections of the identical underlying biological state, their intrinsic geometric structures should be preserved irrespective of data modality\. Given that SAE disentanglement resolves geometric contamination, we applied GW\-map to couple these image representations with authentic scRNA\-seq data\. This coupling successfully bridged the modalities, achieving label transfer accuracies exceeding random chance \(50%\) for both strictly one\-to\-one and probabilistic barycentric matchings \(barycentric accuracy\>70%\>70\\%; Fig\. 8\(a\)\)\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x7.png)
Figure 7:Cross\-modal alignment via GW\-map\.\(a\)Label transfer accuracy of unconstrained GW coupling\. One\-to\-one accuracy measures exact maximum\-probability matches; barycentric accuracy evaluates probability\-weighted matches\.\(b\)Predictive performance \(R2R^\{2\}\) of XGBoost and MLP predicting scVI latent dimensions, standard log\-transformed genes expressions, and scVI\-denoised expressions directly from image representations\.\(c\)Gene\-specific predictability ranked byZZ\-score \(ZSNCA×3Z\_\{SNCA\\times 3\}of standard log\-normalized expression\), highlighting top/bottom 10 genes\.\(d\)2D predictability landscape of the top 200ZSNCA×3Z\_\{SNCA\\times 3\}genes reconstructing biological functional modules\.
To evaluate this alignment while accounting for genotype\-specific confounding factors, we constrained couple vectors exclusively within identical biological classes \(intra\-class coupling\)\. We then trained simple non\-linear regressors \(XGBoost and Multilayer Perceptrons\) to predict transcriptomic profiles directly from the SAE image representations\. Both models achieved remarkably high predictive fidelity \(R2R^\{2\}\) across scVI latent dimensions, standard log\-normalized expressions, and scVI\-denoised expressions, significantly outperforming permutation null baselines \(mean permutationR2<0\.1R^\{2\}<0\.1\)\. Notably, MLP\-based prediction of scVI\-denoised genes yieldedR2R^\{2\}values exceeding 0\.7 across both classes \(Fig\. 8\(b\)\)\.
If this alignment is biologically faithful, the capacity to predict a specific gene’s expression from an image should proportionally reflect the gene’s pathological relevance\. To test this hypothesis while preventing variance\-induced inflation ofR2R^\{2\}values, we quantified gene\-specific predictability usingZZ\-scores derived from the permutation null distributions\. Ranked by predictability \(ZSNCA×3Z\_\{SNCA\\times 3\}of standard log\-normalized expression\), the top genes included members of the ’axon guidance’ cluster \(e\.g\.,*EFNA5*,*DAB1*\) intimately associated withα\\alpha\-synuclein pathological progressionDijkstraet al\.\([2015](https://arxiv.org/html/2606.31394#bib.bib277)\)\(Fig\. 8\(c\)\)\. Conversely, the least predictable genes \(e\.g\.,*S100A11*,*PRSS56*,*FNDC1*,*TM4SF4*\) were non\-specific markers of tissue damage or inflammation lacking distinct morphological signatures\.
Furthermore, because genes operate within co\-regulated networks, plotting genes in a \(ZControlZ\_\{Control\},ZSNCA×3Z\_\{SNCA\\times 3\}\) predictability coordinate space would reveal a pathway\-specific topography\. Projecting the top 200ZSNCA×3Z\_\{SNCA\\times 3\}genes onto a 2D landscape \(Fig\. 8\(d\)\) unveiled an exceptional alignment with*SNCA×3\\times 3*mutation\-associated neuronal architecture\. Unsupervised Leiden clustering within this predictability space precisely reconstructed functional biological modules, notably isolating ’Presynaptic Architecture’ and the adjacent ’Calcium\-AIS Scaffold’ pathways\.
Taken together, these results demonstrate the high fidelity of GW\-map and reveals that resolving geometric contamination via SAE allows imaging to integrate with authentic single\-cell transcriptomics\.
## 7Discussion
Superposition is critical to address for two fundamental reasons\. First, from an interpretability perspective, disentangling superposition allows for the direct interpretation of internal network representations, bypassing the manifold\-attribution paradox\. Moreover, while superposed networks exhibit continuous rotational symmetry, permitting arbitrary coarse isometric embeddings, compressed sensing theory demonstrates that explicit sparsity constraints break this continuous symmetryElhageet al\.\([2022](https://arxiv.org/html/2606.31394#bib.bib189)\); Donoho and Elad \([2003](https://arxiv.org/html/2606.31394#bib.bib224)\); Donoho \([2006](https://arxiv.org/html/2606.31394#bib.bib223)\)\. This mathematically aligns the representational axes with objective biological bases, enabling data\-aligned interpretation\.
Second, while numerous studies on superposition are predominantly focused on interpretability, we revealed that superposition fundamentally corrupts the representational metric spaece\. Given the importance of constructing representations that capture the underlying semantics, geometric contamination deserves further attention\. By recovering this geometric fidelity, we successfully adapted scRNA\-seq analytical frameworks directly to image domain\. Given the structural similarity, scRNA\-seq methods such as high\-dimensional visualization, and gene co\-expression networks can be directly adapted\. Finally, we aligned SAE representations with authentic scRNA\-seq data via GW\-map with high biological plausibility\. This establishes a powerful new paradigm for scalable spatial biology while corroborating that SAE representations faithfully reflect the intrinsic semantics of data\.
#### Limitations and Future Directions
First, while we demonstrated this framework using contrastive CNN representations, further investigation is required to validate our framework in other architectures, such as Vision Transformers \(ViTs\)Dosovitskiyet al\.\([2021](https://arxiv.org/html/2606.31394#bib.bib147)\)\. Second, future studies should validate whether this geometric corruption and subsequent SAE\-based recovery hold consistently across diverse data modalities and distinct clinical domains\. Finally, while our GW\-map framework provides a powerful*de novo*cross\-modal alignment, both training overcomplete SAE dictionaries and computing Gromov\-Wasserstein optimal transport incur additional computational overhead\.
Acknowledgments
This work was supported by the National Research Foundation of Korea, funded by the Korean government \(RS\-2023\-00266872 to M\.L\.C\. and D\.K\.; RS\-2024\-00343012 to M\.L\.C\.; RS\-2025\-00521226 to D\.K\.\)\. This work was supported by the InnoCORE program of the Ministry of Science and ICT \(M\.L\.C\.; N10260004\)\. M\.L\.C\. and D\.K\. acknowledge support from the KAIST – Formosa R&D Center\.
Author Contributions
J\.P\. and M\.L\.C\. conceived and designed the study\. J\.P\. developed the software code, performed the experiments, and wrote the original draft of the manuscript\. M\.L\.C\. supervised the study, wrote, and edited the manuscript\. S\.K\. performed the data analysis and generated the figures\. D\.Y\. conducted the cellular experiments\. E\.L\., S\.C\., W\.C\., and I\.C\. contributed to the data analysis\. J\.E\. provided critical intellectual input\. D\.K\. contributed to the conceptualization of the study, provided critical input, and edited the manuscript\. S\.G\. provided the data sources and edited the manuscript\. All authors reviewed and approved the final version of the manuscript\.
Conflicts of Interests
The authors declare no conflicts of interest\.
Data Availability Statement
The image dataset has been deposited in Zenodo, with a preview subset available immediately, and the full dataset will be released upon publication of the manuscript\.
## References
- M\. Ancona, E\. Ceolini, C\. Öztireli, and M\. Gross \(2018\)Towards better understanding of gradient\-based attribution methods for deep neural networks\.InInternational Conference on Learning Representations \(ICLR\),Vancouver, Canada\(en\)\.Cited by:[§4\.2](https://arxiv.org/html/2606.31394#S4.SS2.p2.1)\.
- C\. Anders, P\. Pasliev, A\. Dombrowski, K\. Müller, and P\. Kessel \(2020\)Fairwashing explanations with off\-manifold detergent\.InProceedings of the 37th International Conference on Machine Learning,pp\. 314–323\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v119/anders20a.html)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- A\. Aravindkumar, M\. Ramadoss, S\. A\. Fakhruddin Ahmed, V\. Sampath, and K\. Lakshminarayanan \(2026\)Explainable AI in healthcare: a systematic review of XAI use cases in imaging, diagnostics, and rehabilitation\.Frontiers in Artificial Intelligence9\(English\)\.External Links:ISSN 2624\-8212,[Link](https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2026.1749527/full),[Document](https://dx.doi.org/10.3389/frai.2026.1749527)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- B\. Bilodeau, N\. Jaques, P\. W\. Koh, and B\. Kim \(2024\)Impossibility theorems for feature attribution\.Proceedings of the National Academy of Sciences121\(2\),pp\. e2304406120\.External Links:[Link](https://www.pnas.org/doi/10.1073/pnas.2304406120),[Document](https://dx.doi.org/10.1073/pnas.2304406120)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- W\. Brendel and M\. Bethge \(2019\)Approximating CNNs with Bag\-of\-local\-Features models works surprisingly well on ImageNet\.InInternational Conference on Learning Representations,New Orleans\(en\)\.External Links:[Link](https://openreview.net/forum?id=SkfMWhAqYQ)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- T\. Bricken, A\. Templeton, J\. Batson, B\. Chen, A\. Jermyn, T\. Conerly, and N\. L\. Turner \(2023\)Towards Monosemanticity: Decomposing Language Models With Dictionary Learning\.Transformer Circuits Thread\.External Links:[Link](https://transformer-circuits.pub/2023/monosemantic-features)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1),[§2\.2](https://arxiv.org/html/2606.31394#S2.SS2.p1.4)\.
- T\. Burgert, O\. Stoll, P\. Rota, and B\. Demir \(2025\)ImageNet\-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression\.InAdvances in Neural Information Processing Systems,Vol\.38\.Note:Version Number: 5External Links:[Link](https://arxiv.org/abs/2509.20234),[Document](https://dx.doi.org/10.48550/ARXIV.2509.20234)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- J\. C\. Caicedo, C\. McQuin, A\. Goodman, S\. Singh, and A\. E\. Carpenter \(2018\)Weakly Supervised Learning of Single\-Cell Feature Embeddings\.Proceedings\. IEEE Computer Society Conference on Computer Vision and Pattern Recognition2018,pp\. 9309–9318\(eng\)\.External Links:ISSN 1063\-6919,[Document](https://dx.doi.org/10.1109/CVPR.2018.00970)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p2.1)\.
- B\. Chai, C\. Efstathiou, H\. Yue, and V\. M\. Draviam \(2024\)Opportunities and challenges for deep learning in cell dynamics research\.Trends in Cell Biology34\(11\),pp\. 955–967\(English\)\.External Links:ISSN 0962\-8924, 1879\-3088,[Link](https://www.cell.com/trends/cell-biology/abstract/S0962-8924(23)00228-3),[Document](https://dx.doi.org/10.1016/j.tcb.2023.10.010)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p2.1)\.
- M\. Chartier\-Harlin, J\. Kachergus, C\. Roumier, V\. Mouroux, X\. Douay, S\. Lincoln, C\. Levecque, L\. Larvor, J\. Andrieux, M\. Hulihan, N\. Waucquier, L\. Defebvre, P\. Amouyel, M\. Farrer, and A\. Destée \(2004\)Alpha\-synuclein locus duplication as a cause of familial Parkinson’s disease\.Lancet364\(9440\),pp\. 1167–1169\(eng\)\.External Links:ISSN 1474\-547X,[Document](https://dx.doi.org/10.1016/S0140-6736%2804%2917103-1)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p2.10)\.
- C\. Chen, O\. Li, D\. Tao, A\. Barnett, C\. Rudin, and J\. K\. Su \(2019\)This Looks Like That: Deep Learning for Interpretable Image Recognition\.InAdvances in Neural Information Processing Systems,Vol\.32\.External Links:[Link](https://proceedings.neurips.cc/paper/2019/hash/adf7ee2dcf142b0e11888e72b43fcb75-Abstract.html)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- T\. Chen and C\. Guestrin \(2016\)XGBoost: A Scalable Tree Boosting System\.Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining\.External Links:[Link](https://dl.acm.org/doi/10.1145/2939672.2939785)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p1.3)\.
- Y\. Chu, G\. A\. Morfini, L\. B\. Langhamer, Y\. He, S\. T\. Brady, and J\. H\. Kordower \(2012\)Alterations in axonal transport motor proteins in sporadic and experimental Parkinson’s disease\.Brain: A Journal of Neurology135\(Pt 7\),pp\. 2058–2073\(eng\)\.External Links:ISSN 1460\-2156,[Document](https://dx.doi.org/10.1093/brain/aws133)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p2.2)\.
- K\. D’Sa, J\. R\. Evans, G\. S\. Virdi, G\. Vecchi, A\. Adam, O\. Bertolli, J\. Fleming, H\. Chang, C\. Leighton, M\. H\. Horrocks, D\. Athauda, M\. L\. Choi, and S\. Gandhi \(2023\)Prediction of mechanistic subtypes of Parkinson’s using patient\-derived stem cell models\.Nature Machine Intelligence5\(8\),pp\. 933–946\(en\)\.External Links:ISSN 2522\-5839,[Link](https://www.nature.com/articles/s42256-023-00702-9),[Document](https://dx.doi.org/10.1038/s42256-023-00702-9)Cited by:[§E\.1](https://arxiv.org/html/2606.31394#A5.SS1.SSS0.Px1.p1.1),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- P\. Demetci, R\. Santorella, B\. Sandstede, W\. S\. Noble, and R\. Singh \(2022\)SCOT: Single\-Cell Multi\-Omics Alignment with Optimal Transport\.Journal of Computational Biology: A Journal of Computational Molecular Cell Biology29\(1\),pp\. 3–18\(eng\)\.External Links:ISSN 1557\-8666,[Document](https://dx.doi.org/10.1089/cmb.2021.0446)Cited by:[§1\.3](https://arxiv.org/html/2606.31394#S1.SS3.p2.1)\.
- A\. A\. Dijkstra, A\. Ingrassia, R\. X\. d\. Menezes, R\. E\. v\. Kesteren, A\. J\. M\. Rozemuller, P\. Heutink, and W\. D\. J\. v\. d\. Berg \(2015\)Evidence for Immune Response, Axonal Dysfunction and Reduced Endocytosis in the Substantia Nigra in Early Stage Parkinson’s Disease\.PLOS ONE10\(6\),pp\. e0128651\(en\)\.External Links:ISSN 1932\-6203,[Link](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0128651),[Document](https://dx.doi.org/10.1371/journal.pone.0128651)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4),[§6](https://arxiv.org/html/2606.31394#S6.p3.4)\.
- D\.L\. Donoho \(2006\)Compressed sensing\.IEEE Transactions on Information Theory52\(4\),pp\. 1289–1306\(en\)\.External Links:ISSN 0018\-9448,[Link](http://ieeexplore.ieee.org/document/1614066/),[Document](https://dx.doi.org/10.1109/TIT.2006.871582)Cited by:[§7](https://arxiv.org/html/2606.31394#S7.p1.1)\.
- D\. L\. Donoho and M\. Elad \(2003\)Optimally sparse representation in general \(nonorthogonal\) dictionaries via ℓ1\{\}^\{\\textrm\{1\}\}minimization\.Proceedings of the National Academy of Sciences100\(5\),pp\. 2197–2202\(en\)\.External Links:ISSN 0027\-8424, 1091\-6490,[Link](https://pnas.org/doi/full/10.1073/pnas.0437847100),[Document](https://dx.doi.org/10.1073/pnas.0437847100)Cited by:[§7](https://arxiv.org/html/2606.31394#S7.p1.1)\.
- A\. Dosovitskiy, L\. Beyer, A\. Kolesnikov, D\. Weissenborn, X\. Zhai, T\. Unterthiner, M\. Dehghani, M\. Minderer, G\. Heigold, S\. Gelly, J\. Uszkoreit, and N\. Houlsby \(2021\)An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale\.In9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3\-7, 2021,External Links:[Link](https://openreview.net/forum?id=YicbFdNTTy)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1),[§7](https://arxiv.org/html/2606.31394#S7.SS0.SSS0.Px1.p1.1)\.
- N\. Elhage, T\. Hume, C\. Olsson, N\. Schiefer, T\. Henighan, and S\. Kravec \(2022\)Toy Models of Superposition\.Transformer Circuits Thread\.External Links:[Link](https://transformer-circuits.pub/2022/toy_model/index.html)Cited by:[§B\.1](https://arxiv.org/html/2606.31394#A2.SS1.p1.3),[§B\.1](https://arxiv.org/html/2606.31394#A2.SS1.p4.3),[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1),[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14),[§7](https://arxiv.org/html/2606.31394#S7.p1.1)\.
- N\. Elhage, N\. Nanda, C\. Olsson, T\. Henighan, N\. Joseph, B\. Mann, A\. Askell, and Y\. Bai \(2021\)A Mathematical Framework for Transformer Circuits\.Transformer Circuits Thread\.External Links:[Link](https://transformer-circuits.pub/2021/framework/index.html)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- D\. Erhan, Y\. Bengio, A\. Courville, and P\. Vincent \(2009\)Visualizing Higher\-Layer Features of a Deep Network\.Technical Report 1341, University of Montreal\(en\)\.Cited by:[§E\.9](https://arxiv.org/html/2606.31394#A5.SS9.SSS0.Px3.p1.1)\.
- C\. Frye, C\. Rowat, and I\. Feige \(2020\)Asymmetric Shapley values: incorporating causal knowledge into model\-agnostic explainability\.InAdvances in Neural Information Processing Systems,Vol\.33,pp\. 1229–1239\.External Links:[Link](https://proceedings.neurips.cc/paper/2020/hash/0d770c496aa3da6d2c3f2bd19e7b9d6b-Abstract.html)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- B\. Guo, W\. Ling, S\. H\. Kwon, P\. Panwar, S\. Ghazanfar, K\. Martinowich, and S\. C\. Hicks \(2025\)Integrating Spatially\-Resolved Transcriptomics Data Across Tissues and Individuals: Challenges and Opportunities\.Small Methods9\(5\),pp\. 2401194\(en\)\.Note:\_eprint: https://onlinelibrary\.wiley\.com/doi/pdf/10\.1002/smtd\.202401194External Links:ISSN 2366\-9608,[Link](https://onlinelibrary.wiley.com/doi/abs/10.1002/smtd.202401194),[Document](https://dx.doi.org/10.1002/smtd.202401194)Cited by:[§1\.3](https://arxiv.org/html/2606.31394#S1.SS3.p2.1)\.
- G\. Gut, M\. D\. Tadmor, D\. Pe’er, L\. Pelkmans, and P\. Liberali \(2015\)Trajectories of cell\-cycle progression from fixed cell populations\.Nature Methods12\(10\),pp\. 951–954\(en\)\.External Links:ISSN 1548\-7091, 1548\-7105,[Link](https://www.nature.com/articles/nmeth.3545),[Document](https://dx.doi.org/10.1038/nmeth.3545)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p1.6)\.
- L\. Haghverdi, M\. Büttner, F\. A\. Wolf, F\. Buettner, and F\. J\. Theis \(2016\)Diffusion pseudotime robustly reconstructs lineage branching\.Nature Methods13\(10\),pp\. 845–848\(en\)\.External Links:ISSN 1548\-7091, 1548\-7105,[Link](https://www.nature.com/articles/nmeth.3971),[Document](https://dx.doi.org/10.1038/nmeth.3971)Cited by:[§2\.3](https://arxiv.org/html/2606.31394#S2.SS3.p1.1)\.
- B\. He, L\. Bergenstråhle, L\. Stenbeck, A\. Abid, A\. Andersson, Å\. Borg, J\. Maaskola, J\. Lundeberg, and J\. Zou \(2020a\)Integrating spatial gene expression and breast tumour morphology via deep learning\.Nature Biomedical Engineering4\(8\),pp\. 827–834\(en\)\.External Links:ISSN 2157\-846X,[Link](https://www.nature.com/articles/s41551-020-0578-x),[Document](https://dx.doi.org/10.1038/s41551-020-0578-x)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p1.6),[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4)\.
- K\. He, H\. Fan, Y\. Wu, S\. Xie, and R\. Girshick \(2020b\)Momentum Contrast for Unsupervised Visual Representation Learning\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,pp\. 9729–9738\.External Links:[Link](https://openaccess.thecvf.com/content_CVPR_2020/html/He_Momentum_Contrast_for_Unsupervised_Visual_Representation_Learning_CVPR_2020_paper.html),[Document](https://dx.doi.org/10.1109/CVPR42600.2020.00975)Cited by:[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- S\. C\. Hicks, F\. W\. Townes, M\. Teng, and R\. A\. Irizarry \(2018\)Missing data and technical variability in single\-cell RNA\-sequencing experiments\.Biostatistics19\(4\),pp\. 562–578\.External Links:ISSN 1465\-4644,[Link](https://doi.org/10.1093/biostatistics/kxx053),[Document](https://dx.doi.org/10.1093/biostatistics/kxx053)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p4.1)\.
- X\. Huang and J\. Marques\-Silva \(2024\)On the failings of Shapley values for explainability\.International Journal of Approximate Reasoning171,pp\. 109112\.External Links:ISSN 0888\-613X,[Link](https://www.sciencedirect.com/science/article/pii/S0888613X23002438),[Document](https://dx.doi.org/10.1016/j.ijar.2023.109112)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- J\. Jiang, J\. Zhou, P\. Wang, Q\. Qu, D\. G\. Mixon, C\. You, and Z\. Zhu \(2024\)Generalized neural collapse for a large number of classes\.InProceedings of the 41st International Conference on Machine Learning,ICML’24, Vol\.235,Vienna, Austria,pp\. 22010–22041\.Cited by:[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p3.3)\.
- R\. Jiang, T\. Sun, D\. Song, and J\. J\. Li \(2022\)Statistics or biology: the zero\-inflation controversy about scRNA\-seq data\.Genome Biology23\(1\),pp\. 31\(en\)\.External Links:ISSN 1474\-760X,[Link](https://doi.org/10.1186/s13059-022-02601-5),[Document](https://dx.doi.org/10.1186/s13059-022-02601-5)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p2.1)\.
- Y\. Jin, F\. Li, Z\. Li, T\. C\. Ikezu, J\. O’Leary, M\. Selvaraj, Y\. Zhu, Y\. A\. Martens, S\. Koga, H\. Santhakumar, Y\. Li, W\. Lu, Y\. You, K\. Lolo, M\. DeTure, A\. I\. Beasley, M\. D\. Davis, P\. J\. McLean, O\. A\. Ross, T\. Kanekiyo, T\. Ikezu, T\. Caulfield, J\. Carr, Z\. K\. Wszolek, G\. Bu, D\. W\. Dickson, and N\. Zhao \(2024\)Modeling Lewy body disease withSNCAtriplication iPSC\-derived cortical organoids and identifying therapeutic drugs\.Science Advances10\(37\),pp\. eadk3700\(en\)\.External Links:ISSN 2375\-2548,[Link](https://www.science.org/doi/10.1126/sciadv.adk3700),[Document](https://dx.doi.org/10.1126/sciadv.adk3700)Cited by:[§E\.8](https://arxiv.org/html/2606.31394#A5.SS8.SSS0.Px1.p1.1),[§2\.4](https://arxiv.org/html/2606.31394#S2.SS4.p1.3)\.
- S\. S\. Kang, Z\. Zhang, X\. Liu, F\. P\. Manfredsson, M\. J\. Benskey, X\. Cao, J\. Xu, Y\. E\. Sun, and K\. Ye \(2017\)TrkB neurotrophic activities are blocked by α\-synuclein, triggering dopaminergic cell death in Parkinson’s disease\.Proceedings of the National Academy of Sciences of the United States of America114\(40\),pp\. 10773–10778\(eng\)\.External Links:ISSN 1091\-6490,[Document](https://dx.doi.org/10.1073/pnas.1713969114)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p2.2)\.
- M\. Khan, S\. Arslanturk, and S\. Draghici \(2025\)A comprehensive review of spatial transcriptomics data alignment and integration\.Nucleic Acids Research53\(12\),pp\. gkaf536\(eng\)\.External Links:ISSN 1362\-4962,[Document](https://dx.doi.org/10.1093/nar/gkaf536)Cited by:[§1\.3](https://arxiv.org/html/2606.31394#S1.SS3.p2.1)\.
- P\. Khosla, P\. Teterwak, C\. Wang, A\. Sarna, Y\. Tian, P\. Isola, A\. Maschinot, C\. Liu, and D\. Krishnan \(2020\)Supervised Contrastive Learning\.InAdvances in Neural Information Processing Systems,Vol\.33,pp\. 18661–18673\.External Links:[Link](https://proceedings.neurips.cc/paper/2020/hash/d89a66c7c80a29b1bdbab0f2a1a94af8-Abstract.html)Cited by:[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- B\. Kim, M\. Wattenberg, J\. Gilmer, C\. Cai, J\. Wexler, F\. Viegas, and R\. Sayres \(2018\)Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors \(TCAV\)\.InProceedings of the 35th International Conference on Machine Learning,pp\. 2668–2677\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v80/kim18d.html)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- P\. W\. Koh, T\. Nguyen, Y\. S\. Tang, S\. Mussmann, E\. Pierson, B\. Kim, and P\. Liang \(2020\)Concept Bottleneck Models\.InProceedings of the 37th International Conference on Machine Learning,pp\. 5338–5348\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v119/koh20a.html)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- I\. V\. Kopylova, A\. B\. Ivanov, L\. R\. Eidelman, E\. N\. Zaitseva, E\. D\. Kulikova, D\. A\. Grehnyov, A\. N\. Bogomazova, V\. A\. Vigont, E\. V\. Kaznacheyeva, M\. A\. Lagarkova, O\. S\. Lebedeva, and E\. I\. Olekhnovich \(2026\)Convergent transcriptomic signature in iPSC\-dopaminergic neurons of hereditary Parkinson’s disease\.Life Science Alliance9\(7\) \(en\)\.External Links:ISSN 2575\-1077,[Link](https://www.life-science-alliance.org/content/9/7/e202503551),[Document](https://dx.doi.org/10.26508/lsa.202503551)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p2.1)\.
- S\. Kornblith, M\. Norouzi, H\. Lee, and G\. Hinton \(2019\)Similarity of Neural Network Representations Revisited\.InProceedings of the 36th International Conference on Machine Learning,pp\. 3519–3529\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v97/kornblith19a.html)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p1.3)\.
- S\. Krishna, T\. Han, A\. Gu, S\. Wu, S\. Jabbari, and H\. Lakkaraju \(2024\)The Disagreement Problem in Explainable Machine Learning: A Practitioner’s Perspective\.Transactions on Machine Learning Research\.External Links:ISSN 2835\-8856,[Link](https://openreview.net/forum?id=jESY2WTZCe)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- Y\. Lecun, L\. Bottou, Y\. Bengio, and P\. Haffner \(1998\)Gradient\-based learning applied to document recognition\.Proceedings of the IEEE86\(11\),pp\. 2278–2324\.External Links:ISSN 1558\-2256,[Link](https://ieeexplore.ieee.org/document/726791),[Document](https://dx.doi.org/10.1109/5.726791)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- C\. Leterrier \(2018\)The Axon Initial Segment: An Updated Viewpoint\.The Journal of Neuroscience: The Official Journal of the Society for Neuroscience38\(9\),pp\. 2135–2145\(eng\)\.External Links:ISSN 1529\-2401,[Document](https://dx.doi.org/10.1523/JNEUROSCI.1922-17.2018)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p2.2)\.
- J\. A\. Livezey, A\. F\. Bujan, and F\. T\. Sommer \(2019\)Learning Overcomplete, Low Coherence Dictionaries with Linear Inference\.Journal of Machine Learning Research20\(174\),pp\. 1–42\.External Links:ISSN 1533\-7928,[Link](http://jmlr.org/papers/v20/18-703.html)Cited by:[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14)\.
- M\. A\. E\. Lohrum, R\. L\. Ludwig, M\. H\. G\. Kubbutat, M\. Hanlon, and K\. H\. Vousden \(2003\)Regulation of HDM2 activity by the ribosomal protein L11\.Cancer Cell3\(6\),pp\. 577–587\(eng\)\.External Links:ISSN 1535\-6108,[Document](https://dx.doi.org/10.1016/s1535-6108%2803%2900134-x)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p3.1)\.
- A\. Longon, D\. Klindt, and M\. Khosla \(2026\)Superposition disentanglement of neural representations reveals hidden alignment\.InProceedings of UniReps: the Third Edition of the Workshop on Unifying Representations in Neural Models,pp\. 342–359\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v322/longon26a.html)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- R\. Lopez, J\. Regier, M\. B\. Cole, M\. I\. Jordan, and N\. Yosef \(2018\)Deep generative modeling for single\-cell transcriptomics\.Nature Methods15\(12\),pp\. 1053–1058\(en\)\.External Links:ISSN 1548\-7105,[Link](https://www.nature.com/articles/s41592-018-0229-2),[Document](https://dx.doi.org/10.1038/s41592-018-0229-2)Cited by:[§2\.4](https://arxiv.org/html/2606.31394#S2.SS4.p1.3)\.
- M\. H\. R\. Ludtmann, P\. R\. Angelova, M\. H\. Horrocks, M\. L\. Choi, M\. Rodrigues, A\. Y\. Baev, A\. V\. Berezhnov, Z\. Yao, D\. Little, B\. Banushi, A\. S\. Al\-Menhali, R\. T\. Ranasinghe, D\. R\. Whiten, R\. Yapom, K\. S\. Dolt, M\. J\. Devine, P\. Gissen, T\. Kunath, M\. Jaganjac, E\. V\. Pavlov, D\. Klenerman, A\. Y\. Abramov, and S\. Gandhi \(2018\)Α\-synuclein oligomers interact with ATP synthase and open the permeability transition pore in Parkinson’s disease\.Nature Communications9\(1\),pp\. 2293\(en\)\.External Links:ISSN 2041\-1723,[Link](https://www.nature.com/articles/s41467-018-04422-2),[Document](https://dx.doi.org/10.1038/s41467-018-04422-2)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p2.10),[§5](https://arxiv.org/html/2606.31394#S5.p3.4)\.
- S\. M\. Lundberg and S\. Lee \(2017\)A Unified Approach to Interpreting Model Predictions\.InAdvances in Neural Information Processing Systems,Vol\.30\.External Links:[Link](https://papers.nips.cc/paper_files/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1),[§4\.2](https://arxiv.org/html/2606.31394#S4.SS2.p2.1)\.
- F\. Mémoli \(2011\)Gromov–Wasserstein Distances and the Metric Approach to Object Matching\.Foundations of Computational Mathematics11\(4\),pp\. 417–487\(en\)\.External Links:ISSN 1615\-3375, 1615\-3383,[Link](http://link.springer.com/10.1007/s10208-011-9093-5),[Document](https://dx.doi.org/10.1007/s10208-011-9093-5)Cited by:[§1\.3](https://arxiv.org/html/2606.31394#S1.SS3.p2.1)\.
- K\. R\. Moon, D\. van Dijk, Z\. Wang, S\. Gigante, D\. B\. Burkhardt, W\. S\. Chen, K\. Yim, A\. v\. d\. Elzen, M\. J\. Hirn, R\. R\. Coifman, N\. B\. Ivanova, G\. Wolf, and S\. Krishnaswamy \(2019\)Visualizing structure and transitions in high\-dimensional biological data\.Nature Biotechnology37\(12\),pp\. 1482–1492\(en\)\.External Links:ISSN 1546\-1696,[Link](https://www.nature.com/articles/s41587-019-0336-3),[Document](https://dx.doi.org/10.1038/s41587-019-0336-3)Cited by:[§2\.3](https://arxiv.org/html/2606.31394#S2.SS3.p1.1)\.
- C\. Morcelle, S\. Menoyo, F\. D\. Morón\-Duran, A\. Tauler, S\. C\. Kozma, G\. Thomas, and A\. Gentilella \(2019\)Oncogenic MYC Induces the Impaired Ribosome Biogenesis Checkpoint and Stabilizes p53 Independent of Increased Ribosome Content\.Cancer Research79\(17\),pp\. 4348–4359\(eng\)\.External Links:ISSN 1538\-7445,[Document](https://dx.doi.org/10.1158/0008-5472.CAN-18-2718)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p3.1)\.
- A\. Murphy, J\. Zylberberg, and A\. Fyshe \(2024\)Correcting biased centered kernel alignment measures in biological and artificial neural networks\.ICLR 2024 Re\-Align Workshop,ICLR 2024 Re\-Align Workshop,ICLR 2024 Re\-Align Workshop\.Note:arXiv:2405\.01012 \[q\-bio\.NC\]External Links:[Link](https://arxiv.org/)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p1.3)\.
- T\. Nguyen, M\. Raghu, and S\. Kornblith \(2021\)Do wide and deep networks learn the same things? Uncovering how neural network representations vary with width and depth\.InInternational Conference on Learning Representations,Note:arXiv:2010\.15327 \[cs\.LG\]Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p1.3)\.
- C\. H\. O’Flanagan, K\. R\. Campbell, A\. W\. Zhang, F\. Kabeer, J\. L\. P\. Lim, J\. Biele, P\. Eirew, D\. Lai, A\. McPherson, E\. Kong, C\. Bates, K\. Borkowski, M\. Wiens, B\. Hewitson, J\. Hopkins, J\. Pham, N\. Ceglia, R\. Moore, A\. J\. Mungall, J\. N\. McAlpine, S\. P\. Shah, and S\. Aparicio \(2019\)Dissociation of solid tumor tissues with cold active protease for single\-cell RNA\-seq minimizes conserved collagenase\-associated stress responses\.Genome Biology20,pp\. 210\.External Links:ISSN 1474\-7596,[Link](https://pmc.ncbi.nlm.nih.gov/articles/PMC6796327/),[Document](https://dx.doi.org/10.1186/s13059-019-1830-0)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p2.1)\.
- C\. Olah, N\. Cammarata, L\. Schubert, G\. Goh, M\. Petrov, and S\. Carter \(2020\)Zoom In: An Introduction to Circuits\.Distill5\(3\),pp\. 10\.23915/distill\.00024\.001\.External Links:ISSN 2476\-0757,[Link](https://distill.pub/2020/circuits/zoom-in),[Document](https://dx.doi.org/10.23915/distill.00024.001)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- C\. Olah, A\. Mordvintsev, and L\. Schubert \(2017\)Feature Visualization\.Distill2\(11\),pp\. 10\.23915/distill\.00007\.External Links:ISSN 2476\-0757,[Link](https://distill.pub/2017/feature-visualization),[Document](https://dx.doi.org/10.23915/distill.00007)Cited by:[§E\.9](https://arxiv.org/html/2606.31394#A5.SS9.SSS0.Px3.p1.1)\.
- V\. Papyan, X\. Y\. Han, and D\. L\. Donoho \(2020\)Prevalence of neural collapse during the terminal phase of deep learning training\.Proceedings of the National Academy of Sciences117\(40\),pp\. 24652–24663\(en\)\.External Links:ISSN 0027\-8424, 1091\-6490,[Link](https://pnas.org/doi/full/10.1073/pnas.2015509117),[Document](https://dx.doi.org/10.1073/pnas.2015509117)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p2.10),[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p3.3)\.
- K\. Park, Y\. J\. Choe, and V\. Veitch \(2024\)The linear representation hypothesis and the geometry of large language models\.InProceedings of the International Conference on Machine Learning,ICML’24, Vol\.235,Vienna, Austria,pp\. 39643–39666\.Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1),[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14)\.
- G\. Peyré and M\. Cuturi \(2019\)Computational Optimal Transport: With Applications to Data Science\.Vol\.11,Now Publishers\.External Links:ISSN 1935\-8237,[Link](https://doi.org/10.1561/2200000073),[Document](https://dx.doi.org/10.1561/2200000073)Cited by:[§1\.3](https://arxiv.org/html/2606.31394#S1.SS3.p2.1)\.
- M\. Pokotylo, N\. Brüggemann, and J\. Prasuhn \(2025\)Metabolic Dysregulation in Parkinson’s Disease: Non\-Oxidative Phosphorylation and Its Role in Brain Energy Metabolism\.Aging and Disease16\(5\),pp\. 2721–2738\(eng\)\.External Links:ISSN 2152\-5250,[Document](https://dx.doi.org/10.14336/AD.2025.0619)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p3.1)\.
- M\. Raghu, T\. Unterthiner, S\. Kornblith, C\. Zhang, and A\. Dosovitskiy \(2021\)Do Vision Transformers See Like Convolutional Neural Networks?\.InAdvances in Neural Information Processing Systems,Vol\.34,pp\. 12116–12128\.External Links:[Link](https://proceedings.neurips.cc/paper/2021/hash/652cf38361a209088302ba2b8b7f51e0-Abstract.html)Cited by:[§A\.1](https://arxiv.org/html/2606.31394#A1.SS1.p3.3),[§2\.1](https://arxiv.org/html/2606.31394#S2.SS1.p1.1)\.
- S\. Rajamanoharan, A\. Conmy, L\. Smith, T\. Lieberum, V\. Varma, J\. Kramár, R\. Shah, and N\. Nanda \(2024\)Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders\.Advances in Neural Information Processing Systems37,pp\. 775–818\(en\)\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2024/hash/01772a8b0420baec00c4d59fe2fbace6-Abstract-Conference.html),[Document](https://dx.doi.org/10.52202/079017-0024)Cited by:[§2\.2](https://arxiv.org/html/2606.31394#S2.SS2.p1.4)\.
- O\. Roy and M\. Vetterli \(2007\)The effective rank: A measure of effective dimensionality\.In2007 15th European Signal Processing Conference,pp\. 606–610\.External Links:[Link](https://ieeexplore.ieee.org/abstract/document/7098875)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p2.10)\.
- T\. Schirinzi, G\. Madeo, G\. Martella, M\. Maltese, B\. Picconi, P\. Calabresi, and A\. Pisani \(2016\)Early synaptic dysfunction in Parkinson’s disease: Insights from animal models\.Movement Disorders: Official Journal of the Movement Disorder Society31\(6\),pp\. 802–813\(eng\)\.External Links:ISSN 1531\-8257,[Document](https://dx.doi.org/10.1002/mds.26620)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4),[Appendix D](https://arxiv.org/html/2606.31394#A4.p2.2)\.
- R\. R\. Selvaraju, M\. Cogswell, A\. Das, R\. Vedantam, D\. Parikh, and D\. Batra \(2017\)Grad\-cam: visual explanations from deep networks via gradient\-based localization\.InProceedings of the IEEE International Conference on Computer Vision \(ICCV\),pp\. 618–626\.Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- A\. B\. Singleton, M\. Farrer, J\. Johnson, A\. Singleton, S\. Hague, J\. Kachergus, M\. Hulihan, T\. Peuralinna, A\. Dutra, R\. Nussbaum, S\. Lincoln, A\. Crawley, M\. Hanson, D\. Maraganore, C\. Adler, M\. R\. Cookson, M\. Muenter, M\. Baptista, D\. Miller, J\. Blancato, J\. Hardy, and K\. Gwinn\-Hardy \(2003\)Alpha\-Synuclein locus triplication causes Parkinson’s disease\.Science302\(5646\),pp\. 841\(eng\)\.External Links:ISSN 1095\-9203,[Document](https://dx.doi.org/10.1126/science.1090278)Cited by:[§3](https://arxiv.org/html/2606.31394#S3.p2.10)\.
- S\. Somvanshi, M\. M\. Islam, A\. Rafe, A\. G\. Tusti, A\. Chakraborty, A\. Baitullah, T\. I\. Chowdhury, N\. Alnawmasi, A\. Dutta, and S\. Das \(2026\)Bridging the Black Box: A Survey on Mechanistic Interpretability in AI\.ACM Comput\. Surv\.58\(8\),pp\. 210:1–210:35\.External Links:ISSN 0360\-0300,[Link](https://dl.acm.org/doi/10.1145/3787104),[Document](https://dx.doi.org/10.1145/3787104)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1)\.
- O\. Stegle, S\. A\. Teichmann, and J\. C\. Marioni \(2015\)Computational and analytical challenges in single\-cell transcriptomics\.Nature Reviews Genetics16\(3\),pp\. 133–145\(en\)\.External Links:ISSN 1471\-0064,[Link](https://www.nature.com/articles/nrg3833),[Document](https://dx.doi.org/10.1038/nrg3833)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p4.1)\.
- M\. Sundararajan and A\. Najmi \(2020\)The Many Shapley Values for Model Explanation\.InProceedings of the 37th International Conference on Machine Learning,pp\. 9269–9278\(en\)\.External Links:ISSN 2640\-3498,[Link](https://proceedings.mlr.press/v119/sundararajan20b.html)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
- A\. Templeton, T\. Conerly, J\. Marcus, J\. Lindsey, T\. Bricken, and B\. Chen \(2024\)Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet\.Transformer Circuits Thread\.External Links:[Link](https://transformer-circuits.pub/2024/scaling-monosemanticity/)Cited by:[§1\.2](https://arxiv.org/html/2606.31394#S1.SS2.p1.1),[§2\.2](https://arxiv.org/html/2606.31394#S2.SS2.p1.4)\.
- J\.A\. Tropp \(2004\)Greed is good: algorithmic results for sparse approximation\.IEEE Transactions on Information Theory50\(10\),pp\. 2231–2242\.External Links:ISSN 1557\-9654,[Link](https://ieeexplore.ieee.org/abstract/document/1337101),[Document](https://dx.doi.org/10.1109/TIT.2004.834793)Cited by:[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14),[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p3.3)\.
- S\. C\. van den Brink, F\. Sage, Á\. Vértesy, B\. Spanjaard, J\. Peterson\-Maduro, C\. S\. Baron, C\. Robin, and A\. van Oudenaarden \(2017\)Single\-cell sequencing reveals dissociation\-induced gene expression in tissue subpopulations\.Nature Methods14\(10\),pp\. 935–936\(en\)\.External Links:ISSN 1548\-7105,[Link](https://www.nature.com/articles/nmeth.4437),[Document](https://dx.doi.org/10.1038/nmeth.4437)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p4.1)\.
- M\. Vos \(2022\)Mitochondrial Complex I deficiency: guilty in Parkinson’s disease\.Signal Transduction and Targeted Therapy7\(1\),pp\. 136\(en\)\.External Links:ISSN 2059\-3635,[Link](https://www.nature.com/articles/s41392-022-00983-3),[Document](https://dx.doi.org/10.1038/s41392-022-00983-3)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p3.1)\.
- S\. Waldron \(2003\)Generalized Welch bound equality sequences are tight frames\.IEEE Transactions on Information Theory49\(9\),pp\. 2307–2309\.External Links:ISSN 1557\-9654,[Link](https://ieeexplore.ieee.org/document/1226620),[Document](https://dx.doi.org/10.1109/TIT.2003.815788)Cited by:[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14)\.
- G\. P\. Way, T\. Natoli, A\. Adeboye, L\. Litichevskiy, A\. Yang, X\. Lu, J\. C\. Caicedo, B\. A\. Cimini, K\. Karhohs, D\. J\. Logan, M\. H\. Rohban, M\. Kost\-Alimova, K\. Hartland, M\. Bornholdt, S\. N\. Chandrasekaran, M\. Haghighi, E\. Weisbart, S\. Singh, A\. Subramanian, and A\. E\. Carpenter \(2022\)Morphology and gene expression profiling provide complementary information for mapping cell state\.Cell Systems13\(11\),pp\. 911–923\.e9\(English\)\.External Links:ISSN 2405\-4712, 2405\-4720,[Link](https://www.cell.com/cell-systems/abstract/S2405-4712(22)00402-1),[Document](https://dx.doi.org/10.1016/j.cels.2022.10.001)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4)\.
- L\. Welch \(1974\)Lower bounds on the maximum cross correlation of signals \(Corresp\.\)\.IEEE Transactions on Information Theory20\(3\),pp\. 397–399\.External Links:ISSN 1557\-9654,[Link](https://ieeexplore.ieee.org/abstract/document/1055219),[Document](https://dx.doi.org/10.1109/TIT.1974.1055219)Cited by:[§4\.3](https://arxiv.org/html/2606.31394#S4.SS3.SSS0.Px1.p1.14)\.
- F\. A\. Wolf, F\. K\. Hamey, M\. Plass, J\. Solana, J\. S\. Dahlin, B\. Göttgens, N\. Rajewsky, L\. Simon, and F\. J\. Theis \(2019\)PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells\.Genome Biology20\(1\),pp\. 59\(en\)\.External Links:ISSN 1474\-760X,[Link](https://doi.org/10.1186/s13059-019-1663-x),[Document](https://dx.doi.org/10.1186/s13059-019-1663-x)Cited by:[§2\.3](https://arxiv.org/html/2606.31394#S2.SS3.p1.1)\.
- S\. L\. Wolock, R\. Lopez, and A\. M\. Klein \(2019\)Scrublet: Computational Identification of Cell Doublets in Single\-Cell Transcriptomic Data\.Cell Systems8\(4\),pp\. 281–291\.e9\(eng\)\.External Links:ISSN 2405\-4720,[Document](https://dx.doi.org/10.1016/j.cels.2018.11.005)Cited by:[§5](https://arxiv.org/html/2606.31394#S5.p4.1)\.
- R\. Xie, K\. Pang, S\. Chung, C\. Perciani, S\. MacParland, B\. Wang, and G\. Bader \(2023\)Spatially Resolved Gene Expression Prediction from Histology Images via Bi\-modal Contrastive Learning\.Advances in Neural Information Processing Systems36,pp\. 70626–70637\(en\)\.External Links:[Link](https://proceedings.neurips.cc/paper_files/paper/2023/hash/df656d6ed77b565e8dcdfbf568aead0a-Abstract-Conference.html)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4)\.
- S\. Xue, F\. Zhu, J\. Chen, and W\. Min \(2025\)Inferring single\-cell resolution spatial gene expression via fusing spot\-based spatial transcriptomics, location, and histology using GCN\.Briefings in Bioinformatics26\(1\),pp\. bbae630\.External Links:ISSN 1477\-4054,[Link](https://doi.org/10.1093/bib/bbae630),[Document](https://dx.doi.org/10.1093/bib/bbae630)Cited by:[Appendix D](https://arxiv.org/html/2606.31394#A4.p1.4)\.
- S\. Yang, S\. E\. Corbett, Y\. Koga, Z\. Wang, W\. E\. Johnson, M\. Yajima, and J\. D\. Campbell \(2020\)Decontamination of ambient RNA in single\-cell RNA\-seq with DecontX\.Genome Biology21\(1\),pp\. 57\(en\)\.External Links:ISSN 1474\-760X,[Link](https://doi.org/10.1186/s13059-020-1950-6),[Document](https://dx.doi.org/10.1186/s13059-020-1950-6)Cited by:[§E\.8](https://arxiv.org/html/2606.31394#A5.SS8.SSS0.Px1.p1.1)\.
- Z\. Zhou, M\. Hu, M\. Salcedo, N\. Gravel, W\. Yeung, A\. Venkat, D\. Guo, J\. Zhang, N\. Kannan, and S\. Li \(2023\)XAI meets Biology: A Comprehensive Review of Explainable AI in Bioinformatics Applications\.arXiv\.Note:arXiv:2312\.06082 \[cs\]External Links:[Link](http://arxiv.org/abs/2312.06082),[Document](https://dx.doi.org/10.48550/arXiv.2312.06082)Cited by:[§1\.1](https://arxiv.org/html/2606.31394#S1.SS1.p1.1)\.
## Appendix ACell Density Variation Normalization via GAP\-L2 Scaling
### A\.1Theoretical Analyis
As established in previous studiesGutet al\.\[[2015](https://arxiv.org/html/2606.31394#bib.bib42)\], Heet al\.\[[2020a](https://arxiv.org/html/2606.31394#bib.bib172)\], variations in cell density can bias model representations\. We denote the feature vector as𝐯=\(v1,v2,…,vn\)∈ℝn\\mathbf\{v\}=\(v\_\{1\},v\_\{2\},\\dots,v\_\{n\}\)\\in\\mathbb\{R\}^\{n\}, where each dimensionviv\_\{i\}is influenced by several factors: cell density \(dd\), the biological state of cells \(bb\), and the stochasticity of the training process\. Since the weight matrix is normalized by its per\-output\-channelL2L\_\{2\}norm \(see Appendix C\. for detailed methods\), the influence of training stochasticity is significantly mitigated\. Consequently,viv\_\{i\}can be expressed as a function of the remaining dominant factors:
vi≔fi\(d,b\)v\_\{i\}\\coloneqq f\_\{i\}\(d,b\)\(3\)
To understand the role of cell density, consider a scenario wherebbis held constant whileddvaries\. It is plausible to think this scienario because the microscopic imaging process is independent of the state of the cell state\. Given the local receptive fields of CNNsLecunet al\.\[[1998](https://arxiv.org/html/2606.31394#bib.bib90)\], Olahet al\.\[[2020](https://arxiv.org/html/2606.31394#bib.bib111)\], Burgertet al\.\[[2025](https://arxiv.org/html/2606.31394#bib.bib149)\], Brendel and Bethge \[[2019](https://arxiv.org/html/2606.31394#bib.bib151)\], Dosovitskiyet al\.\[[2021](https://arxiv.org/html/2606.31394#bib.bib147)\], Raghuet al\.\[[2021](https://arxiv.org/html/2606.31394#bib.bib148)\], the feature extraction process can be approximated as a homogeneous function with respect toddbecause GAP is linear\. Specifically, we assume:
fi\(nd,b\)≈nfi\(d,b\)for anyn∈ℝ\+and\(d,b\)∈𝒟f\_\{i\}\(nd,b\)\\approx nf\_\{i\}\(d,b\)\\quad\\text\{for any \}n\\in\\mathbb\{R\}^\{\+\}\\text\{ and \}\(d,b\)\\in\\mathcal\{D\}\(4\)
This linear scaling property allows us to decompose the representation asvi≈d⋅f^i\(b\)v\_\{i\}\\approx d\\cdot\\hat\{f\}\_\{i\}\(b\)\. Under this assumption, theL2L\_\{2\}normalization of the feature vector𝐯\\mathbf\{v\}yields:
𝐯‖𝐯‖2≈d⋅𝐟^\(b\)∑i=1nd2f^i2\(b\)=𝐟^\(b\)∑i=1nf^i2\(b\)\\frac\{\\mathbf\{v\}\}\{\\\|\\mathbf\{v\}\\\|\_\{2\}\}\\approx\\frac\{d\\cdot\\mathbf\{\\hat\{f\}\}\(b\)\}\{\\sqrt\{\\sum\_\{i=1\}^\{n\}d^\{2\}\\hat\{f\}\_\{i\}^\{2\}\(b\)\}\}=\\frac\{\\mathbf\{\\hat\{f\}\}\(b\)\}\{\\sqrt\{\\sum\_\{i=1\}^\{n\}\\hat\{f\}\_\{i\}^\{2\}\(b\)\}\}\(5\)
As shown, the termddis canceled out, rendering the final representation invariant to cell density\. This ensures that the model captures the intrinsic biological statebbwithout being confounded by density variations\.
### A\.2Emperical Validation
As theoretically predicted, ifL2L\_\{2\}normalization technically conditions out the influence of cell density variations, the representation will reflect the underlying semantics more accurately\. To validate this, we trained ridge regression and XGBoost on CNN representations\. For CNN which represetntaion isL2L\_\{2\}normalized before the projector, normalization substantially improvedR2R^\{2\}values across all mutation lines\. \(Fig\. 9 \(a\)\)
Even in models trained without normalization \(trained with three different random seeds\), post hocL2L\_\{2\}transformation consistently enhanced prediction accuracy \(R2R^\{2\}\), with statistically significant improvements observed for SNCA ×3 XGBoost, LRRK2 ridge regression, and LRRK2 XGBoost models \(Fig\. 8 \(b\)\)\. Overall representational geometry was likwse preserved measured by CKA similarity between normalized and non\-normalized models \(Fig\. 9\)\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x8.png)
Figure 8:\(a\)Effect ofL2L\_\{2\}normalization on feature vectors derivend from CNN which feature vectors wereL2L\_\{2\}normalized during training\. showing It shows sensitivity to image cell\-density variation, as quantifiedR2R^\{2\}for cell death rate prediction using Ridge regression and XGBoost\.\(b\)Effect ofL2L\_\{2\}normalization on feature vectors derivend from CNN which feature vectors were notL2L\_\{2\}normalized during training\. SNCA ×3 XGBoost, LRRK2 ridge regression, and LRRK2 XGBoost models were statistically significance\. Statistical significance was assessed using a two\-sided Wilcoxon signed\-rank test, with each cross\-validation fold treated as an independent observation \(\*p < 0\.05, \*\*p < 0\.01, \*\*\*p < 0\.001\)
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x9.png)
Figure 9:CKA analysis comparing models trained with and without L2 normalization on feature vectors\. CKA analysis reveals that the model similarity remains considerably consistent regardless of whether GAP and L2 normalization are applied to the feature vectors before projector during training\. This consistency suggests that L2 norm does not reflect the biological state\. CKA was calculated 2500 images per class
Collectively, these results demonstrate that normalization effectively suppresses spurious density\-dependent variation and enhances biologically meaningful latent representations\. Consequently, all subsequent analyses were performed usingL2L\_\{2\}normalized CNN feature vectors\.
## Appendix BTheoretical Analysis of Geometric Interference Due to Superposition
### B\.1Strategic Superposition under dimensional bottleneck
In modern neural architectures, it is plausible to assume that the number of conceptsnnin real world typically exceeds the model’s hidden dimensiondd\(n\>\>dn\>\>d\)\. Recent researchElhageet al\.\[[2022](https://arxiv.org/html/2606.31394#bib.bib189)\]shows that this dimensional bottleneck forces the model to employ superposition\.
Definitions:Letvi∈ℝnv\_\{i\}\\in\\mathbb\{R\}^\{n\}be input features\. The encoding process into the hidden spaceℝd\\mathbb\{R\}^\{d\}is given byσ\(Wvi\+b\)\\sigma\(Wv\_\{i\}\+b\), whereW∈ℝd×nW\\in\\mathbb\{R\}^\{d\\times n\}\(d<nd<n\) andσ\(⋅\)\\sigma\(\\cdot\)denotes the ReLU activation function\. By introducing a diagonal matrixDiD\_\{i\}representing the activation state of ReLU forviv\_\{i\}, the resulting representationhi∈ℝdh\_\{i\}\\in\\mathbb\{R\}^\{d\}is:
hi=σ\(Wvi\+b\)=Di\(Wvi\+b\)h\_\{i\}=\\sigma\(Wv\_\{i\}\+b\)=D\_\{i\}\(Wv\_\{i\}\+b\)\(6\)
\(Di\)kk=\{1if\(Wvi\+b\)k\>00otherwise\(D\_\{i\}\)\_\{kk\}=\\begin\{cases\}1&\\text\{if \}\(Wv\_\{i\}\+b\)\_\{k\}\>0\\\\ 0&\\text\{otherwise\}\\end\{cases\}\(7\)
Derivation:Sincerank\(W\)≤d<n\\text\{rank\}\(W\)\\leq d<n, the model cannot form an identity mapping for allnninput concepts\. Following recent studiesElhageet al\.\[[2022](https://arxiv.org/html/2606.31394#bib.bib189)\], as ReLU and bias gating can cancel out trivial noise, the model opts for superposition rather than concepts discarding\. Therefore, this rank deficiency ensures that any readout or distance calculation inℝd\\mathbb\{R\}^\{d\}is fundamentally contaminated by the overlapping projections of non\-orthogonal concepts\.
The inner product between two representationshi,hj∈ℝdh\_\{i\},h\_\{j\}\\in\\mathbb\{R\}^\{d\}inherits this structural interference:
⟨hi,hj⟩=viT\(WTDiDjW\)⏟Φijvj\+𝒞ij\(W,D,b\)\\langle h\_\{i\},h\_\{j\}\\rangle=v\_\{i\}^\{T\}\\underbrace\{\(W^\{T\}D\_\{i\}D\_\{j\}W\)\}\_\{\\Phi\_\{ij\}\}v\_\{j\}\+\\mathcal\{C\}\_\{ij\}\(W,D,b\)\(8\)whereΦij∈ℝn×n\\Phi\_\{ij\}\\in\\mathbb\{R\}^\{n\\times n\}is the effective projection matrix and𝒞ij\\mathcal\{C\}\_\{ij\}aggregates bias\-related noise terms \(bTDiDjWvj\+viTWTDiDjb\+bTDiDjbb^\{T\}D\_\{i\}D\_\{j\}Wv\_\{j\}\+v\_\{i\}^\{T\}W^\{T\}D\_\{i\}D\_\{j\}b\+b^\{T\}D\_\{i\}D\_\{j\}b\)\.
Due to the bottleneck,rank\(Φij\)≤d<n\\text\{rank\}\(\\Phi\_\{ij\}\)\\leq d<n, forcesΦij\\Phi\_\{ij\}into the formIn\+EijI\_\{n\}\+E\_\{ij\}whereEijE\_\{ij\}≠0\\neq 0represents the structural error matrix\. Consequently, the inner product in the latent space is coupled with both rank\-induced interference and bias noise:
⟨hi,hj⟩=⟨vi,vj⟩\+viTEijvj⏟Rank\-induced Interference\+𝒞ij\(W,b,D\)⏟Bias Noise\\langle h\_\{i\},h\_\{j\}\\rangle=\\langle v\_\{i\},v\_\{j\}\\rangle\+\\underbrace\{v\_\{i\}^\{T\}E\_\{ij\}v\_\{j\}\}\_\{\\text\{Rank\-induced Interference\}\}\+\\underbrace\{\\mathcal\{C\}\_\{ij\}\(W,b,D\)\}\_\{\\text\{Bias Noise\}\}\(9\)
The squared Euclidean distance between these latent representationshi,hj∈ℝdh\_\{i\},h\_\{j\}\\in\\mathbb\{R\}^\{d\}is:
‖hi−hj‖22=‖hi‖22\+‖hj‖22−2⟨hi,hj⟩\\\|h\_\{i\}\-h\_\{j\}\\\|\_\{2\}^\{2\}=\\\|h\_\{i\}\\\|\_\{2\}^\{2\}\+\\\|h\_\{j\}\\\|\_\{2\}^\{2\}\-2\\langle h\_\{i\},h\_\{j\}\\rangle\(10\)Consequently, this confirms that the metric itself, computed directly on thedd\-dimensional bottleneck, is a function of the superposition noiseEijE\_\{ij\}\.
### B\.2LpL\_\{p\}Norm Generalization via Local Norm Equivalence
As discussed in section 3\.4\.1, superposition contaminates the Euclidean distances\. The geometric contamination induced by the dimensional bottleneck is a metric\-invariant flaw that propagates across the representation space\. Within the local tangent spaceTpℳT\_\{p\}\\mathcal\{M\}at any given semantic contextpp, the neighborhood behaves as a local linear vector space\. Consequently, by the equivalence of norms \(C1‖h‖2≤‖h‖p≤C2‖h‖2C\_\{1\}\\\|h\\\|\_\{2\}\\leq\\\|h\\\|\_\{p\}\\leq C\_\{2\}\\\|h\\\|\_\{2\}\), the local geometric packing stress governed by the Generalized Welch Bound inherently permeates all localLpL\_\{p\}spaces \(p≥1p\\geq 1\)\. Substituting the coupled local inner product expansion \(Eq\. 10\) into theL2L\_\{2\}distance metric yields:
‖hi−hj‖2=‖hi‖22\+‖hj‖22−2\(Sij\+Gij\)\\\|h\_\{i\}\-h\_\{j\}\\\|\_\{2\}=\\sqrt\{\\\|h\_\{i\}\\\|\_\{2\}^\{2\}\+\\\|h\_\{j\}\\\|\_\{2\}^\{2\}\-2\(S\_\{ij\}\+G\_\{ij\}\)\}\(11\)
whereGij=∑k≠lvi,kvj,l\(𝐄p\)k,l\(d\)G\_\{ij\}=\\sum\_\{k\\neq l\}v\_\{i,k\}v\_\{j,l\}\(\\mathbf\{E\}\_\{p\}\)\_\{k,l\}\(d\)represents the coupled geometric contamination andSij=∑k=1mvi,kvj,k\(𝐆p\)k,k\+∑k≠lvi,kvj,lS\_\{ij\}=\\sum\_\{k=1\}^\{m\}v\_\{i,k\}v\_\{j,k\}\(\\mathbf\{G\}\_\{p\}\)\_\{k,k\}\+\\sum\_\{k\\neq l\}v\_\{i,k\}v\_\{j,l\}represents the pure local semantic Component as discussed in section 3\.4\.1\. The dual nature of this local packing error𝐄p\(d\)\\mathbf\{E\}\_\{p\}\(d\)establishes two fundamental boundaries that corrupt the metric space\.
#### The Local Resolution Ceiling \(Upper Bound on Separation\)
For semantic instances composed of locally disjoint ground\-truth concepts withinTpℳT\_\{p\}\\mathcal\{M\}\(i\.e\.,Sij=0S\_\{ij\}=0\), the latent distance should ideally reach its unconstrained orthogonal maximum \(hi⟂hjh\_\{i\}\\perp h\_\{j\}\)\. However, because the Generalized Welch Bound guarantees a strict positive distortion energy \(‖𝐄p\(d\)‖F2\>0\\\|\\mathbf\{E\}\_\{p\}\(d\)\\\|\_\{F\}^\{2\}\>0\) due to the bottleneckm\>dm\>d, severe structural collisions \(\(𝐄p\)k,l\(d\)\>0\(\\mathbf\{E\}\_\{p\}\)\_\{k,l\}\(d\)\>0\) become mathematically unavoidable\. For instances activating these colliding concept pairs, the positive local interferenceGij\>0G\_\{ij\}\>0forces an artificial proximity, capping the achievable separation even under otherLpL\_\{p\}metrics:
‖hi−hj‖p≤C2‖hi‖22\+‖hj‖22−2⋅Collision\(𝐄p\)<C2‖hi‖22\+‖hj‖22\\\|h\_\{i\}\-h\_\{j\}\\\|\_\{p\}\\leq C\_\{2\}\\sqrt\{\\\|h\_\{i\}\\\|\_\{2\}^\{2\}\+\\\|h\_\{j\}\\\|\_\{2\}^\{2\}\-2\\cdot\\text\{Collision\}\(\\mathbf\{E\}\_\{p\}\)\}<C\_\{2\}\\sqrt\{\\\|h\_\{i\}\\\|\_\{2\}^\{2\}\+\\\|h\_\{j\}\\\|\_\{2\}^\{2\}\}\(12\)This ceiling confirms that the compressed representation space fails to provide sufficient white space for true semantic orthogonality, artificially pulling unrelated instances together and truncating contrastive resolution\.
#### The Geometric Noise Floor \(Lower Bound on Proximity\)
Conversely, minimizing contrastive loss under the dimensional bottleneck forces the local dictionary𝐖\\mathbf\{W\}into a tightly packed, generalized ETF\-like configuration, triggering ubiquitous background repulsion \(\(𝐄p\)k,l\(d\)<0\(\\mathbf\{E\}\_\{p\}\)\_\{k,l\}\(d\)<0\) across non\-correlated concepts to optimize the packing density\. Because each instancehih\_\{i\}is a linear combination of multiple distributed concepts, the probability that all cross\-terms inGijG\_\{ij\}vanish simultaneously is asymptotically zero\. This persistent negative contamination \(Gij<0G\_\{ij\}<0\) creates an inescapable geometric noise floor that artificially inflates proximity, preventing identical semantic states from achieving perfect metric convergence:
‖hi−hj‖p≥C1‖hi‖22\+‖hj‖22−2\(Sij−\|Repulsion\|\)\>0\\\|h\_\{i\}\-h\_\{j\}\\\|\_\{p\}\\geq C\_\{1\}\\sqrt\{\\\|h\_\{i\}\\\|\_\{2\}^\{2\}\+\\\|h\_\{j\}\\\|\_\{2\}^\{2\}\-2\(S\_\{ij\}\-\|\\text\{Repulsion\}\|\)\}\>0\(13\)Consequently, the localLpL\_\{p\}metric space remains permanently distorted, failing to preserve the true unconstrained isometry of the underlying semantic manifold\.
### B\.3Conclusion: Metric Entrapment
Consequently, anyLpL\_\{p\}measurement is trapped between a geometric noise floor that obscures fine\-grained similarities and a resolution ceiling that prevents true semantic separation\. The latent distance does not measure pure semantic proximity; rather, it reflects the equilibrium between structural collisions and heterogeneous repulsive packing\. This metric entrapment mathematically necessitates the use of sparse autoencoders \(SAEs\) to expand the dimensionality toM≥mM\\geq m, thereby ecoupling superposed features \(wkTwl→0w\_\{k\}^\{T\}w\_\{l\}\\to 0asymptotically\), nullifyingGijG\_\{ij\}, and fully restoring the integrity of semantic distances\.
## Appendix CThoretical Analysis of Metric Restoration SAE Sparsity
### C\.1Mechanism of Noise Nullification:
SAE resolve the rank deficiency by projecting bottlenecked representations into an expanded latent spaceℝM\\mathbb\{R\}^\{M\}\(M≫dM\\gg d\)\. This expansion, coupled with extreme sparsity, restores orthogonality by providing sufficient white space to decouple superposed concepts\. The interference terms derived in Section 1, governed byDiDjD\_\{i\}D\_\{j\}, are theoretically nullified: lettingp=K/Mp=K/Mbe the activation probability \(K≪MK\\ll M\), the expected overlap evaluates to:
∑k=1M𝔼\[\(Di\)kk\(Dj\)kk\]=M⋅p2=K2M\\sum\_\{k=1\}^\{M\}\\mathbb\{E\}\[\(D\_\{i\}\)\_\{kk\}\(D\_\{j\}\)\_\{kk\}\]=M\\cdot p^\{2\}=\\frac\{K^\{2\}\}\{M\}\(14\)AsM→∞M\\to\\inftyandp→0p\\to 0, the overlapK2/MK^\{2\}/Mvanishes\. This ensuresDiDj≈𝟎D\_\{i\}D\_\{j\}\\approx\\mathbf\{0\}, which effectively orthogonalizes the concept dictionary \(wkTwl→0w\_\{k\}^\{T\}w\_\{l\}\\to 0\)\. Consequently, the Geometric Contamination \(GijG\_\{ij\}\) from Eq\. \(10\) is nullified, eliminating both rank\-induced interference and structural noise\.
### C\.2Geometric Flattening andLpL\_\{p\}Integrity:
Sparsity induces geometric flattening by isolating the active latent concept set within linear, low\-dimensional subspaces\. In this decompressed spaceℝM\\mathbb\{R\}^\{M\}, the expanded dimensionMMallows themmground\-truth concepts superposed within the bottleneck \(d<m<Md<m<M\) to transition from a crushed Softmax Code\-like state to near\-orthogonality\. This expansion breaks the Metric Entrapment shown in Appendix B; by satisfying M\>m, the resolution ceiling is lifted, and the geometric noise floor is cleared\.
Lettingzi∈ℝMz\_\{i\}\\in\\mathbb\{R\}^\{M\}be the sparse representation, anyLpL\_\{p\}metric \(p≥1p\\geq 1\) now faithfully reflects the pure semantic distance of the ground\-truth conceptsvv:
‖zi−zj‖p≈\(∑k∈active\|vi,k−vj,k\|p\)1/p\\\|z\_\{i\}\-z\_\{j\}\\\|\_\{p\}\\approx\\left\(\\sum\_\{k\\in\\text\{active\}\}\|v\_\{i,k\}\-v\_\{j,k\}\|^\{p\}\\right\)^\{1/p\}\(15\)This structural restoration ensures that latent distances measure the singular presence of semantic units, rather than the equilibrium of forced inter\-feature competition\.
### C\.3Conclusion
In the dimensionality bottleneck \(d<md<m\) regime, generalized welch bound ensure a non\-zero interference Consequently, latent distances inℝd\\mathbb\{R\}^\{d\}remain trapped between a geometric noise floor and a resolution ceiling, failing to reliably reflect pure semantic proximity\. This implies that such a vulnerability could be inherent to many representation learning paradigms, extending to modern architectures like CNNs, ViTs and transformers\. This intrinsic metric corruption indicates that optimization alone cannot restore feature independence; instead, a transition to high\-dimensional sparse representations via SAEs is required to nullify interference\. Advanced architectures like Gated SAE further optimize this restoration by decoupling feature detection from magnitude estimation, thereby eliminatingL1L\_\{1\}shrinkage bias and non\-linear shifts\. Consequently, SAEs restore the metric integrity and local linearity required for reliable manifold learning, clustering, and graph\-based semantic analysis\.
## Appendix DBiological Plausiblity of GW\-Map
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x10.png)
Figure 10:\(a\)\. Expression heatmap showing Z\-scores \(standard log\-transformed and denoised\) forSNCA×3SNCA\\times 3and Control intra\-class coupling across the top and bottom 25 genes, which were selected based onSNCA×3SNCA\\times 3intra\-class coupling Z\-scores of standard log\-normalized HVGs \(computed against permutation\-based null distributions\)\.\(b\)\. Sankey diagram tracking the hierarchical emergence of functional gene modules among the top 200 genes across increasing Leiden resolutions \(0\.08, 0\.25, 0\.4\)\. The genes was selected based onSNCA×3SNCA\\times 3intra\-class coupling Z\-scores of standard log\-normalized HVGs\. Subsequent analyses were also conducted using these selected genes\(c\)\. Global transcriptional module discovery\. All valid genes were Leiden\-clustered \(resolutions=0\.002, 0\.2, 0\.3\) and functionally annotated via over\-representation analysis \(ORA; GO, KEGG, MSigDB Hallmark; FDR < 0\.05\)\.\(d\)Force\-directed gene interaction network \(Fruchterman\-Reingold\) resolving the hierarchical bifurcation of a higher\-order transcriptional module\. Nodes are colored by their fine\-grained sub\-module identities derived from top\-ranked significant ORA hits\. Node size is proportional to graph degree\. Genes lacking significant enrichment for either sub\-module are depicted as peripheral nodes\. <Note\> For \(b–d\), Graph adjacency matrices constructed via k\-NN on dimension\-wise rank\-normalized coordinates bounded to \[0,1\] to enhance k\-NN stability\. \(k=5 for \(b\); k=4 for \(c\), \(d\)\)\. Genes exhibiting \>99\.5% sparsity \(zero counts\) were strictly excluded\. Comprehensive cluster gene lists are provided in the https://github\.com/jijihihi/Bio\_superposition
We further conducted subsequent analysis to verify the biological faithfulness of coupling\. Previous studiesHeet al\.\[[2020a](https://arxiv.org/html/2606.31394#bib.bib172)\], Xieet al\.\[[2023](https://arxiv.org/html/2606.31394#bib.bib174)\], Xueet al\.\[[2025](https://arxiv.org/html/2606.31394#bib.bib175)\], Wayet al\.\[[2022](https://arxiv.org/html/2606.31394#bib.bib173)\]suggest that deep learning\-derived phenotypical representations encode underlying molecular information\. Following the same procedure described in Section 6 \(Figure 7\), we selected the top/botton 25 genes based onSNCA×3SNCA\\times 3intra\-class coupling Z\-scores of standard log\-normalized HVGs \(Fig\. 10 \(a\)\)\. This reveals that genes with highZSNCA×3Z\_\{SNCA\\times 3\}scores include ‘axon guidance’ cluster \(e\.g\., EFNA5, DAB1, CDH10, CDH7\)Dijkstraet al\.\[[2015](https://arxiv.org/html/2606.31394#bib.bib277)\]as well as the ‘synaptic transmission and ion channel activity’Schirinziet al\.\[[2016](https://arxiv.org/html/2606.31394#bib.bib278)\]\(e\.g\., GRM5, KCNQ5, NKAIN3\)\. These pathways are associated withα\\alphasynuclein pathological progression\. Conversely, genes withZSNCA×3Z\_\{SNCA\\times 3\}approaching near zero \(e\.g\., S100A6, S100A11, PRSS56, FNDC1, TM4SF4\) are generally non\-specific markers of tissue damage or inflammation with lower specific pathological relevance\.
Based on the coordinates of genes on theZcontrolZ\_\{control\}andZSNCA×3Z\_\{SNCA\\times 3\}in the section 6 Figure 7 \(d\), increasing the Leiden clustering resolution \(from res=0\.08 to 1\.0\) induced a hierarchical refinement, transitioning from a broad ’Synaptic Function’ ontologySchirinziet al\.\[[2016](https://arxiv.org/html/2606.31394#bib.bib278)\]into specialized regulatory sub\-modules \(Fig\. 10 \(b\)\)\. At the finest granularity, this approach unveiled specific pathological vulnerabilities: the segregation of ’TrkB Survival’Kanget al\.\[[2017](https://arxiv.org/html/2606.31394#bib.bib279)\]and ‘Axonal Transport’Chuet al\.\[[2012](https://arxiv.org/html/2606.31394#bib.bib280)\]anchored by key effectors such as DYNC1I1 and MUTS2 and the isolation of the ANK3\-centric ’Calcium\-AIS Scaffold’Leterrier \[[2018](https://arxiv.org/html/2606.31394#bib.bib281)\], confirming that our image\-based prediction scores recognize and align distinct biological states in 2D space\.
To validate these findings across the entire transcriptome, we performed Gene Set Over\-representation Analysis \(ORA\) across increasing resolutions \(Fig\. 10 \(c\)\)\. GW\-map precisely deconstructed the ’Ribosome\-Translation Core’ into its constituent regulatory axes\. Notably, increasing resolution from res=0\.2 to 0\.3 induced a systematic bifurcation of ’Myc\-Regulated Ribosome Biogenesis’Morcelleet al\.\[[2019](https://arxiv.org/html/2606.31394#bib.bib282)\]into a specialized ’Myc\-p53 Regulated Translation’ axis and a discrete ’Cation Homeostasis’ program\. This specification directly aligns with the established coupling between nucleolar stress and p53\-mediated checkpointsLohrumet al\.\[[2003](https://arxiv.org/html/2606.31394#bib.bib284)\], while effectively isolating the concurrent collapse in ionic buffering and Oxidative PhosphorylationVos \[[2022](https://arxiv.org/html/2606.31394#bib.bib286)\], Pokotyloet al\.\[[2025](https://arxiv.org/html/2606.31394#bib.bib285)\]\. Furthermore, force\-directed layouts confirmed that functionally synergetic genes form dense, autonomous interaction hubs such as RPL/RPS77,80, KCNN273, and ANK378 \(Fig\. 10 \(d\)\)\. This granular alignment with canonical pathways confirms the robust fidelity of GW\-map in coupling the image and scRNA\-seq data\.
Collectively, these findings demonstrate that CNN\-SAE captures essential genetic information from cellular images\. Furthermore, the autonomous reconstruction of the hierarchical architecture in SNCA ×3 mutation\-associated neuronal pathology confirms the high fidelity of GW\-map in coupling neuronal images with organoid transcriptomic data, thereby validating the biological integrity of the GW\-coupling framework and supporting the notion that disease\-specific information is intrinsically encoded across heterogeneous data modalities\.
## Appendix EDetailed Method
### E\.1Data Preparation and Cell Death Quantification
#### Generation of Human iPSC\-derived Cortical Neurons and High\-content Image Acquisition
Figure 11 illustrates the data acquisition process\. Human iPSC\-derived cortical neurons from three healthy controls and Parkinson’s disease\-associated mutation lines are used in this study\. Cell lines used are listed in Supplementary Table 1\. hiPSCs were maintained on Geltrex\-coated plates in E8 or mTeSR\. Cortical differentiation was induced through dual SMAD inhibition and neuronal cultures were maintained as previously described by Choi et alD’Saet al\.\[[2023](https://arxiv.org/html/2606.31394#bib.bib253)\]\. All cell lines were routinely tested for mycoplasma contamination\.
Cortical neuronal identity was verified by immunocytochemistry using antibodies against MAP2 \(Abcam, \# ab32454\) and SATB2 \(Santa Cruz Biotechnology, \# SC\-81376\)\. Immunostaining was performed following previously described procedures\.
Live\-cell imaging was carried out as previously described by Choi et al\. Cortical neurons were plated onto 96\-well plates at 20,000\-40,000 cells per well and maintained until imaging\. Before imaging, cells were stained with 10μ\\muM Hoechst 33342, 25nM TMRM, 250nM LysoTracker Deep, and 500nM SYTOX green\. Images were acquired using the Opera Phenix High\-Content Screening System \(PerkinElmer\)\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x11.png)
Figure 11:Illustration of data acquisition\. Cortical neurons derived from Parkinson’s disease patients were cultured, and data were acquired using a high\-content imaging platform
#### Image Preprocessing
To fully utilize the 16\-bit dynamic range while ensuring numerical stability during training, raw pixel intensities were divided by the maximum theoretical value \(65535\) to map them into the \[0, 1\] range\. Following this scale transformation, each original1024×10241024\\times 1024image was systematically partitioned into an8×88\\times 8grid, generating non\-overlapping128×128128\\times 128cropped patches\. These cropped patches served as the localized input images for the downstream framework\. To account for microscopy artifacts, such as variations in focus, and to normalize variations in cell density across different patches, we applied instance normalization to each individual input patch\.
However, in cropped patches lacking specific organelles \(e\.g\., the absence of nuclei in the blue channel due to uneven cell distribution\), the pixel standard deviation within that channel can become exceptionally low\. Applying standard instance normalization in these scenarios leads to the disproportionate amplification of background noise\. To prevent this artifact, we implemented a clipped standard deviation approach, termed safenormalization: if the standard deviation of a given channel falls below an empirical threshold of 0\.01, instance normalization is bypassed\. This threshold was determined by systematically comparing patches with and without target organelles, thereby ensuring an optimal balance between signal preservation and noise suppression\.
#### Quantification of Cell Death Rate
To calculate the cell death rate, nuclear segmentation was initially performed on the1024×10241024\\times 1024MIP images using StarDist \(v0\.9\.1\) with the pre\-trained’2D\_versatile\_fluo’model\. The segmentation utilized the default probability threshold \(0\.479\) and a non\-maximum suppression \(NMS\) threshold of 0\.3\. To ensure data integrity, segmented nuclei with an area smaller than 40 pixels were discarded as debris\.
Subsequently, the1024×10241024\\times 1024images were cropped into128×128128\\times 128sub\-images \(patches\)\. During this process, any128×128128\\times 128cropped images containing edge\-touching nucleus relative to the original1024×10241024\\times 1024boundaries were strictly excluded\. This filtering step was necessary because nuclei located at the outermost boundaries of the original image might not be fully masked or segmented, which could introduce artifacts into the downstream cell death calculation\. Furthermore, cropped images where nuclei occupied less than 25% of the total pixels were removed to guarantee a sufficient cell density for a reliable estimation\.
The Sytox Green threshold was then established via Multi\-Otsu thresholding \(n=3n=3\) based on the pixel intensities within the remaining nuclear masks\. Among the two generated thresholds, the highest value was selected as the final criterion for Sytox Green\-positive signals\. Rather than relying on a binary cell\-count metric \(i\.e\., alive versus dead\), which obscures the continuous nature of cell degeneration, the cell death rate was quantified by calculating the ratio of the total area of Sytox Green\-positive pixels to the total nuclear pixel area within each valid patch\. This area\-based metric robustly captures the continuous, multi\-stage progression of cellular apoptosis and necrosis\.
### E\.2CNN Framework with Contrastive Learning
#### Network Architecture
To ensure spatial interpretability, we employ a CNN\-based architecture rather than a Vision Transformer \(ViT\)\. This choice is motivated by the CNN’s local receptive field , which allows for a direct mapping between feature activations and specific image regions\. We designed a custom CNN architecture to maintain the feature maps at a high spatial resolution of64×6464\\times 64\(H×WH\\times W\) for better interpretability\. This architectural choice ensures that the upsampled activation maps via bilinear interpolation provide fine grained and precise localization of the identified features\. The model consists of a stem layer followed by four stages and a refine stage\. The stem layer utilizes a convolutional layer with a kernel size of 3 and a stride of 2, downsampling the 128×128 input to 64×64\. The core architecture includes 9 residual blocks grouped into four stages \(2, 2, 2, 3 blocks, respectively\), with each block comprising two convolutional layers with skip connections to facilitate gradient flow\. The network starts with 64 channels in the stem layer and progressively increases the feature depth to 128, 256, and 512 channels across stages 2 through 4, while stage 5 maintains 512 channels\. A final refine stage containing one additional residual block was appended to further process the high\-level semantic features\. ReLU activation was used throughout the network\. The output feature maps from the refine stage were global average pooled\(GAP\) and then selectively L2\-normalized to adjust the cell density variation before being passed to the projector\(Supplementary Note 1\)\. The projector consists of two linear layers with a hidden dimension of 2048, mapping the 512\-dimensional input to a 512\-dimensional embedding space, which was then L2\-normalized for the contrastive loss computation\.
#### Contrastive Learning Framework via Moco
The model was trained using Supervised Contrastive Learning \(SupCon\) combined with Momentum Contrast \(MoCo\)\. For each training step, two random rotated augmented views \(v1, v2\) were generated from each image\. The query encoder \(modelqmodel\_\{q\}\) processed v1 to produce query representations, while the momentum encoder \(modelkmodel\_\{k\}\) processed v2 to produce key representations without gradient computation\. The momentum encoder was updated via exponential moving average \(EMA\) of the query encoder parameters with a momentum coefficient of m = 0\.995:
The parameters are updated as follows:
θk←m⋅θk\+\(1−m\)⋅θq\\theta\_\{k\}\\leftarrow m\\cdot\\theta\_\{k\}\+\(1\-m\)\\cdot\\theta\_\{q\}\(16\)wheremmdenotes the momentum coefficient\.
A large FIFO queue with a capacity of 65,536 entries stored historical key representations and their labels, enabling the contrastive loss to be computed against a large and diverse set of negatives beyond the current 512 mini\-batch\. At each step, the key pool consisted of the current batch keys concatenated with the queue\. Subsequent analysis was conducted using only the query encoder after training\.
#### CNN Training Strategy
Since conventional augmentation techniques, such as Gaussian blurring, can compromise subtle textures or distort local morphological features of cellular images, we restricted our augmentation to random rotations to preserve fine\-grained biological details\. To mitigate the GAP fluctuation due to the stochastic weight updates during the training, we applied the L2 normalization every step\.
For each Conv2d and Linear layer, the weight matrix is divided by its per\-output\-channelL2L\_\{2\}norm:
𝐰←𝐰/‖𝐰‖2\\mathbf\{w\}\\leftarrow\\mathbf\{w\}/\\\|\\mathbf\{w\}\\\|\_\{2\}\(17\)This mitigate the GAP fluctuation due to the stochastic weight updates during the training\.
To prevent the model from learning batch\-specific confounders such as plate effects or cell line imbalances, we employed a strictly balanced batch sampling strategy\. Each mini\-batch was constructed to contain an equal number of images from every combination of genotype class, cell line, and culture plate, ensuring that no single biological or technical covariate was over\-represented within a batch\.
We selected Stochastic Gradient Descent \(SGD\) over Adam to ensure compatibility with our per\-iteration L2 normalization strategy\. While Adam’s adaptive moment estimation adjusts individual weight scales independently, it can interfere with the constrained feature space imposed by the L2 norm\. In contrast, SGD’s consistent gradient updates allowed for more stable convergence\.
A linear warmup over 4 epochs was used, ramping the learning rate from 0 to the base learning rate of 0\.1\. After warmup, a cosine annealing schedule decayed the learning rate to the minimum value of 0\.0\. Gradient clipping with a maximum norm of 1\.0 is applied\.
The dataset was partitioned into training, validation, and testing sets with a ratio of 3:1:2, respectively\. The model was trained using 8 seeds for L2\-normalized projector inputs and 3 seeds for non\-normalized projector inputs\.
#### Loss Function
Supcon loss was computed as:
L=−1\|P\(i\)\|∑p∈P\(i\)log\[exp\(qi⋅kp/τ\)∑jexp\(qi⋅kj/τ\)\]L=\-\\frac\{1\}\{\|P\(i\)\|\}\\sum\_\{p\\in P\(i\)\}\\log\\left\[\\frac\{\\exp\(q\_\{i\}\\cdot k\_\{p\}/\\tau\)\}\{\\sum\_\{j\}\\exp\(q\_\{i\}\\cdot k\_\{j\}/\\tau\)\}\\right\]\(18\)
whereP\(i\)P\(i\)denotes the set of positive keys sharing the same genotype label as queryqiq\_\{i\},kpk\_\{p\}represents a positive key, andτ=0\.07\\tau=0\.07is the temperature parameter\. Numerical stability was ensured by subtracting the maximum logit before exponentiation\.
#### Linear Probing and Evaluation
To evaluate the representation quality of the trained CNN framework, we conducted a linear probing analysis\. Weight parameters in the CNN encoder were frozen after contrastive learning, ensuring a static weight configuration that was consistently applied to all subsequent analyses\. A bias\-free linear classifier \(ℝ512→ℝ4\\mathbb\{R\}^\{512\}\\to\\mathbb\{R\}^\{4\}\) was then attached to theL2L\_\{2\}\-normalized encoder outputs\.
The linear classifier was trained for 50 epochs on the training set without data augmentation, using SGD with a learning rate of 0\.1, a momentum of 0\.9, and no weight decay\. At the end of each training epoch, the representation quality was monitored on the validation set\. The checkpoint achieving the highest linear probe accuracy on the validation set was selected as the final model\. The classification performance of this selected model was ultimately evaluated and reported using the held\-out test set without augmentation\.
### E\.3Representational Similarity Analysis via CKA
To evaluate and compare the CNN model representations learned across different seeds, we utilize linear Centered Kernel Alignment \(CKA\) on the feature vectors of thefrefinef\_\{\\text\{refine\}\}layer at both the class and global levels\. The CKA similarity is formulated based on the Hilbert\-Schmidt Independence Criterion \(HSIC\)\.
LetK=XXTK=XX^\{T\}andL=YYTL=YY^\{T\}denote the Gram matrices\. The Linear CKA coefficient is computed as follows:
CKA\(K,L\)=HSIC\(K,L\)HSIC\(K,K\)HSIC\(L,L\)\\text\{CKA\}\(K,L\)=\\frac\{\\text\{HSIC\}\(K,L\)\}\{\\sqrt\{\\text\{HSIC\}\(K,K\)\\text\{HSIC\}\(L,L\)\}\}\(19\)where the centered HSIC is defined as:
HSIC\(K,L\)=1\(n−1\)2tr\(KHLH\)\\text\{HSIC\}\(K,L\)=\\frac\{1\}\{\(n\-1\)^\{2\}\}\\text\{tr\}\(KHLH\)\(20\)
In this expression,H=In−1n𝟏𝟏TH=I\_\{n\}\-\\frac\{1\}\{n\}\\mathbf\{11\}^\{T\}represents the centering matrix\. The resulting CKA score ranges from 0 to 1, where a value of 1 indicates that the two models have captured identical representations\.
For this analysis, 2,500 images per class were used as input, with each image represented by a 512\-dimensional feature vector\. Given that the CKA analysis revealed a remarkably high architectural similarity between representations obtained with and without theL2L\_\{2\}\-normalized projector, we exclusively utilized the GAPL2L\_\{2\}\-normalized features for all subsequent analyses\.
### E\.4Sparse Autoencoder
#### Gated Sparse Autoencoder Architecture
To resolve concept superposition, we utilize a gated sparse autoencoder\(SAE\) to decompose feature maps into monosemantic and interpretable components\. This decomposition ensures that vector distances more faithfully reflect the underlying biological states\. Unlike conventional sparse autoencoders that often suffer from shrinkage bias due toL1L\_\{1\}regularization, the gated sparse autoencoder architecture effectively decouples the activation magnitude through a gating mechanism\. The dictionary size \(dsaed\_\{sae\}\) and its corresponding sparsity\(λ\\lambda\) were evaluated under various hyperparameter configurations and detailed evaluation metrics are shown in section 4\.1
We integrated an SAE into a each standalone CNN encoder backbone\. Each spatial location in the feature map is treated as an individual 512\-dimensional vector, serving as the input token for point\-wise SAE training\. The input tokens are batch\-centered\(batch size N=64\) andL2L\_\{2\}\-normalized\. The same processing was applied to all subsequent analyses\. The gating and magnitude encoders share the same weight matrix \(Wgate=WmagW\_\{gate\}=W\_\{mag\}\) while maintaining independent norms and biases, following the standard weight\-tying configuration in SAE\. The gating biasesbgateb\_\{gate\}are initialized with a small positive constant 0\.1 to ensure every latent feature maps are sufficiently updated\.
We constrain the columns of the decoder weight matrix \(Wdec∈ℝdin×dsaeW\_\{dec\}\\in\\mathbb\{R\}^\{d\_\{in\}\\times d\_\{sae\}\}\) to have exactly unitL2L\_\{2\}norm by renormalizing them after every update to prevent scaling ambiguity:
‖𝐰i‖2=1,∀i∈\{1,…,dsae\}\\\|\\mathbf\{w\}\_\{i\}\\\|\_\{2\}=1,\\quad\\forall i\\in\\\{1,\\dots,d\_\{sae\}\\\}\(21\)
Additionally, we project out the gradient components parallel to these rows before applying the optimizer update:
∇𝐰iprojℒ=∇𝐰iℒ−\(∇𝐰iℒ⋅𝐰i\)𝐰i\\nabla\_\{\\mathbf\{w\}\_\{i\}\}^\{proj\}\\mathcal\{L\}=\\nabla\_\{\\mathbf\{w\}\_\{i\}\}\\mathcal\{L\}\-\\left\(\\nabla\_\{\\mathbf\{w\}\_\{i\}\}\\mathcal\{L\}\\cdot\\mathbf\{w\}\_\{i\}\\right\)\\mathbf\{w\}\_\{i\}\(22\)
The resulting gated activationhhis formulated as the element\-wise product of the binary gate and the magnitude vector:
h=𝟙\(gpre\>0\)⊙ReLU\(mpre\)h=\\mathds\{1\}\(g\_\{pre\}\>0\)\\odot\\text\{ReLU\}\(m\_\{pre\}\)\(23\)where𝟙\(⋅\)\\mathds\{1\}\(\\cdot\)denotes the indicator function\. The pre\-activations are computed as:
gpre=Wgatev\+bgate,mpre=Wmagv\+bmagg\_\{pre\}=W\_\{gate\}v\+b\_\{gate\},\\quad m\_\{pre\}=W\_\{mag\}v\+b\_\{mag\}Here,v∈ℝdinv\\in\\mathbb\{R\}^\{d\_\{in\}\}represents the input activation vector andWgate,Wmag∈ℝdin×dsaeW\_\{gate\},W\_\{mag\}\\in\\mathbb\{R\}^\{d\_\{in\}\\times d\_\{sae\}\}are the encoder weights for the gating and magnitude paths, respectively\.
During backpropagation, the gating mechanism is implemented using a Straight\-Through Estimator \(STE\); while the forward pass employs a binary gate based on a step function, the backward pass utilizes the sigmoid gradient as an approximation to ensure stable parameter updates\.
#### Gated Sparse Autoencoder Loss Function and Train Strategy
The SAE is trained by minimizing a loss function composed of reconstruction, sparsity, and auxiliary terms\. The total lossℒ\\mathcal\{L\}is defined as follows:
ℒ=ℒrecon\+λℒsparsity\+αℒaux\\mathcal\{L\}=\\mathcal\{L\}\_\{recon\}\+\\lambda\\mathcal\{L\}\_\{sparsity\}\+\\alpha\\mathcal\{L\}\_\{aux\}\(24\)whereλ\\lambdaandα\\alphaare hyperparameters controlling the strength of the sparsity penalty and the auxiliary task, respectively\. Sparsity warmup period was set to 100 steps\. The auxiliary loss weightα\\alphais fixed at 0\.03125\.
The model was trained for 8 epochs with a total batch size of 262,144, which was processed in 4 distinct chunks to manage memory constraints\. To prevent batch\-level data bias and ensure a balanced feature distribution, tokens were randomly permutated prior to being input into the SAE\. We employed an initial learning rate of3×10−43\\times 10^\{\-4\}with a 10% learning rate warmup fraction, and a weight decay of1×10−41\\times 10^\{\-4\}regulated by a cosine scheduler\. The model from the final epoch is selected for downstream tasks\. To prioritize regions identified as significant by the CNN, we selectively multiply the token L2 norm by the reconstruction loss for instances utilizingλ\\lambda= 3200
Reconstruction loss is computed as mean squared error\(MSE\)\.
ℒrecon=‖v−\(Wdech\+bdec\)‖22\\mathcal\{L\}\_\{recon\}=\\\|v\-\(W\_\{dec\}h\+b\_\{dec\}\)\\\|\_\{2\}^\{2\}\(25\)
Penalizing only the gating signal, the model learns sparse activation boundaries without suppressing the feature magnitudes:
ℒsparsity=∑i=1dsaeReLU\(gpre,i\)\\mathcal\{L\}\_\{sparsity\}=\\sum\_\{i=1\}^\{d\_\{sae\}\}\\text\{ReLU\}\(g\_\{pre,i\}\)\(26\)
To prevent the gating mechanism from becoming trapped in sub\-optimal dead states, an auxiliary reconstruction loss is employed\. Rather than simply using all magnitude information, we utilize a specialized computational path that trains only a subset of inactive features\. The auxiliary activations,mauxm\_\{aux\}, are formulated by masking out the currently active features and retaining only the top\-KKinstances \(withK=32K=32\) based on their magnitude:
maux=TopK\(ReLU\(mpre\)⊙𝟙\(gpre≤0\)\)m\_\{aux\}=\\text\{Top\}\_\{K\}\\Big\(\\text\{ReLU\}\(m\_\{pre\}\)\\odot\\mathds\{1\}\(g\_\{pre\}\\leq 0\)\\Big\)\(27\)The auxiliary reconstruction loss is then computed using the stop\-gradient operator \(sg\(⋅\)\\text\{sg\}\(\\cdot\)\), which detaches the decoder weights and biases during backpropagation\. This critical detachment prevents the auxiliary error gradients from inappropriately altering the main decoder weight distributions:
ℒaux=‖v−\(sg\(Wdec\)maux\+sg\(bdec\)\)‖22\\mathcal\{L\}\_\{aux\}=\\big\\\|v\-\\big\(\\text\{sg\}\(W\_\{dec\}\)m\_\{aux\}\+\\text\{sg\}\(b\_\{dec\}\)\\big\)\\big\\\|\_\{2\}^\{2\}\(28\)
#### Evaluation of Gated Sparse Autoencoder
For each input image, the pre\-trained CNN activations are passed through the SAE to obtain a sparse feature maph∈ℝH×W×dsaeh\\in\\mathbb\{R\}^\{H\\times W\\times d\_\{sae\}\}\. To eliminate the randomness of batch centering and per\-image artifacts, we perform centering at the token level within each individual image\. The extracted feature representations were subsequently cached and utilized for all downstream evaluation analyses\. We applied GAP and getdsaed\_\{sae\}\-dimensional latent feature vector\. Following the standard linear probing protocol, we train a single linear classification layer \(a linear probe\) on top of these pooled vectors while keeping the SAE and CNN weights frozen and test with a test images\.
The feature activation frequency is tracked via an EMA \(decay = 0\.99\)\. Linear probe is trained with SGD \(momentum=0\.9\) optimizer and cross entroy loss\. Because the L2 norm of CNN token is information indicating degree of focus the model assigns to specific features, we test two distinct representations: raw feature vector and its L2 norm\-scaled counterpart\. L2 norm\-scaled vector was consistently employed in the following analytical steps yielding activation maps that preserve relative spatial magnitude\. Finally, the global average\-pooled SAE latent feature vectors wereL2L\_\{2\}\-normalized\. This step is conducted to correct for the inherent reduction in Euclidean magnitude that occurs when a singular representation is disentangled into multiple sparse variables \(i\.e\.,\|a\|≈\|x\|\+\|y\|\+\|z\|\|a\|\\approx\|x\|\+\|y\|\+\|z\|, buta2≥x2\+y2\+z2a^\{2\}\\geq x^\{2\}\+y^\{2\}\+z^\{2\}\), ensuring that highly fractionated concepts do not artificially lose their discriminative influence in downstream distance metrics\.
We further quantified the Fraction of Variance Unexplained \(FVU\) and the mean inner product of the concept vector \(column of SAE decoder\)\. The FVU was computed by comparing the reconstructed outputs with the activations of the CNN to assess reconstruction fidelity\.
### E\.5Predicting the Cell Death Rate using Feature Vector
#### Feature Vector Extraction and Preprocessing
We trained Ridge regression and XGBoost models to predict per\-image cell death rates directly from CNN and SAE feature vectors\. To eliminate genotype confounding effects, regression models were trained and evaluated independently within each genotype\. Briefly, feature vectors were extracted from three layers of a single trained CNN encoder \(fstage5\_midf\_\{stage5\\\_mid\},fstage5\_outf\_\{stage5\\\_out\}, andfrefine\_outf\_\{refine\\\_out\}\) and collapsed via GAP\. For the SAE, pre\-trained CNNfstage5\_outf\_\{stage5\\\_out\}activations for each input image were fed into the SAE to yield a sparse latent feature map,h∈ℝH×W×dsaeh\\in\\mathbb\{R\}^\{H\\times W\\times d\_\{\\text\{sae\}\}\}\. We then applied GAP to obtain adsaed\_\{\\text\{sae\}\}\-dimensional feature vector, while restoring the originalL2L\_\{2\}\-norm magnitude within this latent representation\. Consequently, all downstream representation analyses were performed using global average\-pooled feature vectors\.
The extracted features were then partitioned by genotype and matched with their respective cell death rate labels\. To mitigate overfitting and ensure a fair comparison across disparate feature spaces, all feature dimensions were reduced to 250 via Principal Component Analysis \(PCA\) and standardized to zero mean and unit variance prior to downstream modeling\.
#### Model Configurations
Ridge Regression was trained as a linear baseline\. The regularization parameter was optimized via internal leave\-one\-out cross\-validation, performing a grid search over 50 candidate values spaced evenly on a log scale from10−310^\{\-3\}to10510^\{5\}\.
XGBoost was trained to capture nonlinear relationships\. we utilized a gradient\-boosted tree ensemble \(XGBRegressor\)\. The hyperparameters were set as follows: number of estimators = 300, maximum tree depth = 5, learning rate = 0\.04, row subsampling rate = 0\.8, and column subsampling rate per tree = 0\.8 \(colsample\_bytree\)\. The histogram\-based tree method \(tree\_method="hist"\) was used\. Within each training fold, 15% of the training samples were held out as an internal validation set for early stopping, with a patience of 30 rounds to prevent overfitting\.
#### Validation and Robustness
We employed a repeated 5\-fold cross\-validation \(CV\) scheme to assess performance\. All folds were stratified and shuffled with a fixed random seed\. The coefficient of determination \(R2R^\{2\}\) was calculated for each fold as:
R2=1−∑i\(yi−y^i\)2∑i\(yi−y¯test\)2R^\{2\}=1\-\\frac\{\\sum\_\{i\}\(y\_\{i\}\-\\hat\{y\}\_\{i\}\)^\{2\}\}\{\\sum\_\{i\}\(y\_\{i\}\-\\bar\{y\}\_\{\\text\{test\}\}\)^\{2\}\}\(29\)whereyi,y^iy\_\{i\},\\hat\{y\}\_\{i\}, andy¯test\\bar\{y\}\_\{\\text\{test\}\}represent the observed labels, predicted values, and the mean of the test\-fold labels, respectively\. To ensure the reliability of our findings, the entire pipeline was repeated across 8 independent seeds for CNN and corresponding SAE on a fixed backhone\. Additionally, a permutation test was conducted by shuffling apoptosis labels while maintaining the original feature structures and CV splits to establish a null distribution\. Statistical significance between the CNN and SAE was determined by a two\-sided Mann\-Whitney U test and each seed treated as an independent observation\.
#### Layer Comparision of CNN layers andL2L\_\{2\}Nomralization Effect\.
The efficacy ofL2L\_\{2\}normalization \(cell density correction\) on CNNfstage5\_outf\_\{\\text\{stage5\\\_out\}\}representation was evaluated by quantifying the performance gain relative to raw features\. Statistical significance for CNN layer comparisons, normalization effects and comparision between CNN and SAE were assessed using a two\-sided Wilcoxon signed\-rank test\.
### E\.6A Geometric Analysis of Vector spaces: Comparing CNN and SAE Representations
#### Quantification of Representation Fidelity viaKK\-NN Phenotypic Coherence and Moran’s I
If SAE representation more faithfully reflect the data intrinsic sturcture by disentangling the superposition, the variance of phenotypic labels within akk\-nearest neighbor \(kk\-NN\) neighborhood is expected to be significantly lower than the global variance across the class\. To quantify this local semantic consistency, we constructedkk\-NN graphs on both the CNN and SAE feature vectors across varying neighborhood sizes \(k∈\{5,10,15,20,25\}k\\in\\\{5,10,15,20,25\\\}\), computed independently within each mutation class Phenotypic coherence was evaluated using the standard deviation ratio:
Ratio=σlocalσglobal\\text\{Ratio\}=\\frac\{\\sigma\_\{\\text\{local\}\}\}\{\\sigma\_\{\\text\{global\}\}\}\(30\)whereσlocal\\sigma\_\{\\text\{local\}\}is the standard deviation of the cell death rate within a given data point’skk\-NN neighborhood, andσglobal\\sigma\_\{\\text\{global\}\}denotes the baseline standard deviation across the each mutation class\. A lower ratio indicates that the spatial clustering of the feature space closely aligns with semantic relatedness\. To statistically compare the representational spaces, we computed the mean of the standard deviation ratios across all data points\.
To complement the local dispersion metric, we evaluated the spatial autocorrelation of phenotypic labels using Global Moran’s I calculated per mutation class across the samekk\-NN defined neighborhoods\. The statistic is defined as:
I=n∑i=1n∑j=1nwij∑i=1n∑j=1nwij\(zi−z¯\)\(zj−z¯\)∑i=1n\(zi−z¯\)2I=\\frac\{n\}\{\\sum\_\{i=1\}^\{n\}\\sum\_\{j=1\}^\{n\}w\_\{ij\}\}\\frac\{\\sum\_\{i=1\}^\{n\}\\sum\_\{j=1\}^\{n\}w\_\{ij\}\(z\_\{i\}\-\\bar\{z\}\)\(z\_\{j\}\-\\bar\{z\}\)\}\{\\sum\_\{i=1\}^\{n\}\(z\_\{i\}\-\\bar\{z\}\)^\{2\}\}\(31\)wherennis the total number of observations within the given class,ziz\_\{i\}represents the phenotypic label value of theii\-th instance,z¯\\bar\{z\}is the sample mean of that class, andwijw\_\{ij\}denotes the spatial weight matrix derived from the intra\-classkk\-NN topology \(wherewij=1w\_\{ij\}=1ifjjis akk\-nearest neighbor ofii\(i≠ji\\neq j\), and0otherwise\)\. While the standard deviation ratio captures the local dispersion of labels within immediate neighborhoods, Global Moran’s I rigorously evaluates the overall spatial continuity across the entire intra\-class manifold\.
#### Assessing Information Capacity via Effective Rank
To compare the information capacity of the CNN 3 layers and SAE feature spaces, we computed their Effective Rank \(eRanke\\text\{Rank\}\) under three settings: \(1\) raw high\-dimensional features, \(2\) 250\-dimensional PCA\-reduced features, and \(3\) 250\-dimensional PCA\-reduced features with prior feature\-wise standardization\. PCA projection to a fixed dimension \(250D\) ensures a fair comparison by aligning the capacity ceiling across disparate architectures \(e\.g\., CNN vs\. overcomplete SAE\)\. Additionally, the standardization step \(scaling features to unit variance\) was selectively applied to evaluate whether low\-variance directions encode meaningful biological structures, preventing them from being dominated by a few high\-variance components\.
TheeRanke\\text\{Rank\}quantifies the effective dimensionality of a feature matrix based on the Shannon entropy of its variance distribution\. Letλi\\lambda\_\{i\}\(i=1,…,mi=1,\\dots,m\) denote the eigenvalues of the covariance matrix sorted in descending order, wheremmis the number of principal components\. These eigenvalues are normalized to form a discrete probability distribution:
pi=λi∑j=1mλjp\_\{i\}=\\frac\{\\lambda\_\{i\}\}\{\\sum\_\{j=1\}^\{m\}\\lambda\_\{j\}\}\(32\)The effective rank is defined as the exponential of this spectral entropy:
eRank=exp\(−∑i=1mpilnpi\)e\\text\{Rank\}=\\exp\\left\(\-\\sum\_\{i=1\}^\{m\}p\_\{i\}\\ln p\_\{i\}\\right\)\(33\)A highereRanke\\text\{Rank\}indicates a more uniform variance distribution across orthogonal dimensions, signaling higher information capacity\. Conversely, aneRanke\\text\{Rank\}close to 1 implies that the representational space has collapsed into a single dominant dimension\.
### E\.7Adaptation of Single\-cell seq Methods to the Representation Derived from image
#### Visualization of Cell States via PHATE
We employed Potential of Heat\-diffusion for Affinity\-based Trajectory Embedding \(PHATE\), which preserves both local and global data structure\. SAE representations were treated as analogues of single\-cell gene expression profiles: each image\-level feature vector represents a point in the learned feature space, where local Euclidean distances reflect semantic similarity owing to the disentangled nature of the SAE representation\.
To directly visualise the relationship between phenotypic progression and cell death, a separate PHATE embedding was constructed each control and mutations pair \(Control\-*SNCA x3*, Control\-*GBA*, Control\-*LRRK2*\)\. After log\-transformation followed by z\-score standardization \(log\-z\-standardization\), the filtered feature matrix was reduced to 50 dimensions via PCA\. PHATE was then computed withk=5k=5nearest neighbours,α\\alpha\-decay=120=120, Euclidean distance, and the diffusion time was set to 35\.
#### Pseudotime Analysis and Partition\-based Graph Abstraction
We adopted diffusion pseudotime to SAE representations derived from images to compute trajectories of control\-mutatnt pairs\. To prevent circular analysis, the dataset was stratified into a feature\-selection set \(50%\) and an evaluation set \(50%\); differential activation was computed on the former, and all downstream statistics were assessed on the latter\.
Feature vectors wereL2L\_\{2\}\-normalized and log\-zz\-standardization\. For each mutation, a Control\-mutation cell pair was constructed\. The cells were then projected into a diffusion map space with 15 components\. Akk\-NN graph was constructed using Euclidean distance\. These diffusion pseudotime \(DPT\) analyses were implemented using Scanpy’s sc\.tl\.diffmap \(with n\_comps=15\) and sc\.tl\.dpt functions\. The root cell was selected as the control cell closest to the control centroid in diffusion\-map coordinates, serving as an approximation of the Fréchet centroid\. Partition\-based graph abstraction \(PAGA\) was employed to verify pathological continuity and connectivity between mutation and control clusters with resolution 0\.65\.
The monotonic association between DPT and apoptosis was assessed by Spearman’sρ\\rhoand Pearson’srr\. A generalized additive model \(GAM; cubic splines\) was fitted to capture nonlinear trends, and adjustedR2R^\{2\}was computed to quantify the explained variance of the DPT–apoptosis relationship\. To test the robustness of root cell selection, we performed a root perturbation analysis by randomly resamplingn=10n=10alternative root cells were selected from the neighborhood of the Control medoid and computing the resulting distribution of Spearman’sρ\\rho\. Cells were categorized into three DPT terciles based on the 5th–95th percentile range to exclude outliers\. A Jonckheere–Terpstra \(JT\) test was applied to assess the significance of the ordered trend in apoptosis rates across these stages\. Post\-hoc pairwise comparisons \(Proximal vs\. Intermediate, Intermediate vs\. Distal, and Proximal vs\. Distal\) were performed using Dunn’s test with Bonferroni correction for multiple comparisons\.
#### Characterization of Feature Map Activation Dynamics along the Pseudotime Trajectory
To visualize the continuous dynamics of the learned representations across DPT, feature vectors were first ordered by their DPT values and aggregated into 100 equal\-sized pseudotime bins to compute stable mean feature activations\. To emphasize the relative temporal dynamics, the binned activation profile of each feature was Z\-score normalized and subsequently smoothed using a 1D Gaussian filter \(σ=1\.5\\sigma=1\.5\)\.
To identify modules of features exhibiting synchronized activation patterns, we constructed akk\-NN \(kk= 15\) graph and applied the Leiden algorithm with a resolution of 1\.0 on the smoothed temporal profiles\. The resulting feature modules were then hierarchically ordered according to their peak activation time along the pseudotime trajectory, visualizing a sequential cascade of phenotypic transformations from the early \(Control\-like\) to late \(pathological\) stages\.
Furthermore, to quantify model the continuous, non\-linear activation trends of interpretable individual feature map activation along the DPT axis, we fitted GAM utilizing penalized cubic regression splines\. To mitigate the potential leverage of extreme density tails, the GAM fitting was strictly confined to cells within the 1st and 99th percentiles of the DPT distribution\. The confidence interval \(95% CI\) was computed\. Both the original cell image and its corresponding attention map overlay were visualized\.
### E\.8scRNA Integration
#### Public Data Acquisition and Preprocessing
Single\-cell RNA sequencing \(scRNA\-seq\) data from SNCAX3 and control 3D cortical organoids were obtained from the NCBI Gene Expression Omnibus \(GEO\) database under accession number GSE236002Jinet al\.\[[2024](https://arxiv.org/html/2606.31394#bib.bib84)\]\. The dataset comprised 12 samples \(6 controls and 6 SNCAX3\) derived from four subclones per genotype, sequenced across three flow cells\. One SNCAX3 sample was excluded following the original study’s quality\-control criteria\. Per\-sample quality control was performed using Scanpy\. Cells expressing fewer than 250 genes or exceeding 25% mitochondrial transcript fraction were removed\. To eliminate putative doublets, cells exceeding both the 99th percentile of detected genes and UMI counts within each sample were excluded\. Ambient RNA contamination was corrected per sample using DecontXYanget al\.\[[2020](https://arxiv.org/html/2606.31394#bib.bib256)\], with raw \(unfiltered\) barcode matrices supplied as background profiles\. After concatenation, genes detected in fewer than three cells were removed, and sex\-linked genes \(Y\-chromosome genes, XIST, and TTTY\-family transcripts\) were excluded to eliminate sex\-confounded variation\. The 3,000 most highly variable genes\(HVG\) were selected using the variance\-stabilizing transformation \(Seurat v3 flavour\) computed for each batch individually, where each batch was defined as the combination of genotype and subclone identity\. Batch correction was performed using scVI with genotype–subclone as the batch key and sex and flow cell as categorical covariates\. Mitochondrial and ribosomal transcript fractions were included as continuous covariates\. The model was trained for up to 400 epochs with early stopping \(patience = 20\) using a negative binomial likelihood, yielding a 30\-dimensional batch\-corrected latent representation used for all downstream analyses\. This latent space was subsequently utilized for dimensional reduction and downstreama analysis
#### Isolation of 2D\-Equivalent Neuronal Lineages from 3D Organoid scRNA\-seq Data
To ensure a fair and direct comparison between 2D monolayer cultures and 3D organoids, we specifically subsampled cell populations from the 3D dataset that transcriptomically represent lineages typically yielded in 2D neuronal environments\. We evaluated a range of Leiden clustering resolution parameters \(0\.4, 0\.5, and 0\.6\) based on the scVI latent space, and selected 0\.5 as the optimal resolution, as it provided the clearest separation of major cell lineages without over\-clustering\.
Cluster\-specific marker genes were identified using differential expression gene \(DEG\) analysis based on raw counts\. Based on these canonical markers, we selectively isolated the specific clusters that exhibited transcriptional profiles corresponding to 2D\-cultured neurons—primarily comprising "Mature Excitatory Neurons" and "Forebrain Excitatory Neurons"—for subsequent comparative analysis\. The detailed DEG profiles defining each cluster are provided in Supplementary Fig\. S14\.
#### Gromov\-wasserstein Coupling of Image Representations and scRNA\-seq data
Figure 12 provides an overview of the implementation pipeline for GW\-map\. After the isolation of 2D\-Equivalent Neuronal Lineage, image\-level SAE representation dimensions were filtered to retain those related to Control andSNCAX3\-specific feature maps \(Wilcoxon rank\-sum DE, adjustedp<0\.05p<0\.05,\|log2FC\|≥0\.58\|\\log\_\{2\}\\text\{FC\}\|\\geq 0\.58\)\. The resulting feature vectors were then log\-transformed,zz\-score standardized, andL2L\_\{2\}\-normalized and projected to 30 principal components for Gromov\-wasserstein\(GW\) optimal transport \(OT\) coupling\. On the scRNA side, the 30\-dimensional scVI latent representation was used directly without further transformation\. Up to 15,000 cells per modality were subsampled with genotype\-balanced selection \(equal Control and SNCA counts\)\.
An entropic GW\-OT coupling matrix was computed between the image PCA \(d=50\) space and the scVI latent space using geodesic intra\-modal cost matrices \(kk\-NN graph shortest paths;k=60k=60\) and Sinkhorn regularization \(ε=0\.003\\varepsilon=0\.003\)\. The resulting coupling matrixπ∈ℝnimg×nrna\\pi\\in\\mathbb\{R\}^\{n\_\{\\text\{img\}\}\\times n\_\{\\text\{rna\}\}\}defines a soft probabilistic assignment between image and transcriptomic cells\. Coupling quality was assessed via genotype matching accuracy \(hard assignment and barycentric projection\) and GW loss\.
Per\-gene expression profiles were obtained by applying the trained scVI decoder to yield denoised normalized expression across highly variable genes \(3,000\)\. For each image cellii, a soft gene expression target was constructed via barycentric projection:y^i=∑jπ~ij⋅gj\\hat\{y\}i=\\sum\_\{j\}\\tilde\{\\pi\}\{ij\}\\cdot g\_\{j\}, whereπ~ij\\tilde\{\\pi\}\_\{ij\}denotes the row\-normalized coupling weight andgjg\_\{j\}the gene expression vector of RNA celljj\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x12.png)
Figure 12:Overview of the GW\-map implementation pipeline\.
#### Prediction of Coupled Image Representations and scRNA\-seq Data
Two regression models were trained via five\-fold stratified cross\-validation: XGBoost and a multilayer perceptron\(MLP\)\. First, both \(i\) pre\-scVI log\-transformed HVG expression and \(ii\) post\-scVI denoised HVG expression were predicted from 100\-dimensional image PCA feature vectors via a two\-hidden\-layer MLP \(128 units, dropout 0\.4, batch normalization, cosine annealing with learning rate5×10−45\\times 10^\{\-4\}and weight decay1\.2×10−31\.2\\times 10^\{\-3\}; early stopping with patience 50, max 200 epochs\)\. MLP inputs and targets were standardized per fold using training\-set statistics, and predictions were clipped to the observed training range to prevent numerical instability from sparse genes\. Second, \(iii\) scVI latent vectors were predicted from 30\-dimensional image PCA feature vectors via XGBoost \(200 estimators, max depth 5, learning rate 0\.1\)\. Performance was quantified byR2R^\{2\}averaged across folds\. To eliminate genotype as a confounding factor, the coupling matrix was restricted to within\-genotype sub\-blocks \(Control–Control and SNCA–SNCA\), and prediction was performed independently within each genotype ensuring that predictive performance reflects genuine cell\-level Phenotype–transcriptome correspondence rather than trivial genotype separation\.
Coupling rows were randomly permuted to generate null distributions ofR2R^\{2\}\. For per\-gene significance, z\-scoredR2R^\{2\}values were computed fromn=800n=800Ridge \(utilized for computational efficiency in large\-scale permutation\) andn=1,000n=1\{,\}000MLP permutations, enabling to mitigate potential inflation of values due to data distribution or variance structures\.
#### Gene Network Inference Based on Coefficient of Determination
To identify coherent biological programes among the predicted genes, each gene was represented as a two\-dimensional point using its within\-genotype permutationzz\-scoredR2R^\{2\}\-one axis for Control and one for SNCA\-quantifying how strongly that gene’s expression was predicted from phenotypical features within each genotype\. Akk\-nearest\-neighbour graph \(k=15k=15, Euclidean distance\) was constructed on these two\-dimensional coordinates and community detection was performed using the Leiden algorithm at multiple resolutions \(0\.002, 0\.2, 0\.3\)\. Gene flow across resolutions was visualized as a Sankey diagram, allowing hierarchical decomposition of functionally coherent gene modules\. Each cluster was annotated via over\-representation analysis \(ORA; Fisher’s exact test, Benjamini–Hochberg adjustedp<0\.05p<0\.05\) against GO Biological Process, KEGG, Reactome, and MSigDB Hallmark gene sets\. The identity of each cluster was subsequently determined based on the most significantly enriched biological terms and the expression of known canonical markers\.
We also selected top 200 high*SNCA x3*z score genes and similarly employed sankey diagram using Leiden algorithm at multiple resolutions \(0\.08, 0\.25, 0\.4, 1\.0\)\. The identity of each cluster was manually assigned by directly examining the lists of RNA\.
Force\-directed networks were generated for selected clusters to visualize intra\-cluster gene connectivity and sub\-cluster organization, with node size proportional to graph degree\.
### E\.9CNN Interpretability through the lens of Mechanistic Interpretability
#### Quantification of class\-specific feature maps across models
The analysis was coducted with sparsity 3200 instance\. The class\-specific feature map was selected based on image\-level mean activations\. Considering that the opposite sign concepts can be superposed to single feature map, raw CNN GAP values were transformed to absolute magnitude\. Then identified via differential activation comparing Control to each mutation \(sparsity threshold5×10−55\\times 10^\{\-5\}; Wilcoxon rank\-sum test, adjustedp<0\.05p<0\.05;\|log2FC\|\>0\.58\|\\log\_\{2\}\\text\{FC\}\|\>0\.58\)\.
#### Visualizing the Feature map via Bilinear Interpolation
We employ bilinear interpolation to visualize class\-specific latent feature maps\. For a given input images, the CNN encoder produces a spatial feature map at \(fstage5outf\_\{stage5\_\{o\}ut\}\)\. The normalized tokens are passed through the trained SAE encoder, producing an activation vector of the dictionary size per spatial position, excluding dead feature maps\.
For each CNN and SAE latent feature map, the resulting activation map \(64×6464\\times 64\) is bilinearly upsampled to the original image resolution \(128×128128\\times 128\) and visualized as a jet\-colormap overlay\. The activation map is normalized by clipping values below the 50th percentile \(median\) to zero and mapping the 99\.9th percentile to the maximum\. The overlay is alpha\-blended \(α=0\.5\\alpha=0\.5\) onto the Fiji\-style linearly scaled microscopy image, where per\-channel intensity is linearly mapped from the 10th percentile to the 99\.5th percentile into the 8\-bit range\. The top 20 highest\-activating images are displayed for each concept\. Concept selection is performed via a differential expression filter\.
Additionally, to simulate the CNN’s perspective, the input image is processed with per\-channel SafeInstanceNormalization\. The resulting three\-channel tensor is then linearly mapped to 8\-bit using a global \(cross\-channel\) 1st to 99th percentile clipping\.
#### Activation Maximization
To visualize the learned concept of individual SAE feature maps, we employ activation maximization \(AM\)Erhanet al\.\[[2009](https://arxiv.org/html/2606.31394#bib.bib108)\], Olahet al\.\[[2017](https://arxiv.org/html/2606.31394#bib.bib110)\]in Fourier\-preconditioned space\. For each target feature map, a synthetic 128×128 three\-channel input image is optimized to maximally activate the feature map’s GAP response through the frozen CNN encoder and SAE\.
Optimization is performed in the frequency domain: learnable complex\-valued Fourier coefficients are initialized with standard deviation 0\.01 and converted to pixel space via inverse FFT weighted by a1/fα1/f^\{\\alpha\}decay mask, which penalizes high\-frequency components to encourage naturalistic images\. Because the SAE’s binary gating produces zero gradients in evaluation mode, we replace the hard gate with a temperature\-scaled sigmoid \(T = 10\), providing differentiable approximation\.
The objective function maximizes the target feature map’s GAP activation while minimizing L2 decay on Fourier coefficients \(λ\\lambda=1×10−41\\times 10^\{\-4\}\) and L1 pixel sparsity \(λ\\lambda=1×10−41\\times 10^\{\-4\}\) to promote dark backgrounds\. At each optimization step, the pixel\-space image is clamped to \[\-3\.5, 3\.5\] to remain within the distribution seen by the safeinstancenormalized encoder, then subjected to random spatial jitter, rotation \(±15∘\\pm 15^\{\\circ\}\), and scaling \(0\.9–1\.1\) for transformation robustness\. Optimization uses Adam with cosine annealing over 2,048 steps at an initial learning rate of 40\.0\.
To control the visual granularity, AM is performed across a grid of Fourier decay powers \(α∈\\alpha\\in\{0\.4, 0\.8, 1\.2, 1\.6\}\), spatial jitter magnitudes \(\{4, 8, 12, 16\} pixels\), and three random seeds\. The resulting synthetic images are visualized per\-channel alongside spatial activation heatmaps obtained by bilinear upsampling of the SAE neuron’s per\-token activation map to the input resolution\.
## Appendix FSupplementary Figures
### F\.1Robustness of Diffusion Pseudotime \(DPT\) Analysis
Control
Mutation
A\. Control \(GBA\)
A\. Mutation \(GBA\)
B\. Control \(SNCA\)
B\. Mutation \(SNCAX3\)
C\. Control \(LRRK2\)
C\. Mutation \(LRRK2\)
Figure 13:Distribution of geodesic distance between control medoid and mutation
Left column shows control samples, while the right column shows mutation saples \(A\) GBA, \(B\) SNCAX3, and \(C\) LRRK2 lines of geodesic distance \(pseudotime\) from control medoid\.Figure 14:Robustness of DPT to Root Cell Selection
To evaluate the stability of our pseudotime estimates, we performed a root permutation analysis by shifting the starting point within the vicinity of the control centroid \(n = 10 iterations\)\. For each iteration, the relationship between the distance from the control medoid and the cell death rate within each mutation population was quantified using Spearman’s rank correlation\. The consistent distribution of DPT and the stability of these coefficients across permutations demonstrate the robustness of our trajectory model against initial root selection\."
### F\.2Characterization of Leiden Clusters and Cell\-Type Annotation of scRNA\-seq data
Figure 15:Expression patterns of cluster\-specific marker genes
Dot plot visualizing the expression of the top 10 differentially expressed genes \(DEGs\) for each cell cluster\. The size of each dot represents the percentage of cells expressing the gene, and the color intensity indicates the average expression level \(scaled\) within the clusterFigure 16:Statistical summary of top 10 DEGs\(Continued on next page\)Figure 17:Statistical summary of top 10 DEGs per cell cluster
List of the top 10 most significant differentially expressed genes for each identified cluster\. Statistics include gene symbols, average log2 fold\-change, and adjusted p\-values calculated using the Wilcoxon Rank Sum test\.
### F\.3Interpretation of Individual Feature Map of SAE \(Fig\. 17 \- Fig\. 21\)
Original cell image, attention map cell image, overlay, activation maximization with individual channel, combined and attention map of activation maximization are included\. These figures indicate that the individual feature map of SAE is activated by specific concept like mitochondria, mitochondiria\-lysosome spatial colocalization, nuclear envelope making each feature map interpretable\. This monosemanticity indicates that SAE successfully disentangle the entangled superposition\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x23.png)
Figure 18:Visual illustration of SAE feature map\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x24.png)
Figure 19:Visual illustration of SAE feature map\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x25.png)
Figure 20:Visual illustration of SAE feature map\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x26.png)
Figure 21:Visual illustration of SAE feature map\.
![[Uncaptioned image]](https://arxiv.org/html/2606.31394v1/x27.png)
Figure 22:Visual illustration of SAE feature map\.Similar Articles
Sparse Autoencoders Map Brain-LLM Alignment onto Cortical Semantic Topography
This paper uses sparse autoencoders to decompose LLMs into interpretable features and shows that semantic features explain brain alignment with cortical semantic topography, generalizing across English, Chinese, and French.
A Geometric View for Understanding Concept Learning and Neuron Interpretation in Sparse Autoencoders
This paper proposes a unified geometric framework for understanding concept learning and neuron interpretation in sparse autoencoders, formalizing concepts as sets and defining detection, separation, and approximation. It provides error bounds, capacity constraints, and links to formal concept analysis, with experiments on synthetic data.
Effects of sparsity and superposition on loss in simple autoencoders
This paper provides a mathematical analysis of superposition in neural networks, deriving upper and lower bounds on L2 reconstruction loss for simple autoencoders with power activation functions, corroborating empirical findings by Elhage et al.
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
This paper applies TopK Sparse Autoencoders to three EEG foundation models (SleepFM, REVE, LaBraM) to extract interpretable feature dictionaries and introduces a framework for concept steering, revealing representational failures and clinical entanglements.
3D Masked Autoencoders are Robust Learners of Volumetric and Multimodal Cellular Representations for Microscopy
This paper presents 3D masked autoencoders for volumetric microscopy data, demonstrating that 3D modeling outperforms 2D max-projection and slice-based variants on downstream single-cell tasks, with cross-modal alignment to a protein language model further improving performance.