latent-knowledge

#latent-knowledge

Forgetting is Not Erasure: Recovering Latent Knowledge via Transport Keys

arXiv cs.LG ↗ · 2d ago Cached

This paper argues that catastrophic forgetting in neural networks is not erasure but an interface alignment problem. It introduces 'transport keys' to recover latent task-specific features from sequentially trained models, demonstrating significant performance recovery on split CIFAR-100.

0 favorites 0 likes

#latent-knowledge

MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models

arXiv cs.CL ↗ · 2026-05-29 Cached

MechELK is a three-stage framework combining mechanistic interpretability tools (SAE, activation patching, causal probing) with representation engineering to elicit latent knowledge from LLMs, achieving 84.7% accuracy and outperforming existing methods like CCS and linear probing.

0 favorites 0 likes

latent-knowledge

Forgetting is Not Erasure: Recovering Latent Knowledge via Transport Keys

MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models

Submit Feedback