information-leakage

#information-leakage

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

arXiv cs.CL ↗ · 2026-05-27 Cached

This paper investigates the risk of sensitive information inference from exported LLM representations in clinical summarization, showing that reducing leakage from one vector artifact does not guarantee privacy in others. It introduces SurfaceLoRA, a fine-tuning method that reduces race recovery from targeted vectors while preserving utility.

0 favorites 0 likes

information-leakage

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

Submit Feedback