llm-privacy

#llm-privacy

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

arXiv cs.LG ↗ · 2026-05-21 Cached

Proposes Complementary Self-Distillation (SelfCI) to improve contextual integrity in LLMs by balancing utility and privacy. Evaluated on CI-RL and PrivacyLens benchmarks across multiple models.

0 favorites 0 likes

#llm-privacy

Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper introduces Attention-Shifting (AS), a novel framework for selective machine unlearning in LLMs that balances effective removal of sensitive information while preventing hallucinations and preserving model utility. The method uses importance-aware attention suppression and retention enhancement to achieve up to 15% higher accuracy preservation compared to existing unlearning approaches on standard benchmarks.

0 favorites 0 likes

llm-privacy

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting

Submit Feedback