disentanglement

Tag

Cards List
#disentanglement

Natively Unlearnable Large Language Models

arXiv cs.LG · yesterday Cached

The paper proposes NULLs (Natively Unlearnable LLMs), a model class that isolates source-specific contributions in sparsely activated sinks while sharing backbone neurons, enabling clean unlearning of individual data sources without retraining and preserving general language capabilities.

0 favorites 0 likes
← Back to home

Submit Feedback