Tag
This paper proposes a post-training refinement approach using interventional contrastive learning to disentangle speech foundation model representations into separate content and speaker subspaces. The method shows improved out-of-domain speaker verification performance and evidence of successful separation.