leakage

#leakage

MosaicLeaks: Can your research agent keep a secret?

Hugging Face Blog ↗ · 13h ago Cached

MosaicLeaks introduces a new benchmark for measuring privacy leakage in deep-research AI agents, showing that agents often leak private information through external queries and proposing a training method (PA-DR) to reduce leakage while improving task performance.

0 favorites 0 likes

#leakage

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

arXiv cs.CL ↗ · 2026-06-01 Cached

Introduces MosaicLeaks, a benchmark of 1,001 multi-hop deep research tasks that chain private enterprise documents with public web queries to evaluate privacy leakage. Finds that models leak sensitive information at multiple levels, and proposes PA-DR, a reinforcement learning framework that reduces leakage while improving task accuracy.

0 favorites 0 likes

leakage

MosaicLeaks: Can your research agent keep a secret?

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

Submit Feedback