PAAC: Privacy-Aware Agentic Device-Cloud Collaboration

Hugging Face Daily Papers Papers

Summary

This paper introduces PAAC, a privacy-aware agentic framework for device-cloud collaboration that uses a decoupled architecture and LLM-driven sanitization to protect sensitive data while maintaining high performance.

Large language model (LLM) agents face a structural tension: cloud agents provide strong reasoning but expose user data, while on-device agents preserve privacy at the cost of overall capability. Existing device-cloud designs treat this boundary as a compute split rather than a trust boundary suited to agentic workloads, and existing sanitizers force a choice between policy flexibility and the structural fidelity tool calls require. In this work, we develop PAAC, a privacy-aware agentic framework that aligns planner--executor decomposition with the device-cloud boundary so that role specialization itself becomes the privacy mechanism. The cloud agent reasons over typed placeholder tokens that preserve each sensitive value's reasoning role while discarding its content, while the on-device agent identifies sensitive spans and distills each step's execution outcome into compact key findings. Sanitization confines the on-device LLM to proposing which spans to mask, while a deterministic registry performs all substitution and reversal, keeping actions directly executable on device. On three agentic benchmarks under strict privacy settings, PAAC dominates the Pareto frontier of privacy and accuracy, improving average accuracy by 15-36\% and reducing average leakage by 2-6times over state-of-the-art device-cloud baselines, with the largest margins on privacy targets outside fixed entity taxonomies. We find consistent improvements on 17 additional benchmarks spanning 10 domains, including math, science, and finance.
Original Article
View Cached Full Text

Cached at: 05/13/26, 12:15 PM

Paper page - PAAC: Privacy-Aware Agentic Device-Cloud Collaboration

Source: https://huggingface.co/papers/2605.08646

https://huggingface.co/papers/2605.08646#%F0%9F%94%91-tldr🔑 TL;DR

PAAC reframes the device-cloud split as atrust boundaryrather than a compute split, with two contributions working in tandem: a decoupled agentic architecture and an LLM-driven privacy sanitizer.

https://huggingface.co/papers/2605.08646#%F0%9F%A4%9D-decoupled-architecture🤝 Decoupled Architecture

Cloud-reason-and-plan, device-execute-and-judge. The cloud agent reasons and plans over typed placeholder tokens (e.g.,\{BALANCE: \.\.\.\}); the on-device agent identifies sensitive spans, executes tools with real values, and distills each step’s outcome into compact key findings. Role specialization itself becomes the privacy mechanism, and per-step distillation keeps each agent’s input compact across turns, avoiding the trajectory-coupled context growth that breaks single-agent pipelines.

https://huggingface.co/papers/2605.08646#%E2%9A%99%EF%B8%8F-proposerverifierregistry-sanitization⚙️ Proposer–Verifier–Registry Sanitization

The on-device LLM onlyproposes(span, proxy token) pairs; a deterministic append-only regex registry handles all substitution and reversal. This preserves tool-call fidelity, gives cross-round consistency, and locks in first-turn protection even if the on-device LLM is later compromised.

https://huggingface.co/papers/2605.08646#%F0%9F%93%8A-results-qwen3-4b–gemini-3-flash📊 Results (Qwen3-4B + Gemini 3 Flash)

  • 📈+15-36% accuracyand2-6× lower leakagevs SOTA device-cloud baselines on \\tau^2-Bench Airline/Retail and GAIA
  • 🎯 0% leakage on open-vocab targets (CLUTRR names) where pattern-based methods hit 38.6%
  • 🪶 Stable accuracy and token cost as privacy tightens; gains hold across 17 more benchmarks in 10 domains

Similar Articles

CONCORD: Asynchronous Sparse Aggregation for Device-Cloud RAG under Document Isolation

arXiv cs.AI

CONCORD is an asynchronous sparse aggregation framework for retrieval-augmented generation (RAG) under document isolation in device-cloud setups. It improves throughput and reduces communication by orders of magnitude by treating the cloud as an asynchronously arriving evidence source rather than a continuously synchronized co-generator.

MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents

Hugging Face Daily Papers

MemPrivacy is a research paper introducing a framework for privacy-preserving personalized memory management in edge-cloud AI agents, using type-aware placeholders to protect sensitive data while maintaining semantic utility. It includes a new benchmark dataset and demonstrates superior performance over general-purpose models like GPT-5.2 and Gemini-3.1-Pro.