PAAC: Privacy-Aware Agentic Device-Cloud Collaboration
Summary
This paper introduces PAAC, a privacy-aware agentic framework for device-cloud collaboration that uses a decoupled architecture and LLM-driven sanitization to protect sensitive data while maintaining high performance.
View Cached Full Text
Cached at: 05/13/26, 12:15 PM
Paper page - PAAC: Privacy-Aware Agentic Device-Cloud Collaboration
Source: https://huggingface.co/papers/2605.08646
https://huggingface.co/papers/2605.08646#%F0%9F%94%91-tldr🔑 TL;DR
PAAC reframes the device-cloud split as atrust boundaryrather than a compute split, with two contributions working in tandem: a decoupled agentic architecture and an LLM-driven privacy sanitizer.
https://huggingface.co/papers/2605.08646#%F0%9F%A4%9D-decoupled-architecture🤝 Decoupled Architecture
Cloud-reason-and-plan, device-execute-and-judge. The cloud agent reasons and plans over typed placeholder tokens (e.g.,\{BALANCE: \.\.\.\}); the on-device agent identifies sensitive spans, executes tools with real values, and distills each step’s outcome into compact key findings. Role specialization itself becomes the privacy mechanism, and per-step distillation keeps each agent’s input compact across turns, avoiding the trajectory-coupled context growth that breaks single-agent pipelines.
https://huggingface.co/papers/2605.08646#%E2%9A%99%EF%B8%8F-proposerverifierregistry-sanitization⚙️ Proposer–Verifier–Registry Sanitization
The on-device LLM onlyproposes(span, proxy token) pairs; a deterministic append-only regex registry handles all substitution and reversal. This preserves tool-call fidelity, gives cross-round consistency, and locks in first-turn protection even if the on-device LLM is later compromised.
https://huggingface.co/papers/2605.08646#%F0%9F%93%8A-results-qwen3-4b–gemini-3-flash📊 Results (Qwen3-4B + Gemini 3 Flash)
- 📈+15-36% accuracyand2-6× lower leakagevs SOTA device-cloud baselines on \\tau^2-Bench Airline/Retail and GAIA
- 🎯 0% leakage on open-vocab targets (CLUTRR names) where pattern-based methods hit 38.6%
- 🪶 Stable accuracy and token cost as privacy tightens; gains hold across 17 more benchmarks in 10 domains
Similar Articles
When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems
This paper systematically studies hybrid multi-agent systems combining cloud-based LLMs and on-device SLMs, revealing task-dependent optimal architectures and challenging the assumption that more frontier compute always improves performance.
CONCORD: Asynchronous Sparse Aggregation for Device-Cloud RAG under Document Isolation
CONCORD is an asynchronous sparse aggregation framework for retrieval-augmented generation (RAG) under document isolation in device-cloud setups. It improves throughput and reduces communication by orders of magnitude by treating the cloud as an asynchronously arriving evidence source rather than a continuously synchronized co-generator.
MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents
MemPrivacy is a research paper introducing a framework for privacy-preserving personalized memory management in edge-cloud AI agents, using type-aware placeholders to protect sensitive data while maintaining semantic utility. It includes a new benchmark dataset and demonstrates superior performance over general-purpose models like GPT-5.2 and Gemini-3.1-Pro.
"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"
The article introduces OncoAgent, a dual-tier multi-agent framework designed for privacy-preserving clinical decision support in oncology. It details a system architecture that combines corrective RAG, a reflexion safety loop, and dual-tier QLoRA fine-tuning optimized for AMD hardware.
PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets
PrivFusion is a privacy-preserving multi-agent framework that automates the harmonization of structured datasets across institutions before federated training, reducing manual effort and enabling collaborative analytics on sensitive clinical data.