Tag
PolicyGuard is a sub-agent verifier that enhances LLM agent policy adherence by providing contextual reasoning and conversation-specific feedback across multi-turn interactions, achieving significant improvements on the tau^2-BENCH benchmark.
LedgerAgent is a method for customer service agents that maintains task states in a separate ledger to improve policy adherence and state management during tool calling. It improves average passk over standard approaches across four domains.