Tag
This paper introduces an LLM-powered comparative pipeline for analyzing governance discourse in AI agent protocols, applying it to ERC-8004 and Google A2A to examine how institutional design shapes thematic priorities and community structure.
A study by Emory University and IBM Research introduces a verifiable context governance approach for LLMs, achieving 97% accuracy at one-third the cost.
This paper introduces MemClaw, a governed shared memory architecture for multi-agent LLM systems, formalizing failure modes like unauthorized leakage and stale propagation, and evaluating the system via the ArgusFleet harness.
A study reveals that 74% of companies have pulled AI agents from production, with even higher rollback rates among those with mature AI governance. The core issue is not the AI models themselves but the messy, disconnected infrastructure and data they rely on.
The article argues that the AI race may ultimately be about trust and organizational intelligence rather than model benchmark competition, as enterprise adoption requires integration, governance, and accountability beyond raw intelligence.
Explores the challenge of enforcing authorization when AI agents take real-world actions, questioning where security controls should be placed.
Anthropic's recent IPO filing and a safety paper advocating for pausing AI development expose the tension between commercial growth and safety commitments, raising questions about who holds the authority to slow or stop model training as the company goes public.
This paper proposes AgenticRei, a framework for runtime governance of LLM-driven agentic AI systems using deontic policies expressed in OWL, enabling obligations, dispensations, and conflict resolution beyond traditional policy engines.
A critique of poorly built automation systems created by so-called experts who ignore error handling, documentation, and governance, leaving clients with fragile workflows that fail in production.
Dean Ball announces he will join OpenAI as Head of Strategic Futures, a new team focusing on frontier AI policy and internal governance, while remaining a Nonresident Senior Fellow at the Foundation for American Innovation.
An analysis arguing that companies fail at AI because they focus on the model rather than the foundational layers—process design, governance, knowledge architecture, human judgment, and feedback loops—which are the true sources of value. The article cites Nadella's 'token capital' concept, Apple's model-swappable Siri, and survey data showing a wide gap between strategy and execution.
Google DeepMind introduces the AI Control Roadmap, a defense-in-depth framework for securing AI agents against risks from misalignment, calling for collaborative prioritization across AI labs, government, and academia.
The article discusses the problem of agent sprawl in teams using multiple AI agents with overlapping permissions and workflows. It proposes a basic control layer with owner, read/write systems, budget, stop rule, and four agent classes: readers, routers, operators, spenders.
This paper introduces the vulnerable world hypothesis, which posits that at some level of technological development, civilization is almost certain to be devastated unless robust global governance and policing mechanisms are in place. It analyzes historical and speculative vulnerabilities and argues for the need to balance technological progress with preventive measures.
This paper proposes a behavioral measure of trust between AI agents based on costly verification in a cooperative survival game, studying trust formation, breakage, and recovery across six frontier model snapshots. It finds that models differ in trust calibration and that persistent over-verification is associated with indecision rather than safety.
The author discusses the need for a fourth governance loop in self-improving AI agent systems to prevent objective drift, proposing periodic human review, withheld benchmarks, and rotating evaluators as practical controls.
An experiment with a local governance harness for AI coding agents shows that when the agent's own governance record is surfaced in its context, the agent begins to self-correct by following policies and asking for intent declarations, without hard enforcement.
SpaceX's record $75 billion IPO highlights its culture of extreme ownership and Elon Musk's dominant control, drawing both investor excitement and skepticism.
The author argues that as AI agents become more autonomous, a governance layer is needed for control, observability, and auditability, and introduces Bendex Arc as a solution with components like Arc Gate, Arc Replay, Arc Approve, and Arc Memory.
This paper demonstrates that content moderation systems can cause disproportionate harm to bridge users connecting separate communities, even when aggregate accuracy metrics appear satisfactory, with governance loss increasing under false-positive-heavy conditions.