Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols
Summary
This paper introduces an LLM-powered comparative pipeline for analyzing governance discourse in AI agent protocols, applying it to ERC-8004 and Google A2A to examine how institutional design shapes thematic priorities and community structure.
View Cached Full Text
Cached at: 06/26/26, 05:11 AM
# Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols
Source: [https://arxiv.org/html/2606.26203](https://arxiv.org/html/2606.26203)
andLuyao ZhangDuke Kunshan UniversityKunshanChina
\(2026\)
###### Abstract\.
As AI agent protocols proliferate, the governance structures shaping their interoperability standards remain empirically underexamined\. We introduce an LLM\-powered comparative pipeline for large\-scale governance discourse analysis, integrating automated annotation, neural topic modeling, and multi\-layer network analysis to study socio\-technical power structures at scale\. We validate it on two contrasting standards for agent interoperability: ERC\-8004 \(permissionless, on\-chain\) and Google A2A \(corporate\-led\)\. Analyzing 4,323 governance participation records, we combine LLM\-assisted coding, topic modeling, and multi\-layer network analysis to examine how institutional design shapes thematic priorities and community structure\. We find that while governance form influences substantive focus, both regimes exhibit comparable levels of participation inequality and community fragmentation\. Discourse alignment is denser in the permissionless setting, suggesting that open governance may foster greater thematic convergence despite decentralized participation\. These findings illustrate how LLM\-assisted methods can advance the empirical study of technology governance, with implications for designing more equitable agentic AI standards\. All data and code are openly available\.
DAO governance, consortium governance, AI agents, LLM research pipelines, blockchain
††conference:ACM Conference Name; Month DD–DD, YYYY; City, Country††journalyear:2026††isbn:978\-x\-xxxx\-xxxx\-x/YY/MM††doi:10\.1145/nnnnnnn\.nnnnnnn## 1\.Introduction
Interoperability is an inherited characteristic of decentralized applications\(Harvey and Rabetti,[2024](https://arxiv.org/html/2606.26203#bib.bib46)\)\. As AI agent protocols proliferate, the governance of their interoperability standards remains empirically underexamined\. Who controls the rules by which autonomous agents discover, negotiate, and coordinate across organizational boundaries? This question sits at the intersection of artificial intelligence and institutional design\(Evanset al\.,[2026](https://arxiv.org/html/2606.26203#bib.bib2); World Economic Forum and Accenture,[2026](https://arxiv.org/html/2606.26203#bib.bib5)\), yet the public discourse through which such standards are negotiated has been difficult to study at scale\. Prior work relies predominantly on manual coding with fixed categories, limiting both thematic discovery and structural analysis across large text corpora\.
We address this gap with an LLM\-powered comparative pipeline for large\-scale governance discourse analysis, integrating automated annotation, neural topic modeling, and multi\-layer network analysis\. We validate the pipeline on two standards: ERC\-8004 and Google A2A\. The two protocols address the same technical problem of AI agentic communication across systems, and their governance processes are both public on GitHub or Forums\. Therefore, comparing them isolates the effect of governance form on who participates and what they discuss\.
ERC\-8004, which belongs to Ethereum Improvement Proposals \(EIP\), defines a smart\-contract interface standard for on\-chain AI agent identity\. It “proposes to use blockchains to discover, choose, and interact with agents across organizational boundaries without pre\-existing trust”\(Ethereum Magicians,[2025](https://arxiv.org/html/2606.26203#bib.bib12)\)\. Its lifecycle follows the stages defined in EIP\-1, and is advanced by rough consensus among forum participants and EIP editors, with no formal votes and permissionless implementations\(Ethereum Foundation,[2015](https://arxiv.org/html/2606.26203#bib.bib13)\)\. Google A2A introduces “An open protocol enabling communication and interoperability between opaque agentic applications”\(A2A Authors,[2025a](https://arxiv.org/html/2606.26203#bib.bib14)\)\. It was initiated by Google and donated to the Linux Foundation in June 2025, now governed by an eight\-seat Technical Steering Committee \(TSC\) composed entirely of corporate representatives\(A2A Authors,[2025b](https://arxiv.org/html/2606.26203#bib.bib15); Google Cloud,[2025](https://arxiv.org/html/2606.26203#bib.bib16)\)\.
Comparing the two cases, we ask:
RQ: Compared to corporate hierarchy, does the governance structure of a permissionless DAO really achieve a higher degree of decentralization?
We decompose this into three sub\-questions:
- •RQ1 \(Decision Architecture\):How do the formal decision procedures, entry rights, and authority structures of the two regimes differ?
- •RQ2 \(Discourse Composition\):How does governance form shape the topical and argumentative composition of participation discourse?
- •RQ3 \(Relational Networks\):How does governance form shape co\-participation ties, discourse\-level consensus and conflict structures, and actor–topic divisions of collaborative labor?
To answer these questions, we first reconstruct the two governance mechanisms from their official documents and visualize them as decision\-flow diagrams\. We then collect 4,323 governance participation records from their public repositories and apply an LLM\-assisted annotation pipeline to label each record’s argumentative function, stance, and stakeholder affiliation\(Carlson and Burbano,[2026](https://arxiv.org/html/2606.26203#bib.bib17)\)\. On top of this annotation, we run two topic\-discovery methods: unsupervised BERTopic\(Grootendorst,[2022](https://arxiv.org/html/2606.26203#bib.bib18)\), and LLM\-inductive Thematic\-LM\(Qiaoet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib19)\)\. Moreover, we run three network analyses: co\-participation social network analysis \(SNA\)\(Aoet al\.,[2023](https://arxiv.org/html/2606.26203#bib.bib20)\), discourse network analysis\(Leifeld,[2013](https://arxiv.org/html/2606.26203#bib.bib21)\), and socio\-semantic bipartite networks\(Roth and Cointet,[2010](https://arxiv.org/html/2606.26203#bib.bib22)\)\. The methodological triad mirrors the three sub\-questions\.
Three findings emerge\. First, the two cases instantiate structurally opposite decision architectures: ERC\-8004 advances by rough consensus with permissionless deployment, while A2A vests binding authority in a corporate technical steering committee\. Second, governance form shapes the thematic focus of deliberation: DAO governance concentrates discourse on security mechanisms and protocol principles, whereas corporate governance distributes deliberation across engineering\-execution workstreams\. Third, both networks exhibit comparable levels of participation inequality and community fragmentation; discourse congruence is nonetheless denser in ERC\-8004, reflecting tighter within\-community consensus formation\.
This paper makes three contributions\. Methodologically, the architecture can generalize to any large\-scale governance discourse analysis, offering a computational lens on the black box of public text negotiation, communication and coordination\. Empirically, it provides the first matched\-case, multi\-method comparison of DAO and corporate governance of agentic standardization with text corpora at scale\. Theoretically, it pioneers the fundamental problem of who controls the future AI agentic infrastructure, with the interdisciplinary standing of institutional design and AI\.
## 2\.Related Work
Three streams of literature converge on this study\. For a more specific comparison, see Appendix[H](https://arxiv.org/html/2606.26203#A8)\.
Decentralized governance via blockchain\.First, Harvey provides a comprehensive overview of the technologies, theories, and practices inherent to decentralized finance \(DeFi\) and blockchain technology\(Harveyet al\.,[2021](https://arxiv.org/html/2606.26203#bib.bib29)\)\. Beck et al\.\(Becket al\.,[2018](https://arxiv.org/html/2606.26203#bib.bib30)\)provided the foundational Information System framework mapping blockchain governance along decision rights, accountability and incentives dimensions\. Ziolkowski et al\.\(Ziolkowskiet al\.,[2020](https://arxiv.org/html/2606.26203#bib.bib31)\)identified six major governance challenges specific to the blockchain system, and half of them are unique to this technology\. Kiayias and Lazos\(Kiayias and Lazos,[2023](https://arxiv.org/html/2606.26203#bib.bib32)\)systematised the field with a SoK review of blockchain governance mechanisms\. More recently, Ellinger et al\.\(Ellingeret al\.,[2024](https://arxiv.org/html/2606.26203#bib.bib8)\)explored the balanced polycentric governance of digital commons\. Reineke et al\.\(Reinekeet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib9)\)developed an integrative theoretical framework, showing the evolution of decentralization concepts\. Sunyaev et al\.\(Sunyaevet al\.,[2026](https://arxiv.org/html/2606.26203#bib.bib28)\)called for purposeful, design\-oriented decentralization rather than ending at ideology\. Motea and Oba\(Motea and Oba,[2026](https://arxiv.org/html/2606.26203#bib.bib33)\)interrogated the democratic legitimacy of blockchain governance structures\.
Governance of decentralized versus corporate structure\.Murray et al\.\(Murrayet al\.,[2021](https://arxiv.org/html/2606.26203#bib.bib34)\)examined how smart contracts and DAOs alter agency costs in corporate contracting, providing leading opinions\. Lumineau et al\.\(Lumineauet al\.,[2021](https://arxiv.org/html/2606.26203#bib.bib35)\)highlights tacitness of transactions and social implications of this technology\. Rahman et al\.\(Rahmanet al\.,[2024](https://arxiv.org/html/2606.26203#bib.bib36)\)and Hunt et al\.\(Huntet al\.,[2024](https://arxiv.org/html/2606.26203#bib.bib37)\)analyzed the dynamics and power accumulation of platforms\. Hui and Tucker\(Hui and Tucker,[2025](https://arxiv.org/html/2606.26203#bib.bib38)\)address AI along with a decentralized ecosystem, proposing an innovative governance framework\. However, these studies rely on theoretical or interview\-based methods; none of them use governance\-participation data or computational methods to test structural difference between DAO and corporate governance in the same domain\.
Computational studies of cooperative work\.Researchers have long examined coordination in open, online communities\. Mockus et al\.\(Mockuset al\.,[2002](https://arxiv.org/html/2606.26203#bib.bib39)\)documented participation inequality in Apache and Mozilla\. Im et al\.\(Imet al\.,[2018](https://arxiv.org/html/2606.26203#bib.bib23)\)analyzed Wikipedia’s Requests for Comments \(RfC\), a deliberation mechanism structurally analogous to EIP rough consensus, and revealed a persistent imbalance between deliberation and resolution\. Germonprez et al\.\(Germonprezet al\.,[2018](https://arxiv.org/html/2606.26203#bib.bib26)\)catalogued structural realities of contemporary open\-source projects, including pervasive corporate engagement; Li et al\.\(Liet al\.,[2021](https://arxiv.org/html/2606.26203#bib.bib25)\)examined code\-of\-conduct conversations on GitHub as a window into informal governance norms in open\-source repositories\. Kulakowski and Frasincar\(Kulakowski and Frasincar,[2023](https://arxiv.org/html/2606.26203#bib.bib41)\)introduced CryptoBERT, a domain\-adapted BERT variant pre\-trained on 3\.2 million cryptocurrency social\-media posts, establishing a specialized embedding backbone for blockchain\-native corpora\. More recently, Wu et al\.\(Quanet al\.,[2024](https://arxiv.org/html/2606.26203#bib.bib42)\)applied sentiment and discourse analysis to six DAO forums; Stine and Agarwal\(Stine and Agarwal,[2020](https://arxiv.org/html/2606.26203#bib.bib24)\)proposed comparative discourse analysis via topic models; Qiao et al\.\(Qiaoet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib19)\)introduced Thematic\-LM for LLM\-assisted inductive thematic analysis of large corpora; Ao et al\.\(Aoet al\.,[2023](https://arxiv.org/html/2606.26203#bib.bib20)\)used social network analysis on on\-chain Aave data to show voting\-power concentration; Leifeld\(Leifeld,[2013](https://arxiv.org/html/2606.26203#bib.bib21)\)proposed discourse network analysis concentrating on participants’ stance; Roth and Cointet\(Roth and Cointet,[2010](https://arxiv.org/html/2606.26203#bib.bib22)\)connects semantic analysis with social topology\. Wang el al\.\(Wanget al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib43)\)found the threat of power concentration via scaled empirical analysis of scaleSnapShotdata\. Özdemir Sönmez et al\.\(Jungnickelet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib10)\)quantified voting power concentration and participation apathy across DAO governance models, especially in token\-based organizations\. Chen et al\.\(Chenet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib44)\)applied a quasi\-experimental PSM\-DID design to 98,000 Steemit users, finding that governance token ownership boosts curation effort but reduces creation novelty\.
## 3\.Methodology
We selected MiniMax\-M2\.5\(MiniMax,[2026](https://arxiv.org/html/2606.26203#bib.bib48)\)as the LLM backbone for its reasoning capability and low cost\. The model is trained on complex real\-world environments and achieves 80\.2% on SWE\-Bench Verified; frontier models of this capability tier have been shown to complete complex multi\-turn analytical tasks with high reliability\(Maet al\.,[2024](https://arxiv.org/html/2606.26203#bib.bib1)\), making them suitable for assigning nuanced governance\-process labels such asArgument TypeandConsensus Signal\. For design of the comparative case, see Appendix[B](https://arxiv.org/html/2606.26203#A2)\.
### 3\.1\.Data Collection and LLM Annotation
We collect governance participation records from the public repositories\. For ERC\-8004, data originate from two sources: \(1\) 113 posts from the Ethereum Magician forum thread\(Ethereum Magicians,[2025](https://arxiv.org/html/2606.26203#bib.bib12)\); and \(2\) 36 GitHub records from the nine pull requests that directly modifiedERCS/erc\-8004\.md\(ERC\-8004 Authors,[2026](https://arxiv.org/html/2606.26203#bib.bib47)\)\. For Google A2A, data originate from three streams in thea2aproject/A2Arepository\(A2A Authors,[2025a](https://arxiv.org/html/2606.26203#bib.bib14)\): \(1\) 3,104 issue and issue\-comment records, \(2\) 1,955 pull requests and review\-comment records, and \(3\) 822 GitHub Discussion records\.
The raw records totaled 6,030\. We removed the following data: records with fewer than 20 characters of body text, which primarily are CI notifications, merge\-conflict markers, and bot\-generated status messages; and records attributed to verified bot accounts\. After such filtering, 4,323 records are retained \(ERC\-8004: 142; Google A2A: 4,181\)\. SHA\-256 checksums for all raw data fields are provided in the authors’ GitHub repository111https://github\.com/kl41r3/erc8004\-a2a\-case\-study\. See also Appendix[C](https://arxiv.org/html/2606.26203#A3)\.
The retained records was annotated with four categorical fields:
- •Stakeholder Institution: Google / MetaMask / Ethereum Foundation / Coinbase / Independent / Unknown
- •Argument Type: Technical / Governance\-Principle / Economic / Process / Off\-topic
- •Stance: Support / Oppose / Modify / Neutral / Off\-topic
- •Consensus Signal: Adopted / Rejected / Pending / N/A
The institution affiliations of the top 109 contributors were also manually reviewed\. The cascade is detailed in Appendix[F](https://arxiv.org/html/2606.26203#A6)\.
### 3\.2\.Discourse Composition Analysis
To characterize the topical and argumentative composition of the two governance discourses, we apply three methods of increasing inductive depth\.
#### 3\.2\.1\.Supervised argument typing
Treating the LLM\-assignedargument\_typeas the record’s communicative function, we test cross\-case independence with a chi\-square test\(Cohen,[1988](https://arxiv.org/html/2606.26203#bib.bib49)\)and within\-ERC\-8004 temporal change across three consecutive two\-month phases spanning submission to mainnet deployment \(phase boundaries in Appendix[D](https://arxiv.org/html/2606.26203#A4)\)\.
#### 3\.2\.2\.BERTopic comparative discourse analysis
Following Stine and Agarwal\(Stine and Agarwal,[2020](https://arxiv.org/html/2606.26203#bib.bib24)\), we fit BERTopic\(Grootendorst,[2022](https://arxiv.org/html/2606.26203#bib.bib18)\)jointly on the combined corpus\. Texts are embedded withall\-MiniLM\-L6\-v2\(Reimers and Gurevych,[2019](https://arxiv.org/html/2606.26203#bib.bib50)\), reduced via UMAP \(nneighbors=15n\_\{\\text\{neighbors\}\}=15, cosine, seed=42=42\), and clustered with HDBSCAN \(min cluster size1010\) intoK=19K=19topics plus a noise class\. For each topicttand casecc, the within\-case share is
pt\(c\)=nt\(c\)/n\(c\)\.p\_\{t\}^\{\(c\)\}=n\_\{t\}^\{\(c\)\}/n^\{\(c\)\}\.Cross\-case divergence is measured by Jensen–Shannon divergence\(Lin,[1991](https://arxiv.org/html/2606.26203#bib.bib51)\):
JSD\(p,q\)=12KL\(p,m\)\+12KL\(q,m\)\\mathrm\{JSD\}\(p,q\)=\\tfrac\{1\}\{2\}\\mathrm\{KL\}\(p,m\)\+\\tfrac\{1\}\{2\}\\mathrm\{KL\}\(q,m\)with
KL\(p,m\)=∑ip\(i\)logp\(i\)m\(i\),m=12\(p\+q\),\\mathrm\{KL\}\(p,m\)=\\sum\_\{i\}p\(i\)\\log\\frac\{p\(i\)\}\{m\(i\)\},\\qquad m=\\tfrac\{1\}\{2\}\(p\+q\),andJSD=0\\mathrm\{JSD\}\\\!=\\\!0denotes identical distributions,JSD=1\\mathrm\{JSD\}\\\!=\\\!1disjoint\.
To assess whether ERC\-8004’s topic concentration is an artefact of a general\-purpose embedding model rather than a genuine structural property, we also embed the 142 ERC\-8004 records with CryptoBERT, a model further pre\-trained on cryptocurrency social\-media posts \(StockTwits, Reddit, Telegram, Twitter\)\(Kulakowski and Frasincar,[2023](https://arxiv.org/html/2606.26203#bib.bib41)\)\.
#### 3\.2\.3\.Thematic\-LM inductive themes
To complement the embedding based view with human\-interpretable labels, we apply the method of Thematic\-LM\(Qiaoet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib19)\), a four\-stage multi\-agent pipeline: \(1\)*open coding*assigns a short code to each record; \(2\)*aggregation*groups 300 sampled codes into 14 raw clusters; \(3\)*codebook review*merges and refines to 19 themes \(T01–T19\); \(4\)*theme assignment*labels every record with its best\-fitting theme, orUnclassifiedwhen confidence is insufficient\. We again report per\-theme shares and JSD between cases\.
### 3\.3\.Relational Network Analysis
We construct three complementary networks for each case, each layer adding discursive information to the previous\. All graphs use actors as nodes; what changes is the semantic content of an edge\.
#### 3\.3\.1\.Co\-participation network \(SNA\)
Following Ao et al\.\(Aoet al\.,[2023](https://arxiv.org/html/2606.26203#bib.bib20)\), an undirected edge connects two contributors who both posted to the same discussion thread \(forum topic, GitHub issue, pull request, or discussion\); edge multiplicity counts co\-occurrences\. We report six structural measures:
- •Density: the realized fraction of possible ties\. Computed by ρ=2EN\(N−1\)\\rho=\\frac\{2E\}\{N\(N\-1\)\}whereEEdenotes the number of edges andNNdenotes the number of nodes\.
- •Degree Gini: inequality of interaction counts \(did\_\{i\}is node degree\)\. Computed by G=∑i,j\|di−dj\|2N∑idiG=\\frac\{\\sum\_\{i,j\}\|d\_\{i\}\-d\_\{j\}\|\}\{2N\\sum\_\{i\}d\_\{i\}\}wheredid\_\{i\}is the number of edges of nodeii\.G=0G=0implies perfectly equal participation;G→1G\\to 1implies that interactions are concentrated in a small elite\.
- •Components and giant\-component ratio: fragmentation into disconnected threads\. Computed by GCR=Nmax/N\.\\mathrm\{GCR\}=N\_\{\\max\}/N\.
- •Newman–Girvan modularity: reported for institution partitions and Louvain partitions\(Blondelet al\.,[2008](https://arxiv.org/html/2606.26203#bib.bib52)\)\(Newman,[2006](https://arxiv.org/html/2606.26203#bib.bib53)\)\. Computed by Q=12m∑i,j\[Aij−kikj2m\]δ\(ci,cj\)Q=\\frac\{1\}\{2m\}\\sum\_\{i,j\}\\left\[A\_\{ij\}\-\\frac\{k\_\{i\}k\_\{j\}\}\{2m\}\\right\]\\delta\(c\_\{i\},c\_\{j\}\)wheremmis the total number of edges,AijA\_\{ij\}is the adjacency matrix entry,kik\_\{i\}is the degree of nodeii, andδ\(ci,cj\)=1\\delta\(c\_\{i\},c\_\{j\}\)=1if nodesiiandjjbelong to the same community, 0 otherwise\.
- •Core–periphery: Borgatti–Everett coreness\(Borgatti and Everett,[2000](https://arxiv.org/html/2606.26203#bib.bib54)\)\. Computed by ρBE=corr\(A,Δ\),Δij=δi⋅δj\\rho\_\{\\mathrm\{BE\}\}=\\mathrm\{corr\}\(A,\\,\\Delta\),\\qquad\\Delta\_\{ij\}=\\delta\_\{i\}\\cdot\\delta\_\{j\}whereδi∈\{0,1\}\\delta\_\{i\}\\in\\\{0,1\\\}is the coreness label of nodeii\. Statistical significance is evaluated by comparing the observedρBE\\rho\_\{\\mathrm\{BE\}\}against a null distribution of random graphs that preserve the empirical degree sequence \(configuration model\), following theqq\-sstest of Kojaku and Masuda\(Kojaku and Masuda,[2018](https://arxiv.org/html/2606.26203#bib.bib55)\)\.
- •Betweenness centrality and network efficiency: identifies governance brokers—actors who mediate between otherwise disconnected groups: b\(v\)=∑s≠v≠tσst\(v\)σstb\(v\)=\\sum\_\{s\\neq v\\neq t\}\\frac\{\\sigma\_\{st\}\(v\)\}\{\\sigma\_\{st\}\}whereσst\\sigma\_\{st\}is the total number of shortestss–ttpaths andσst\(v\)\\sigma\_\{st\}\(v\)the number that pass throughvv\.
- •Network efficiency: mean normalized harmonic centrality, measured by h¯=1n\(n−1\)∑u≠vd\(u,v\)−1\\bar\{h\}=\\frac\{1\}\{n\(n\-1\)\}\\sum\_\{u\\neq v\}d\(u,v\)^\{\-1\}whered\(u,v\)−1=0d\(u,v\)^\{\-1\}=0for unreachable pairs\(Latora and Marchiori,[2001](https://arxiv.org/html/2606.26203#bib.bib56)\)\.
#### 3\.3\.2\.Discourse Network Analysis \(DNA\)
Following Leifeld\(Leifeld,[2013](https://arxiv.org/html/2606.26203#bib.bib21)\), we construct the co\-participation layer with stance\-aware edges that capture shared discursive positions\. LetAAbe the actor set andT=\{T01,…,T19\}T=\\\{T\_\{01\},\\ldots,T\_\{19\}\\\}the Thematic\-LM codebook\. For each record we encode stance into Support=\+1=\+1, Modify=\+0\.5=\+0\.5, Neutral=0=0, Oppose=−1=\-1and obtain an actor–theme stance matrixMMsimilar to the pseudo\-matrix below:
Two networks are projected fromMM: the*congruence network*G\+G^\{\+\}connects actors who take same\-sign stances on at least one shared theme \(Alice and Bob agree on Theme A\), and the*conflict network*G−G^\{\-\}connects actors who take strictly opposite\-sign stances \(Alice and Bob conflict on Theme B\); edge weights count the number of themes satisfying each criterion\. We report density, Louvain modularity and betweenness centralityb\(v\)b\(v\)of the top\-5 discourse brokers\.
#### 3\.3\.3\.Socio\-semantic bipartite network
Following Roth and Cointet\(Roth and Cointet,[2010](https://arxiv.org/html/2606.26203#bib.bib22)\), we construct a two\-mode \(bipartite\) networkℬ=\(A∪T,E\)\\mathcal\{B\}=\(A\\cup T,\\,E\), whereAAis the set of actors,TTthe set of Thematic\-LM themes, and an edge\(a,t\)∈E\(a,t\)\\in Eexists whenever actoraaauthored at least one record assigned to themett\. The weightBatB\_\{at\}equals the number of such records, forming a non\-negative integer matrixB∈ℤ≥0\|A\|×\|T\|B\\in\\mathbb\{Z\}\_\{\\geq 0\}^\{\|A\|\\times\|T\|\}\.
Two one\-mode projections are derived fromBB\. The*actor–actor projection*WA=BB⊤W^\{A\}=BB^\{\\top\}links any two actors by the number of themes they co\-discussed \(self\-loops removed\); the*theme–theme projection*WT=B⊤BW^\{T\}=B^\{\\top\}Blinks any two themes by the number of actors who engaged in both\.
Per\-actor topic diversity is measured by Shannon entropy:
H\(a\)=−∑t=1\|T\|p^atlog2p^at,p^at=Bat∑t′Bat′,H\(a\)=\-\\sum\_\{t=1\}^\{\|T\|\}\\hat\{p\}\_\{at\}\\log\_\{2\}\\hat\{p\}\_\{at\},\\qquad\\hat\{p\}\_\{at\}=\\frac\{B\_\{at\}\}\{\\textstyle\\sum\_\{t^\{\\prime\}\}B\_\{at^\{\\prime\}\},\}whereH\(a\)=0H\(a\)=0for a pure specialist \(all posts in one theme\) andH\(a\)=log2\|T\|H\(a\)=\\log\_\{2\}\|T\|for a perfect generalist \(uniform spread across all\|T\|\|T\|themes\)\.
We further characterize the distribution of\{H\(a\)\}a∈A\\\{H\(a\)\\\}\_\{a\\in A\}via its Gini coefficient\. The higher the Gini, the more thematic breadth is concentrated in a few actors\. We also measure per\-theme actor concentration as the Gini of the column vector\{Bat\}a∈A\\\{B\_\{at\}\\\}\_\{a\\in A\}for eachtt\.
The thematic overlap coefficient is measured by
Ω=\|T1∩T2\|min\(\|T1\|,\|T2\|\),\\Omega=\\frac\{\|T\_\{1\}\\cap T\_\{2\}\|\}\{\\min\(\|T\_\{1\}\|,\|T\_\{2\}\|\)\},whereTcT\_\{c\}denotes the active theme set of casec∈\{1,2\}c\\in\\\{1,2\\\}, quantifies cross\-case thematic alignment;Ω=1\\Omega=1means one community’s thematic space is a subset of the other’s\.
Actor\-set sizes across the three filtering stages are detailed in Appendix[G](https://arxiv.org/html/2606.26203#A7)\.
## 4\.Results
### 4\.1\.Decision Architectures
Figure 1\.Left: ERC\-8004 governance decision flow\. Right: A2A governance structure and decision flow\.Two\-panel flow diagram\. Left panel: ERC\-8004 lifecycle stages \(Idea, Draft, Review, Last Call, Final/Living/Stagnant/Withdrawn\) with arrows showing progression via rough consensus and EIP editor review; a dashed arrow indicates decoupled mainnet deployment\. Right panel: A2A governance hierarchy showing Google/Linux Foundation stewardship, an eight\-seat TSC with one seat per corporate member, and two decision paths: lazy consensus for routine PRs and GitVote for contested changes\.ERC\-8004 and Google A2A represent contrasting governance arche\-types, as detailed in figure[1](https://arxiv.org/html/2606.26203#S4.F1)\. ERC\-8004 remains at the stage of “draft” when the data was fetched \(Mar 2026\), nevertheless, its three canonical registry contracts were deployed on Ethereum mainnet on January 29, 2026\(Etherscan,[2026](https://arxiv.org/html/2606.26203#bib.bib58)\)\. Proposals are discussed through the Ethereum Magicians forum and are recorded on GitHub\. Ownership of Google A2A transitioned from Google\-controlled to Linux Foundation governance, and Contested decisions result in a GitVote\(a2aproject,[2025](https://arxiv.org/html/2606.26203#bib.bib59)\)\. For more details and pseudocode, see Appendix[E](https://arxiv.org/html/2606.26203#A5)\.
### 4\.2\.Discourse Composition
#### 4\.2\.1\.Supervised argument typing
Figure 2\.Left: argument\-type distribution of both cases\. Right: ERC\-8004 argument\-type distribution across three two\-month phases\. Technical arguments dominate in both regimes, but corporate governance carries roughly double the procedural coordination burden\.Two\-panel figure\. Left panel shows pie charts comparing argument\-type categories \(Technical, Process, Governance\-Principle, Economic, Off\-topic\) between ERC\-8004 and Google A2A\. ERC\-8004: 74\.3% Technical, 13\.9% Process, 5\.4% Governance\-Principle, 1\.4% Economic, 5\.0% Off\-topic\. Google A2A: 62\.1% Technical, 25\.4% Process, 2\.7% Governance\-Principle, 0\.7% Economic, 9\.0% Off\-topic\. Right panel is a stacked bar chart for ERC\-8004 across Phase 1 \(Aug–Oct 2025, $n=100$\), Phase 2 \(Oct–Dec 2025, $n=11$\), and Phase 3 \(Dec 2025–Feb 2026, $n=17$\)\. Technical arguments dominate Phases 1–2 \($\\geq$80%\); Process discussion surges to 53% in Phase 3\.Figure 3\.Stance×\\timesargument\-type cross\-tabulation \(% within stance row\)\.Two side\-by\-side heatmaps, one for ERC\-8004 \(left\) and one for Google A2A \(right\)\. Rows represent stance categories \(Support, Modify, Neutral, Oppose, Off\-topic\); columns represent argument types \(Technical, Process, Governance\-Principle, Economic, Off\-topic\)\. Cell shading encodes percentage within each stance row\. Both cases are Technical\-dominated under Support and Modify; the Neutral row in A2A shows a markedly higher Process share \(44%\) compared to ERC\-8004 \(26%\), and Governance\-Principle arguments are more prominent under Oppose in ERC\-8004\.Figure[2](https://arxiv.org/html/2606.26203#S4.F2)compares the LLM\-assigned argument type distributions\. Both communities are predominantly Technical \(74\.3% in ERC\-8004 vs\. 62\.1% in A2A\), but A2A devotes almost twice the share to Process arguments \(25\.4% vs\. 13\.9%\)\. The cross\-case difference is significant with a small effect size \(χ2\(3\)=52\.88\\chi^\{2\}\(3\)=52\.88,p<\.001p<\.001, Cramér’sV=\.103V=\.103\): both cases remain technically grounded, but corporate governance carries substantively heavier coordination overhead\.
Within ERC\-8004, argument composition shifts significantly across three two\-month phases \(χ2\(6\)=25\.32\\chi^\{2\}\(6\)=25\.32,p<\.001p<\.001,V=\.315V=\.315\)\. Technical arguments dominate Phases 1–2 \(≥\\geq80%\); in Phase 3 Process discussion surges to 53% as deliberation moves from substantive design to editorial ratification\. This two\-stage pattern is consistent with rough\-consensus norms in open standards bodies\. Small\-nncaution applies to Phases 2–3 \(n=11n=11andn=17n=17\)\.
Figure[3](https://arxiv.org/html/2606.26203#S4.F3)cross\-tabulates stance with argument type\. Support and Modify stances are dominated by Technical arguments in both cases \(ERC Support 73%; A2A Support 87%\): substantive engagement is predominantly technical regardless of governance form\. Governance\-Principle arguments, particularly underOppose, are more visible in ERC\-8004, reflecting the principled deliberation characteristic of the EIP process\. The sharpest cross\-case contrast appears in theNeutralrow: 44% of A2A Neutral records are Process in nature, against 26% in ERC\-8004—corporate governance generates procedural commentary even among non\-committal participants\.
#### 4\.2\.2\.BERTopic
Figure 4\.BERTopic cross\-case divergence\. ERC\-8004 concentrates on foundational agent architecture, while A2A distributes deliberation across engineering\-execution workstreams\.Horizontal diverging bar chart with 20 BERTopic topics \(19 named topics plus a noise class\) sorted by the absolute difference in within\-case share between ERC\-8004 and Google A2A\. Blue bars extend left to indicate ERC\-8004\-dominant topics \(e\.g\., Topic 0: broad agent discourse\); orange bars extend right for A2A\-dominant topics \(e\.g\., Task/Message management, JSON/proto spec, PR\-contribution workflows\)\. The chart highlights moderate but meaningful structural separation at global JSD$=0\.288$\.BERTopic identifies 19 topics plus a noise class over the combined corpus\. The global Jensen–Shannon divergence between the two cases’ topic distributions isJSDBERTopic=0\.288\\mathrm\{JSD\}\_\{\\text\{BERTopic\}\}=0\.288, a moderate but meaningful structural separation \(Figure[4](https://arxiv.org/html/2606.26203#S4.F4)\)\. ERC\-8004 is strikingly concentrated: 67\.6% of its records fall into Topic 0 \(agent, agents, for, of\), the broad agent\-discourse cluster, versus 21\.2% for A2A\. Implementation\-heavy topics—Task/Message management \(8\.2%\), JSON/proto spec \(7\.7%\), PR\-contribution workflows \(7\.2%\), SDK samples \(5\.9%\)—carry substantial A2A weight but zero ERC\-8004 records, confirming that low\-level engineering is absent from the EIP forum\. The only topic with a higher ERC share than A2A is Topic 9 \(suggestion, sap, linkedin; 2\.8% vs\. 0\.9%\), signalling corporate voices \(SAP, LinkedIn\) entering the EIP thread\.
CryptoBERT on ERC data shows more precise results\. It divides the ”agent” topic to T0 \(Onchain, Reputation\) and T2 \(Trust, Feedback\), and the total percentage is 73\.2%, which aligns with the 67\.6% of pure BERTopic \(figure[5](https://arxiv.org/html/2606.26203#S4.F5)\)\.
Figure 5\.CryptoBERT topic frequency for ERC\-8004\.Horizontal bar chart showing CryptoBERT topic frequency distribution for ERC\-8004 records\.
#### 4\.2\.3\.Thematic\-LM
The Thematic\-LM pipeline yields a 19\-theme codebook \(T01–T19\)\. Figure[6](https://arxiv.org/html/2606.26203#S4.F6)overlays per\-case record share \(diverging bars\) with per\-theme actor participation rate \(lines with markers\)\. Table[1](https://arxiv.org/html/2606.26203#S4.T1)reports the six themes with the largest cross\-case divergence\. The global Jensen–Shannon divergence isJSD=0\.216\\mathrm\{JSD\}=0\.216\.
The most discriminating theme isT08 Trust & Security Mechanisms\. It covers agent\-trust scoring, reputation schemes and on\-chain credential verification—the foundational problem of establishing trustworthy agency\. By contrast, A2A spreads deliberation across T06 Documentation & Examples, T18 Clarifications & Information Requests, T07 Community Collaboration & Contributions, and T01 Protocol Specification & Versioning\. A2A additionally covers three themes: T09 \(Transport & Protocol Mechanisms\), T14 \(Project Governance & Process\), and T16 \(Streaming & Real\-time Communication\)\. T09 and T16 are purely engineering\-execution concerns entirely absent from the EIP forum; T14’s ERC\-8004 records were authored by automated review bots and thus fall outside the actor network\.
The two methods agree on the direction and approximate magnitude of divergence, with BERTopic’sJSD=0\.288\\mathrm\{JSD\}=0\.288slightly exceeding Thematic\-LM’sJSD=0\.216\\mathrm\{JSD\}=0\.216\. The gap is expected: Thematic\-LM compresses divergence by absorbing related concerns into shared conceptual themes, while BERTopic retains finer\-grained lexical distinctions \(ERC’s “trustless”, “reputation” versus A2A’s “samples”, “proto”\)\. Together the two estimates bracket the structural divergence and confirm its robustness: the two governance processes occupy meaningfully different but overlapping discourse spaces, with ERC\-8004 concentrated on the constitutive layer of the design space \(“what to build and why”\) and A2A on executive ones \(how to build, document, and ship it\)\.
Table 1\.Thematic\-LM themes with largest cross\-case divergence\. The result is aligned with figure[4](https://arxiv.org/html/2606.26203#S4.F4)\.Figure 6\.Thematic\-LM overlaid chart\. Bars show per\-case record share; lines show actor participation rate\. The DAO concentrates deliberation on security and trust; the corporate project distributes effort across documentation, tooling, and engineering\-execution themes\.Overlaid diverging bar and line chart\. Bars show record share diverging from center: ERC\-8004 bars extend right, A2A bars extend left\. Lines with diamond \(ERC\) and square \(A2A\) markers show actor participation rates, overlaid on the same axes\. Theme rows sorted by ERC minus A2A share difference\. Nineteen themes with full codebook labels on the y\-axis\.
### 4\.3\.Relational Networks
#### 4\.3\.1\.Co\-participation network \(SNA\): core and periphery
Table[2](https://arxiv.org/html/2606.26203#S4.T2)reports structural metrics for the co\-participation networks \(ERC\-8004:N=67N=67; Google A2A:N=771N=771\)\. For network visualization, see[7](https://arxiv.org/html/2606.26203#S4.F7)\.
Figure 7\.Your network diagram caption here\.Both communities exhibit a high degree of inequality\. The Gini coefficient of degree is 0\.804 for ERC\-8004 and 0\.779 for A2A, indicating that in both cases, a small elite drives the majority of interactions\. The top\-3 contributors account for 32\.3% of all ERC\-8004 interactions and 14\.9% of A2A interactions\. In A2A, two out of three of Google employees, and another is from Microsoft\. In ERC\-8004, the three contributors are: MarcoMetaMask \(Marco De Rossi\), who obviously by name is fromMetamask; spengrah \(Spencer Graham\), cofounder ofHats Protocol; and pcarranzav \(Pablo Carranza Vélez\), who is fromThe Graph, a decentralized protocol for indexing and querying blockchain data\. Their formal organizational mandate is indeterminate from public records, but their persistent participation accrues reputational standing\.
Both networks are structurally fragmented, and neither shows statistically significant core\-periphery structure\. Louvain community detection recovers 46 communities for ERC\-8004 and 358 for A2A, closely matching the component counts and confirming that participation organizes around parallel threads rather than a coherent deliberative body\. Institution labels also do not predict interaction structure in either case\. The giant component covers 32\.8% of ERC\-8004 nodes and 53\.4% of A2A nodes; 333 of 771 A2A participants \(43\.2%\) are complete isolates with no co\-participation edges\. Density indicates that the vast majority of possible co\-participation pairs are unrealized\. Under the Borgatti\-Everett test\(Borgatti and Everett,[2000](https://arxiv.org/html/2606.26203#bib.bib54)\), ERC\-8004 returnsp=\.095p=\.095and A2A returnsp=1\.000p=1\.000; neither crosses theα=\.05\\alpha=\.05threshold\.
Table 2\.Governance Network Structural Metrics\. Despite structurally opposite decision architectures, both networks exhibit comparably high participation inequality: a small elite drives most interactions regardless of governance form\.
#### 4\.3\.2\.Discourse network: congruence and conflict
The stance layer \(Table[3](https://arxiv.org/html/2606.26203#S4.T3)\) reveals structures that are invisible in the co\-participation graph\.
ERC\-8004 achieves denser within\-community consensus, while A2A generates far greater absolute conflict volume\. Congruence density is0\.1480\.148for ERC\-8004 versus0\.0820\.082for A2A: within the tighter EIP community, participants more often share positions\. Conflict edges in A2A are 34×\\timeslarger in absolute count \(2,531 vs\. 74\), which is not only an effect of actor numbers, but also from the broader range of technical positions supported by a multi\-vendor engineering project\.
The top\-3 betweenness share in the congruence network drops to34\.5%34\.5\\%for ERC\-8004 and12\.2%12\.2\\%for A2A, compared with70\.8%70\.8\\%and48\.5%48\.5\\%in the co\-participation network \(Table[2](https://arxiv.org/html/2606.26203#S4.T2)\)\. Discourse brokerage is more distributed than structural interaction brokerage in both cases; yet ERC\-8004’s congruence network remains markedly more concentrated, consistent with a compact EIP core mediating contested positions on trust and protocol security\.
Table 3\.Discourse Network Analysis metrics\. The DAO achieves denser within\-community agreement; the corporate regime generates a much larger volume of conflict edges, reflecting the broader technical surface of a multi\-vendor engineering project\.
#### 4\.3\.3\.Socio\-semantic bipartite network: thematic concentration of discursive labor
The actor–theme bipartite layer \(Table[4](https://arxiv.org/html/2606.26203#S4.T4), Figure[8](https://arxiv.org/html/2606.26203#S4.F8)\) reveals where each governance community directs its deliberative effort\. Peripheral participation is the structural baseline in both cases: median actor Shannon entropyH=0H=0in both ERC\-8004 and A2A, meaning the majority of contributors engage a single theme across their entire record\(Mockuset al\.,[2002](https://arxiv.org/html/2606.26203#bib.bib39)\)\. This is not a governance\-specific feature but an intrinsic property of large\-scale deliberative systems\.
Figure 8\.Per\-actor Shannon entropy over Thematic\-LM themes\.H=0H=0marks pure specialists; theH=0H=0bin dominates both distributions\.Density curve plot showing the distribution of per\-actor Shannon entropy $H$ over Thematic\-LM themes for ERC\-8004 \(blue, $N=66$\) and Google A2A \(red, $N=710$\)\. Both distributions are strongly right\-skewed with a dominant spike at $H=0$ \(pure specialists who engaged only a single theme\)\. The A2A distribution has a heavier tail toward higher entropy values, consistent with its higher mean $H=0\.617$ versus ERC\-8004’s mean $H=0\.348$, reflecting that A2A core contributors range across more themes on average\.Table 4\.Socio\-semantic bipartite network metrics\. Both regimes share the same thematic space, but governance form drives which themes absorb deliberative effort and how tightly specialized those subgroups become\.What differs across governance forms is*which*themes absorb deliberative effort, and how broadly the most active participants range\. Mean actor entropy is0\.3480\.348for ERC\-8004 and0\.6170\.617for A2A: A2A’s core contributors span roughly twice as many themes on average\. Gini\(H\)\(H\)remains high in both cases \(0\.7730\.773vs\.0\.7070\.707\), confirming that thematic breadth is itself concentrated in a small number of generalist actors in both communities\.
The cross\-case divergence is sharpest at the theme level\. In ERC\-8004,34\.5%34\.5\\%of actors participated in T08 \(Trust & Security Mechanisms\), and13\.1%13\.1\\%participated in T01 \(Protocol Specification & Versioning\)\. For A2A, the pattern inverts for engineering\-execution themes: Documentation & Examples \(T06\) engaged10\.8%10\.8\\%of A2A actors but only2\.1%2\.1\\%of ERC\-8004 actors\. Mean theme actor\-concentration Gini is0\.4530\.453for A2A against0\.0850\.085for ERC\-8004, indicating that each A2A theme attracts a narrower, more dedicated subgroup\. The thematic overlap coefficientΩ=1\.0\\Omega=1\.0: all 16 active ERC\-8004 themes reappear in A2A\.
## 5\.Discussion
### 5\.1\.Decentralization More a Design than a Fact
Decentralized governance is appealing for its idealistic design of distributed authority and trustless coordination\. In practice, however, the ERC\-8004 process exhibits a centralization paradox\.
Routine decisions are not subject to voting\. The power to decide, for example, which proposals are “ready”, which objections are “substantive”, which positions count as “rough consensus”, accumulates in the hands of a few minorities, mirroring the participation inequality patterns repeatedly documented in voluntary online cooperation\(Mockuset al\.,[2002](https://arxiv.org/html/2606.26203#bib.bib39); Germonprezet al\.,[2018](https://arxiv.org/html/2606.26203#bib.bib26)\)\.
More fundamentally, no formal hierarchy is provided to distribute the power of decision\. One possible reason for this is that the small scale of community size does not request such hierarchy; however, one direct drawback is that people tend to engage in groupthink, resulting in the denser discourse congruence\.
These two features compound: a certain minority holds the voice\. The decentralization promised by the EIP architecture exists at the level of entry rights; in practice, routine decision authority concentrates around whoever has the time and reputation to remain in the room\.
### 5\.2\.Role of Open Source and the Limitations
A natural objection is that the convergence we observe stems not from governance form but from open\-source\-society \(OSS\) norms shared by both cases\. First, the topical divergence between the two cases \(JSD=0\.288\\mathrm\{JSD\}=0\.288on BERTopic,0\.2160\.216on Thematic\-LM\) cannot be reduced to OSS norms; it is governance\-driven, reflecting the constitutive–regulative distinction between proposing a standard \(what the protocol*is*\) and shipping an implementation \(*how*it is built\)\. Second, ERC\-8004’s discourse\-congruence density is nearly twice that of A2A, the opposite of what an OSS\-scale\-only account would predict\. The cautious framing that follows is therefore layered: open\-source publication establishes a baseline of high participation inequality and fragmented community structure; governance form then redirects which themes this skewed participation engages and which actors accrue informal authority\.
Here are two more possible limitations\. First, ERC\-8004 contributes 142 records against A2A’s 4,181, so themes of low frequency carry wide confidence intervals\. Second, A2A’s TSC meetings, internal Google design reviews, and partner negotiations occur outside the public repository\. The structural concentration we observe in A2A may therefore understate deliberation among its core members\.
### 5\.3\.Who Controls Future Directions?
The answer is, in both cases, a small group of elites, despite the different constitutions\. In ERC\-8004, they are those who stay in the room; for A2A, corporation representatives are the gatekeepers\. The deeper risk of the DAO model is not that elites emerge—they emerge in any sustained deliberation\(Mockuset al\.,[2002](https://arxiv.org/html/2606.26203#bib.bib39); Germonprezet al\.,[2018](https://arxiv.org/html/2606.26203#bib.bib26)\)—but that, absent formal accountability structures, their influence operates through reputational authority that is opaque to outsiders and difficult for newcomers to contest\.
Moreover, the governance form determines what problems communities treat as worth deliberating about\. ERC\-8004’s deliberation is dominated by trust and security, while A2A also spreads its deliberation across engineering and execution issues\. The concentration differences of DAO and corporate discourse reflect a value distribution that propagates into deployed systems: communities that deliberate about accountability will architect for accountability; those that deliberate about velocity will architect for velocity\(World Economic Forum and Accenture,[2026](https://arxiv.org/html/2606.26203#bib.bib5)\)\. As agentic AI scales from research prototypes to critical infrastructure, these upstream deliberative choices become societal choices\.
Three implications follow for those positioned to act on these findings\.*Standards bodies*should pair open entry rights with procedural accountability mechanisms, because open access alone does not redistribute deliberative authority\.*Protocol architects*should treat the deliberative agenda itself as a design artifact: topics absent from the docket become embedded defaults, so a periodic audit of what is not discussed is as essential as auditing what is\.*Practitioners and policymakers*adopting either standard should consult its deliberation record alongside its specification, because the values that shaped the rules are rarely fully recoverable from the rules themselves\.
## References
- A2A Authors \(2025a\)Agent2Agent \(A2A\) Protocol\.Note:GitHubAccessed: Mar\. 28, 2026External Links:[Link](https://github.com/a2aproject/A2A)Cited by:[§1](https://arxiv.org/html/2606.26203#S1.p3.1),[§3\.1](https://arxiv.org/html/2606.26203#S3.SS1.p1.1)\.
- A2A Authors \(2025b\)GOVERNANCE\.md\.Note:GitHubAccessed: Mar\. 28, 2026External Links:[Link](https://github.com/a2aproject/A2A/blob/main/GOVERNANCE.md)Cited by:[Appendix E](https://arxiv.org/html/2606.26203#A5.p3.1),[§1](https://arxiv.org/html/2606.26203#S1.p3.1)\.
- a2aproject \(2025\)\.gitvote\.yml\.Note:GitHubExternal Links:[Link](https://github.com/a2aproject/A2A/blob/main/.gitvote.yml)Cited by:[Appendix E](https://arxiv.org/html/2606.26203#A5.p3.1),[§4\.1](https://arxiv.org/html/2606.26203#S4.SS1.p1.1)\.
- Z\. Ao, L\. W\. Cong, Gergely\. Horvath, and L\. Zhang \(2023\)Is decentralized finance actually decentralized? A social network analysis of the Aave protocol on the Ethereum blockchain\.Note:arXiv:2206\.08401 \[econ\.GN\]Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.13.9.4.1.1),[§1](https://arxiv.org/html/2606.26203#S1.p8.1),[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.3\.1](https://arxiv.org/html/2606.26203#S3.SS3.SSS1.p1.1)\.
- R\. Beck, C\. Mueller\-Bloch, and J\. King \(2018\)Governance in the blockchain economy: a framework and research agenda\.Journal of the Association for Information Systems19,pp\. 1020–1034\.External Links:[Document](https://dx.doi.org/10.17705/1jais.00518)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.21.5.4.1.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- V\. D\. Blondel, J\. Guillaume, R\. Lambiotte, and E\. Lefebvre \(2008\)Fast unfolding of communities in large networks\.Journal of Statistical Mechanics: Theory and Experiment2008\(10\),pp\. P10008\.External Links:[Document](https://dx.doi.org/10.1088/1742-5468/2008/10/P10008)Cited by:[4th item](https://arxiv.org/html/2606.26203#S3.I2.i4.p1.8)\.
- S\. P\. Borgatti and M\. G\. Everett \(2000\)Models of core/periphery structures\.Social Networks21\(4\),pp\. 375–395\.External Links:ISSN 0378\-8733,[Document](https://dx.doi.org/https%3A//doi.org/10.1016/S0378-8733%2899%2900019-2),[Link](https://www.sciencedirect.com/science/article/pii/S0378873399000192)Cited by:[5th item](https://arxiv.org/html/2606.26203#S3.I2.i5.p1.6),[§4\.3\.1](https://arxiv.org/html/2606.26203#S4.SS3.SSS1.p3.3)\.
- N\. A\. Carlson and V\. Burbano \(2026\)The use of llms to annotate data in management research: foundational guidelines and warnings\.Strategic Management Journal47\(3\),pp\. 699–725\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/smj.70023),[Link](https://sms.onlinelibrary.wiley.com/doi/abs/10.1002/smj.70023),https://sms\.onlinelibrary\.wiley\.com/doi/pdf/10\.1002/smj\.70023Cited by:[§1](https://arxiv.org/html/2606.26203#S1.p8.1)\.
- K\. Chen, Y\. Fan, Y\. Fang, and X\. \(\. Luo \(2025\)Beyond money: incentive effects of tokenized ownership on user contribution in daos\.Journal of Operations Management71\(7\),pp\. 988–1016\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/joom.1351),[Link](https://onlinelibrary.wiley.com/doi/abs/10.1002/joom.1351),https://onlinelibrary\.wiley\.com/doi/pdf/10\.1002/joom\.1351Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- J\. Cohen \(1988\)Statistical power analysis for the behavioral sciences\.2nd edition,Lawrence Erlbaum Associates,Hillsdale, NJ\.Cited by:[§3\.2\.1](https://arxiv.org/html/2606.26203#S3.SS2.SSS1.p1.1)\.
- E\. W\. Ellinger, R\. W\. Gregory, T\. Mini, T\. Widjaja, and O\. Henfridsson \(2024\)Skin in the game: the transformational potential of decentralized autonomous organizations\.MIS quarterly48\(1\),pp\. 245–272\.Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- ERC\-8004 Authors \(2026\)Erc\-8004\.md\.Note:GitHubAccessed: Mar\. 10, 2026External Links:[Link](https://github.com/ethereum/ERCs/blob/c8cd50348b111d13f62e1d71d2b4c8101bd3514f/ERCS/erc-8004.md)Cited by:[§3\.1](https://arxiv.org/html/2606.26203#S3.SS1.p1.1)\.
- Ethereum Foundation \(2015\)EIP\-1: EIP purpose and guidelines\.Note:Ethereum Improvement ProposalsAccessed: Mar\. 28, 2026External Links:[Link](https://eips.ethereum.org/EIPS/eip-1)Cited by:[Appendix E](https://arxiv.org/html/2606.26203#A5.p1.1),[Appendix E](https://arxiv.org/html/2606.26203#A5.p2.1),[§1](https://arxiv.org/html/2606.26203#S1.p3.1)\.
- Ethereum Magicians \(2025\)ERC\-8004: trustless agents\.Note:Ethereum Magicians Forum, topic 25098Accessed: Mar\. 10, 2026External Links:[Link](https://ethereum-magicians.org/t/erc-8004-trustless-agents/25098)Cited by:[§1](https://arxiv.org/html/2606.26203#S1.p3.1),[§3\.1](https://arxiv.org/html/2606.26203#S3.SS1.p1.1)\.
- Etherscan \(2026\)ERC\-8004 identity registry contract \(0x8004a169…\)\.Note:Ethereum MainnetDeployed Jan\. 29, 2026External Links:[Link](https://etherscan.io/address/0x8004A169FB4a3325136EB29fA0ceB6D2e539a432)Cited by:[§4\.1](https://arxiv.org/html/2606.26203#S4.SS1.p1.1)\.
- J\. Evans, B\. Bratton, and B\. Agüera y Arcas \(2026\)Agentic ai and the next intelligence explosion\.Science391\(6791\)\.External Links:[Document](https://dx.doi.org/10.1126/science.aeg1895)Cited by:[§1](https://arxiv.org/html/2606.26203#S1.p1.1)\.
- M\. Germonprez, G\. J\.P\. Link, K\. Lumbard, and S\. Goggins \(2018\)Eight observations and 24 research questions about open source projects: illuminating new realities\.Proc\. ACM Hum\.\-Comput\. Interact\.2\(CSCW\)\.External Links:[Link](https://doi.org/10.1145/3274326),[Document](https://dx.doi.org/10.1145/3274326)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§5\.1](https://arxiv.org/html/2606.26203#S5.SS1.p2.1),[§5\.3](https://arxiv.org/html/2606.26203#S5.SS3.p1.1)\.
- Google Cloud \(2025\)Google Cloud donates A2A to Linux Foundation\.Note:Google Developers BlogAccessed: Mar\. 28, 2026External Links:[Link](https://developers.googleblog.com/en/google-cloud-donates-a2a-to-linux-foundation/)Cited by:[Appendix E](https://arxiv.org/html/2606.26203#A5.p3.1),[§1](https://arxiv.org/html/2606.26203#S1.p3.1)\.
- M\. Grootendorst \(2022\)BERTopic: neural topic modeling with a class\-based TF\-IDF procedure\.Note:arXiv:2203\.05794Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.21.6.5.1.1.1),[§1](https://arxiv.org/html/2606.26203#S1.p8.1),[§3\.2\.2](https://arxiv.org/html/2606.26203#S3.SS2.SSS2.p1.6)\.
- C\. R\. Harvey and D\. Rabetti \(2024\)International business and decentralized finance\.Journal of International Business Studies55\(4\),pp\. 840–863\.External Links:[Document](https://dx.doi.org/10.1057/s41267-024-00705-7)Cited by:[Appendix B](https://arxiv.org/html/2606.26203#A2.p1.1),[§1](https://arxiv.org/html/2606.26203#S1.p1.1)\.
- C\. R\. Harvey, A\. Ramachandran, and J\. Santoro \(2021\)DeFi and the future of finance\.Wiley,Hoboken, NJ\.Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- X\. Hui and C\. Tucker \(2025\)Decentralization, blockchain, artificial intelligence \(ai\): challenges and opportunities\.Journal of Product Innovation Management42\(5\),pp\. 947–957\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1111/jpim.12800),[Link](https://onlinelibrary.wiley.com/doi/abs/10.1111/jpim.12800),https://onlinelibrary\.wiley\.com/doi/pdf/10\.1111/jpim\.12800Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.4.6.1.1.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p3.1)\.
- R\. A\. Hunt, D\. M\. Townsend, R\. Nugent, J\. J\. Simpson, M\. Stallkamp, and E\. Bozdag \(2024\)Digital battlegrounds: The power dynamics and governance of contemporary platforms\.Academy of Management Annals19\(1\)\.External Links:[Document](https://dx.doi.org/10.5465/annals.2022.0188)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p3.1)\.
- J\. Im, A\. X\. Zhang, C\. J\. Schilling, and D\. Karger \(2018\)Deliberation and resolution on wikipedia: a case study of requests for comments\.2\(CSCW\)\.External Links:[Link](https://doi.org/10.1145/3274343),[Document](https://dx.doi.org/10.1145/3274343)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- J\. Ji, R\. Lei, X\. Pan, Z\. Wei, H\. Sun, Y\. Lin, X\. Chen, Y\. Yang, Y\. Li, B\. Ding, and J\. Wen \(2026\)Leveraging LLM\-based agents for social science research: insights from citation network simulations\.Humanities and Social Sciences Communications13,pp\. 127\.External Links:[Document](https://dx.doi.org/10.1057/s41599-025-06193-w)Cited by:[§A\.2](https://arxiv.org/html/2606.26203#A1.SS2.p2.3)\.
- M\. Jungnickel, F\. Özdemir Sönmez, C\. Mulligan, and W\. J\. Knottenbelt \(2025\)DAO governance: voting power, participation, and controversy \- a review and an empirical analysis\.Distrib\. Ledger Technol\.\.Note:Just AcceptedExternal Links:[Link](https://doi.org/10.1145/3777416),[Document](https://dx.doi.org/10.1145/3777416)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- A\. Kiayias and P\. Lazos \(2023\)SoK: blockchain governance\.InProceedings of the 4th ACM Conference on Advances in Financial Technologies,AFT ’22,New York, NY, USA,pp\. 61–73\.External Links:ISBN 9781450398619,[Link](https://doi.org/10.1145/3558535.3559794),[Document](https://dx.doi.org/10.1145/3558535.3559794)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- S\. Kojaku and N\. Masuda \(2018\)A generalised significance test for individual communities in networks\.Scientific Reports8\.External Links:[Document](https://dx.doi.org/10.1038/s41598-018-25560-z)Cited by:[5th item](https://arxiv.org/html/2606.26203#S3.I2.i5.p1.5)\.
- M\. Kulakowski and F\. Frasincar \(2023\)Sentiment classification of cryptocurrency\-related social media posts\.38\(4\),pp\. 5–9\.External Links:ISSN 1541\-1672,[Link](https://doi.org/10.1109/MIS.2023.3283170),[Document](https://dx.doi.org/10.1109/MIS.2023.3283170)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.2\.2](https://arxiv.org/html/2606.26203#S3.SS2.SSS2.p2.1)\.
- V\. Latora and M\. Marchiori \(2001\)Efficient behavior of small\-world networks\.Physical Review Letters87\(19\),pp\. 198701\.External Links:[Document](https://dx.doi.org/10.1103/PhysRevLett.87.198701)Cited by:[7th item](https://arxiv.org/html/2606.26203#S3.I2.i7.p1.1)\.
- P\. Leifeld \(2013\)Reconceptualizing major policy change in the advocacy coalition framework: a discourse network analysis of german pension politics\.Policy Studies Journal41\(1\),pp\. 169–198\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1111/psj.12007),[Link](https://onlinelibrary.wiley.com/doi/abs/10.1111/psj.12007),https://onlinelibrary\.wiley\.com/doi/pdf/10\.1111/psj\.12007Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.16.12.4.1.1),[Table 11](https://arxiv.org/html/2606.26203#A8.T11.21.4.3.1.1.1),[§1](https://arxiv.org/html/2606.26203#S1.p8.1),[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.3\.2](https://arxiv.org/html/2606.26203#S3.SS3.SSS2.p1.7)\.
- R\. Li, P\. Pandurangan, H\. Frluckaj, and L\. Dabbish \(2021\)Code of conduct conversations in open source software projects on github\.Proc\. ACM Hum\.\-Comput\. Interact\.5\(CSCW1\)\.External Links:[Link](https://doi.org/10.1145/3449093),[Document](https://dx.doi.org/10.1145/3449093)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- J\. Lin \(1991\)Divergence measures based on the shannon entropy\.IEEE Transactions on Information Theory37\(1\),pp\. 145–151\.External Links:[Document](https://dx.doi.org/10.1109/18.61115)Cited by:[§3\.2\.2](https://arxiv.org/html/2606.26203#S3.SS2.SSS2.p1.9)\.
- F\. Lumineau, W\. Wang, and O\. Schilke \(2021\)Blockchain governance—a new way of organizing collaborations?\.Organization Science32\(2\),pp\. 500–521\.External Links:ISSN 1526\-5455,[Link](https://doi.org/10.1287/orsc.2020.1379),[Document](https://dx.doi.org/10.1287/orsc.2020.1379)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.2.2.2.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p3.1)\.
- C\. Ma, J\. Zhang, Z\. Zhu, C\. Yang, Y\. Yang, Y\. Jin, Z\. Lan, L\. Kong, and J\. He \(2024\)AgentBoard: an analytical evaluation board of multi\-turn llm agents\.InProceedings of the 38th International Conference on Neural Information Processing Systems,NIPS ’24,Red Hook, NY, USA\.External Links:ISBN 9798331314385Cited by:[§3](https://arxiv.org/html/2606.26203#S3.p1.1)\.
- MiniMax \(2026\)MiniMax M2\.5: built for real\-world productivity\.Note:MiniMaxAccessed: Apr\. 11, 2026External Links:[Link](https://www.minimax.io/news/minimax-m25)Cited by:[§3](https://arxiv.org/html/2606.26203#S3.p1.1)\.
- A\. Mockus, R\. T\. Fielding, and J\. D\. Herbsleb \(2002\)Two case studies of open source software development: apache and mozilla\.ACM Trans\. Softw\. Eng\. Methodol\.11\(3\),pp\. 309–346\.External Links:ISSN 1049\-331X,[Link](https://doi.org/10.1145/567793.567795),[Document](https://dx.doi.org/10.1145/567793.567795)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§4\.3\.3](https://arxiv.org/html/2606.26203#S4.SS3.SSS3.p1.1),[§5\.1](https://arxiv.org/html/2606.26203#S5.SS1.p2.1),[§5\.3](https://arxiv.org/html/2606.26203#S5.SS3.p1.1)\.
- M\. Motea and P\. Oba \(2026\)Who governs the ledger: rethinking blockchain governance through democratic innovation\.Humanities and Social Sciences Communications13\(1\),pp\. 770\.External Links:[Document](https://dx.doi.org/10.1057/s41599-026-06980-z)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- A\. Murray, S\. Kuban, M\. Josefy, and J\. Anderson \(2021\)Contracting in the smart era: The implications of blockchain and decentralized autonomous organizations for contracting and corporate governance\.Academy of Management Perspectives35\(4\),pp\. 622–641\.External Links:[Document](https://dx.doi.org/10.5465/amp.2018.0066)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.1.1.2.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p3.1)\.
- M\. E\. J\. Newman \(2006\)Modularity and community structure in networks\.Proceedings of the National Academy of Sciences103\(23\),pp\. 8577–8582\.External Links:[Document](https://dx.doi.org/10.1073/pnas.0601602103)Cited by:[4th item](https://arxiv.org/html/2606.26203#S3.I2.i4.p1.8)\.
- T\. Qiao, C\. Walker, C\. Cunningham, and Y\. S\. Koh \(2025\)Thematic\-lm: a llm\-based multi\-agent system for large\-scale thematic analysis\.InProceedings of the ACM on Web Conference 2025,WWW ’25,New York, NY, USA,pp\. 649–658\.External Links:ISBN 9798400712746,[Link](https://doi.org/10.1145/3696410.3714595),[Document](https://dx.doi.org/10.1145/3696410.3714595)Cited by:[§A\.3](https://arxiv.org/html/2606.26203#A1.SS3.p1.2),[Table 11](https://arxiv.org/html/2606.26203#A8.T11.10.6.4.1.1),[§1](https://arxiv.org/html/2606.26203#S1.p8.1),[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.2\.3](https://arxiv.org/html/2606.26203#S3.SS2.SSS3.p1.1)\.
- Y\. Quan, X\. Wu, W\. Deng, and L\. Zhang \(2024\)Decoding social sentiment in dao: a comparative analysis of blockchain governance communities\.In2024 IEEE 24th International Conference on Software Quality, Reliability, and Security Companion \(QRS\-C\),pp\. 216–224\.External Links:[Document](https://dx.doi.org/10.1109/QRS-C63300.2024.00037)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- H\. A\. Rahman, A\. Karunakaran, and L\. D\. Cameron \(2024\)Taming platform power: Taking accountability into account in the management of platforms\.Academy of Management Annals18\(1\),pp\. 251–294\.External Links:[Document](https://dx.doi.org/10.5465/annals.2022.0090)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p3.1)\.
- N\. Reimers and I\. Gurevych \(2019\)Sentence\-bert: sentence embeddings using siamese bert\-networks\.pp\. 3973–3983\.External Links:[Document](https://dx.doi.org/10.18653/v1/D19-1410)Cited by:[§3\.2\.2](https://arxiv.org/html/2606.26203#S3.SS2.SSS2.p1.6)\.
- P\. Reineke, R\. Katila, and K\. M\. Eisenhardt \(2025\)Decentralization in organizations: A revolution or a mirage?\.Academy of Management Annals19\(1\),pp\. 298–342\.External Links:[Document](https://dx.doi.org/10.5465/annals.2022.0206)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.3.3.2.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- C\. Roth and J\. Cointet \(2010\)Social and semantic coevolution in knowledge networks\.Social Networks32\(1\),pp\. 16–29\.Note:Dynamics of Social NetworksExternal Links:ISSN 0378\-8733,[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.socnet.2009.04.005),[Link](https://www.sciencedirect.com/science/article/pii/S0378873309000215)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.19.15.4.1.1),[Table 11](https://arxiv.org/html/2606.26203#A8.T11.21.3.2.1.1.1),[§1](https://arxiv.org/html/2606.26203#S1.p8.1),[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.3\.3](https://arxiv.org/html/2606.26203#S3.SS3.SSS3.p1.8)\.
- A\. L\. Russell \(2006\)‘Rough Consensus and Running Code’ and the Internet\-OSI Standards War\.IEEE Annals of the History of Computing28\(3\),pp\. 48–61\.External Links:[Document](https://dx.doi.org/10.1109/MAHC.2006.42)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.21.2.1.1.1.1)\.
- Z\. K\. Stine and N\. Agarwal \(2020\)Comparative discourse analysis using topic models: contrasting perspectives on china from reddit\.InInternational Conference on Social Media and Society,SMSociety’20,New York, NY, USA,pp\. 73–84\.External Links:ISBN 9781450376884,[Link](https://doi.org/10.1145/3400806.3400816),[Document](https://dx.doi.org/10.1145/3400806.3400816)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.7.3.4.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p4.1),[§3\.2\.2](https://arxiv.org/html/2606.26203#S3.SS2.SSS2.p1.6)\.
- A\. Sunyaev, M\. Avital, and M\. C\. Lacity \(2026\)From ideology to design: Toward purposeful decentralization of information systems governance\.Journal of the Association for Information Systems41\(1\)\.External Links:[Document](https://dx.doi.org/10.1177/02683962261430919)Cited by:[Table 11](https://arxiv.org/html/2606.26203#A8.T11.4.4.2.1.1),[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
- Q\. Wang, G\. Yu, Y\. Sai, C\. Sun, L\. D\. Nguyen, and S\. Chen \(2025\)Understanding daos: an empirical study on governance dynamics\.IEEE Transactions on Computational Social Systems12\(5\),pp\. 2814–2832\.External Links:[Document](https://dx.doi.org/10.1109/TCSS.2025.3539889)Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p4.1)\.
- World Economic Forum and Accenture \(2026\)Organizational transformation in the age of AI: how organizations maximize AI’s potential\.Note:World Economic ForumExternal Links:[Link](https://www.weforum.org/publications/organizational-transformation-in-the-age-of-ai-how-organizations-maximize-ais-potential/)Cited by:[§1](https://arxiv.org/html/2606.26203#S1.p1.1),[§5\.3](https://arxiv.org/html/2606.26203#S5.SS3.p2.1)\.
- R\. K\. Yin \(2018\)Case study research and applications: Design and methods\.6th edition,SAGE,Thousand Oaks, CA\.Cited by:[Appendix B](https://arxiv.org/html/2606.26203#A2.p1.1)\.
- R\. Ziolkowski, G\. Miscione, and G\. Schwabe \(2020\)Decision problems in blockchain governance: old wine in new bottles or walking in someone else’s shoes?\.Journal of Management Information Systems37\(2\),pp\. 316–348\.External Links:[Document](https://dx.doi.org/10.1080/07421222.2020.1759974),[Link](https://doi.org/10.1080/07421222.2020.1759974),https://doi\.org/10\.1080/07421222\.2020\.1759974Cited by:[§2](https://arxiv.org/html/2606.26203#S2.p2.1)\.
## Appendix AData Expansion and Robustness Verification
To verify that main\-text findings survive annotator choice and case scope, we expanded the DAO corpus from a single ERC\-8004 thread to a 34\-ERC agent\-standardization cluster, re\-annotated both cases with multiple independent models, and compared results against the original MiniMax\-M2\.5 labels\.
### A\.1\.Design
Data Expansion\.Beyond ERC\-8004, we added 33 contemporaneous \(≥\\geq2025\-08\) agent\-standardization ERCs from the Ethereum Magicians forum, totaling 34 unique ERC standards: 8001, 8004, 8033, 8041, 8107, 8118, 8122, 8126, 8150, 8160, 8162, 8165, 8166, 8171, 8181, 8183, 8184, 8196, 8203, 8210, 8217, 8220, 8226, 8239, 8240, 8242, 8257, 8259, 8263, 8264, 8273, 8274, 8275, 8294\. Among these, ERC\-8183 \(Agentic Commerce\), ERC\-8274 \(AI Inference Proof Verification\), and ERC\-8210 \(Agent Assurance\) are the three most deliberatively active additions\. The expanded ERC corpus contains 1,664 annotated records, up from 142 in the main text\. The A2A corpus was reused\.
Cross\-Model Annotation\.Three models \(DeepSeek\-V4\-Flash, GLM\-4\-Plus, and Moonshot\-v1\-auto\) independently annotated every record on all five fields \(stakeholder\_institution, argument\_type, stance, consensus\_signal, key\_point\)\. Majority voting \(2 out of 3\) yielded consensus labels for 1,664 ERC and 4,058 A2A records\. Together with the original MiniMax\-M2\.5, four independent annotators provide inter\-annotator agreement estimates\.
Table 5\.Three\-model cross\-consensus agreement rates \(majority voting across 3 rounds within each model, then 2\-of\-3 across models\)\. “No maj\.” = all three models disagree on that field\.Cross\-Round Annotation\.Each of the three models independently re\-annotated every record three times on the three core fields \(argument\_type, stance, consensus\_signal\)\. Majority vote across rounds within each model produced a per\-model consensus; majority vote across models then produced the final cross\-consensus\. Per\-field cross\-consensus agreement rates are reported in Table[5](https://arxiv.org/html/2606.26203#A1.T5)\. When all three models disagree for a given field \(no majority;<2%<2\\%–3\.5%3\.5\\%of records\), the label is assigned by plurality with an effectively arbitrary tie\-break and is treated as low\-confidence in subsequent analyses\.
### A\.2\.Robustness Verification
Cross\-model reliability\.Four\-model Fleiss’κ\\kappawas computed \(Table[6](https://arxiv.org/html/2606.26203#A1.T6)\)\. All three fields reach Moderate agreement in both cases, with argument type—the core field driving theχ2\\chi^\{2\}, BERTopic, and Thematic\-LM analyses—atκ=0\.545\\kappa=0\.545\(ERC\) andκ=0\.529\\kappa=0\.529\(A2A\)\. The strongest pairwise agreement occurs between GLM\-4\-Plus and Moonshot\-v1\-auto \(argument typeκ=0\.671\\kappa=0\.671ERC,0\.5550\.555A2A\), suggesting these two models share similar interpretive biases relative to DeepSeek\-V4\-Flash and MiniMax\-M2\.5\.
Table 6\.Four\-model Fleiss’κ\\kappa\(MiniMax\-M2\.5, DeepSeek\-V4\-Flash, GLM\-4\-Plus, Moonshot\-v1\-auto\)\. ERCN=144N=144, A2AN=3,760N=3\{,\}760\.Cross\-round self\-consistency\.Within\-model test\-retest reliability \(Table[7](https://arxiv.org/html/2606.26203#A1.T7)\) reveals clear model\-level differences\. GLM\-4\-Plus and Moonshot\-v1\-auto achieve almost perfect self\-consistency \(Fleiss’κ\>0\.81\\kappa\>0\.81on all three fields in both cases\), while DeepSeek\-V4\-Flash yields Moderate\-to\-Substantial values \(κ=0\.49\\kappa=0\.49–0\.630\.63\)\. The gap confirms that annotator model choice dominates stochastic variation: two models are near\-deterministic in their labeling behavior, while one exhibits higher intrinsic variance\(Jiet al\.,[2026](https://arxiv.org/html/2606.26203#bib.bib3)\)\. Notably, Moonshot\-v1\-auto, a generic auto\-routing model with no special reasoning configuration, outperforms both alternatives in self\-consistency, challenging the intuition that reasoning models are inherently more reliable annotators\.
Table 7\.Within\-model cross\-round Fleiss’κ\\kappa\(3 rounds, majority vote\)\. ERCN=1,664N\{=\}1\{,\}664, A2AN≈3,845N\{\\approx\}3\{,\}845\.
### A\.3\.Substantive Replication
We re\-ran the annotation pipeline with three additional models \(Moonshot\-v1\-auto, GLM\-4\-Plus, DeepSeek\-V4\-Flash; ICRκ\\kappa≈\\approx0\.7–0\.9 for the top two, § A\.1\) and replicated the Thematic\-LM discourse analysis\(Qiaoet al\.,[2025](https://arxiv.org/html/2606.26203#bib.bib19)\)using Moonshot\-v1\-auto as the LLM backbone, yielding a 12\-theme codebook with 96\.6% coverage\. Table[8](https://arxiv.org/html/2606.26203#A1.T8)reports the key combined metrics\.
Table 8\.Replication metrics: multi\-model consensus annotation \+ Moonshot Thematic\-LM codebook\.Three of four main\-text findings replicate: \(1\) Technical argument types dominate regardless of annotator, with A2A receiving roughly twice the Process share; \(2\) both networks remain steeply unequal \(betweenness Gini≈\\approx0\.8\); \(3\) ERC discourse congruence density exceeds A2A \(0\.403 vs\. 0\.252\)\. Thematic content also replicates: ERC concentrates on constitutive themes \(compliance, standards, verification\), A2A on executive themes \(documentation, coordination, community\)\. However,network connectivity reverses:the expanded ERC network coalesces into one dominant component \(GCR 0\.328→\\rightarrow0\.917\), while the re\-annotated A2A network fragments further \(GCR 0\.534→\\rightarrow0\.285\)\. The cross\-case thematic JSD of 0\.092 is lower than the main\-text BERTopic value \(0\.288\), partly reflecting Moonshot’s coarser 12\-theme granularity\. We interpret the connectivity reversal as an*observability*differential: permissionless DAO governance externalizes deliberation into a genuinely interconnected public record, whereas corporate governance internalizes decisive coordination channels \(TSC calls, private forums\), leaving the public GitHub trace persistently fragmented\. Participation*concentration*is comparable across both governance forms, but*observability*is not\.
## Appendix BComparative Case Study Design
We adopt a comparative case study design to examine how governance structure shapes participation dynamics in AI protocol standardization\(Yin,[2018](https://arxiv.org/html/2606.26203#bib.bib45)\)\. Interoperability is an inherited characteristic of decentralized applications \(dApps\) in DeFi\(Harvey and Rabetti,[2024](https://arxiv.org/html/2606.26203#bib.bib46)\), and ERC\-8004 as a protocol specifically designed for AI agents extends this feature\. Google A2A, on the corporation’s side, realized similar functions in different contexts\. Moreover, they originated within the same year \(2025\)\. Nevertheless, they differ sharply in governance architecture\. ERC\-8004 belongs to Ethereum, a permissionless and community\-driven open\-source DAO, and Google A2A is now governed by TSC\. Therefore, the two protocols constitute a theoretically matched pair\. Holding the technical domain constant while varying the governance form enables structured comparison of participation patterns, discourse composition, and network analysis\.
## Appendix CData Provenance
All raw data files are versioned with SHA\-256 checksums stored indata/raw/CHECKSUMS\.jsonin the project repository\. Table[9](https://arxiv.org/html/2606.26203#A3.T9)lists the file names, record counts, and collection dates for the primary raw data files used in this study\.
Table 9\.Raw Data Files and Record Counts
## Appendix DERC\-8004 Lifecycle Phase Boundaries
Topic analysis within ERC\-8004 was conducted across three consecutive two\-month phases spanning the full proposal lifecycle \(2025\-08\-13 to 2026\-01\-29\):
- •Phase 1\(Aug 13 – Oct 13, 2025\): Initial submission and early community review\.n=100n=100records\.
- •Phase 2\(Oct 13 – Dec 13, 2025\): Sustained review period prior to Last Call designation\.n=11n=11records\.
- •Phase 3\(Dec 13, 2025 – Feb 13, 2026\): Last Call through Final ratification \(mainnet deployment 2026\-01\-29\)\.n=17n=17records\.
## Appendix EDecision Architectures
Algorithm 1EIP Lifecycle1:Data:
Stages=\{IDEA,DRAFT,REVIEW,LAST\_CALL,Stages=\\\{IDEA,DRAFT,REVIEW,LAST\\\_CALL,
2:
FINAL,STAGNANT,WITHDRAWN\}FINAL,STAGNANT,WITHDRAWN\\\}
3:
stage←IDEAstage\\leftarrow IDEA
4:while
stage≠FINALstage\\neq FINALand
stage≠WITHDRAWNstage\\neq WITHDRAWNdo
5:Proceed to
NEXT\(stage\)NEXT\(stage\)based on community consensus;
6:ifNo activity for a long periodthen
7:
stage←STAGNANTstage\\leftarrow STAGNANT;
8:elseifAuthor decides to cancelthen
9:
stage←WITHDRAWNstage\\leftarrow WITHDRAWN;
10:endif
11:endwhileDeployment:Theoretically any partyMAYdeploy registry independently of
stagestage
Algorithm 2A2A Decision Process \(TSC Governance\)1:Input:GitHub Pull Request or Issue \(
prpr\)
2:Output:Decision Status \(Merged or Rejected\)
3:if
prpris NOT Contestedthen
4:// Apply Lazy Consensus
5:
Status←MERGEDStatus\\leftarrow MERGED;
6:else
7:// Escalation to Technology Steering Committee \(TSC\)
8:InitiateGitVote\(Restricted to 1 seat per company\);
9:ifVote == PASSthen
10:
Status←MERGEDStatus\\leftarrow MERGED;
11:else
12:
Status←REJECTEDStatus\\leftarrow REJECTED;
13:endif
14:endif
15:return
StatusStatus;
There are 3 types of EIPs: standard track EIP, Meta EIP and Informational EIP\. The standards track EIP can be broken down to 4 categories: core, networking, interface and ERC\(Ethereum Foundation,[2015](https://arxiv.org/html/2606.26203#bib.bib13)\)\. All EIPs, including ERC\-8004 in our case, undergo 5 stages: idea, draft, review, last call and final\. EIPs updated continuously are assigned to the special stage of “living”\. All EIPs are decided by rough consensus: Core EIPs by Ethereum core developers at the meeting calledAllCoreDevs, and others by common developers and users\.
This reflects a structural feature of ERCs as application\-layer specifications: unlike Core EIPs, which modify the Ethereum protocol itself and requires hard\-fork activation when deployment, ERCs define smart\-contract interface standards that any party may implement independently, regardless of the proposal’s formal lifecycle status\(Ethereum Foundation,[2015](https://arxiv.org/html/2606.26203#bib.bib13)\)\.
Google A2A launched undergoogle/A2Ain April 2025 and was donated to the Linux Foundation in June 2025\(Google Cloud,[2025](https://arxiv.org/html/2606.26203#bib.bib16)\), migrating to the vendor\-neutrala2aproject/A2Aorganization\. Governance authority rests with an eight\-seat Technical Steering Committee \(TSC\), comprising representatives each from Google, Microsoft, Cisco, AWS, Salesforce, SAP, IBM, and ServiceNow\. Independent contributors cannot join the TSC during an 18\-month startup phase\(A2A Authors,[2025b](https://arxiv.org/html/2606.26203#bib.bib15)\)\. Routine pull requests are merged upon maintainer approval without a formal vote\(A2A Authors,[2025b](https://arxiv.org/html/2606.26203#bib.bib15)\); contested specification changes escalate to a GitVote\(a2aproject,[2025](https://arxiv.org/html/2606.26203#bib.bib59)\), a GitHub\-native ballot in which only TSC members cast binding votes at a 51% threshold\.
For pseudocodes, see Algorithm[1](https://arxiv.org/html/2606.26203#alg1)and[2](https://arxiv.org/html/2606.26203#alg2)\.
## Appendix FInstitution Label Provenance Cascade
Institution labels were assigned using the following three\-tier cascade, applied in priority order:
1. \(1\)Manual investigation\(high confidence\): A systematic review of GitHub profiles, LinkedIn pages, personal websites, and EIP commit metadata was conducted for the top 109 contributors across both cases \(all 71 ERC\-8004 participants plus the top 38 A2A contributors by post count\)\. This review yielded 40 institutional upgrades, replacing LLM\-inferred labels with verified affiliations\. The original LLM\-inferred label is preserved in a separateinstitution\_lmfield for all records to enable sensitivity checks\.
2. \(2\)LLM inference\(low confidence\): For the remaining 517 authors, institution was inferred by MiniMax\-M2\.5 from contextual signals in the record text \(e\.g\., repository ownership, self\-identification, email patterns where visible\)\.
## Appendix GActor Filtering Stages
The three network analyses operate on successively filtered actor sets \(Table[10](https://arxiv.org/html/2606.26203#A7.T10)\)\. Starting from the annotated records \(ERC: 142 records / 71 contributors; A2A: 4,181 / 778\), the co\-participation network drops four ERC and seven A2A bot/short\-text residuals, yieldingN=67N=67andN=771N=771respectively\. The DNA and socio\-semantic analyses further \(a\) inner\-join against the Thematic\-LMcoded\_records\.json\(12 records lose a theme assignment\) and \(b\) exclude records whose stance isOff\-topicorUnclassified, dropping actors whose*only*contributions fall into those categories\. This produces the final stance\-bearing actor sets ofN=66N=66\(ERC\-8004\) andN=710N=710\(Google A2A\)\. All SNA statistics useN=67N=67/N=771N=771; all DNA and socio\-semantic statistics useN=66N=66/N=710N=710\.
Table 10\.Actor and record counts through the three filtering stages\.
## Appendix HRelated Work Table
Table[11](https://arxiv.org/html/2606.26203#A8.T11)positions sixteen representative works into three classes by paper type, each evaluated by criteria appropriate to that class\. Panel \(a\) lists*foundational works*that originated the methods or concepts we inherit; panel \(b\) lists*perspective papers*that advance theoretical claims about DAO and corporate governance without conducting their own empirical tests; panel \(c\) lists*empirical and methodological studies*are the key methodological works evaluated additionally by openness\. This study extends panel \(c\) with all data and codes open\.
Table 11\.Positioning of this study relative to prior work, organized by paper type\. Each panel applies an evaluation rubric appropriate to its class; the synthesis line at the bottom describes how this study bridges all three\.\(a\) Foundational / Seminal Works
\(b\) Perspective / Opinion PapersSimilar Articles
A sobering tale of AI governance
This Reddit post discusses a research paper highlighting fundamental challenges in AI governance, including social attack surfaces, failures of social coherence in LLM-backed agents, and the inadequacy of current governance tools for agentic systems.
Deontic Policies for Runtime Governance of Agentic AI Systems
This paper proposes AgenticRei, a framework for runtime governance of LLM-driven agentic AI systems using deontic policies expressed in OWL, enabling obligations, dispensations, and conflict resolution beyond traditional policy engines.
Governance by Design: Architecting Agentic AI for Organizational Learning and Scalable Autonomy
This paper presents a qualitative case study of a large IT services company's 2025 development and rollout of an agentic AI system, distilling seven lessons for embedding governance into system architecture and operations to balance autonomy with accountability.
Redrawing the AI Map: A Theory of Accountability Boundaries in Agentic Ecosystems
This paper introduces accountability assets and rule debt, proposing a theory of how agentic AI orchestrators affect organizational boundaries, arguing that accountability constraints limit the modularizing effects of technical decomposability.
different approach for agentic AI for regulated industry - questions
Summarizes a deterministic, constraint-based approach for building AI agents in regulated finance, where the LLM only generates prose, numbers are cryptographically sealed, and auditability is ensured through separated layers.