Voices in the Loop: Mapping Participatory AI

arXiv cs.AI Papers

Summary

This paper presents a reproducible protocol for building an open repository and interactive atlas of participatory AI initiatives, analyzing 131 records to reveal geographic and lifecycle patterns, and proposes a framework for participatory-by-default AI infrastructures.

arXiv:2605.16827v1 Announce Type: new Abstract: Participatory approaches to artificial intelligence are increasingly documented across public, civic, and humanitarian settings, but evidence about how participation is organized remains fragmented. This paper reports on the construction of an open repository and interactive atlas of participatory AI initiatives, using records harmonized from Maga~na and Shilton's Trustworthy AI corpus, and additional audited cases from research and practice. We contribute three elements. First, we specify a reproducible protocol for discovery, vetting, harmonization, geocoding, provenance tracking, and release-based publication of participatory AI records. Second, we report corpus-level patterns in geography, participation tiers, lifecycle loci, organizational form, verification status, and remaining documentation gaps. Documented initiatives remain concentrated in a small number of countries, while participation is most often coded at problem formulation, evaluation, and governance rather than model development or training. Third, we show how the atlas operationalizes a design and governance framework for participatory-by-default AI infrastructures through versioned releases, record-linked issue and annotation channels, schema feedback workflows, and redaction or restricted-disclosure requests. The atlas is intended to support comparative research, policy learning, and community scrutiny through a living inventory that can be updated, contested, and reused.
Original Article
View Cached Full Text

Cached at: 05/19/26, 06:36 AM

# Voices in the Loop: Mapping Participatory AI
Source: [https://arxiv.org/html/2605.16827](https://arxiv.org/html/2605.16827)
Rashid Mushkani

\(2026\)

###### Abstract\.

Participatory approaches to artificial intelligence are increasingly documented across public, civic, and humanitarian settings, but evidence about how participation is organized remains fragmented\. This paper reports on the construction of an open repository and interactive atlas of participatory AI initiatives, using 131 records harmonized from Magaña and Shilton’s Trustworthy AI corpus, and additional audited cases from research and practice\. We contribute three elements\. First, we specify a reproducible protocol for discovery, vetting, harmonization, geocoding, provenance tracking, and release\-based publication of participatory AI records\. Second, we report corpus\-level patterns in geography, participation tiers, lifecycle loci, organizational form, verification status, and remaining documentation gaps\. Documented initiatives remain concentrated in a small number of countries, while participation is most often coded at problem formulation, evaluation, and governance rather than model development or training\. Third, we show how the atlas operationalizes a design and governance framework for participatory\-by\-default AI infrastructures through versioned releases, record\-linked issue and annotation channels, schema feedback workflows, and redaction or restricted\-disclosure requests\. The atlas is intended to support comparative research, policy learning, and community scrutiny through a living inventory that can be updated, contested, and reused\.

participatory AI, public interest technology, AI governance, registries, documentation, mapping studies

††journalyear:2026††copyright:cc††conference:The 2026 ACM Conference on Fairness, Accountability, and Transparency; June 25–28, 2026; Montreal, QC, Canada††booktitle:The 2026 ACM Conference on Fairness, Accountability, and Transparency \(FAccT ’26\), June 25–28, 2026, Montreal, QC, Canada††ccs:Applied computing Governance††ccs:Applied computing Law, social and behavioral sciences††ccs:Computing methodologies Machine learning## 1\.Introduction

Artificial intelligence systems are developed and deployed in contexts where decisions are contested, impacts are unevenly distributed, and knowledge about local conditions is held by actors outside development teams\. In response, public agencies, civic organizations, and researchers have adopted participatory methods intended to incorporate affected stakeholders into the design, evaluation, and governance of AI\-enabled systems\. Such efforts draw on traditions in participatory design and democratic governance that treat participation as a question of decision influence rather than consultation alone\(Arnstein,[1969](https://arxiv.org/html/2605.16827#bib.bib1); Schuler and Namioka,[1993](https://arxiv.org/html/2605.16827#bib.bib4); Fung,[2006](https://arxiv.org/html/2605.16827#bib.bib5); Robertson and Simonsen,[2012](https://arxiv.org/html/2605.16827#bib.bib6); Mushkani,[2026](https://arxiv.org/html/2605.16827#bib.bib2)\)\. Within AI research and practice, participatory approaches have been discussed as a route to improve problem formulation, data practices, and accountability, while also raising concerns about extractive engagement and “participation\-washing”\(Sloaneet al\.,[2022](https://arxiv.org/html/2605.16827#bib.bib8); Birhaneet al\.,[2022](https://arxiv.org/html/2605.16827#bib.bib9); Irani,[2019](https://arxiv.org/html/2605.16827#bib.bib13); Mushkaniet al\.,[2025](https://arxiv.org/html/2605.16827#bib.bib3)\)\. In this paper, our focus is participation*for*AI: participatory arrangements that shape the design, deployment, or oversight of an AI system\. We include “AI for participation” only when the AI system itself is governed through such mechanisms; tools that merely support meetings, consultation, or translation without affecting an AI system’s own design or governance are out of scope\.

Public sector adoption of machine learning has expanded the range of settings where these concerns arise\. Algorithmic tools are used to allocate services, target inspections, and shape administrative priorities, including in urban governance and social service delivery\. Prior work documents how data\-driven systems can reinforce existing inequalities when institutional constraints and political choices are treated as technical parameters\(Eubanks,[2018](https://arxiv.org/html/2605.16827#bib.bib34); Selbstet al\.,[2019](https://arxiv.org/html/2605.16827#bib.bib14)\)\. As a result, governance responses combine demands for transparency about algorithmic systems with demands for participation in decisions about whether, how, and under what conditions such systems should be used\(Wieringa,[2020](https://arxiv.org/html/2605.16827#bib.bib35); Kitchin,[2017](https://arxiv.org/html/2605.16827#bib.bib36)\)\. Participation in this setting is not limited to user experience design; it includes institutional mechanisms such as oversight boards, labor representation, community data governance, and channels for contesting automated decisions\.

Despite the growth of participatory claims, the landscape of participatory AI remains difficult to navigate\. Documentation is distributed across project websites, academic publications, government portals, and grey literature, with heterogeneous terminology and inconsistent reporting of basic metadata\. The result is a fragmented evidence base that hinders comparative research, impedes learning across sectors, and makes it difficult for communities and policymakers to identify relevant precedents\. In urban governance, the absence of systematic visibility has motivated the development of algorithm registers and transparency standards that publish structured information about algorithmic tools used by public bodies\(City of Amsterdam,[2022](https://arxiv.org/html/2605.16827#bib.bib29); City of Helsinki,[2020](https://arxiv.org/html/2605.16827#bib.bib27); UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28); Digital Forum Lab and Eurocities,[2022](https://arxiv.org/html/2605.16827#bib.bib30)\)\. Related concerns occur in humanitarian and civic technology, where short project cycles, shifting funders, and limited archival capacity contribute to loss of institutional memory and to limited comparability across pilots\(Berditchevskaiaet al\.,[2021](https://arxiv.org/html/2605.16827#bib.bib22); Meier,[2015](https://arxiv.org/html/2605.16827#bib.bib21)\)\.

This paper addresses these limitations by constructing an open repository and interactive atlas of participatory AI initiatives assembled from public documentation available through early 2026\. The repository is designed as a living infrastructure rather than a static catalog\. It is intended to support ongoing additions, corrections, and disputes about how participatory claims are operationalized in practice\.

Our empirical focus is 131 projects\. The dataset harmonizes records from the 95\-project Trustworthy AI corpus reported by Magaña and Shilton\(Magaña and Shilton,[2025](https://arxiv.org/html/2605.16827#bib.bib45)\), along with additional audited searches across research and practice documentation\. Because records can enter through more than one discovery path, we treat the corpus as an integrated collection rather than partitioning it by source\.

Across this corpus, we curate geospatial, documentary, and participation\-specific metadata to support a global map view and comparative analysis of where participation is located in the AI lifecycle, how strong the underlying evidence is, and which records remain contested or provisional\. We also examine what can be learned from the corpus\.

Participation tier, participants, methods, and lifecycle stages are coded for every record, shifting the main analytical bottleneck from the absence of participation fields to the uneven evidentiary quality and archival durability of public documentation\. Using this dataset, we identify four empirical patterns: concentration in a small set of jurisdictions; the dominance of community\-led, co\-design, and governance\-oriented forms of participation; a long tail of provenance domains; and persistent gaps in partner constellations, closure timelines, and third\-source corroboration\.

The contribution of the paper is threefold\. We specify a reproducible protocol for discovery, vetting, and harmonization of project records, including deduplication, geocoding, provenance tracking, and release\-based archiving\. We then report empirical patterns in the corpus, emphasizing geographic concentration, participation tiers, lifecycle loci, organizational forms, verification status, and remaining documentation gaps\. Finally, we propose a design and governance framework for participatory\-by\-default AI infrastructures\. The framework synthesizes lessons from public algorithm registers, collective crisis intelligence pilots, indigenous data governance, and community\-maintained open science\(Carrollet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib23); Kukutai and Taylor,[2016](https://arxiv.org/html/2605.16827#bib.bib24); Berditchevskaiaet al\.,[2021](https://arxiv.org/html/2605.16827#bib.bib22); Bowker and Star,[1999](https://arxiv.org/html/2605.16827#bib.bib32)\)\. Rather than treating participation as an optional layer added to technical work, the framework treats participation as a default property of infrastructures that mediate data collection, model development, and public accountability\. The contribution is therefore both infrastructural and analytical: we provide a reusable, versioned corpus and atlas, but we also use that corpus to make claims about the current conditions under which participatory AI can be compared, contested, and critically evaluated\.

The remainder of the paper is organized as follows\. Section[2](https://arxiv.org/html/2605.16827#S2)reviews prior work relevant to participatory AI mapping and documentation\. Section[3](https://arxiv.org/html/2605.16827#S3)defines the operational scope used for mapping\. Section[4](https://arxiv.org/html/2605.16827#S4)presents the mapping protocol and repository design\. Section[5](https://arxiv.org/html/2605.16827#S5)reports corpus\-level patterns and documentation gaps\. Section[6](https://arxiv.org/html/2605.16827#S6)proposes governance and design requirements for participatory\-by\-default infrastructures\. Section[7](https://arxiv.org/html/2605.16827#S7)discusses implications and limitations, followed by conclusions\.

## 2\.Background and related work

Mapping participatory AI draws on theory from participatory design and democratic governance, and on practical ways to document and compare sociotechnical systems\. We review four areas: \(1\) participation in design and governance, \(2\) participatory approaches in AI/ML, \(3\) documentation and evaluation artifacts, and \(4\) registers, inventories, and systematic mapping in public technology governance\.

Participatory design emerged from labor and workplace democratization movements that treated technology design as a locus of power, negotiation, and situated expertise\(Schuler and Namioka,[1993](https://arxiv.org/html/2605.16827#bib.bib4); Simonsen and Robertson,[2012](https://arxiv.org/html/2605.16827#bib.bib7)\)\. Its methods emphasize in\-context inquiry, iterative prototyping with stakeholders, and explicit attention to whose interests are represented\. In governance research, participation is often assessed by influence, representation, and institutional form\. Arnstein’s ladder distinguishes tokenistic consultation from delegated power\(Arnstein,[1969](https://arxiv.org/html/2605.16827#bib.bib1)\), while Fung’s typology differentiates participation by who participates, how they deliberate, and what authority they hold\(Fung,[2006](https://arxiv.org/html/2605.16827#bib.bib5)\)\. Public administration work highlights how these mechanisms operate under legal, budgetary, and capacity constraints, including when participation is mandated without commensurate resources\(Nabatchi,[2012](https://arxiv.org/html/2605.16827#bib.bib33)\)\.

Within AI and machine learning, participatory methods have been argued to matter across the lifecycle, including problem formulation, data practices, evaluation, and monitoring\(Birhaneet al\.,[2022](https://arxiv.org/html/2605.16827#bib.bib9); Sloaneet al\.,[2022](https://arxiv.org/html/2605.16827#bib.bib8)\)\. A central critique is that “participation” can become extractive labor or validation theater when communities supply data or legitimacy without meaningful decision influence or reciprocal benefit\(Sloaneet al\.,[2022](https://arxiv.org/html/2605.16827#bib.bib8); Irani,[2019](https://arxiv.org/html/2605.16827#bib.bib13)\)\. Human\-centered AI research also emphasizes tensions between local specificity and the pressures of scale, especially for systems deployed across jurisdictions with different norms and institutional capacities\(Selbstet al\.,[2019](https://arxiv.org/html/2605.16827#bib.bib14)\)\. In parallel, public interest technology and design justice emphasize participation as a strategy to redistribute design authority and align system goals with community\-defined priorities\(Costanza\-Chock,[2020](https://arxiv.org/html/2605.16827#bib.bib11); D’Ignazio and Klein,[2020](https://arxiv.org/html/2605.16827#bib.bib12)\)\. Together, these debates motivate mapping that distinguishes participation claims from documented mechanisms, and that treats missing documentation as an empirical finding rather than a nuisance\. Two recent lines of work provide particularly direct context for our contribution\. Delgado et al\. synthesize the theoretical foundations of the “participatory turn” in AI design and show that current practice varies substantially in how much agency it grants stakeholders\(Delgadoet al\.,[2023](https://arxiv.org/html/2605.16827#bib.bib42)\)\. We draw directly on that distinction between participation claims and decision influence when defining inclusion criteria\. Kawakami et al\.’s Situate AI Guidebook provides a co\-designed toolkit for early\-stage, multi\-stakeholder deliberation around proposed public sector AI systems\(Kawakamiet al\.,[2024](https://arxiv.org/html/2605.16827#bib.bib43)\)\. That work is prospective and intervention\-oriented; by contrast, our atlas documents already public initiatives as an empirical case base that can be compared, contested, and revisited over time\.

Documentation artifacts enable comparison across diverse projects\. Proposals such as datasheets and model cards standardize reporting about datasets/models, intended use, and performance\(Gebruet al\.,[2021](https://arxiv.org/html/2605.16827#bib.bib15); Mitchellet al\.,[2019](https://arxiv.org/html/2605.16827#bib.bib16)\); data statements formalize provenance and collection conditions in NLP\(Bender and Friedman,[2018](https://arxiv.org/html/2605.16827#bib.bib17)\); and algorithmic impact assessments and auditing frameworks aim to structure disclosure about risks, mitigations, and oversight\(Reismanet al\.,[2018](https://arxiv.org/html/2605.16827#bib.bib19); Rajiet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib18)\)\. These approaches inform repository schema design but typically foreground technical artifacts rather than participatory process metadata\. Evaluation traditions add relevant distinctions: value sensitive design treats values as explicit design constraints that can be elicited and negotiated\(Friedman and Hendry,[2019](https://arxiv.org/html/2605.16827#bib.bib37)\), while participatory evaluation differentiates participation\-as\-data\-collection from shared control over evaluation questions, interpretation, and use\(Cousins and Whitmore,[1998](https://arxiv.org/html/2605.16827#bib.bib38)\)\. For participatory AI mapping, this suggests documenting not only who was engaged, but how engagement shaped decisions and how outcomes were assessed\.

Work on infrastructures and classification frames an atlas or repository as a governance instrument rather than a neutral index\. What becomes visible and actionable depends on standards, installed bases, and classification choices\(Star and Ruhleder,[1996](https://arxiv.org/html/2605.16827#bib.bib39); Star,[1999](https://arxiv.org/html/2605.16827#bib.bib40); Bowker and Star,[1999](https://arxiv.org/html/2605.16827#bib.bib32)\)\. Registries enact categories and defaults that shape accountability; a “living” participatory AI atlas must therefore anticipate how schemas and review processes can reproduce exclusions, and must support contestation and revision over time\.

Public\-sector technology governance offers concrete precedents for living inventories\. Municipal algorithm registers \(Helsinki and Amsterdam\) publish structured descriptions of deployed systems and provide feedback channels\(City of Helsinki,[2020](https://arxiv.org/html/2605.16827#bib.bib27); City of Amsterdam,[2022](https://arxiv.org/html/2605.16827#bib.bib29)\)\. The UK Algorithmic Transparency Recording Standard provides a disclosure template covering purpose, data, oversight, and impact considerations\(UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28)\), while transnational efforts aim to harmonize transparency categories across jurisdictions\(Digital Forum Lab and Eurocities,[2022](https://arxiv.org/html/2605.16827#bib.bib30)\)\. The New York City ADS Task Force report illustrates both the demand for visibility and the institutional difficulty of standardizing disclosure\(New York City Automated Decision Systems Task Force,[2019](https://arxiv.org/html/2605.16827#bib.bib41)\)\. While these instruments focus on systems inside public administrations, participatory AI initiatives often span civic organizations, academia, and humanitarian networks; nonetheless, register practices around provenance, versioning, and public contestation are instructive for participatory AI documentation\.

A closely related line of work is Magaña and Shilton’s systematic review of 95 global projects connecting participatory AI to trustworthy AI\(Magaña and Shilton,[2025](https://arxiv.org/html/2605.16827#bib.bib45)\)\. Their contribution is an analytic synthesis of frameworks, methods, and shared tasks across documented cases\. Our work instead provides a reusable documentation infrastructure: a release\-based atlas with harmonized project records, provenance and verification fields, participation\-tier and lifecycle coding, and governance mechanisms for contestation, redaction, and schema evolution\. More broadly, repositories such as Participedia demonstrate the value of large\-scale, crowdsourced case libraries for democratic innovations\(Fung and Warren,[2011](https://arxiv.org/html/2605.16827#bib.bib46); Landry and von Lieres,[2022](https://arxiv.org/html/2605.16827#bib.bib47)\)\. Our atlas draws inspiration from this infrastructural model, but differs in being AI\-specific and in encoding fields that broader participation repositories typically do not, including technical modality, the locus of participation within the AI lifecycle, evidence of decision influence, provenance links, and sensitivity controls such as redaction or restricted disclosure\.

Finally, systematic mapping has established protocols in evidence synthesis, specifying search strategies, inclusion criteria, and extraction templates\(Kitchenham and Charters,[2007](https://arxiv.org/html/2605.16827#bib.bib20)\)\. Participatory AI mapping adds complications: conceptual ambiguity, uneven participation across lifecycle stages, and documentation bias or intentional non\-disclosure for sensitive contexts\. We therefore treat the repository itself as an evolving artifact, whose categories and governance must remain revisable as new cases, critiques, and community feedback emerge\. Taken together, these literatures motivate the design of a documentation infrastructure that enables systematic comparison, critical scrutiny, and ongoing revision of participatory AI claims\.

## 3\.Operational definition and scope

“Participatory AI” is used inconsistently across research and practice, so mapping requires an operational definition that is both extractable and flexible across institutional contexts\. We define a*participatory AI project*as an initiative that meets two conditions: \(1\) it develops, deploys, or maintains an AI\-enabled system or workflow, where AI refers to machine learning or related statistical inference used to automate, support, or structure decisions; and \(2\) it includes a*documented*mechanism through which affected stakeholders, community representatives, or frontline practitioners can influence decisions about problem definition, data governance, model development, deployment conditions, or oversight\.

This definition excludes projects that only use public data or conduct user testing without decision influence, and it distinguishes concrete participation mechanisms from aspirational ethical claims\. Consistent with participation research, the key criterion is the existence of decision points where stakeholder input can shape outcomes\(Arnstein,[1969](https://arxiv.org/html/2605.16827#bib.bib1); Fung,[2006](https://arxiv.org/html/2605.16827#bib.bib5)\)\. Participation is treated as a feature of governance arrangements rather than of any single workshop, interface, or engagement activity\. This boundary also distinguishes participation*for*AI from AI*for*participation\. We focus on the former\. We include the latter only when public documentation shows that the AI system itself was shaped or governed through participatory mechanisms\.

Because documentation practices vary by sector, participation may be visible through different traces\. Public algorithm registers can provide feedback channels and structured templates\(City of Helsinki,[2020](https://arxiv.org/html/2605.16827#bib.bib27); UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28)\)\. Citizen science platforms often operationalize participation via volunteer data collection and distributed validation\(Bonneyet al\.,[2009](https://arxiv.org/html/2605.16827#bib.bib31)\)\. Indigenous language technologies may center participation through community authority over data licensing and use conditions\(Carrollet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib23); Jones,[2025](https://arxiv.org/html/2605.16827#bib.bib25)\)\. The repository therefore stores participation as a harmonized curatorial field linked to provenance, verification status, and review state, so that inclusions can be audited rather than taken as self\-evident\.

We bound the mapping in three ways\.Temporal scope:the current analysis uses the early 2026 release snapshot, with known project start years ranging from 2006 to 2026 rather than a fixed intake window\.Geographic scope:global coverage, with location recorded at the granularity supported by public sources\.Substantive scope:emphasis on public, civic, and humanitarian settings\. Projects whose primary contribution is an AI tool for facilitating deliberation, translation, or consultation are only included when the tool’s own design or governance was itself participatory\.

To enable comparison across heterogeneous cases, extraction records five dimensions: participation locus, stakeholder roles, decision influence, documentation artifacts, and institutional setting\. Appendix Table[4](https://arxiv.org/html/2605.16827#A1.T4)lists the public indicators used for each dimension\. These dimensions align the repository with AI lifecycle and governance research while remaining grounded in evidence that can be verified from public documentation\.

Participation mechanisms also cluster into recurring, non\-exclusive families that differ in how influence is exercised and what traces they leave\. Table[1](https://arxiv.org/html/2605.16827#S3.T1)provides the families used as a conceptual guide for extraction and later coding\.

Table 1\.Families of participation mechanisms observed in documented participatory AI initiatives and their typical evidence traces\.This operationalization structures the mapping protocol in Section[4](https://arxiv.org/html/2605.16827#S4)and motivates the governance framework in Section[6](https://arxiv.org/html/2605.16827#S6), which treats documentation and contestability as core features of participatory\-by\-default infrastructures\.

## 4\.Mapping protocol and repository design

This section describes the data sources, the mapping protocol used to construct project records, and the repository and atlas design decisions that support ongoing maintenance\.

### 4\.1\.Data sources

The current release integrates two input streams\. The first is the 95\-project Trustworthy AI corpus reported by Magaña and Shilton\(Magaña and Shilton,[2025](https://arxiv.org/html/2605.16827#bib.bib45)\)\. The second is an audited set of additional cases drawn from research and practice, selected to broaden coverage of governance\-oriented participation mechanisms, including public algorithm registers, volunteer coordination infrastructures, and community\-governed language technologies\.

### 4\.2\.Discovery and vetting protocol

The protocol aims to be reproducible, while acknowledging that search results and web content change over time\. Discovery uses a combination of keyword searches, network expansion, and targeted domain queries\. Keyword searches identify projects that self\-describe using terms such as “co\-design”, “citizen science”, “community governance”, “participatory”, and “public register” in proximity to AI\-related terms\. Network expansion follows links from known civic technology organizations, humanitarian innovation labs, and municipal digital service units\. Targeted queries focus on domains where participation mechanisms have established precedents, including public sector transparency registers and community\-run data stewardship initiatives\. Because the public\-interest dataset and audited searches were conducted primarily through English\- and German\-language web sources, the protocol is better understood as a structured recovery procedure than as a language\-neutral discovery method\. This matters for interpretation: over\-representation of European and North American cases may reflect both underlying activity and differential recoverability through those documentation ecologies\.

Vetting applies the operational definition in Section[3](https://arxiv.org/html/2605.16827#S3)\. A project must have an identifiable AI\-enabled component and a documented participation mechanism\. Documentation must be publicly accessible at the time of extraction and must provide enough information to identify at least a country\-level location and a responsible organization or consortium\. Projects are excluded when participation claims are purely aspirational, when documentation is limited to press releases without operational details, or when the only evidence of participation is passive data collection without stakeholder influence\. Borderline cases are retained when participation can be inferred from governance artifacts, for example when a project record is part of a municipal register with documented oversight or when community data licensing terms are explicitly specified\.

### 4\.3\.Extraction template

For each included project, the repository stores a canonical project name, a primary provenance URL that identifies the source used during extraction, and a set of harmonized metadata fields\. Descriptive attributes include location, lead organization, organization type, activity status, application domain, AI modality, and start and end years where available\. The dataset also codes participation\-specific fields, including participation tier, participants, participation methods, and AI lifecycle stages\. Evidence fields include up to three corroborating source URLs, free\-text participatory evidence notes, verification status, evidence grade, and review status\. Extraction therefore distinguishes descriptive codings from the documentary basis on which those codings rest\.

Harmonization resolves differences in naming and formatting across sources\. Name normalization resolves superficial differences such as whitespace and punctuation\. Country names are normalized to reduce duplication caused by variant spellings\. Where the same initiative appears in multiple sources, the repository retains one canonical record and preserves alternate links for audit\. The repository also retains completeness information for fields that remain uneven across public documentation, notably partner organizations, end years, and multi\-source corroboration\.

### 4\.4\.Geocoding, location resolution, and data sensitivity

Geocoding is performed using available location information in project documentation\. When a city and country are available, the record is geocoded to a representative point coordinate for that locality\. When only a country is available, the record is geocoded to a country\-level reference coordinate and flagged accordingly\. In the current release, 130 of 131 records have mappable coordinates; one global record is intentionally left ungeocoded\. Coordinates are treated as approximate anchors for atlas visualization, not as precise operational sites\. This approach avoids implying that participation occurs uniformly within a city or country, and it limits the risk of exposing sensitive operational locations in humanitarian and conservation contexts\.

The repository does not store personal data about individual participants\. Stakeholder roles are recorded at the level of organizations or collectives when they are publicly documented\. For projects involving Indigenous communities or other groups with collective governance requirements, the repository prioritizes documentation that specifies community authority and consent conditions\(Carrollet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib23)\)\. The governance framework in Section[6](https://arxiv.org/html/2605.16827#S6)further proposes mechanisms for redaction and restricted disclosure requests\.

### 4\.5\.Provenance

Provenance tracking is implemented as a mandatory field\. Each record includes at least one URL pointing to the source used for extraction, and many records include additional corroborating links\. The repository is designed for versioned publication with archival snapshots so that analyses and citations can be tied to stable releases\. This design is aligned with documentation practices in algorithm registers, where transparency relies on traceable records that can be contested and revised\(UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28); City of Amsterdam,[2022](https://arxiv.org/html/2605.16827#bib.bib29)\)\. In addition to provenance, the repository stores a change log for edits, including the reason for change and the contributor identity when available\. Releases are intended to be periodic rather than continuously mutating public state, so citations can point to stable snapshots even as corrections accumulate between releases\. All numeric values in Section[5](https://arxiv.org/html/2605.16827#S5)refer to a frozen release snapshot rather than to the continuously updated live atlas\. Alongside each release, we therefore archive the CSV snapshot, schema version, manifest, and derived JSON and GeoJSON artifacts used for the paper’s counts, so that readers can reproduce the reported tables and figures against the same state of the corpus\.

### 4\.6\.Atlas interface

The atlas interface renders the repository as an interactive map with filtering by geography, application domain, organization type, participation tier, and review\-related fields\. It supports drill\-down to project pages that display provenance, evidence signals, and links to documentation artifacts\. A core design decision is to treat each record as a pointer to external documentation rather than as an authoritative representation of project practice\. The atlas therefore emphasizes transparency about what is known, what is missing, and what is contested\. In the current implementation accompanying this paper, each public record exposes provenance, documentation completeness, release version, evidence grade, and review status; stores edit history; and links to archived release artifacts\. Record pages include separate channels for \(i\) correction or dispute submissions, \(ii\) contextual annotations, and \(iii\) redaction or restricted\-disclosure requests\. New records enter through a moderated intake form rather than immediate publication, while schema\-change proposals are handled in a separate governance queue\. These design choices operationalize participation through multiple contribution channels, contestability through record\-linked issue handling, and stewardship through archived releases and disclosure controls\. Figure[1](https://arxiv.org/html/2605.16827#S4.F1)makes these mechanisms concrete\.

Figure[2](https://arxiv.org/html/2605.16827#S4.F2)summarizes the end\-to\-end workflow from discovery to atlas publication and feedback\. The workflow is designed so that external contributors can propose additions and corrections, while maintaining a review process that records disagreements and preserves provenance\.

![Refer to caption](https://arxiv.org/html/2605.16827v1/figures/atlas_interface_composite.png)Figure 1\.Atlas interface showing the explorer view, map markers, and filter controls for the 131\-project release\.A screenshot of the atlas explorer\. A left sidebar contains search and region filters\. The main panel shows a world map with project markers\. A header at the top includes navigation items for the atlas and contribution workflow\.Discoveryand candidate listVettingand inclusion decisionExtractionand harmonizationVersionedrepositoryAtlasinterfacePublic feedbackand correction proposalsQuality checksand audits

Figure 2\.Workflow for constructing and maintaining a living repository and atlas of participatory AI projects\. The feedback loop is treated as part of the infrastructure, not as a post\-publication add\-on\.A flow diagram with boxes labeled Discovery, Vetting, Extraction, Repository, and Atlas connected left to right, with additional boxes for Quality checks and Public feedback feeding into the versioned repository\.The protocol and schema are intended to be extensible\. Sections[5](https://arxiv.org/html/2605.16827#S5)and[6](https://arxiv.org/html/2605.16827#S6)use the corpus to identify documentation gaps and to motivate governance mechanisms that support long\-term maintenance, including stewardship structures, contribution review, and policies for sensitive information\.

## 5\.Empirical findings from the mapped corpus

This section reports empirical patterns in the mapped corpus\. Because participation tier, participants, methods, and AI lifecycle stages are coded across all 131 records, the central questions concern where these codings are concentrated, the strength of the evidence supporting them, and which documentary gaps remain\. When we refer below to low, medium, and high missingness, we mean fields absent in less than 30%, between 30–70%, and more than 70% of records, respectively\. Because the corpus combines existing datasets with audited expansion and interpretive harmonization, the descriptive results reported here should be understood as properties of the dataset rather than unbiased prevalence estimates of the full participatory AI landscape\.

### 5\.1\.Geographic distribution

Figure[3](https://arxiv.org/html/2605.16827#S5.F3)shows the geographic distribution of projects using a combined view of top countries and mapped coordinates\. The corpus spans 36 normalized country labels, alongside global and multi\-country records\. North America and Europe remain the largest regions, with 38 and 27 projects, followed by Africa \(16\), Asia \(11\), Latin America \(7\), and Oceania \(5\)\. Multi\-region and global records add 11 and 16 entries\. The four largest single\-country labels are the United States \(35\), the United Kingdom \(16\), Canada \(13\), and Kenya \(7\), which together account for 71 of 131 projects\.

![Refer to caption](https://arxiv.org/html/2605.16827v1/x1.png)Figure 3\.Geographic distribution of participatory AI records \(131 projects\)\. The bar chart uses normalized country labels; the map shows approximate anchors for the 130 geocoded records\.A combined figure with a horizontal bar chart of the most frequent normalized country labels and a world map with colored project markers by region\. The largest bars are the United States, the United Kingdom, Canada, and Kenya\. Markers cluster most densely in Europe and North America, with additional records across Africa, Asia, Latin America, and Oceania\.North America and Europe together account for 65 of 131 records \(49\.6%\)\. The concentration should therefore not be read as a clean proxy for where participatory AI is most active in absolute terms\. It captures an interaction between underlying activity, institutional visibility, and recoverability through the protocol\. Public algorithm registers and other web\-visible disclosure infrastructures are especially legible in Europe and North America, while cases documented only in other languages or in less indexable formats are less likely to be recovered\. An aggregate regional view of the same pattern is provided in Appendix[A](https://arxiv.org/html/2605.16827#A1)\(Figure[5](https://arxiv.org/html/2605.16827#A1.F5)\)\. The concentration implies that generalizations about participatory AI based on documented projects will over\-represent institutional contexts and governance regimes prevalent in those regions\.

### 5\.2\.Participation tiers, lifecycle loci, and substantive domains

Three tiers dominate the release: community\-led initiatives \(39 projects\), co\-design \(26\), and participatory governance \(22\)\. Together they account for 87 of 131 records\. Public consultation \(6\), participatory audit \(6\), and co\-governance \(3\) appear, but much less frequently\. This distribution matters because it shows that the mapped landscape is not only about interface co\-design; governance\-oriented and community\-led arrangements constitute a substantial share of documented cases\.

The same pattern appears across lifecycle loci\. Counting each stage whenever a project lists more than one, participation is most often documented in problem formulation \(85 projects\), governance \(72\), evaluation \(57\), data collection \(37\), deployment \(33\), and design \(24\)\. By contrast, model training \(16\) and model development \(9\) are relatively rare\. In the current release, public documentation therefore more often locates participation around agenda setting, evaluation, and oversight than around core model\-building stages\. Appendix[A](https://arxiv.org/html/2605.16827#A1)provides the full participation\-tier and lifecycle distributions \(Figures[6](https://arxiv.org/html/2605.16827#A1.F6)and[7](https://arxiv.org/html/2605.16827#A1.F7)\)\.

Substantively, the most frequent exact application\-domain labels concern healthcare, AI governance and trustworthy AI, AI policy and public administration, assistive technology and accessibility, and human rights documentation and accountability\. At the category level, public dialogue and governance, inclusive stakeholder engagement, public\-health mental\-health support, and trustworthy AI co\-design are especially common\. Together, these patterns suggest that participatory AI is documented most densely in settings where legitimacy, accountability, and situated expertise are central to adoption\.

### 5\.3\.Organizational forms, activity status, and temporal profile

Organizational form is heterogeneous, but the release is anchored in public\-interest and collaborative settings\. The most common lead\-organization labels are university/research lab and nonprofit/NGO \(1010each\), standalone university and research consortium \(99each\), university plus civic methodology partner \(88\), and nonprofit/research collaboration \(77\)\. This mix reinforces that many participatory AI initiatives operate through hybrid institutional arrangements rather than clearly bounded firms or agencies\.

Status fields are available per project\. 79 projects are marked active, 19 completed, 15 published case, 9 pilot, 5 funded, and 4 legacy\. Start years are available for 131 projects and range from 2006 to 2026, but the corpus is heavily recent: 124 dated records begin in 2018 or later and 96 in 2021 or later\. The atlas therefore captures not only mature precedents such as registers and long\-running language initiatives, but also a wave of recent governance, deliberation, and auditing efforts\. These dates should not be read as a complete time series of the field, because inclusion still depends on recoverable public documentation rather than exhaustive discovery\.

### 5\.4\.Documentation completeness, verification, and remaining gaps

Table[2](https://arxiv.org/html/2605.16827#S5.T2)summarizes completeness of selected fields\. Participation mode, participants, participation methods, AI lifecycle stages, lead organization, primary provenance URL, and official project URL are coded for all 131 records\. The documented subset includes second corroborating sources for all records\. Remaining gaps are concentrated elsewhere: third sources are present for 112 records, partner organizations for 95, and end years for only 29\. By the thresholds above, third corroborating sources and partner organizations exhibit low missingness, while end years exhibit high missingness\. These gaps matter because they limit inference about institutional durability, coalition structure, and project closure\.

Table 2\.Completeness of selected repository fields \(131 projects\)\.Completeness alone does not settle reliability\. The release differentiates verification status, evidence grade, and review state\. 88 records are live\_verified, 25 indirect\_verified, 11 mixed\_verified, and 7 paper\_verified\. Evidence grades are A for 98 records, B for 8, and C for 25\. The atlas also makes curatorial uncertainty explicit: 70 records are marked core, 37 cautious, and 24 review\_candidate\. Review\-candidate entries disproportionately carry B/C evidence grades \(17 of 24\) and are often coded as participatory governance or public consultation \(9 of 24\), indicating that scope boundary cases cluster in policy\-facing and oversight\-oriented material rather than in the most concrete co\-design cases\.

Figure[4](https://arxiv.org/html/2605.16827#S5.F4)visualizes completeness for selected fields\. Completeness reflects the presence of a non\-empty value and does not assess the specificity or verifiability of the value\.

![Refer to caption](https://arxiv.org/html/2605.16827v1/x2.png)Figure 4\.Completeness of selected atlas fieldsA horizontal bar chart showing completeness percentages for selected fields\. Lead organization, provenance URL, official URL, and participation\-related fields are complete, while partner organizations and end year have substantially lower coverage\.
### 5\.5\.Provenance sources

The corpus draws on documentation hosted across 107 distinct provenance domains, indicating a long tail of organizational websites, research pages, and repositories rather than a small set of durable registries\. The most common domain suffixes in provenance URLs are\.org\(46 records\),\.com\(16\),\.uk\(10\),\.net\(8\), and\.io\(7\)\. This distribution reflects both organizational heterogeneity and the continuing dependence on web pages that may not provide long\-term archival stability\. The long tail of 107 source domains is itself a result: participatory AI is presently documented through a dispersed and often unstable web ecology rather than through a few durable registries or repositories\. This helps explain why a versioned atlas matters\. Without release snapshots and archived provenance, the empirical basis for comparison can disappear as organizational pages change or vanish\. Appendix[A](https://arxiv.org/html/2605.16827#A1)provides a ranked list of the most frequent provenance domains \(Table[5](https://arxiv.org/html/2605.16827#A1.T5)\)\.

### 5\.6\.Implications of curation and bias

The mapped corpus should not be interpreted as a census of participatory AI\. It is a curated release built from recoverable public documentation and harmonized through interpretive coding\. The atlas materially improves participation legibility by coding tier, participants, methods, and lifecycle stage for every record, but it does so through curatorial judgment that remains open to correction\. Records with lower evidence grades or review\-candidate status are therefore not noise; they mark where the boundary between participation for AI and adjacent governance, deliberation, or advocacy infrastructures is hardest to draw\. Documentation bias remains visible in multiple ways\. Language, searchability, funding ecosystems, and disclosure mandates shape recoverability\. Public\-sector registers and English\-language research sites remain easier to surface than informal, non\-indexed, or local\-language initiatives\. These biases motivate the governance requirements in Section[6](https://arxiv.org/html/2605.16827#S6), where the atlas is treated as an infrastructure that must expose not only cases but also uncertainty, disagreement, and the conditions under which records were included\. The atlas is therefore best read as a map of documented legibility\. Cases with disclosure mandates, funder reporting obligations, or dedicated communications staff become easier to recover than equally consequential local initiatives documented through informal, non\-indexed, or non\-English channels\.

## 6\.Design and governance framework

Section[5](https://arxiv.org/html/2605.16827#S5)shows that documentation of participatory AI is uneven\. A living atlas can support accountability only if participation, contestability, and stewardship are treated as core infrastructure properties\. The framework below is therefore a governance contribution\. It treats documentation, contestability, and disclosure control as infrastructural conditions for evaluating participatory AI claims\.

### 6\.1\.From project catalogs to infrastructure stewardship

A participatory AI repository has a reflexive risk: it can reproduce extractive dynamics if communities are merely documented rather than positioned as contributors to how records are interpreted, corrected, and governed\. Public algorithm registers offer a partial model through structured disclosure and feedback channels\(City of Helsinki,[2020](https://arxiv.org/html/2605.16827#bib.bib27); City of Amsterdam,[2022](https://arxiv.org/html/2605.16827#bib.bib29); UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28)\), but participatory AI initiatives often span public agencies, community organizations, and volunteer networks\. The atlas in Figure[1](https://arxiv.org/html/2605.16827#S4.F1)is designed around this premise\. Participation is operationalized through contribution channels; contestability through record\-linked corrections, disputes, and annotations; and stewardship through release governance, provenance logging, and redaction pathways\. The dataset also makes evidentiary uncertainty visible through verification status, evidence grade, and review status fields, so that inclusion does not imply equal evidentiary strength\.

### 6\.2\.Design requirements for a participatory\-by\-default atlas

We propose five requirements: verifiable provenance; minimal viable participation documentation; community rights and licensing; contestability and change tracking; and interoperability with technical documentation artifacts\.

Verifiable provenance\.Each entry should include at least one stable source reference, and the atlas should preserve prior versions for audit and citation\.

Minimal viable participation documentation\.A minimally interpretable record should specify \(i\) lifecycle locus, \(ii\) participants as collectives or organizations, \(iii\) decision points influenced, and \(iv\) the mechanism translating input into decisions\. These criteria establish a minimum documentation threshold rather than a certification of participatory quality\. Records that invoke participation but do not specify locus, mechanism, or decision influence are treated as documentation\-insufficient, rather than being implicitly accepted as participatory\.

Community rights and licensing\.Records should represent data governance and consent conditions when relevant, and communities should be able to request redaction or restricted disclosure for sensitive information\(Carrollet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib23); Kukutai and Taylor,[2016](https://arxiv.org/html/2605.16827#bib.bib24); Jones,[2025](https://arxiv.org/html/2605.16827#bib.bib25)\)\.

Contestability and change tracking\.Because participatory practice is described differently by project teams and stakeholders, the atlas should support issue tracking, structured disagreement, and links to third\-party assessments\(Rajiet al\.,[2020](https://arxiv.org/html/2605.16827#bib.bib18); Reismanet al\.,[2018](https://arxiv.org/html/2605.16827#bib.bib19)\)\. Annotations should remain distinct from direct record edits so that critical commentary and counter\-documentation do not disappear into a revised canonical entry\.

Interoperability with technical artifacts\.Where available, project records should link to datasheets, model cards, and data statements\(Gebruet al\.,[2021](https://arxiv.org/html/2605.16827#bib.bib15); Mitchellet al\.,[2019](https://arxiv.org/html/2605.16827#bib.bib16); Bender and Friedman,[2018](https://arxiv.org/html/2605.16827#bib.bib17)\), while preserving participation metadata that those artifacts usually omit\.

Table[3](https://arxiv.org/html/2605.16827#S6.T3)summarizes governance elements that operationalize these requirements\.

Table 3\.Governance elements for a participatory\-by\-default atlas, operationalized in the current implementation and motivated by prior infrastructures\.
### 6\.3\.Participatory evaluation

An atlas supports evaluation only if it links participation mechanisms to decision outcomes\. Participatory evaluation research treats evaluation criteria as governance because they define success and encode values\(Cousins and Whitmore,[1998](https://arxiv.org/html/2605.16827#bib.bib38)\)\. The atlas can therefore help build shared scrutiny standards without becoming a certification scheme: it does not score participation quality on its own, but it can establish what must minimally be documented if a participatory claim is to be examinable\. Community annotations and linked third\-party audits then provide a way to record disagreement when project self\-description overstates stakeholder influence\.

### 6\.4\.Implementing “living” participation in the atlas

A living atlas requires durable responsibility for maintenance and channels for contribution beyond project owners\. In practice, new records and record edits enter a moderated intake queue with required provenance and minimal participation fields\. Accepted changes are published through periodic releases\. Each release is archived with a manifest and downloadable artifacts so that analyses remain citable\.

Record\-linked corrections and disputes are handled through issue threads tied to the relevant entry\. Annotations are distinct from edits: they can attach contextual notes, missing documentation, or critical commentary without overwriting the source record\. Schema evolution is handled separately from record maintenance through a stewardship review process documented in release notes\. These mechanisms make the atlas “living” in two senses: the corpus can be corrected and expanded, and the schema itself can change in response to critique\.

### 6\.5\.Reducing documentation and coverage bias

Because the atlas inherits the biases of searchable documentation, maintenance must include incentives and partnerships, not only better search queries\. Multilingual submission templates and translation support can lower entry barriers; regional curators or partner networks can audit for missing cases; compensated community review can reduce extractive reporting burdens; and interoperability with public registers or existing repositories can reduce duplicate documentation work\. These steps do not eliminate bias, but they make recoverability an object of stewardship rather than an invisible artifact of search ranking and language privilege\.

### 6\.6\.Case\-based grounding

Appendix Table[6](https://arxiv.org/html/2605.16827#A2.T6)summarizes illustrative cases showing why different participation mechanisms imply different documentation needs\. Across municipal registers, humanitarian pilots, Indigenous language technologies, and community\-facing platforms, the common requirement is a documented link between participation mechanisms and decision influence\.

## 7\.Discussion and limitations

The mapping and governance framework have implications for research, policy, and participatory practice\.

For research, the repository enables comparative analysis of participation mechanisms across domains, jurisdictions, and organizational forms, and supports sampling and systematic reviews that do not depend on a small set of canonical cases\. It also enables work linking participation to evaluation, for example by comparing whether stakeholder\-defined success criteria are documented alongside technical artifacts such as datasheets and model cards\(Gebruet al\.,[2021](https://arxiv.org/html/2605.16827#bib.bib15); Mitchellet al\.,[2019](https://arxiv.org/html/2605.16827#bib.bib16)\)\. Because tier and lifecycle codings are now available across the full release, the main empirical constraint is not the total absence of participation fields, but the labor required to triangulate heterogeneous documentation and to keep those codings auditable over time\.

For policy and governance, the atlas can complement algorithm registers by covering initiatives outside public administrations, including civic and humanitarian projects that still shape public decision making\. The framework suggests that transparency mandates should include participation reporting requirements, not only templates for purpose, data, and oversight\(UK Central Digital and Data Office,[2023](https://arxiv.org/html/2605.16827#bib.bib28)\)\. This helps move discussion of participation\-washing from “was participation claimed?” to “what was documented about locus, mechanism, participants, and decision influence?” That shift does not settle normative adequacy, but it creates a baseline for scrutiny\.

For practice, a living atlas can help communities and developers find related efforts, reuse documentation templates, and reduce duplicated work\. Linking records to datasheets and model cards can support technical transparency, but these artifacts rarely capture decision influence or governance arrangements\(Bender and Friedman,[2018](https://arxiv.org/html/2605.16827#bib.bib17)\)\. The practical implication is that the atlas should accumulate not only cases, but also the conditions for scrutiny: release history, disagreement, third\-party commentary, and documentation of what remains unknown\.

The work has limitations\. The corpus reflects projects with publicly accessible documentation, under\-representing informal, safety\-sensitive, and local\-language initiatives\. Location fields are approximate anchors rather than full operational geographies\. Furthermore, many values are harmonized curatorial summaries rather than verbatim source statements, and they remain sensitive to boundary judgments\. End years are available for only 29 projects and partner organizations for 95, limiting inference about project closure and coalition structure\. Start years are available across the documented subset, but start\-year values still reflect what public sources make comparable rather than a field\-wide timeline\. The atlas has also not yet been validated through formal user studies or through structured consultation with researchers and practitioners in underrepresented regions to determine which cases remain invisible because they are documented differently rather than not documented at all\.

Addressing these limits will require consultation with regional research and practice networks, multilingual intake and review workflows, compensated community curation, crosswalks to existing public registers and repositories, and future interface evaluation with the atlas’s intended users\. These are governance tasks, not ancillary improvements\.

## 8\.Conclusion

Participatory approaches to AI are increasingly documented in public, civic, and humanitarian settings, yet their distribution and implementation remain difficult to assess from scattered public materials\. This paper reported the construction of a versioned repository and interactive atlas of 131 participatory AI projects\. We specified a reproducible mapping protocol, reported empirical patterns in geography, participation tiers, lifecycle loci, organizational forms, and evidence status, and identified the remaining documentary gaps that constrain comparison\. Across the release, documented initiatives remain geographically concentrated, while participation is most often located in problem formulation, evaluation, and governance rather than model development or training\. The central problem, in other words, is not only locating participatory AI initiatives, but documenting them in ways that keep claims about participation open to scrutiny, correction, and reuse\.

To address these gaps, we proposed a design and governance framework for participatory\-by\-default AI infrastructures\. The framework treats provenance, contestability, community rights, and interoperability with technical documentation as core infrastructure requirements\. A living atlas can support research, policy learning, and collaboration only if it is itself governed as a participatory artifact, with mechanisms for contribution, annotation, and dispute resolution\. The repository and atlas provide an initial basis for such work and can be extended through community\-led stewardship and alignment with emerging transparency standards\.

### Generative AI usage

Generative AI tools were used to assist with language editing and grammar\. No generative AI system was used to create the dataset or generate the empirical counts reported in the paper\.

## References

- S\. R\. Arnstein \(1969\)A ladder of citizen participation\.Journal of the American Institute of Planners35\(4\),pp\. 216–224\.External Links:[Document](https://dx.doi.org/10.1080/01944366908977225)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p2.1),[§3](https://arxiv.org/html/2605.16827#S3.p2.1)\.
- E\. M\. Bender and B\. Friedman \(2018\)Data statements for natural language processing: toward mitigating system bias and enabling better science\.Transactions of the Association for Computational Linguistics6,pp\. 587–604\.External Links:[Document](https://dx.doi.org/10.1162/tacl%5Fa%5F00041)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p6.1),[§7](https://arxiv.org/html/2605.16827#S7.p4.1)\.
- A\. Berditchevskaia, E\. Malliaraki, and K\. Peach \(2021\)Participatory AI for humanitarian innovation\.Note:Nesta Briefing PaperAccessed 2026\-01\-14External Links:[Link](https://media.nesta.org.uk/documents/Nesta_Participatory_AI_for_humanitarian_innovation_Final.pdf)Cited by:[Table 6](https://arxiv.org/html/2605.16827#A2.T6.1.3.2.1.1.1),[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[§1](https://arxiv.org/html/2605.16827#S1.p8.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.4.3.3.1.1)\.
- A\. Birhane, W\. Isaac, V\. Prabhakaran, M\. Diaz, M\. C\. Elish, I\. Gabriel, and S\. Mohamed \(2022\)Power to the people? opportunities and challenges for participatory AI\.InProceedings of the 2nd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization,EAAMO ’22,New York, NY, USA\.External Links:[Document](https://dx.doi.org/10.1145/3551624.3555290),[Link](https://doi.org/10.1145/3551624.3555290),ISBN 9781450394772Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- R\. Bonney, C\. B\. Cooper, J\. Dickinson, S\. Kelling, T\. Phillips, K\. V\. Rosenberg, and J\. Shirk \(2009\)Citizen science: a developing tool for expanding science knowledge and scientific literacy\.BioScience59\(11\),pp\. 977–984\.External Links:[Document](https://dx.doi.org/10.1525/bio.2009.59.11.9)Cited by:[§3](https://arxiv.org/html/2605.16827#S3.p3.1)\.
- G\. C\. Bowker and S\. L\. Star \(1999\)Sorting things out: classification and its consequences\.MIT Press,Cambridge, MA\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p8.1),[§2](https://arxiv.org/html/2605.16827#S2.p5.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.5.4.3.1.1)\.
- S\. R\. Carroll, I\. Garba, O\. L\. Figueroa\-Rodríguez, J\. Holbrook, R\. Lovett, S\. Materechera, M\. Parsons, K\. Raseroka, D\. Rodriguez\-Lonebear, R\. Rowe, R\. Sara, J\. D\. Walker, J\. Anderson, and M\. Hudson \(2020\)The CARE principles for indigenous data governance\.Data Science Journal19\(1\),pp\. 43\.External Links:[Document](https://dx.doi.org/10.5334/dsj-2020-043),[Link](https://doi.org/10.5334/dsj-2020-043)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p8.1),[§3](https://arxiv.org/html/2605.16827#S3.p3.1),[§4\.4](https://arxiv.org/html/2605.16827#S4.SS4.p2.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p4.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.7.6.3.1.1)\.
- City of Amsterdam \(2022\)Grip on algorithms: the algorithm lifecycle approach\.Note:GuidelineAccessed 2026\-01\-14External Links:[Link](https://assets.amsterdam.nl/publish/pages/1053010/playbook_algorithms.pdf)Cited by:[Table 6](https://arxiv.org/html/2605.16827#A2.T6.1.2.1.1.1.1),[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[§2](https://arxiv.org/html/2605.16827#S2.p6.1),[§4\.5](https://arxiv.org/html/2605.16827#S4.SS5.p1.1),[§6\.1](https://arxiv.org/html/2605.16827#S6.SS1.p1.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.2.1.3.1.1)\.
- City of Helsinki \(2020\)AI register\.Note:Municipal websiteAccessed 2026\-01\-14External Links:[Link](https://ai.hel.fi/en/ai-register/)Cited by:[Table 6](https://arxiv.org/html/2605.16827#A2.T6.1.2.1.1.1.1),[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[§2](https://arxiv.org/html/2605.16827#S2.p6.1),[§3](https://arxiv.org/html/2605.16827#S3.p3.1),[§6\.1](https://arxiv.org/html/2605.16827#S6.SS1.p1.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.3.2.3.1.1)\.
- S\. Costanza\-Chock \(2020\)Design justice: community\-led practices to build the worlds we need\.MIT Press,Cambridge, MA\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- J\. B\. Cousins and E\. Whitmore \(1998\)Framing participatory evaluation\.New Directions for Evaluation1998\(80\),pp\. 5–23\.External Links:[Document](https://dx.doi.org/10.1002/ev.1114)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.3](https://arxiv.org/html/2605.16827#S6.SS3.p1.1)\.
- C\. D’Ignazio and L\. F\. Klein \(2020\)Data feminism\.MIT Press,Cambridge, MA\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- F\. Delgado, S\. Yang, M\. Madaio, and Q\. Yang \(2023\)The participatory turn in AI design: theoretical foundations and the current state of practice\.InProceedings of the 3rd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization,EAAMO ’23,New York, NY, USA\.External Links:[Document](https://dx.doi.org/10.1145/3617694.3623261),[Link](https://doi.org/10.1145/3617694.3623261),ISBN 9798400703812Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- Digital Forum Lab and Eurocities \(2022\)Algorithmic transparency standard\.Note:Accessed 2026\-02\-16External Links:[Link](https://standard.algorithmictransparency.org/)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[§2](https://arxiv.org/html/2605.16827#S2.p6.1)\.
- V\. Eubanks \(2018\)Automating inequality: how high\-tech tools profile, police, and punish the poor\.St\. Martin’s Press,New York, NY\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p2.1)\.
- B\. Friedman and D\. G\. Hendry \(2019\)Value sensitive design: shaping technology with moral imagination\.MIT Press,Cambridge, MA\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1)\.
- A\. Fung and M\. E\. Warren \(2011\)The participedia project: an introduction\.International Public Management Journal14\(3\),pp\. 341–362\.External Links:[Document](https://dx.doi.org/10.1080/10967494.2011.618309)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p7.1)\.
- A\. Fung \(2006\)Varieties of participation in complex governance\.Public Administration Review66\(s1\),pp\. 66–75\.External Links:[Document](https://dx.doi.org/10.1111/j.1540-6210.2006.00667.x)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p2.1),[§3](https://arxiv.org/html/2605.16827#S3.p2.1)\.
- T\. Gebru, J\. Morgenstern, B\. Vecchione, J\. W\. Vaughan, H\. Wallach, H\. Daumé III, and K\. Crawford \(2021\)Datasheets for datasets\.Communications of the ACM64\(12\),pp\. 86–92\.External Links:[Document](https://dx.doi.org/10.1145/3458723),[Link](https://dl.acm.org/doi/10.1145/3458723)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p6.1),[§7](https://arxiv.org/html/2605.16827#S7.p2.1)\.
- L\. Irani \(2019\)Chasing innovation: making entrepreneurial citizens in modern india\.Princeton University Press,Princeton, NJ\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- P\. Jones \(2025\)Kaitiaki: closing the door on open indigenous data\.International Journal on Digital Libraries26\(1\),pp\. 1\.External Links:[Document](https://dx.doi.org/10.1007/s00799-025-00410-2),[Link](https://link.springer.com/article/10.1007/s00799-025-00410-2)Cited by:[Table 6](https://arxiv.org/html/2605.16827#A2.T6.1.4.3.1.1.1),[§3](https://arxiv.org/html/2605.16827#S3.p3.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p4.1)\.
- A\. Kawakami, A\. Coston, H\. Zhu, H\. Heidari, and K\. Holstein \(2024\)The situate ai guidebook: co\-designing a toolkit to support multi\-stakeholder, early\-stage deliberations around public sector ai proposals\.InProceedings of the 2024 CHI Conference on Human Factors in Computing Systems,New York, NY, USA,pp\. 1–22\.External Links:[Document](https://dx.doi.org/10.1145/3613904.3642849)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- B\. Kitchenham and S\. Charters \(2007\)Guidelines for performing systematic literature reviews in software engineering\.Technical reportTechnical ReportEBSE\-2007\-01,Keele University and Durham University Joint Report\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p8.1)\.
- R\. Kitchin \(2017\)Thinking critically about and researching algorithms\.Information, Communication & Society20\(1\),pp\. 14–29\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p2.1)\.
- T\. Kukutai and J\. Taylor \(Eds\.\) \(2016\)Indigenous data sovereignty: toward an agenda\.ANU Press,Canberra\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p8.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p4.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.7.6.3.1.1)\.
- J\. Landry and B\. von Lieres \(2022\)Strengthening democracy through knowledge mobilization: Participedia – a citizen\-led global platform for transformative and democratic learning\.Journal of Transformative Education20\(3\),pp\. 206–225\.External Links:[Document](https://dx.doi.org/10.1177/15413446221103191),[Link](https://doi.org/10.1177/15413446221103191)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p7.1)\.
- G\. Leoni, L\. Steven, K\. Tūreiti, K\. Mahelona, P\. Jones, and S\. Duncan \(2024\)Solving failure modes in the creation of trustworthy language technologies\.InProceedings of the 3rd Annual Meeting of the Special Interest Group on Under\-resourced Languages @ LREC\-COLING 2024,M\. Melero, S\. Sakti, and C\. Soria \(Eds\.\),Torino, Italia,pp\. 325–330\.External Links:[Link](https://aclanthology.org/2024.sigul-1.39/)Cited by:[Table 6](https://arxiv.org/html/2605.16827#A2.T6.1.4.3.1.1.1)\.
- M\. I\. Magaña and K\. Shilton \(2025\)Frameworks, methods and shared tasks: connecting participatory ai to trustworthy ai through a systematic review of global projects\.InProceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency,New York, NY, USA,pp\. 2166–2179\.External Links:[Document](https://dx.doi.org/10.1145/3715275.3732148)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p5.1),[§2](https://arxiv.org/html/2605.16827#S2.p7.1),[§4\.1](https://arxiv.org/html/2605.16827#S4.SS1.p1.1)\.
- P\. Meier \(2015\)Digital humanitarians: how big data is changing the face of humanitarian response\.CRC Press,Boca Raton, FL\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.4.3.3.1.1)\.
- M\. Mitchell, S\. Wu, A\. Zaldivar, P\. Barnes, L\. Vasserman, B\. Hutchinson, E\. Spitzer, I\. D\. Raji, and T\. Gebru \(2019\)Model cards for model reporting\.InProceedings of the Conference on Fairness, Accountability, and Transparency,New York, NY, USA,pp\. 220–229\.External Links:[Document](https://dx.doi.org/10.1145/3287560.3287596)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p6.1),[§7](https://arxiv.org/html/2605.16827#S7.p2.1)\.
- R\. Mushkani, H\. Berard, A\. Cohen, and S\. Koseki \(2025\)Position: the right to AI\.InProceedings of the 42nd International Conference on Machine Learning,A\. Singh, M\. Fazel, D\. Hsu, S\. Lacoste\-Julien, F\. Berkenkamp, T\. Maharaj, K\. Wagstaff, and J\. Zhu \(Eds\.\),Proceedings of Machine Learning Research, Vol\.267,pp\. 81876–81896\.External Links:[Link](https://proceedings.mlr.press/v267/mushkani25b.html)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1)\.
- R\. Mushkani \(2026\)Measuring what matters: the AI pluralism index\.Note:Proceedings of the International Association for Safe & Ethical AI \(IASEAI\), 2026External Links:[Document](https://dx.doi.org/10.48550/arXiv.2510.08193),[Link](https://doi.org/10.48550/arXiv.2510.08193)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1)\.
- T\. Nabatchi \(2012\)Putting the “public” back in public values research: designing participation to identify and respond to values\.Public Administration Review72\(5\),pp\. 699–708\.External Links:[Document](https://dx.doi.org/10.1111/j.1540-6210.2012.02544.x)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p2.1)\.
- New York City Automated Decision Systems Task Force \(2019\)Automated decision systems task force report\.Note:ReportAccessed 2026\-01\-14External Links:[Link](https://www.nyc.gov/site/adstaskforce/index.page)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p6.1)\.
- I\. D\. Raji, A\. Smart, R\. N\. White, M\. Mitchell, T\. Gebru, B\. Hutchinson, J\. Smith\-Loud, D\. Theron, and P\. Barnes \(2020\)Closing the AI accountability gap: defining an end\-to\-end framework for internal algorithmic auditing\.InProceedings of the 2020 Conference on Fairness, Accountability, and Transparency,New York, NY, USA,pp\. 33–44\.External Links:[Document](https://dx.doi.org/10.1145/3351095.3372873)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p5.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.3.2.3.1.1)\.
- D\. Reisman, J\. Schultz, K\. Crawford, and M\. Whittaker \(2018\)Algorithmic impact assessments: a practical framework for public agency accountability\.Note:AI Now Institute ReportAccessed 2026\-01\-14External Links:[Link](https://ainowinstitute.org/wp-content/uploads/2023/04/aiareport2018.pdf)Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p4.1),[§6\.2](https://arxiv.org/html/2605.16827#S6.SS2.p5.1)\.
- T\. Robertson and J\. Simonsen \(2012\)Participatory design: an introduction\.InRoutledge International Handbook of Participatory Design,J\. Simonsen and T\. Robertson \(Eds\.\),pp\. 1–17\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1)\.
- D\. Schuler and A\. Namioka \(Eds\.\) \(1993\)Participatory design: principles and practices\.Lawrence Erlbaum Associates,Hillsdale, NJ\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p2.1)\.
- A\. D\. Selbst, D\. Boyd, S\. A\. Friedler, S\. Venkatasubramanian, and J\. Vertesi \(2019\)Fairness and abstraction in sociotechnical systems\.InProceedings of the Conference on Fairness, Accountability, and Transparency,New York, NY, USA,pp\. 59–68\.External Links:[Document](https://dx.doi.org/10.1145/3287560.3287598)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p2.1),[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- J\. Simonsen and T\. Robertson \(Eds\.\) \(2012\)Routledge international handbook of participatory design\.Routledge,New York, NY\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p2.1)\.
- M\. Sloane, E\. Moss, O\. Awomolo, and L\. Forlano \(2022\)Participation is not a design fix for machine learning\.InProceedings of the 2nd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization,EAAMO ’22,New York, NY, USA\.External Links:[Document](https://dx.doi.org/10.1145/3551624.3555285),[Link](https://doi.org/10.1145/3551624.3555285),ISBN 9781450394772Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p1.1),[§2](https://arxiv.org/html/2605.16827#S2.p3.1)\.
- S\. L\. Star and K\. Ruhleder \(1996\)Steps toward an ecology of infrastructure: design and access for large information spaces\.Information Systems Research7\(1\),pp\. 111–134\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p5.1)\.
- S\. L\. Star \(1999\)The ethnography of infrastructure\.American Behavioral Scientist43\(3\),pp\. 377–391\.Cited by:[§2](https://arxiv.org/html/2605.16827#S2.p5.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.5.4.3.1.1)\.
- UK Central Digital and Data Office \(2023\)Algorithmic transparency recording standard\.Note:GOV\.UK guidanceAccessed 2026\-01\-14External Links:[Link](https://www.gov.uk/government/collections/algorithmic-transparency-recording-standard-hub)Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p3.1),[§2](https://arxiv.org/html/2605.16827#S2.p6.1),[§3](https://arxiv.org/html/2605.16827#S3.p3.1),[§4\.5](https://arxiv.org/html/2605.16827#S4.SS5.p1.1),[§6\.1](https://arxiv.org/html/2605.16827#S6.SS1.p1.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.2.1.3.1.1),[Table 3](https://arxiv.org/html/2605.16827#S6.T3.1.5.4.3.1.1),[§7](https://arxiv.org/html/2605.16827#S7.p3.1)\.
- M\. Wieringa \(2020\)What to account for when accounting for algorithms: a systematic literature review on algorithmic accountability\.InProceedings of the 2020 Conference on Fairness, Accountability, and Transparency,New York, NY, USA,pp\. 1–18\.Cited by:[§1](https://arxiv.org/html/2605.16827#S1.p2.1)\.

## Appendix ASupplementary figures and tables

This appendix collects supplementary operationalization details, distributions, and ranked domain summaries that are useful for interpretation\.

### A\.1\.Operationalization reference table

Table 4\.Participation\-related dimensions used in the repository schema\. The mapping emphasizes evidence that can be verified from public documentation\.
### A\.2\.Regional, participation, and lifecycle distributions

![Refer to caption](https://arxiv.org/html/2605.16827v1/x3.png)Figure 5\.Projects by mapped region in the March 2026 release\.A horizontal bar chart showing counts by region\. North America and Europe are the largest regional groups, followed by global, Africa, multi\-region, Asia, Latin America, and Oceania\.![Refer to caption](https://arxiv.org/html/2605.16827v1/x4.png)Figure 6\.Distribution of participation tiers across the March 2026 release\.A horizontal bar chart showing counts by participation tier\. Community\-led, co\-design, and participatory governance are the largest categories\.![Refer to caption](https://arxiv.org/html/2605.16827v1/x5.png)Figure 7\.Most frequently coded AI lifecycle stages, counting projects that list more than one stage\.A horizontal bar chart showing the most common lifecycle stages\. Problem formulation is the largest category, followed by governance, evaluation, data collection, deployment, and design, while model training and model development are much smaller\.
### A\.3\.Provenance domains

Table 5\.Most frequent provenance domains in the corpus \(top ten\)\.

## Appendix BIllustrative cases for grounding governance requirements

Table 6\.Illustrative cases used to ground governance requirements for a living participatory AI atlas\.

Similar Articles

Democratic inputs to AI grant program: lessons learned and implementation plans

OpenAI Blog

OpenAI announces completion of its Democratic Inputs to AI grant program, selecting 10 teams from nearly 1,000 applications across 113 countries to explore participatory approaches to AI governance. The program produced diverse projects spanning video deliberation, crowdsourced audits, and representation guarantees, with OpenAI releasing the teams' code and documentation.

Democratic inputs to AI

OpenAI Blog

OpenAI is launching a grant program to fund teams worldwide in developing proof-of-concept democratic processes for determining the rules and behaviors AI systems should follow, aiming to ensure diverse public input into AI governance. The initiative is funded by the OpenAI non-profit and results will be freely accessible.

Moving AI governance forward

OpenAI Blog

OpenAI publishes AI governance recommendations committing companies to internal and external red-teaming for safety risks, information sharing on emerging capabilities, and mechanisms for detecting AI-generated audio and visual content.

Insights from global conversations

OpenAI Blog

OpenAI reports on a four-week global tour across 25 cities in 6 continents led by CEO Sam Altman to gather feedback from users, developers, and policymakers. The company commits to improving product accessibility, developing governance practices for foundation models, and unlocking AI's benefits through literacy and creator support.