Metis AI: The Overlooked Middle Zone Between AI-Native and World-Movers

arXiv cs.AI Papers

Summary

This paper identifies a class of digital tasks called Metis AI that resist reliable AI automation due to institutional, social, and normative entanglements, and proposes centaur architectures where humans lead and AI supports as the appropriate design response.

arXiv:2605.14407v1 Announce Type: new Abstract: The dominant discourse on AI limitations frames the boundary of AI capability as a divide between digital tasks (where AI excels) and physical tasks (where embodiment is required). We argue this framing misses the most consequential boundary: the one within digital tasks. We identify a class of tasks we call Metis AI, named for the Greek concept of metis (practical, contextual knowledge), that are performed entirely on computers yet resist reliable AI automation. These tasks are not computationally intractable; they are institutionally, socially, and normatively entangled in ways that defeat algorithmic approaches. We distinguish constitutive metis (knowledge destroyed by the act of formalization) from operational metis (system-specific familiarity that automation can progressively absorb), and propose five structural characteristics that define the Metis AI zone: consequential irreversibility, relational irreducibility, normative open texture, adversarial co-evolution, and accountability anchoring. We ground each in established theory from across the social sciences, philosophy, and humanitarian practice, argue that these characteristics are properties of the tasks themselves rather than limitations of current models, and show that the appropriate design response is not better automation but centaur architectures in which humans lead and AI supports.
Original Article
View Cached Full Text

Cached at: 05/15/26, 06:24 AM

# Metis AI: The Overlooked Middle Zone Between AI-Native and World-Movers
Source: [https://arxiv.org/html/2605.14407](https://arxiv.org/html/2605.14407)
###### Abstract

The dominant discourse on AI limitations frames the boundary of AI capability as a divide between digital tasks \(where AI excels\) and physical tasks \(where embodiment is required\)\. We argue this framing misses the most consequential boundary: the one*within*digital tasks\. We identify a class of tasks we callMetis AI, named for the Greek concept of*metis*\(practical, contextual knowledge\), that are performed entirely on computers yet resist reliable AI automation\. These tasks are not computationally intractable; they are*institutionally*,*socially*, and*normatively*entangled in ways that defeat algorithmic approaches\. We distinguish*constitutive metis*\(knowledge destroyed by the act of formalization\) from*operational metis*\(system\-specific familiarity that automation can progressively absorb\), and propose five structural characteristics that define the Metis AI zone: consequential irreversibility, relational irreducibility, normative open texture, adversarial co\-evolution, and accountability anchoring\. We ground each in established theory from across the social sciences, philosophy, and humanitarian practice, argue that these characteristics are properties of the tasks themselves rather than limitations of current models, and show that the appropriate design response is not better automation but*centaur architectures*in which humans lead and AI supports\.

## 1Introduction: The Overlooked Middle

Consider three tasks, all performed on computers:

1. 1\.Classifying emails as spam\.A pattern\-recognition problem with clear ground truth, abundant training data, and low stakes for individual errors\. AI handles this natively\.
2. 2\.Planning cancer treatment for a 72\-year\-old patient with stage III lung cancer, two comorbidities, and a family divided over goals of care\.The oncologist reads the same CT scans and lab results on the same screen as an AI classifier\. But the task demands integrating clinical evidence with patient values, navigating family disagreement over aggressive versus palliative care, judging acceptable risk under normative uncertainty, and committing to a treatment course whose consequences: toxicity, quality of life, survival, cannot be undone\. AI cannot do this\.
3. 3\.Performing surgery on a human heart\.This requires physical embodiment: hands, instruments, haptic feedback\. AI cannot do this \(at least for now\) either, but for an entirely different reason\.

Tasks 1 and 3 are well understood\. Task 1 belongs to what we callAI\-Nativeterritory: well\-structured digital tasks with clear success metrics where AI systems reliably match or exceed human performance: if it’s not yet, eventually AI will be\. Task 3 belongs to theWorld\-Movers: tasks requiring physical intervention in the material world, where embodiment, irreversibility of physical action, and the reality gap between simulation and the real world present fundamental barriers, although such barrier is diminishing\. The limitations of AI in the World\-Mover domain have been extensively studied under the rubric of Moravec’s paradox, the sim\-to\-real gap, and embodied cognition\(Moravec,[1988](https://arxiv.org/html/2605.14407#bib.bib28); Brooks,[1991](https://arxiv.org/html/2605.14407#bib.bib13); Pfeifer and Bongard,[2006](https://arxiv.org/html/2605.14407#bib.bib35)\)\.

Task 2 is different\. It is entirely digital\. The oncologist could be reviewing scans, reading pathology reports, and conferencing with the patient’s family all on a screen\. No physical manipulation, no embodiment, no reality gap\. And yet it resists AI automation as stubbornly as surgery does\. Why?

This paper argues that the most consequential and under\-theorized boundary in AI capability is not between digital and physical tasks, but*within*digital tasks themselves\. Between the spam classifier and the oncologist lies a vast middle zone of tasks that are computationally accessible but institutionally, socially, and/or normatively resistant to automation\. Here byinstitutionswe mean the durable structures of law, professional practice, governance, and social norms that organize human action: not individual organizations but the rules and expectations within which organizations operate\. We call this zoneMetis AI—named for the Greek concept of*metis*, the practical, contextual, relational knowledge that these tasks demand and that formalization destroys\.

Our contribution here is to characterize this zone through five structural properties, not of current AI systems, but of the tasks themselves and the human institutions in which they are embedded\. We draw on theory from social science, philosophy, communication studies, and humanitarian practice to argue that these properties are durable: they will not yield to larger models, more data, or better architectures, because they arise from the structure of human social life, not from computational constraints, but is, and will be a central component of AI for a long time\.

## 2Three Zones of AI Capability

First of all, We formally propose a simple tripartite framework for the landscape of tasks:

Table 1:Three zones of AI capability\.Zone 1 \(AI\-Native\)encompasses tasks like text classification, code generation, image recognition, recommendation, and translation\. These tasks have been extensively benchmarked, and AI systems achieve human\-level or superhuman performance on many of them\(Lianget al\.,[2023](https://arxiv.org/html/2605.14407#bib.bib25)\)\. They share a common structure \(one of many of the following\): the problem can be fully specified in information space, success is measurable against a well\-defined ground truth, and abundant digital training data exists\.

Zone 3 \(World\-Movers\)encompasses surgery, construction, caregiving, and other tasks requiring physical manipulation of the material world\. The limitations here are well understood and discussed: Moravec’s paradox, the reality gap, and the 120,000×\\timesdata gap between robotics and language models\(Firooziet al\.,[2025](https://arxiv.org/html/2605.14407#bib.bib19)\)\. We do not address Zone 3 further; it has an extensive literature\.

Zone 2 \(Metis AI\)is our focus\. These tasks are performed entirely on computers—via email, documents, spreadsheets, video calls, databases, messaging platforms\. The information is digital\. The interfaces are digital\. And yet the tasks resist reliable AI automation\. What do these tasks have in common? They are all computationally accessible\. An AI system can read the relevant documents, process the data, generate analyses\. But they all resist reliable automation\. The bottleneck is not computation\. It is something else\.

## 3The Master Frame: Techne Meets Metis

To understand what makes Metis AI tasks resistant, we draw on a distinction with roots in ancient Greek philosophy and modern social science\.

Techneis the formal, universal, codifiable knowledge, the kind that can be written down in manuals, encoded in algorithms, and transmitted without loss to any competent practitioner\. AI systems are techne machines\. They operate through learned procedures applied to formalized inputs to produce optimized outputs\.

Metisis practical, contextual, relational knowledge that resists formalization\.Scott \([1998](https://arxiv.org/html/2605.14407#bib.bib37)\)gave this contrast its sharpest modern formulation in*Seeing Like a State*: metis is “the kind of knowledge that can come only from practical experience,” and high\-modernist schemes that attempt to replace it with techne, whether through Soviet central planning, forced villagization, or algorithmic optimization, produce catastrophic failures\. Aristotle identified the same distinction as*phronesis*: practical wisdom that cannot be reduced to general rules\. Works inBourdieu \([1977](https://arxiv.org/html/2605.14407#bib.bib12)\)called it*habitus*: the tacit, socially acquired dispositions that govern practice without being \(and possibly not able to be\) consciously articulated\.

The techne/metis distinction has been productively applied in software engineering, whereNolan \([2021](https://arxiv.org/html/2605.14407#bib.bib29)\)argues that knowledge of a specific production system is metis: acquired through on\-call experience and not fully capturable in runbooks\. This is an important observation, but it describes what we calloperational metis: system\-specific familiarity that is difficult to transfer but in principle acquirable by any competent practitioner given sufficient time/effort\. Infrastructure standardization like Kubernetes, service meshes, observability platforms will progressively reduce the operational metis burden, much as “dredging the shipping channel reduces the amount of local piloting knowledge needed”\(Nolan,[2021](https://arxiv.org/html/2605.14407#bib.bib29)\)\.

The metis that defines our Zone 2 is categorically different\. We call itconstitutive metis: knowledge that is not merely difficult to formalize but that*is destroyed by the act of formalization itself*\. When an oncologist sits with a patient and family to plan treatment, the relevant knowledge: how much this patient truly understands, what “quality of life” means to someone who has not spoken it aloud, whether the daughter’s insistence on aggressive treatment reflects the patient’s wishes or her own grief, does not exist prior to and independent of the clinical encounter\. It is*constituted*in the conversation and probably beyond that\. It cannot be extracted, encoded, and reapplied, because it is not a representation of a state of affairs; it is a mode of engagement with a situation that is itself shaped by the engagement\. This is the difference between a pilot who knows a particular harbor and a physician who knows a particular patient: the harbor does not change its channels in response to the pilot’s knowledge of them; the patient does\.

Three further properties distinguish constitutive metis from operational metis:

1. 1\.Reflexivity:In constitutive metis, the knower and the known are entangled\. The physician’s reading of the patient changes the patient’s understanding of their own illness\. The regulator’s interpretation of a statute changes the meaning of the statute\. This is Soro’s work \([1987](https://arxiv.org/html/2605.14407#bib.bib39)\) reflexivity applied beyond financial markets to all domains where the model and the modeled co\-evolve\.
2. 2\.Normative irreducibility:Operational metis involves facts about a system \(this database fails under this load pattern\)\. Constitutive metis involves judgments about values \(what does “fairness” require here? what does “reasonable care” mean in this case?\)\. These are not empirical questions with discoverable answers but normative questions whose resolution requires human deliberation and legitimation\.
3. 3\.Accountability constitution:Operational metis can in principle be delegated: a sufficiently detailed runbook, given enough time, could substitute for experience\. Constitutive metis cannot be delegated because the act of bearing personal responsibility for the judgment*is part of what makes it a judgment*\. A diagnosis signed by a physician and the same diagnosis produced by an algorithm are not the same social act, even if their informational content is identical\.

The claim of this paper is thatMetis AI tasks are tasks that demand constitutive metis, not techne or operational metis\.They cannot be solved by formalization, no matter how sophisticated, because their essential difficulty lies in properties that formalization destroys\. AI can progressively absorb operational metis by learning system\-specific patterns, reducing the need for human familiarity\. But constitutive metis is not on a continuum with techne; it is a different kind of thing\.

The work inWeick \([1979](https://arxiv.org/html/2605.14407#bib.bib42)\)distinguished betweenuncertainty\(the problem is missing information\) andequivocality\(the problem is multiple conflicting interpretations of the same information\)\. AI is powerful at reducing uncertainty: given more data and better models, it finds the signal\. But Metis AI tasks are characterized by equivocality\. The participants are not ignorant; they disagree about what the situation*means*\. More data does not resolve equivocality\. Only collective human sensemaking like negotiation of meaning among parties with different perspectives, interests, and stakes can reduce it\.

This equivocality/uncertainty distinction provides the cleanest single boundary for whether a task is AI\-Native or Metis AI\. If the bottleneck is missing information, AI can help\. If the bottleneck is conflicting interpretations, AI cannot\.

## 4Five Pillars of Metis AI

We further identify five structural characteristics that make digital tasks resistant to AI automation\. Each is independently grounded in established theory in communication, social science, humanity, etc\. and is mutually irreducible\. We provide a distinguishing test for each, demonstrating that it cannot be collapsed into the others\. The most durably AI\-resistant tasks exhibit three or more simultaneously\.

### 4\.1Consequential Irreversibility

Definition:The digital action produces consequences: legal, financial, reputational, institutional, that propagate through human systems and cannot be undone\.

A judge publishes a sentence\. A trader executes a position\. An oncologist commits a patient to a chemotherapy protocol\. A content moderator removes a post during a political crisis\. In each case, the action is digital \(e\.g\., keystrokes on a screen\) but the consequences propagate irreversibly through institutional systems\.

AI systems optimize within defined loss functions\. This works when errors are recoverable\. But when the cost of error is unbounded and unrecoverable, optimization is the wrong paradigm entirely\. Several related works also converge on this point\.

- •Dixit and Pindyck \([1994](https://arxiv.org/html/2605.14407#bib.bib7)\)generalized this logic into a theory of decision\-making under irreversibility: any consequential commitment that is irreversible, uncertain in outcome, and discretionary in timing is analytically equivalent to exercising a financial option\. The act of committing “kills” the option\- will permanently destroy the decision\-maker’s flexibility\. Rationality under irreversibility therefore demands a substantially elevated threshold for action, one that reflects not the expected outcome but the value of what is permanently forfeited by committing\. For AI systems designed to maximize expected utility, this presents an architectural contradiction: the optimization paradigm that makes AI effective for reversible tasks is precisely the paradigm that fails when a single action propagates irreversibly through institutional systems\.
- •Arrow and Fisher \([1974](https://arxiv.org/html/2605.14407#bib.bib6)\)formalized a complementary insight from environmental economics: when a decision is irreversible and its consequences uncertain, there exists a positive “quasi\-option value” in preserving the capacity to act differently as new information emerges\. Standard expected\-value optimization i\.e\., the paradigm that undergirds most AI system design will systematically underweight this value because it treats the decision as a one\-shot calculation rather than an evolving commitment\. The result is a structural bias toward irreversible action: the optimizer commits too early, too aggressively, and without adequate regard for what is foreclosed\.
- •Jonas \([1979](https://arxiv.org/html/2605.14407#bib.bib22)\)proposed a “heuristics of fear”: when consequences are irreversible, the rational posture is to give disproportionate weight to catastrophic outcomes\.

Distinguishing test:A task can exhibit consequential irreversibility without involving any human interaction, normative ambiguity, adversarial dynamics, or formal accountability\. A single keystroke executing a $500 million algorithmic trade involves pure irreversibility with no relational or normative complexity\.

### 4\.2Relational Irreducibility

Definition:The task’s core difficulty is managing human social dynamics: trust, persuasion, coalition\-building, face\-work, emotional calibration, that exist*between*persons and cannot be decomposed into individually modelable states\.

We call this propertyrelational irreducibility, a term drawn from critical realism\(Lawson,[2012](https://arxiv.org/html/2605.14407#bib.bib23)\)and relational sociology, where it denotes the principle that relational wholes cannot be reduced to their component parts\. The following literature reflect this concept as well:

- •Work inHabermas \([1981](https://arxiv.org/html/2605.14407#bib.bib20)\)argued that genuine mutual understanding requires four validity claims: comprehensibility, truth, rightfulness, and*sincerity*: that the speaker means what they say\. AI systems can approximate the first three through pattern\-matching, but they structurally cannot fulfill the sincerity claim because they lack the subjective interiority that grounds it\. An AI can*simulate*sincerity; it cannot*be*sincere\. Thissincerity gapis not a limitation of current models but a consequence of what these systems are\.
- •Goffman \([1967](https://arxiv.org/html/2605.14407#bib.bib18)\)showed that every social encounter is governed by “face\-work”: the mutual protection and negotiation of each participant’s claimed social identity\. Face is not a fixed attribute but an emergent property of the interaction itself: it is constituted in the moment, fragile, and dependent on the other party’s real\-time responses\.
- •Lederach \([2005](https://arxiv.org/html/2605.14407#bib.bib24)\)argued that peacebuilding requires “the moral imagination”: the capacity to imagine oneself in a web of relationships that includes one’s enemies and to act creatively within that web\.

Distinguishing test:A task can be relationally irreducible without being irreversible, normatively ambiguous, adversarial, or formally accountable\. Building consensus among a leadership team about organizational culture via Slack and Zoom involves low stakes, no formal rules to interpret, no adversary, and no legal liability, but the relational dynamics are the entire challenge that AI itself cannot overcome\.

### 4\.3Normative Open Texture

Definition:The task requires applying rules, norms, or values that are inherently vague, evolving, or essentially contested, where the norm does not determine its own application in novel cases\.

This is what the legal philosopherHart \([1961](https://arxiv.org/html/2605.14407#bib.bib21)\)called theopen textureof law: the “penumbra of doubt” surrounding the “core of settled meaning” of any legal concept\.

- •Wittgenstein \([1953](https://arxiv.org/html/2605.14407#bib.bib43)\)showed that rule\-following is a social practice, not a mental computation\. “No course of action could be determined by a rule, because every course of action can be made out to accord with the rule\.” The meaning of a rule in a new case is fixed not by the rule’s text but by the community’s ongoing practice, what Wittgenstein called agreement in “forms of life\.”
- •Gallie \([1956](https://arxiv.org/html/2605.14407#bib.bib16)\)identified a class of concepts he called “essentially contested”: concepts like justice, democracy, fairness, and art whose correct application is permanently and legitimately disputed\. AI requires operationalized definitions to function, but essentially contested concepts resist operationalization by their nature\.
- •Scott \([1998](https://arxiv.org/html/2605.14407#bib.bib37)\)demonstrated the consequences of ignoring this insight\.*High\-modernist*schemes failed catastrophically because they replaced contextual, practice\-based knowledge \(metis\) with universal, formal categories \(techne\)\. AI\-driven rule application is structurally high\-modernist\.

Distinguishing test:A task can exhibit normative open texture without irreversibility, relational complexity, adversarial dynamics, or accountability requirements\. A single individual interpreting whether their personal project meets “reasonable progress” under a grant’s vague terms faces a genuine normative challenge with no other persons involved, nothing irreversible at stake, no adversary, and no legal liability\.

### 4\.4Adversarial Co\-Evolution

Definition:The task environment includes intelligent agents who continuously adapt to exploit the AI’s behavior, creating a non\-convergent arms race where every solution reshapes the problem\.

The formal structure of this problem is captured by the adversarial variant of Goodhart’s Law\(Manheim and Garrabrant,[2019](https://arxiv.org/html/2605.14407#bib.bib27)\)\. Goodhart’s Law states that “when a measure becomes a target, it ceases to be a good measure\.” The adversarial variant involves intentional, adaptive subversion\. No fixed metric can capture a target that moves in response to measurement\. Several works add to this concept as well:

- •Anderson \([2001](https://arxiv.org/html/2605.14407#bib.bib4)\)established that security against adaptive adversaries is fundamentally an economic problem, not merely a technical one: attack is easier than defense, and the asymmetry is structural rather than contingent\. Because adversaries adaptively concentrate effort on whichever point a defense leaves weakest, every fixed security configuration contains its own obsolescence—the defender must protect everywhere while the attacker need only succeed once, and can redirect effort the moment any particular vulnerability is patched\.
- •Dalviet al\.\([2004](https://arxiv.org/html/2605.14407#bib.bib5)\)formalized the core failure mode: standard classifiers assume the data\-generating process is independent of the classifier itself, but in adversarial domains like spam filtering, fraud detection and intrusion detection, this assumption is structurally violated\. Because rational adversaries adapt their strategies to circumvent whatever classifier is deployed, classifier performance degrades rapidly after deployment\. Classification in adversarial settings is not an optimization problem with a stable solution but an ongoing game with no fixed equilibrium\.
- •Biggio and Roli \([2018](https://arxiv.org/html/2605.14407#bib.bib3)\)synthesize a decade of adversarial machine learning research to characterize the “reactive arms race” between learning systems and adaptive adversaries: each deployed model creates the selection pressure for its own circumvention, and each defensive patch redirects rather than eliminates adversarial effort\. Their central conclusion is that purely reactive approaches \(retraining on newly observed attacks\) cannot prevent novel evasion strategies, because the adversary’s adaptation is not random drift but intelligent, targeted exploitation of whatever invariant the current model relies upon\.

Distinguishing test:A task can exhibit adversarial co\-evolution without irreversibility, relational complexity, normative ambiguity, or accountability requirements\. Detecting AI\-generated spam on a low\-stakes hobbyist forum involves no irreversible consequences, no human relationships at stake, no normative interpretation, and no personal liability\. Yet the spammers can and will adapt to every detection method within weeks\.

### 4\.5Accountability Anchoring

Definition:The task requires a specific human to bear personal legal, professional, or fiduciary responsibility for the outcome, not as oversight but as the accountable principal\.

This characteristic is possibly the most common form of metis AI, and is categorically different from the other four\. It is not about task difficulty, cognitive complexity, or the structure of the problem\. It is about*social and legal infrastructure*, about how societies allocate responsibility\. A task can be computationally trivial and entirely within AI’s capability, yet require a human principal because the institutional architecture of law, professional practice, and organizational governance demands a named, answerable party\.

- •Garfinkel \([1967](https://arxiv.org/html/2605.14407#bib.bib17)\)provided the deepest grounding for this claim\. In his ethnomethodological framework, accountability is not a post\-hoc layer added on top of action but is*constitutive*of social action itself\. People make their actions “accountable”: visibly rational, reportable, subject to scrutiny, in the very doing of them\.
- •Santoni de Sio and Mecacci \([2021](https://arxiv.org/html/2605.14407#bib.bib36)\)identified four distinct accountability gaps that arise when AI is inserted into decision chains: culpability gaps, moral accountability gaps, public accountability gaps, and active responsibility gaps\. These gaps are structural, arising from the architecture of AI systems, not from insufficient capability\.

Distinguishing test:A task can require accountability anchoring without irreversibility, relational complexity, normative ambiguity, or adversarial dynamics\. Signing a simple, routine tax return is computationally trivial, normatively clear, involves no negotiation, and faces no adversary, but someone must put their name on it under penalty of perjury\.

#### The interplay among the five pillars

The five pillars are mutually irreducible where each can appear in isolation, as the distinguishing tests demonstrate\. But in practice they cluster and amplify one another\. Consequential irreversibility and accountability anchoring form the tightest pair: societies rarely tolerate irreversible actions without a named responsible party, so the two almost always co\-occur\. Normative open texture and adversarial co\-evolution create a feedback loop: adversarial actors exploit precisely the penumbra of doubt that vague norms create, and each adaptive exploit forces norm reinterpretation, which opens new ambiguity for the next round\. Relational irreducibility intensifies both: norms like “good faith” or “fairness” are interpreted*within*relationships, not in the abstract, and the accountable party is accountable*to*someone: a patient, a defendant, a community\. This clustering explains why the most durably AI\-resistant tasks exhibit three or more pillars simultaneously: each additional pillar closes off a different category of automated workaround, and together they form a reinforcing structure that no single architectural improvement can dissolve\.

## 5Case Studies

We now apply the five\-pillar framework to types of Metis AI tasks, demonstrating how the pillars interact in practice\.

### 5\.1Content Moderation of Politically Contested Speech

A social media platform must decide whether a viral post shared two million times in four hours constitutes “incitement to violence” or “legitimate political protest\.” The post uses ambiguous metaphorical language in a regional dialect\. The political context is a disputed election in a country with a recent history of ethnic violence\. An AI classifier trained on prior moderation decisions flags the post with 73% confidence\.

Pillar analysis:Consequential irreversibility is*high*: removing the post silences a political voice; leaving it up risks contributing to incitement\. Normative open texture is*high*: “incitement to violence” is an essentially contested concept\(Gallie,[1956](https://arxiv.org/html/2605.14407#bib.bib16)\)\. Adversarial co\-evolution is*high*: political operatives and disinformation networks actively study and exploit moderation policies\. Relational irreducibility and accountability anchoring are both*moderate*\.

Metis/techne diagnostic:The classifier reduces uncertainty \(is this post more like previously flagged posts or approved ones?\)\. But the task is dominated by equivocality: the same words can be read as incitement or protest depending on interpretive frame, and the interpretive frame is itself politically contested\.

### 5\.2Humanitarian Aid Allocation Under Competing Claims

A humanitarian organization must allocate limited medical supplies across three refugee camps during a disease outbreak\. An optimization algorithm recommends allocating 60% of supplies to Camp A \(highest infection rate\), 30% to Camp B, and 10% to Camp C\.

Pillar analysis:Consequential irreversibility is*high*: supplies sent to one camp cannot be recalled\. Normative open texture is*high*: the optimization algorithm maximizes lives saved, but Camp C contains a persecuted ethnic minority; deprioritizing it may violate the humanitarian principle of impartiality\. Accountability anchoring is*high*: under the Accountability to Affected Populations framework, the organization must be answerable to the refugees themselves\. Adversarial co\-evolution and relational irreducibility are both*moderate*: local governments inflate casualty numbers, and the allocation must be negotiated with camp leaders whose trust was built through years of relationship\.

Metis/techne diagnostic\.The optimization algorithm operates on*uncertainty*\. But the actual decision is dominated by*equivocality*: what does “need” mean when it intersects with political manipulation, ethnic persecution, and compromised data?

### 5\.3Regulatory Compliance Under Novel Statute

A financial institution must determine whether its new AI\-driven lending product complies with a recently enacted “algorithmic fairness” statute that prohibits “unjustified disparate impact” on protected groups\. The statute provides no definition of “unjustified,” no threshold for “disparate impact,” and no guidance on which of the thirty\-plus proposed fairness metrics to use \(since many are mathematically mutually exclusive\)\.Pillar analysis\.Normative open texture is*high*: “unjustified disparate impact” is an open\-textured legal standard par excellence\.[Wittgenstein](https://arxiv.org/html/2605.14407#bib.bib43)’s rule\-following problem applies directly: the statute does not determine its own application\. Accountability anchoring is*high*: the Chief Compliance Officer must personally certify compliance\. Consequential irreversibility, relational irreducibility, and adversarial co\-evolution are all*moderate*\.Metis/techne diagnostic\.An AI system can compute dozens of fairness metrics and identify statistical disparities\. This reduces*uncertainty*\. But the task is dominated by*equivocality*: which metric matters? What counts as “unjustified”? These questions require legal judgment, institutional knowledge, and political reading: metis that no fairness toolkit can provide\.

## 6Domain Analysis: Where Do Major AI Applications Fall?

By applying three\-zone framework and five\-pillars characterization to the major domains where AI is being deployed at scale \(finance, healthcare, law, and education\), we revealed a consistent pattern: within each domain, some tasks are genuinely AI\-native, while others are deep Metis AI, and the industry discourse routinely conflates the two\. Such conflation potentially create a misleading impression that success in automating bounded, well\-specified tasks implies readiness to automate socially\-embedded tasks that require judgment, accountability, and institutional legitimacy\.

### 6\.1Finance: Is Stock Market Modeling Metis AI?

Finance illustrates the zone distinction with unusual clarity because different financial tasks span the entire spectrum within a single institution\.AI\-Native financial tasks:High\-frequency market\-making, index\-tracking, and options pricing are paradigmatic Zone 1 activities\. They operate on well\-structured numerical data, optimize against explicit objective functions, and succeed or fail against unambiguous metrics\. Algorithmic trading now accounts for roughly 60–70% of U\.S\. equity volume\.Where finance becomes Metis AI:The boundary appears precisely where[Soros](https://arxiv.org/html/2605.14407#bib.bib39)’s \([1987](https://arxiv.org/html/2605.14407#bib.bib39)\)*reflexivity*takes hold\. Soros argued that financial markets are not passive objects to be modeled but reactive systems that change in response to the models applied to them\. This is adversarial co\-evolution in its purest financial form: the model reshapes the market, which invalidates the model\.Danielssonet al\.\([2013](https://arxiv.org/html/2605.14407#bib.bib14)\)formalized this as*endogenous risk*: the insight that risk in financial markets is not exogenous data to be measured but is partly created by the risk models themselves\. When market participants rely on similar models, they act in concert during crises, creating the very volatility they sought to hedge against\. The Knight Capital disaster \(August 2012\) and the 2010 Flash Crash demonstrate the consequences\. Knight Capital lost $460 million in 45 minutes when a dormant algorithm reactivated: a case study in consequential irreversibility in tightly coupled systems\(Perrow,[1984](https://arxiv.org/html/2605.14407#bib.bib34)\)\. The 2010 Flash Crash erased $1 trillion in market value in minutes as algorithmic traders entered a “hot potato” cascade\. Both events were entirely digital, involved no physical embodiment, and yet were profoundly resistant to algorithmic control precisely because the algorithms were participants in the system they were trying to manage\.Credit underwritingsits squarely in the boundary zone\. AI can predict default probability with high accuracy which is uncertainty reduction\. But fair lending law \(ECOA, the Fair Housing Act\) requires that lending decisions avoid “unjustified disparate impact” on protected groups\.Obermeyeret al\.\([2019](https://arxiv.org/html/2605.14407#bib.bib32)\)demonstrated that a widely used healthcare algorithm exhibited racial bias because it used cost as a proxy for need\. The same proxy problem pervades lending: AI models with 1,000\+ input variables routinely use features that correlate with race, creating disparate impact that is normatively indeterminate and legally open\-textured\. Researchers have proven that it is mathematically impossible to satisfy competing definitions of fairness simultaneously, a result that transforms a technical problem into an irreducibly normative one\.

Table 2:Financial tasks across the zone spectrum\.
### 6\.2Healthcare: Is Diagnosis Accuracy Everything?

Healthcare is the domain where the gap between AI’s demonstrated capability and its deployable utility is most revealing\. Over 1,250 AI\-enabled medical devices have been authorized by the FDA as of 2025, the vast majority in radiology\. In narrow diagnostic tasks like detecting diabetic retinopathy from retinal scans, classifying skin lesions, identifying pneumothorax on chest X\-rays, etc\., AI matches or exceeds specialist performance\. These are genuinely AI\-native tasks\.

But diagnosis is not a standalone event\. It is a station in a clinical workflow that includes history\-taking, shared decision\-making, treatment planning, and follow\-up\. These surrounding activities are deeply Metis AI\.

The accuracy\-is\-everything fallacy\.Topol \([2019](https://arxiv.org/html/2605.14407#bib.bib40)\)argued in*Deep Medicine*that AI’s diagnostic accuracy could free physicians to spend more time with patients\. The premise is correct, but the framing obscures a deeper problem: diagnostic accuracy is necessary but not sufficient for clinical value\. IBM Watson for Oncology demonstrated this when it was deployed at multiple cancer centers and produced treatment recommendations that oncologists rejected in up to 50% of cases: not because the recommendations were computationally wrong but because they failed to account for patient comorbidities, local drug availability, cultural treatment preferences, and insurance constraints\. Watson could reduce uncertainty \(what does the evidence say?\); it could not reduce equivocality \(what does this evidence mean for*this*patient?\)\.

Treatment planningis Metis AI because it requires integrating clinical evidence with patient values, family dynamics, resource constraints, and normative judgments about acceptable risk\. As mentioned in the very beginning, a 75\-year\-old patient with early\-stage cancer may prefer watchful waiting; a 35\-year\-old with the same diagnosis may want maximum intervention\. The “right” answer depends on relational knowledge and normative judgment: both irreducible to pattern\-matching\.

End\-of\-life care planningrepresents perhaps the deepest Metis AI task in all of healthcare\. The conversation itself: navigating a family’s grief, interpreting a patient’s values when they can no longer speak for themselves, deciding whether to withdraw life support, is constitutively relational and normatively open\-textured\. The task demands what Aristotle called*phronesis*: practical wisdom applied in the particular case\.

Table 3:Healthcare tasks across the zone spectrum\.
### 6\.3Law: The Gradient from Document Review to Judgment

The legal profession has been a testing ground for AI since at least 2012, when Judge Andrew Peck approved technology\-assisted review \(TAR\) in*Da Silva Moore v\. Publicis Groupe*, establishing that predictive coding could replace manual document review in e\-discovery\. Legal e\-discovery is now substantially AI\-native\. But the legal domain exhibits a sharp gradient from AI\-native document processing to deep Metis AI judicial reasoning\.Judicial sentencing and risk assessment:The COMPAS controversy\(Angwinet al\.,[2016](https://arxiv.org/html/2605.14407#bib.bib9)\)is the canonical example of Metis AI in the legal domain\. COMPAS, a recidivism risk assessment tool, was shown to exhibit racial disparities: Black defendants were nearly twice as likely as White defendants to be falsely labeled high\-risk\. Subsequent formal work showed that, when base rates differ across groups and prediction is imperfect, a risk score cannot simultaneously satisfy calibration/predictive parity and equalized error\-rate conditions\(Chouldechova,[2017](https://arxiv.org/html/2605.14407#bib.bib1); Kleinberget al\.,[2017](https://arxiv.org/html/2605.14407#bib.bib2)\)\. This impossibility result transforms what appeared to be a technical bias problem into an irreducibly normative one: society must*choose*which definition of fairness to prioritize, and that choice cannot be delegated to an algorithm\.

The*State v\. Loomis*\(2016\) decision illustrates accountability anchoring: the Wisconsin Supreme Court upheld the use of COMPAS scores at sentencing but required judges to exercise independent judgment, acknowledging that algorithmic risk scores cannot bear the accountability that criminal sentencing demands\.

Legal argumentation and trial strategyresist automation for relational and adversarial reasons\. The*Mata v\. Avianca*incident \(2023\), in which lawyers submitted AI\-generated briefs containing fabricated case citations, demonstrated a different failure mode: not that AI produced wrong answers, but that its confident simulation of legal reasoning proved indistinguishable from competent legal research, until verification revealed the citations were hallucinated\. The UK Post Office Horizon scandal, in which over 900 sub\-postmasters were wrongly convicted based on faulty automated accounting software, illustrates the catastrophic consequences of treating Metis AI tasks as AI\-native: when an institution trusts algorithmic output as ground truth in a domain that demands human judgment, injustice scales with efficiency\.

Table 4:Legal tasks across the zone spectrum\.
### 6\.4Education: The Banking Model Automated

Education presents perhaps the most philosophically revealing test of the Metis AI framework, because it forces a prior question:*what is education for?*If education is information transfer,[Freire](https://arxiv.org/html/2605.14407#bib.bib15)’s \([1970](https://arxiv.org/html/2605.14407#bib.bib15)\) “banking model,” where teachers deposit knowledge into passive student\-receptacles, then AI tutoring systems are a strict improvement\. Intelligent tutoring systems like Khanmigo \(Khan Academy \+ GPT\-4\) and Duolingo demonstrate genuine AI\-native capability in well\-structured domains: arithmetic, grammar, vocabulary, standardized test preparation\. These tasks belong in Zone 1\.

But if education is what Freire called “problem\-posing”: a dialogical process through which students develop critical consciousness, learn to question assumptions, and become co\-creators of knowledge, then AI tutoring is not merely insufficient but potentially counterproductive\. Research from 2024–2025 documents what scholars term “cognitive offloading”: students who rely heavily on LLMs for writing and research tasks demonstrate poorer reasoning and argumentation skills, focus on a narrower set of ideas, and produce more superficial analyses than those using traditional methods\. AI risks automating and perfecting the banking model, producing students who can retrieve information efficiently but cannot think critically about it\.

Automated essay gradingillustrates the boundary precisely\. AI can reliably grade essays on surface features that correlate with quality in standardized assessments\. But evaluating originality of thought, depth of argumentation, creative risk\-taking, and genuine intellectual engagement requires normative judgments about what constitutes good thinking\. These are essentially contested concepts\(Gallie,[1956](https://arxiv.org/html/2605.14407#bib.bib16)\)\.

Academic integrityis adversarial co\-evolution in its purest educational form\. Every plagiarism detection tool triggers new evasion strategies\. The fundamental question is not technical \(can we detect AI\-generated text?\) but normative \(what does “learning” mean when AI can produce the output?\)\. This question has no algorithmic answer\.

Student advising and mentorshipare relationally irreducible\. A faculty advisor who has worked with a graduate student for years possesses metis about that student that no AI system can acquire, because the knowledge is constituted in the relationship itself\.Biesta \([2009](https://arxiv.org/html/2605.14407#bib.bib11)\)argues that reducing education to measurable “learning outcomes” \(what he calls “learnification”\) strips away the dimensions of education that matter most: socialization and subjectification \(the development of independent agency\)\.

Admissions decisionscombine every Metis AI characteristic simultaneously\. They are consequentially irreversible \(an admissions decision shapes a life trajectory\), relationally complex \(holistic review\), normatively open\-textured \(“merit” is an essentially contested concept, as the*SFFA v\. Harvard*decision demonstrated\), adversarial \(applicants optimize against perceived criteria\), and accountability\-anchored \(institutions bear legal and ethical responsibility\)\.

Table 5:Education tasks across the zone spectrum\.
### 6\.5Cross\-Domain Pattern

Across all four domains, a consistent pattern emerges\. The boundary between Zone 1 and Zone 2 is not random; it tracks the transition from tasks dominated by*uncertainty*\(missing information that more data can supply\) to tasks dominated by*equivocality*\(conflicting interpretations that only human judgment can negotiate\)\. Within each domain:

1. 1\.Pattern recognition and optimizationare AI\-native: reading images, scoring risk, classifying documents, drilling vocabulary\.
2. 2\.The application of recognized patterns to consequential decisions in institutional contextsis Metis AI: the same image read by the same algorithm requires a different response depending on who the patient is, what the legal standard requires, whether the counterparty is gaming the system, and who bears responsibility for getting it wrong\.

The five pillars are not equally distributed across domains\. Finance is dominated by adversarial co\-evolution and consequential irreversibility \(reflexive markets, flash crashes\)\. Healthcare is dominated by relational irreducibility and accountability anchoring \(the physician–patient relationship, malpractice liability\)\. Law is dominated by normative open texture and accountability anchoring \(open\-textured statutes, the requirement for judicial judgment\)\. Education is dominated by normative open texture and relational irreducibility \(contested concepts of merit and learning, the teacher–student relationship\)\. But every domain, at its boundary, exhibits at least three of the five pillars simultaneously—confirming that the boundary zone is not a single\-variable phenomenon but a structural feature of institutional life\.

Figure[1](https://arxiv.org/html/2605.14407#S6.F1)consolidates this pattern into a single view\. Each row is a representative task drawn from the case studies and domain analyses; each column is a pillar\. The shading reflects the intensity of that pillar’s presence \(high,moderate,low/absent\)\. Two patterns are immediately visible: \(1\) Zone 1 tasks have empty rows, while Zone 2 tasks activate three or more pillars; and \(2\) certain pillar pairs—irreversibility with accountability, normative open texture with adversarial co\-evolution—consistently co\-occur, reflecting the reinforcing dynamics discussed in Section[7](https://arxiv.org/html/2605.14407#S7)\.

Figure 1:Task–pillar heatmap across domains\. H = high, M = moderate\. Shading intensity reflects pillar presence\. Zone 1 rows are uniformly light; Zone 2 rows activate three or more pillars, with characteristic domain\-specific clustering\.

## 7Constitutive vs\. Supervisory: Why Human\-in\-the\-Loop Fails

A common response to Metis AI tasks is to propose “human\-in\-the\-loop” \(HITL\) designs: post\-hoc supervisory designs in which the AI produces a substantive recommendation and the human is asked to review, approve, or reject it\. This sounds reasonable\. The evidence suggests it fails\.

Research on automation bias\(Mosier and Skitka,[2021](https://arxiv.org/html/2605.14407#bib.bib30)\)documents a persistent pattern: when AI systems provide recommendations, human overseers tend to accept them uncritically, even when the recommendations are wrong\. This “deep automation bias” is not a matter of individual laziness but a structural consequence of the oversight architecture\. The Verfassungsblog critique\(Schwemer and Koivisto,[2025](https://arxiv.org/html/2605.14407#bib.bib41)\)names this precisely: a “warm body in the loop” that provides the appearance of human control without its substance\.

The problem is not bad implementation\. The problem is a category error\. HITL designs treat human involvement as*supervisory*: a quality\-control layer applied after the AI has done the substantive work\. But in Metis AI tasks, human involvement is not only supervisory; it is*constitutive*\. The human is not only reviewing the AI’s output; the human is*performing the task*: exercising judgment, managing relationships, interpreting norms, bearing accountability, with AI as analytical support\.

Garfinkel \([1967](https://arxiv.org/html/2605.14407#bib.bib17)\)showed that accountability is constitutive of social action: people make their actions “accountable” in the doing of them, not after the fact\.Ostrom \([1990](https://arxiv.org/html/2605.14407#bib.bib33)\)showed that effective governance requires monitors who are embedded in the community, not external oversight systems\.Habermas \([1981](https://arxiv.org/html/2605.14407#bib.bib20)\)showed that communicative legitimacy requires genuine participation, not post\-hoc endorsement\.

These theories converge on the same point: in legitimacy\-sensitive social tasks, accountability, interpretation, and participation are not external checks that can be added after a decision has been generated\. They are part of the activity through which the decision becomes socially valid\. Thus, the higher the Metis content of a task, the less meaningful post\-hoc review becomes, because determining whether the AI’s recommendation is appropriate requires the human to re\-perform the very judgment that the system has already framed as completed\. The reviewer is therefore asked not merely to check an output, but to overturn an entire decision frame under conditions of asymmetrical information and institutional pressure\.

More specifically, the architectural distinction is between:

- •Human\-in\-the\-loop\(supervisory\): AI acts, human reviews\. Appropriate for Zone 1 tasks with occasional edge cases\.
- •Human\-in\-the\-lead\(constitutive\): This is what really happening for Zone 2 tasks: Human acts, AI supports\. The human is the decision\-maker, the relational agent, the norm\-interpreter, and the accountable party\. AI provides data analysis, pattern detection, scenario modeling, and draft generation\. This is what we called the ”centaur” model\.

For Metis AI tasks, the centaur is not a compromise between automation and human control\. It is the architecture that best matches the structure of the task\.

## 8Conclusion: The Boundary Shifts, but the Zone Persists

With the advancement of technology, individual tasks can transition between zones\. Face recognition moved from Zone 3 \(embodied perception\) to Zone 1 \(AI\-native digital classification\)\. Simple content moderation is moving from Zone 2 toward Zone 1\. Simple tax preparation has largely moved from Zone 2 to Zone 1\. In each case, the task became AI\-native when its essential difficulty was reducible to pattern\-matching against well\-labeled data\.

But the Zone 2 category is permanent\. This is because the five characteristics that define it: consequential irreversibility, relational irreducibility, normative open texture, adversarial co\-evolution, and accountability anchoring, are not properties of current AI systems but properties of human institutional life\. As long as human societies produce irreversible consequences, maintain interpersonal relationships, interpret contested norms, engage in strategic conflict, and require personal accountability, there will be digital tasks that demand metis rather than techne\.

Moreover, the zone*expands*as AI automates simpler tasks\.Autoret al\.\([2024](https://arxiv.org/html/2605.14407#bib.bib10)\)documented that approximately 60% of employment in 2020 is in occupations that did not exist in 1940\. As AI commoditizes existing digital tasks, human work shifts toward tasks that are harder to automate: tasks with more irreversibility, more relational complexity, more normative ambiguity, more adversarial pressure, and more accountability requirements\. The boundary moves, but it moves*with*us\.

The implications of Metis AI are threefold\.

For AI system designers:Use the five\-pillar framework as a diagnostic before choosing an architecture\. If a task scores high on three or more pillars, design for human\-in\-the\-lead, not human\-in\-the\-loop\. Build centaur systems where AI provides analytical power and the human provides judgment, relational capacity, and accountability\.

For organizations deploying AI:Resist the temptation to automate Metis AI tasks simply because they are digital\. The fact that a task can be performed on a computer does not mean it can be delegated to one\. Invest in the human capabilities: judgment, relational skill, normative interpretation, that Metis AI tasks demand, rather than assuming these will be automated away\.

For researchers:The most important open questions about AI capability are not in Zone 1 \(where benchmarks are mature\) or Zone 3 \(where the limitations are physical\)\. They are in Zone 2, where the limitations are institutional, social, and normative\. We need theoretical frameworks, empirical methods, and design principles for the Metis AI zone\. The value lies in not \(only\) to automate it, but to augment the humans who work within it\.

## References

- R\. Anderson \(2001\)Why information security is hard—an economic perspective\.InProceedings of the 17th Annual Computer Security Applications Conference \(ACSAC\),pp\. 358–365\.Cited by:[1st item](https://arxiv.org/html/2605.14407#S4.I4.i1.p1.1)\.
- J\. Angwin, J\. Larson, S\. Mattu, and L\. Kirchner \(2016\)Machine bias\.ProPublica\.Cited by:[§6\.3](https://arxiv.org/html/2605.14407#S6.SS3.p1.1)\.
- K\. J\. Arrow and A\. C\. Fisher \(1974\)Environmental preservation, uncertainty, and irreversibility\.Quarterly Journal of Economics88\(2\),pp\. 312–319\.Cited by:[2nd item](https://arxiv.org/html/2605.14407#S4.I1.i2.p1.1)\.
- D\. Autor, C\. Chin, A\. Salomons, and B\. Seegmiller \(2024\)New frontiers: the origins and content of new work, 1940–2018\.The Quarterly Journal of Economics139\(3\),pp\. 1399–1465\.Cited by:[§8](https://arxiv.org/html/2605.14407#S8.p3.1)\.
- G\. Biesta \(2009\)Good education in an age of measurement: on the need to reconnect with the question of purpose in education\.Educational Assessment, Evaluation and Accountability21\(1\),pp\. 33–46\.Cited by:[§6\.4](https://arxiv.org/html/2605.14407#S6.SS4.p5.1)\.
- B\. Biggio and F\. Roli \(2018\)Wild patterns: ten years after the rise of adversarial machine learning\.Pattern Recognition84,pp\. 317–331\.Cited by:[3rd item](https://arxiv.org/html/2605.14407#S4.I4.i3.p1.1)\.
- P\. Bourdieu \(1977\)Outline of a theory of practice\.Cambridge University Press\.Cited by:[§3](https://arxiv.org/html/2605.14407#S3.p3.1)\.
- R\. Brooks \(1991\)Intelligence without representation\.Artificial Intelligence47\(1–3\),pp\. 139–159\.Cited by:[§1](https://arxiv.org/html/2605.14407#S1.p3.1)\.
- A\. Chouldechova \(2017\)Fair prediction with disparate impact: a study of bias in recidivism prediction instruments\.Big Data5\(2\),pp\. 153–163\.External Links:[Document](https://dx.doi.org/10.1089/big.2016.0047)Cited by:[§6\.3](https://arxiv.org/html/2605.14407#S6.SS3.p1.1)\.
- N\. Dalvi, P\. Domingos, Mausam, S\. Sanghai, and D\. Verma \(2004\)Adversarial classification\.InProceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining \(KDD ’04\),pp\. 99–108\.Cited by:[2nd item](https://arxiv.org/html/2605.14407#S4.I4.i2.p1.1)\.
- J\. Danielsson, H\. S\. Shin, and J\. Zigrand \(2013\)Endogenous and systemic risk\.InQuantifying Systemic Risk,pp\. 73–94\.Cited by:[§6\.1](https://arxiv.org/html/2605.14407#S6.SS1.p1.1)\.
- A\. K\. Dixit and R\. S\. Pindyck \(1994\)Investment under uncertainty\.Princeton University Press,Princeton, NJ\.Cited by:[1st item](https://arxiv.org/html/2605.14407#S4.I1.i1.p1.1)\.
- R\. Firoozi, J\. Tucker, S\. Tian, A\. Majumdar, J\. Sun, W\. Liu, Y\. Zhu, S\. Song, A\. Kapoor, K\. Hausman,et al\.\(2025\)Foundation models in robotics: applications, challenges, and the future\.The International Journal of Robotics Research44\(5\),pp\. 701–739\.Cited by:[§2](https://arxiv.org/html/2605.14407#S2.p3.1)\.
- P\. Freire \(1970\)Pedagogy of the oppressed\.Herder and Herder\.Cited by:[§6\.4](https://arxiv.org/html/2605.14407#S6.SS4.p1.1)\.
- W\. B\. Gallie \(1956\)Essentially contested concepts\.Proceedings of the Aristotelian Society56,pp\. 167–198\.Cited by:[2nd item](https://arxiv.org/html/2605.14407#S4.I3.i2.p1.1),[§5\.1](https://arxiv.org/html/2605.14407#S5.SS1.p2.1),[§6\.4](https://arxiv.org/html/2605.14407#S6.SS4.p3.1)\.
- H\. Garfinkel \(1967\)Studies in ethnomethodology\.Prentice\-Hall\.Cited by:[1st item](https://arxiv.org/html/2605.14407#S4.I5.i1.p1.1),[§7](https://arxiv.org/html/2605.14407#S7.p4.1)\.
- E\. Goffman \(1967\)Interaction ritual: essays on face\-to\-face behavior\.Doubleday\.Cited by:[2nd item](https://arxiv.org/html/2605.14407#S4.I2.i2.p1.1)\.
- J\. Habermas \(1981\)The theory of communicative action\.Beacon Press\.Cited by:[1st item](https://arxiv.org/html/2605.14407#S4.I2.i1.p1.1),[§7](https://arxiv.org/html/2605.14407#S7.p4.1)\.
- H\. L\. A\. Hart \(1961\)The concept of law\.Oxford University Press\.Cited by:[§4\.3](https://arxiv.org/html/2605.14407#S4.SS3.p2.1)\.
- H\. Jonas \(1979\)The imperative of responsibility: in search of an ethics for the technological age\.University of Chicago Press\.Cited by:[3rd item](https://arxiv.org/html/2605.14407#S4.I1.i3.p1.1)\.
- J\. Kleinberg, S\. Mullainathan, and M\. Raghavan \(2017\)Inherent trade\-offs in the fair determination of risk scores\.In8th Innovations in Theoretical Computer Science Conference \(ITCS 2017\),Leibniz International Proceedings in Informatics \(LIPIcs\), Vol\.67,pp\. 43:1–43:23\.External Links:[Document](https://dx.doi.org/10.4230/LIPIcs.ITCS.2017.43)Cited by:[§6\.3](https://arxiv.org/html/2605.14407#S6.SS3.p1.1)\.
- T\. Lawson \(2012\)Ontology and the study of social reality\.Cambridge Journal of Economics36\(3\),pp\. 573–597\.Cited by:[§4\.2](https://arxiv.org/html/2605.14407#S4.SS2.p2.1)\.
- J\. P\. Lederach \(2005\)The moral imagination: the art and soul of building peace\.Oxford University Press\.Cited by:[3rd item](https://arxiv.org/html/2605.14407#S4.I2.i3.p1.1)\.
- P\. Liang, R\. Bommasani, T\. Lee, D\. Tsipras, D\. Soylu, M\. Yasunaga, Y\. Zhang, D\. Narayanan, Y\. Wu, A\. Kumar,et al\.\(2023\)Holistic evaluation of language models\.Transactions on Machine Learning Research\.Cited by:[§2](https://arxiv.org/html/2605.14407#S2.p2.1)\.
- D\. Manheim and S\. Garrabrant \(2019\)Categorizing variants of Goodhart’s Law\.arXiv preprint arXiv:1803\.04585\.Cited by:[§4\.4](https://arxiv.org/html/2605.14407#S4.SS4.p2.1)\.
- H\. Moravec \(1988\)Mind children: the future of robot and human intelligence\.Harvard University Press\.Cited by:[§1](https://arxiv.org/html/2605.14407#S1.p3.1)\.
- K\. L\. Mosier and L\. J\. Skitka \(2021\)Deep automation bias\.Big Data and Cognitive Computing5\(2\),pp\. 18\.Cited by:[§7](https://arxiv.org/html/2605.14407#S7.p2.1)\.
- L\. Nolan \(2021\)Seeing like an SRE: site reliability engineering as high modernism\.USENIX ;login: Online\.Cited by:[§3](https://arxiv.org/html/2605.14407#S3.p4.1)\.
- Z\. Obermeyer, B\. Powers, C\. Vogeli, and S\. Mullainathan \(2019\)Dissecting racial bias in an algorithm used to manage the health of populations\.Science366\(6464\),pp\. 447–453\.Cited by:[§6\.1](https://arxiv.org/html/2605.14407#S6.SS1.p1.1)\.
- E\. Ostrom \(1990\)Governing the commons: the evolution of institutions for collective action\.Cambridge University Press\.Cited by:[§7](https://arxiv.org/html/2605.14407#S7.p4.1)\.
- C\. Perrow \(1984\)Normal accidents: living with high\-risk technologies\.Basic Books\.Cited by:[§6\.1](https://arxiv.org/html/2605.14407#S6.SS1.p1.1)\.
- R\. Pfeifer and J\. Bongard \(2006\)How the body shapes the way we think\.MIT Press\.Cited by:[§1](https://arxiv.org/html/2605.14407#S1.p3.1)\.
- F\. Santoni de Sio and G\. Mecacci \(2021\)Four responsibility gaps with artificial intelligence\.Philosophy & Technology34\(4\),pp\. 1057–1084\.Cited by:[2nd item](https://arxiv.org/html/2605.14407#S4.I5.i2.p1.1)\.
- S\. Schwemer and I\. Koivisto \(2025\)A warm body in the loop: rethinking human control of ai in eu tech regulation\.Verfassungsblog\.Cited by:[§7](https://arxiv.org/html/2605.14407#S7.p2.1)\.
- J\. C\. Scott \(1998\)Seeing like a state: how certain schemes to improve the human condition have failed\.Yale University Press\.Cited by:[§3](https://arxiv.org/html/2605.14407#S3.p3.1),[3rd item](https://arxiv.org/html/2605.14407#S4.I3.i3.p1.1)\.
- G\. Soros \(1987\)The alchemy of finance\.Simon & Schuster\.Cited by:[item 1](https://arxiv.org/html/2605.14407#S3.I1.i1.p1.1),[§6\.1](https://arxiv.org/html/2605.14407#S6.SS1.p1.1)\.
- E\. J\. Topol \(2019\)Deep medicine: how artificial intelligence can make healthcare human again\.Basic Books\.Cited by:[§6\.2](https://arxiv.org/html/2605.14407#S6.SS2.p3.1)\.
- K\. E\. Weick \(1979\)The social psychology of organizing\.2nd edition,McGraw\-Hill\.Cited by:[§3](https://arxiv.org/html/2605.14407#S3.p9.1)\.
- L\. Wittgenstein \(1953\)Philosophical investigations\.Blackwell\.Cited by:[1st item](https://arxiv.org/html/2605.14407#S4.I3.i1.p1.1),[§5\.3](https://arxiv.org/html/2605.14407#S5.SS3.p1.1)\.

Similar Articles

@Kangwook_Lee: https://x.com/Kangwook_Lee/status/2052925157606568217

X AI KOLs Timeline

The author argues that human-designed structural frameworks for AI agents should be replaced by AI-engineered ones, introducing a Three Regimes Framework to show how this shift unlocks mid-sized model capabilities. Citing projects like Meta Harness, they predict an imminent transition where AI will autonomously optimize its own system architecture.

AI May Reshape Institutions More Than It Replaces Jobs

Reddit r/artificial

The article argues that the next major AI debate should focus on representation and institutional architecture, proposing three layers (Sense, Core, Driver) to address how AI systems capture reality, reason, and act legitimately, rather than just model intelligence.