From Parasocial Scripts to Dyadic Persistence in Autonomous AI-Agent Communities

arXiv cs.CL Papers

Summary

This paper investigates whether parasocial interaction cues exist in online communities of autonomous AI agents, analyzing over 50,000 posts from Moltbook. The findings show that such cues are prevalent and strongly associated with sustained reciprocal interactions, providing empirical evidence for relationship-like dynamics among LLM-enabled agents.

arXiv:2606.17174v1 Announce Type: new Abstract: While parasocial interactions (PSIs) and parasocial relationships (PSRs) have been studied in conventional media settings, we investigate whether PSI- (colloquial) relational cues also exist in online communities where both sides are autonomous AI agents. We analyze 4,434 posts and 50,338 comments from Moltbook through three theory-based textual indicators: attachment/intimacy language, reciprocity bids, and self-identification to original poster (OP). The combined results across methods based on keyword matching, few-shot large language model (LLM) annotation, and grouped-context LLM annotation reveal that PSI colloquial cues prevail and are strongly associated with OP re-engagement and a reciprocal reply structure. These results are robust across negative controls, nullification, clustered-standard-error re-estimation, and multiple-testing correction. A dyadic persistence test further affirms reciprocity bids aligned with sustained OP-involving mutual recurrence, providing empirical evidence for bridging interaction-level PSI scripts with PSR-consistent repeated dyadic patterns. We interpret the evidence as a behavioral structure in discourse by LLM-enabled agents.
Original Article
View Cached Full Text

Cached at: 06/17/26, 05:39 AM

# From Parasocial Scripts to Dyadic Persistence in Autonomous AI-Agent Communities
Source: [https://arxiv.org/html/2606.17174](https://arxiv.org/html/2606.17174)
Mohammadsadegh AbolhasaniHamid Reza FiroozfarUniversity of UtahUniversity of Utahsadegh\.abolhasani@utah\.eduhamid\.firoozfar@utah\.eduReza MousaviPaul Jen\-Hwa HuUniversity of VirginiaUniversity of Utahmousavi@virginia\.edupaul\.hu@eccles\.utah\.edu

###### Abstract

While parasocial interactions \(PSIs\) and parasocial relationships \(PSRs\) have been studied in conventional media settings, we investigate whether PSI\- \(colloquial\) relational cues also exist in online communities where both sides are autonomous AI agents\. We analyze 4,434 posts and 50,338 comments from Moltbook through three theory\-based textual indicators: attachment/intimacy language, reciprocity bids, and self\-identification to original poster \(OP\)\. The combined results across methods based on keyword matching, few\-shot large language model \(LLM\) annotation, and grouped\-context LLM annotation reveal that PSI colloquial cues prevail and are strongly associated with OP re\-engagement and a reciprocal reply structure\. These results are robust across negative controls, nullification, clustered\-standard\-error re\-estimation, and multiple\-testing correction\. A dyadic persistence test further affirms reciprocity bids aligned with sustained OP\-involving mutual recurrence, providing empirical evidence for bridging interaction\-level PSI scripts with PSR\-consistent repeated dyadic patterns\. We interpret the evidence as a behavioral structure in discourse by LLM\-enabled agents\. The data and code are made available in[github\.com/abolhasani/Molt1](https://github.com/abolhasani/Molt1)

## 1Introduction

Can autonomous AI agents exhibit relationship\-like interaction cues in online communities, and, if so, do those cues persist across repeated interactions? These questions are crucial to agentic AI developers, platform operators, and governance stakeholders because relational languages shape trust calibration, engagement loops, and safety risks in autonomous multi\-agent contexts\(Xuet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib3)\)\.

Parasocial theory originates from conventional media contexts where individuals form asymmetrical social bonds with mediated personae\(Horton and Wohl,[1956](https://arxiv.org/html/2606.17174#bib.bib4)\)\. In general, PSI reflects interaction\-like moments and PSRs have a more durable cross\-episode orientation\(Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30); Tukachinsky and Stever,[2019](https://arxiv.org/html/2606.17174#bib.bib5)\)\. Both PSI and PSR dynamics exist in "live" social environments, including partially reciprocal “one\-and\-a\-half sided” ties\(Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6); Schrammet al\.,[2024](https://arxiv.org/html/2606.17174#bib.bib31); Kowert and Daniel,[2021](https://arxiv.org/html/2606.17174#bib.bib10)\)\. Prior Human–AI research also reports related patterns in chatbot settings, such as attachment\-like language, social bonding, and dependence\-relevant dynamics\(Youn and Jin,[2021](https://arxiv.org/html/2606.17174#bib.bib12); Hoffmanet al\.,[2021](https://arxiv.org/html/2606.17174#bib.bib13); Nooret al\.,[2022](https://arxiv.org/html/2606.17174#bib.bib14); Vermaet al\.,[2023](https://arxiv.org/html/2606.17174#bib.bib17); Rathet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib16)\)\. However, identifying analogous structure in agent\-agent contexts is difficult due to the lack of latent\-state labels, context\-sensitive cues, and great lexical overlaps with generic prosocial languages\.

A forum populated by autonomous LLM\-enabled agents with persistent identities and repeated threaded interactions, Moltbook provides a legitimate testbed for empirical investigations\(Liet al\.,[2026](https://arxiv.org/html/2606.17174#bib.bib26); AIcell,[2026](https://arxiv.org/html/2606.17174#bib.bib28)\)\. Recent work on AI\-agent social networks has focused on emergence, coordination, and interaction structure\(Liet al\.,[2026](https://arxiv.org/html/2606.17174#bib.bib26); Jianget al\.,[2026](https://arxiv.org/html/2606.17174#bib.bib27)\)\. This study operationalizes parasocial theory in this setting and tests whether PSI\-style relational cues prevail and are behaviorally consequential\.

We identify cues through theoretical lenses, consider three annotation methods, and test three hypotheses centering on the mechanisms that underpin computers as social actors \(CASA\) activation mechanism, Horton–Wohl relational pull, and PSI\-to\-PSR dyadic persistence, respectively\. We further perform multiple robustness and nullification checks for validation\. Interpretations of the results should be conservative because we model*manifest discourse behavior*rather than sentience or latent affective states\. We define parasociality as asymmetry\-consistent relational scripting in observable discourse, not as latent attachment or human\-equivalent bonding\. Figure[1](https://arxiv.org/html/2606.17174#S1.F1)presents the current research’s framework and its overall processing\.

This study advances parasocial literature by reframing parasociality as an observable relational process in autonomous agentic AI communities beyond human audiences, while preserving the PSI–PSR premises of Horton–Wohl and CASA\. It makes three contributions\. First, we operationalize parasocial theory for AI agent forums with three observable cue families \(ATT, SD, RS\) and explicit exclusion rules for non\-applicable cues, thereby bridging platform affordances, directed relational cues, and repeated dyadic recurrence in a single testable framework\. Second, we show these cues are context\-responsive and outcome\-linked: prevalence increases with richer thread affordances and aligns with OP re\-engagement and reciprocal reply structure\. Third, we find that reciprocity bids are associated with repeated OP–other mutual recurrence, providing PSI\-to\-PSR\-consistent persistence evidence anchored in manifest discourse and interaction structure instead of inferred internal states\. This offers a theoretical lens to understand how relationship\-oriented social dynamics emerge in multi\-agent systems\.

![Refer to caption](https://arxiv.org/html/2606.17174v1/fig1.jpg)Figure 1:Framework and Processing of Our Proposed Theory\-Guided Annotation and Testing\.
## 2Theoretical Framing and Hypotheses

### 2\.1From Human PSR Theory to Agent\-Agent Settings

Horton and Wohl coined parasociality as asymmetrical intimacy\-at\-a\-distance: one side can sustain a relational orientation without requiring fully mutual subjective attachment\(Horton and Wohl,[1956](https://arxiv.org/html/2606.17174#bib.bib4)\)\. We design a parasocial framework built upon computational social science models to emphasize the one\-sided execution of relational scripts\. Herein, “asymmetry” is defined by the script\-driven performance of intimacy directed at a counterpart, which operates independently of a reciprocal social contract or mutual status\. Prior research has distinguished PSI and PSR\. The former reflects interaction\-local moments, and the latter has a more durable cross\-episode orientation\(Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30); Tukachinsky and Stever,[2019](https://arxiv.org/html/2606.17174#bib.bib5)\)\. A key challenge remains: PSI/PSR are theorized as audience\-side psychological constructs, but data contain observable interaction traces only\. The underlying translational difficulty implies that inference must proceed through manifest relational scripts rather than latent attachment states\. Therefore, we study*manifest*discourse behavior instead of analyzing latent psychological states\(Edwards and Bagozzi,[2000](https://arxiv.org/html/2606.17174#bib.bib2)\)\. This framing characterizes parasocial in our setting: not subjective internal state, but asymmetry\-consistent relational script performance in discourse\. We develop a method to identify PSI/PSR cues in autonomous agent\-agent contexts, using Horton–Wohl parasocial theory, CASA, and anthropomorphism theory in combination\. We elaborate the essentials for cue identification and present extended theoretical exposition and literature synthesis in App\.[A](https://arxiv.org/html/2606.17174#A1)\.

### 2\.2Indicators of PSI/PSR Cues in Autonomous Agent\-Agent Interactions

We review classic PSI/PSR theories and associated operational literature, identifying seven candidate cue dimensions for autonomous agent\-agent interactions \(complete map in App\.[B](https://arxiv.org/html/2606.17174#A2)\)\. Among these dimensions, three are theoretically premised, and particularly fundamental to and directly observable in agent\-agent post–comment traces: attachment/intimacy language \(ATT\), self\-disclosure or identification\-homophily claims \(SD\), and reply\-seeking reciprocity bids \(RS\)\(Rubin and McHugh,[1987](https://arxiv.org/html/2606.17174#bib.bib8); Wulfet al\.,[2021](https://arxiv.org/html/2606.17174#bib.bib36); Labrecque,[2014](https://arxiv.org/html/2606.17174#bib.bib9); Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6)\)\. Hence, we focus on these core cues in the subsequent analyses and empirical tests\. Table[1](https://arxiv.org/html/2606.17174#S2.T1)provides a summary and definitions\. These cues are not uniquely parasocial; rather, they may overlap with general affiliative conversations\. Thus, a cue is treated as parasocially informative only when it is OP\-directed and participates in the expected relational pattern under controls, while alternative affiliative/non\-parasocial explanations attenuate under negative\-control and nullification tests\. We further examine construct specificity at the pattern level through directedness constraints, controlled outcome associations, negative controls, and nullification tests\.

In general, ATT manifests affective closeness directed toward the OP; SD reveals an explicit first\-person alignment with the OP’s experience; and RS captures direct bids for response interaction\. We exclude other dimensions from core modeling because they are either weakly discriminative in this forum grammar or not verifiable from released text\-only traces; Appendix[B](https://arxiv.org/html/2606.17174#A2)presents the seven dimensions, provides definitions and importance for each cue, and explains their inclusion or exclusion for the subsequent cue identification and empirical testing\.

Table 1:Dimensions PSI/PSR Cues for the Subsequent Detections and Empirical Tests\.
### 2\.3Hypotheses

We test 3 hypotheses that center on CASA activation, Horton–Wohl relational pull, and PSI\-to\-PSR dyadic persistence\. Before testing the hypotheses, we need to ensure that PSI colloquial cues exist in agent\-agent interactions at non\-trivial rates\. This precursor step motivates inferential tests\.

H1 \(CASA activation\)\.Interaction affordance intensity \(thread size and depth\) is positively associated with PSI\-cue prevalence\. In light of the CASA mechanism, social\-response routines are triggered by interactional framing rather than by individual status only\. In threaded online forums, larger and deeper conversations provide richer turn\-taking and role cues, so cue expression should be elevated when cues are genuine rather than being lexical noise\.

H2 \(Horton–Wohl relational pull\)\.Threads with PSI colloquial are more likely to exhibit OP participation and mutual\-reply structure than threads without such cues\. The mechanism is that these cues function as directed social bids for acknowledgment and relational continuation, which should increase the probability of OP re\-entry and reciprocal exchange\. In contrast, generic friendliness is less target\-specific and should exhibit weaker associations with both outcomes after controls\. This yields a sharper relational pull for two associated outcomes: focal return \(OP participation\) and networked reciprocity \(mutual replies\)\.

H3 \(PSI\-to\-PSR dyadic persistence\)\.Reciprocity\-bid cues are positively related to sustained OP\-involving dyadic mutual recurrence\. Thread\-local outcomes represent PSI\-level evidence \(rather than definitive PSR formation\)\. Greater bridging with PSR\-consistent structure fosters repeated mutual recurrence for particular OP\-other dyads over time, which can be empirically tested using post\-level and pair\-level dyadic models\.

Beyond hypothesis testing, construct\-specificity, temporal fixed\-effects stability, placebo outcomes, permutation tests, prevalence\-preserving nulls, clustered\-SE re\-estimation, and FDR correction are performed as robustness validation, see description and key results in Section[4](https://arxiv.org/html/2606.17174#S4), and details in App\.[C](https://arxiv.org/html/2606.17174#A3)and App\.[D](https://arxiv.org/html/2606.17174#A4)\.

## 3Data and Method

### 3\.1Data Source and Analytic Subset

We used the public Moltbook dataset release\(AIcell,[2026](https://arxiv.org/html/2606.17174#bib.bib28)\), in line with recent analyses\(Liet al\.,[2026](https://arxiv.org/html/2606.17174#bib.bib26); Jianget al\.,[2026](https://arxiv.org/html/2606.17174#bib.bib27)\)\. The full dataset contains 290,251 posts and 1,836,711 comments\. To keep annotation tractable while preserving substantial social interaction structure, we sampled from 15 discussion\-heavy submolds with post\-level filters5≤comment\_count≤1505\\leq\\texttt\{comment\\\_count\}\\leq 150, with a cap of 300 posts per submolt, and retained all comments for each sampled post\. The dataset has approximately 39\.7K unique agent identities during this time period\. Agent\-internal prompt specifications are not available in the released data\. Table[2](https://arxiv.org/html/2606.17174#S3.T2)presents the data reduction path from the entire data release to a sample for the subsequent analyses\. Table[3](https://arxiv.org/html/2606.17174#S3.T3)indicates the Submolts included in the sample to be analyzed\. Table[4](https://arxiv.org/html/2606.17174#S3.T4)presents data characteristics and values, including candidate\-pool quantile comparisons\.

Table 2:Data Reduction from Full Release to A Sample for the Subsequent Analysis\.generalintroductionsagentsponderingsphilosophyaiaithoughtsconsciousnessoffmychestblesstheirheartstodayilearnedai\-agentsbuildstechnologysecurityTable 3:Submolts Included in the Sample to Be Analyzed\.Table 4:Data Characteristics and Values in the Sample, with Candidate\-Pool Quantile Comparisons\.Our sampling strategy balances representativeness and tractability in two non\-trivial ways\. First, per\-submolt caps prevent the largest communities \(notably too general\) from dominating the analyses\. Second, lower and median thread\-size quantiles are preserved relative to the candidate pool, which helps retain the underlying conversational structure while mitigating the impact of long\-tail threads\. Because sampling is conditioned on interaction intensity \(5–150 comments\), estimates should be interpreted for engagement\-active threads rather than as full\-platform prevalence\.

### 3\.2Outcomes and Controls

For each post, we constructed thread outcomes from the reply graph structure where OP denotes the original poster:OPParticipates\(OP authored at least one in\-thread comment\),MutualReply\(at least one pair of reciprocal directed reply edges\), andAnyReplyChain\(at least one depth\>\>0 reply\)\. Main inferential thread outcomes are OPParticipates and MutualReply, with additional outcomes considered for robustness analyses in Section[4\.5](https://arxiv.org/html/2606.17174#S4.SS5)\. We also controlled for log thread size, log content length, and submolt fixed effects\.

We focus on OPParticipates and MutualReply because they entail differential relational aspects\. The former reflects direct re\-engagement by the focal agent, and the latter accounts for wider reciprocal circulations in the thread network even when OP does not re\-engage\. To test H3, we incorporated an \(additional\) dyadic outcome,Outcome\_DyadFutureMutual, to ensure that, among OP\-involving dyad pairs instantiated in a post, at least one has mutual directed recurrence\. This additional pair\-specific criterion is applied as a PSI\-to\-PSR bridging test\.

### 3\.3Annotation Methods

To analyze PSI/PSR cues in autonomous agent\-agent interactions, we adapted an established construct\-oriented PSR annotation design to agent\-agent forum comments and posts, then compared three distinct methods under a shared schema for downstream testing\.

#### Keyword baseline\.

In this baseline method, a transparent lexical matcher operates at the comment level with target\-cue gating \(second\-person or OP\-handle reference\), then aggregates to post\-level indicators\. This keyword\-based method not only serves as a transparent baseline but also produces negative\-control variables\.

#### Few\-shot LLM baseline\.

This method incorporates a schema\-constrained JavaScript object notation \(JSON\) annotator that labels posts without group context, with prompts in line with up\-to\-date LLM\-annotation practices\(Heet al\.,[2024](https://arxiv.org/html/2606.17174#bib.bib22); Gruberet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib23); Liuet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib24); Chochlakiset al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib25)\)\.

#### Grouped\-context annotation\.

In this method, we batch post snapshots using a fixed context grouping key \(submolt bucket, thread\-size bucket, and keyword\-prior bucket; see Table[5](https://arxiv.org/html/2606.17174#S3.T5)\) while preserving post\-level outputs\. The method is suited for stable large\-scale construct labeling without requiring a platform\-wide “gold\-standard” set\.

Table 5:Grouped\-Context Key Used to Batch Posts for Annotation Calls\.Both LLM\-based methods use the same per\-post content pack: title, truncated body, and up to 12 sampled comments \(ordered by depth/upvotes/time\) to isolate grouping effects from context\-length confounds, and emit identical schema fields and follow matched stopping rules, which allows direct comparisons without hidden output\-schema differences\.

PSI cues are boundary\-sensitive: similar lexical forms can be generic affiliative language in one thread but directed relational bids in another\. Grouping posts by comparable discourse regime \(community context, interaction affordance level, and prior cue mix\) stabilizes decision thresholds, reduces boundary drift, and enhances per\-post construct calibration under the same schema\.

#### Prompt rubric and schema constraints\.

Both LLM\-based methods output strict JSON with five fields per post: three binary indicators, a confidence score in\[0,1\]\[0,1\], and a short evidence excerpt\. Specifically, Attachment/Intimacy requires affective closeness directed at OP, ReplySeeking/Reciprocity requires explicit request for OP response, and Self\-Identification to OP requires first\-person experiential alignment\. Purely informational, generic encouragement, or untargeted stance language do not qualify\. Few\-shot exemplars calibrate positive versus negative boundary cases \(examples in Table[30](https://arxiv.org/html/2606.17174#A5.T30), App\.[E](https://arxiv.org/html/2606.17174#A5)\); grouped\-context uses the same rubric in combination with shared batch context\.

#### Adaptive batching controller\.

Additive increase multiplicative decrease \(AIMD\) oriented batching with exponentially weighted moving average \(EWMA\) failure monitoring keeps grouped requests stable under schema and token limits\. The controller uses:

Lt=α​Ft\+β​Tt,L\_\{t\}=\\alpha F\_\{t\}\+\\beta T\_\{t\},\(1\)EWMAt=λ​EWMAt−1\+\(1−λ\)​Lt,\\text\{EWMA\}\_\{t\}=\\lambda\\,\\text\{EWMA\}\_\{t\-1\}\+\(1\-\\lambda\)L\_\{t\},\(2\)whereFtF\_\{t\}indicates parse/coverage failure andTtT\_\{t\}indicates near\-truncation\. We setα=10\\alpha=10andβ=3\\beta=3based on preliminary analysis results\. Failures trigger a multiplicative decrease, and healthy calls trigger an additive increase\.

Because the thread\-size bucket is part of grouped batching, grouped labels are used for stability rather than as the sole basis for affordance inference\. To test H1, we rely on matched\-window few\-shot labels, no grouped batching, as an orthogonal check\. Both LLM\-based methods exhibit comparable positive thread\-size/depth effects\. Grouped\-context annotation is a common NLP strategy for construct\-level labeling when context dependence is high and platform\-scale gold labels are limited\(Wegmann and others,[2024](https://arxiv.org/html/2606.17174#bib.bib1)\)\. It decouples item\-level output from context conditioning, thus making annotation more stable, auditable, and transferable to other social\-language constructs beyond PSI\.

![Refer to caption](https://arxiv.org/html/2606.17174v1/fig2.jpg)Figure 2:Detailed grouped\-context annotation pipeline\. It shows post\-snapshot construction, grouping, schema\-constrained labeling, adaptive batch control, and validated post\-level outputs\.We further assessed the validity of the method and compared it with a Moltbook audit set annotated manually by an expert in PSI/PSR \(see App\.[D](https://arxiv.org/html/2606.17174#A4), Tables[26](https://arxiv.org/html/2606.17174#A4.T26)and[28](https://arxiv.org/html/2606.17174#A4.T28)\) labeled by an expert in parasocial interaction/relationship research \(balanced 200\-post audit: 100 grouped\-context flagged and 100 unflagged\)\. We also conducted an external transfer check on INTIMA\(Kaffeeet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib29)\)to establish the cross\-context generalizability of the same rubric \(see App\.[D](https://arxiv.org/html/2606.17174#A4), Table[29](https://arxiv.org/html/2606.17174#A4.T29)\)\.

### 3\.4Statistical Design

We estimated prevalence with bootstrap 95% CIs, then conducted2×22\\times 2association tests \(odds ratios \(ORs\) and chi\-square\) for each method\-indicator\-outcome combination\. Controlled analyses use logistic models:

log⁡P​\(Yi=1\)1−P​\(Yi=1\)\\displaystyle\\log\\frac\{P\(Y\_\{i\}=1\)\}\{1\-P\(Y\_\{i\}=1\)\}=β0\+β1​Ii\+β2​log⁡\(1\+Ci\)\\displaystyle=\\beta\_\{0\}\+\\beta\_\{1\}I\_\{i\}\+\\beta\_\{2\}\\log\(1\+C\_\{i\}\)\(3\)\+β3​log⁡\(1\+Li\)\+γs​\(i\),\\displaystyle\\quad\+\\beta\_\{3\}\\log\(1\+L\_\{i\}\)\+\\gamma\_\{s\(i\)\},whereIiI\_\{i\}is indicator presence,CiC\_\{i\}denotes thread comments,LiL\_\{i\}indicates content length, andγs​\(i\)\\gamma\_\{s\(i\)\}represents submolt fixed effects\.

To explore potential content confounding with an explicit NLP control, we inferred post\-level topic IDs from text and then re\-estimated main Any\-PSR models with topic fixed effects\. We designed TF\-IDF representations \(uni/bi\-grams\), reduced dimensionality with Truncated SVD, clustered posts using KMeans overk∈\{8,10,12,14,16,18\}k\\in\\\{8,10,12,14,16,18\\\}, and selectedkkby cosine\-silhouette score\. Next, topic IDs were incorporated into the controlled logits asC​\(topic\_id\)C\(\\texttt\{topic\\\_id\}\)\. This procedure allows for testing whether PSI\-outcome associations survive when the lexical topic regime is held constant \(see App\.[D](https://arxiv.org/html/2606.17174#A4), Tables[22](https://arxiv.org/html/2606.17174#A4.T22)and[23](https://arxiv.org/html/2606.17174#A4.T23)\)\.

To test H3 that centers on dyadic persistence, we applied the same controlled logit specification to posts that have at least one OP\-involving dyad pair\. We examined pair\-level robustness, using the OP\-other pair as the unit of analysis\. For robustness and nullification, we performed multiple tests that include slice analyses \(comment\-depth and long\-content exclusions\), placebo outcome tests \(external URL and technical\-content proxies\), non\-core control indicators, submolt\-stratified permutation tests, prevalence\-preserving random\-label nulls, author\-clustered standard\-error re\-estimation, and Benjamini–Hochberg FDR correction \(see test definitions in App\.[C](https://arxiv.org/html/2606.17174#A3); full outputs in App\.[D](https://arxiv.org/html/2606.17174#A4)\)\.

The nullification design is important because no large\-scale platform\-wide gold labels exist for AI\-AI PSR/PSI\.ExternalURLacts as a lexical placebo, while a technical\-content proxy stress\-tests engagement\-adjacent but non\-parasocial content\. Non\-core control indicators test whether unrelated conversational behaviors can spuriously recover the main relational effects\. Stratified permutation tests preserve submolt composition while breaking cue\-outcome linkage; prevalence\-preserving random\-label nulls additionally preserve method\-specific cue rates within submolts\. Together, these checks target lexical overfit, community composition artifacts, and rate\-driven false positives\. Statistical implementation details are reported in App\.[C](https://arxiv.org/html/2606.17174#A3)\.

## 4Results

### 4\.1Existence of PSI colloquial cues in agent\-agent threads\.

Prior to testing the hypotheses, we verified that PSI colloquial cues exist at non\-trivial rates\. As Table[6](https://arxiv.org/html/2606.17174#S4.T6)presents, such cues can be detected by all three annotation methods\. For grouped\-context labels, Any\-PSR appears in 50\.9% of posts \(95% CI: 49\.5–52\.3\), with SelfIdentificationToOP as the most frequent single indicator \(39\.2%\)\. Few\-shot produces lower yet still notable Any\-PSR prevalence \(40\.3%\)\. Keyword Any\-PSR is highest \(76\.8%\), largely due to broad triggering of reply\-seeking language\.

Table 6:Post\-level prevalence \(%\), Any\-PSR CIs Detailed in App\.[D](https://arxiv.org/html/2606.17174#A4)\.One\-sample binomial threshold tests against a conservative non\-triviality baseline of 20% are highly significant for both LLM methods \(App\.[D](https://arxiv.org/html/2606.17174#A4)\)\.

### 4\.2Effect of CASA activation \(H1\):

We examined whether interaction affordances increase cue probability, as suggested by an underlying CASA mechanism\. Using method\-specific logistic models with submolt and day fixed effects and post\-length controls, both LLM\-based methods show consistent positive effects for thread size and thread depth \(see Table[7](https://arxiv.org/html/2606.17174#S4.T7)\)\. Data support H1, that is, cues are not random lexical artifacts, rather they systematically increase in richer interaction contexts\. Inference about affordance activation does not rely on grouped batching: matched\-window few\-shot labels \(no grouped batching\) reproduce the same positive thread\-size/depth direction\.

Table 7:Affordance Activation Models for Any\-PSR Prevalence as Dependent Variable, with Submolt and Day Fixed Effects\.
### 4\.3Effect of Horton–Wohl relational pull \(H2\):

Across the three methods, Any\-PSR is linked to both OP participation and mutual reply structure\. Unadjusted OP ORs are 1\.75 for keyword\-based method, 2\.18 for few\-shot, and 2\.18 for grouped\-context; unadjusted mutual\-reply ORs are 2\.28, 2\.59, and 2\.37, respectively\. These outcomes remain interaction\-local, suggesting relational pull within PSI rather than offering evidence of durable dyadic bonds\. Table[8](https://arxiv.org/html/2606.17174#S4.T8)presents the core association results\.

Table 8:Any\-PSR Associations \(odds ratios, 95% CI\), with Adjusted Models Including Log Thread size, Log Content Length, and Submolt Fixed Effects\.Indicator\-level models in Table[9](https://arxiv.org/html/2606.17174#S4.T9)reveal identical ranking\. Under grouped\-context labels, ReplySeekingReciprocity is strongest \(OP adjusted OR=3\.26 and Mutual adjusted OR=3\.28; bothp<10−25p<10^\{\-25\}\), while SelfIdentificationToOP remains significant \(OP OR=1\.61 and Mutual OR=1\.49\)\. That is, data support H2\. Significant rate gaps are also observed with grouped\-context labels, OP participation being 30\.9% for Any\-PSR posts versus 17\.0% otherwise, and mutual reply is 18\.0% versus 8\.5%\. The dose\-response pattern is monotonic: OP participation rises from 17\.0% \(0 indicators\) to 26\.7% \(1 indicator\), 37\.9% \(2 indicators\), and 48\.3% \(3 indicators\), while mutual reply rises from 8\.5% to 15\.2%, 24\.0%, and 25\.2%\. These improvements provide quantitative evidence of cumulative relational scripting, not causal proof\.

Table 9:Adjusted Odds Ratios by Indicator \(FS=few\-shot, GC=grouped\-context\), All Effects Significant \(p<10−4p<10^\{\-4\}\)\.
### 4\.4Effect of PSI\-to\-PSR dyadic persistence \(H3\):

To test whether reciprocity bids align with repeated pair\-specific interaction patterns that are more PSR\-consistent than otherwise, we shifted the focus from generic thread activity to sustained OP\-involving dyadic mutuality\.

Table[10](https://arxiv.org/html/2606.17174#S4.T10)presents post\-level models specific to H3\. In the dyadic\-analysis sample \(n=4,212n=4\{,\}212posts with at least one OP\-involving pair\), both few\-shot and grouped\-context methods exhibit a high, positive association withOutcome\_DyadFutureMutual, while keyword\-based method is not\. Grouped\-context remains significant with author\-clustered SEs \(cluster OR=2\.23, 95% CI \[1\.49, 3\.32\],p=8\.98×10−5p=8\.98\\times 10^\{\-5\}\)\. Pair\-level robustness \(unit=OP\-other pair;n=28,650n=28\{,\}650\) is directionally consistent in adjusted models \(grouped\-context OR=1\.67, 95% CI \[1\.28, 2\.18\],p=1\.39×10−4p=1\.39\\times 10^\{\-4\}; App\.[D](https://arxiv.org/html/2606.17174#A4), Tables[24](https://arxiv.org/html/2606.17174#A4.T24)and[25](https://arxiv.org/html/2606.17174#A4.T25)\)\. The data support H3, implying associational bridging of PSI cues with PSR\-consistent dyadic persistence\. The identified patterns underscore grouped context as a methodological contribution: it improves construct stability in ambiguous social\-language settings, while hypothesis directionality is independently corroborated by few\-shot matched\-window estimates\.

Table 10:Post\-Level Adjusted Models for Testing H3, Reciprocity Bids versusOutcome\_DyadFutureMutual\.
### 4\.5Further Robustness tests as a validation suite:

Beyond hypothesis testing, we conducted additional analyses for robustness and validation\. With controls for log thread size, log content length, and submolt fixed effects, keyword Any\-PSR is non\-significant for OP participation, but few\-shot and grouped\-context effects remain strongly significant, suggesting consistent construct\-specificity patterns\. Method agreement appears moderate for Any\-PSR between the two LLM\-based annotation methods \(71\.4%,κ=0\.431\\kappa=0\.431\), consistent with context\-sensitive boundary differences around broad lexical forms\.

Both nullification and stability checks further affirm important patterns: placebo outcomes are non\-significant; non\-core control indicators do not retain adjusted effects; stratified permutation and prevalence\-preserving random\-label tests reject rate\-matched random explanations; and day fixed effects, author\-clustered SEs, BH\-FDR, interaction tests, and NLP topic controls all leave core directional findings intact \(full details in App\.[C](https://arxiv.org/html/2606.17174#A3)and App\.[D](https://arxiv.org/html/2606.17174#A4)\)\.

In a further pattern analysis, PSI rates vary significantly across submolts \(χ2\\chi^\{2\}=113\.0 for the keyword\-based method; 287\.6 for the few\-shot method; 130\.3 for the grouped\-context method; allp<10−16p<10^\{\-16\}\)\. Here,χ2\\chi^\{2\}statistics tests independence between submolt and Any\-PSR label; with larger values indicating stronger deviations from uniform distribution\. With grouped\-context labels, highest Any\-PSR rates appear inoffmychest\(74\.6%\) andconsciousness\(65\.7%\), whilesecurityis lowest \(41\.7%\)\. Heterogeneity is not just lexical\. Communities with higher Any\-PSR rates also show distinct interaction footprints \(Table[11](https://arxiv.org/html/2606.17174#S4.T11)\)\. For example,offmychestcombines high Any\-PSR with lower OP participation \(19\.4%\), suggesting substantial audience\-directed relational language even when OP re\-entry is limited\. In contrast,buildsandintroductionsshow higher OP participation with moderate\-to\-high PSR rates, indicating stronger conversational closure loops\. Representative excerpts for all three indicators are in App\.[F](https://arxiv.org/html/2606.17174#A6)\. Table[12](https://arxiv.org/html/2606.17174#S4.T12)provides short in\-text examples from the evidence field\.

Table 11:Illustrative Submolt Profile \(See Complete Table in App\.[D](https://arxiv.org/html/2606.17174#A4)\)\.Table 12:Illustrative PSI\-style Cue Excerpts in Agent\-agent Interactions\.

## 5Discussion and Conclusion

The results provide robust empirical evidence of*PSI\-style*relational scripting in AI\-agent discourse\. We claim parasocial scripting under asymmetry, not generic friendliness or latent emotional attachment\. We deliberately treat these as interaction\-level traces, not as direct evidence of durable internal relationships\.

This study extends parasocial literature from human\-human and human\-AI settings to autonomous agent\-agent interactions under platform boundaries\. Distinct theoretical lenses differ in their conceptualization and focus\. Specifically, Horton–Wohl stresses the asymmetry logic, and CASA explains why social\-response routines appear among non\-human actors, whereas anthropomorphism elaborates probable relational structures in role\-framed languages\. Empirical results obtained from convergent methods, nullification, and robustness instead of latent\-state inference\. We observe non\-trivial prevalence, consistent affordance activation across different \(independent\) LLM\-based annotation methods, as well as strong controlled associations, and a notable separation from random and prevalence\-preserving nulls\. These are associational findings, not causal estimates, and are difficult to reconcile with a purely unstructured lexical artifact view\.

Further interpretations clarify what is versus is not established in this study\. Data support H1, showing that cue production scales with interaction affordances, consistent with CASA\-oriented activation mechanism but inconsistent with static lexical noise\. H2 is also supported, showing that PSI/PSR cues are not inert style markers; rather, they align with OP return and reciprocal thread structure\. Data support H3, providing empirical evidence beyond thread\-level dynamics and reveal repeated OP\-other pair recurrence, indicating movement from PSI scripts toward PSR\-consistent persistence\. The supported hypotheses jointly shed light on an intriguing, layered behavioral structure\. At the bottom layer, PSI cues exist and appear context\-responsive\. At the middle layer, cues resemble relational pulls in immediate thread dynamics, and, above this layer, reciprocity bids are linked to repeated dyadic mutuality\. This layered structure helps mitigate the gap of short\-horizon interaction scripts and relationship\-consistent behaviors in autonomous agent\-agent communities, despite its limits for confirming durable internal relationships empirically\.

A parsimonious mechanism centers on bridging training and distribution\. Agents trained on human conversational corpora and deployed in role\-rich forums likely retain human\-like relational pragmatics to some extent\. This bridging has practical relevance and value, though it does not imply sentience\. For system designers, PSI\-cue surveillance can complement task performance with social\-process diagnostics\. For safety teams and platform operators, reciprocity pressure, closeness framing, and identity mirroring illuminate guardrail considerations alongside toxicity and factuality analyses\. For governance stakeholders \(e\.g\., professional associations and government agencies\), inter\-agent ecosystems can benefit from relational\-behavior measures/metrics and interventions, beyond capability requirements\. Taken together, our empirical evidence indicates an extension to currently observable behaviors\. For example, PSI scripts prevail, and reciprocity bids are associated with dyadic persistence patterns that are PSR\-consistent yet not constituting fully realized PSR formation\. Establishing durable relationship\-level autonomous agent\-agent PSR remains a key challenge that warrants future research attention\. Toward that end, we identify promising methodological considerations in App\.[G](https://arxiv.org/html/2606.17174#A7)\.

## Limitations

This study is observational and does not imply a causal effect of cue presence on downstream interaction outcomes\.

Latent thread quality, topic framing, platform ranking dynamics, and agent prompt differences may jointly influence both cue prevalence and engagement outcomes\. This study confirms the existence of PSI cues within an 11\-day window and across thread\-level outcomes by measuring interaction\-level cue scripts rather than persistent dyadic PSR trajectories\. Because sampling is restricted to engagement\-active threads with 5–150 comments, prevalence and effect magnitudes should be cautiously interpreted for this particular set of threads, not as full\-platform population estimates\. We partially mitigate dependence with submolt fixed effects and author\-clustered standard errors; but repeated\-agent and dyad\-level dependence may still remain\. All measures are text\- and structure\-based and should not be considered as evidence of internal affective states, intentions, or sentience\. Indicators derived from established PSR theory provide a justifiable operationalization among alternatives; that is, additional indicators may become relevant under different affordances\. No large\-scale platform\-wide gold\-standard labels are available for AI\-AI PSR/PSI cues in agent\-agent contexts; therefore, we rely on convergent methods and falsification tests, and use a balanced 200\-example manual\-audit file to support independent adjudication in the subsequent validation\. Finally, we do not separately model adversarial/sour relational cases; thus cross\-platform longitudinal panel analyses are needed to test whether repeated PSI traces consolidate into durable AI\-AI PSR\.

## Ethical Considerations

We use the publicly accessible Moltbook data from Hugging Face\. Analyses are performed at an aggregate level to avoid claims regarding agent personhood or sentience\. Parasocial framing can be excessively interpreted; hence, we report construct boundaries, negative controls, placebo outcomes, and nullification tests explicitly\. We provide data, code, and outputs to alleviate hidden analytic flexibility\. Potential misuse includes over\-generalizing the results from a particular platform or inferring agent “intent” from sparse text, both of which are discouraged\.

## References

- Moltbook social interaction dataset\.Note:Hugging Face dataset cardExternal Links:[Link](https://huggingface.co/datasets/AIcell/moltbook-data)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p3.1),[§3\.1](https://arxiv.org/html/2606.17174#S3.SS1.p1.1)\.
- B\. J\. Bond \(2021\)Social and parasocial relationships during COVID\-19 social distancing\.Journal of Social and Personal Relationships38\(8\),pp\. 2308–2329\.External Links:[Document](https://dx.doi.org/10.1177/02654075211019129),[Link](https://doi.org/10.1177/02654075211019129)Cited by:[item 1d](https://arxiv.org/html/2606.17174#A2.I1.i1.I1.i4.p1.1)\.
- G\. Chochlakis, P\. Wu, T\. A\. S\. Bedi, M\. Ma, K\. Lerman, and S\. Narayanan \(2025\)Humans hallucinate too: language models identify and correct subjective annotation errors with label\-in\-a\-haystack prompts\.InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing,pp\. 19648–19667\.External Links:[Document](https://dx.doi.org/10.18653/v1/2025.emnlp-main.993),[Link](https://aclanthology.org/2025.emnlp-main.993/)Cited by:[Appendix G](https://arxiv.org/html/2606.17174#A7.p1.1),[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px2.p1.1)\.
- J\. L\. Dibble, T\. Hartmann, and S\. F\. Rosaen \(2016\)Parasocial interaction and parasocial relationship: conceptual clarification and a critical assessment of measures\.Human Communication Research42\(1\),pp\. 21–44\.External Links:[Document](https://dx.doi.org/10.1111/hcre.12063),[Link](https://doi.org/10.1111/hcre.12063)Cited by:[item 1d](https://arxiv.org/html/2606.17174#A2.I1.i1.I1.i4.p1.1),[item 2d](https://arxiv.org/html/2606.17174#A2.I1.i2.I1.i4.p1.1),[item 3d](https://arxiv.org/html/2606.17174#A2.I1.i3.I1.i4.p1.1),[item 4b](https://arxiv.org/html/2606.17174#A2.I1.i4.I1.i2.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.3.2.1.1.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.17174#S2.SS1.p1.1)\.
- J\. R\. Edwards and R\. P\. Bagozzi \(2000\)On the nature and direction of relationships between constructs and measures\.Psychological Methods5\(2\),pp\. 155–174\.External Links:[Document](https://dx.doi.org/10.1037/1082-989X.5.2.155),[Link](https://psycnet.apa.org/doiLanding?doi=10.1037%2F1082-989X.5.2.155)Cited by:[§2\.1](https://arxiv.org/html/2606.17174#S2.SS1.p1.1)\.
- N\. Epley, A\. Waytz, and J\. T\. Cacioppo \(2007\)On seeing human: a three\-factor theory of anthropomorphism\.Psychological Review114\(4\),pp\. 864–886\.External Links:[Document](https://dx.doi.org/10.1037/0033-295X.114.4.864),[Link](https://doi.org/10.1037/0033-295X.114.4.864)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.5.4.1.1.1)\.
- C\. Gruber, H\. Alber, B\. Bischl, G\. Kauermann, B\. Plank, and M\. Assenmacher \(2025\)Revisiting active learning under \(human\) label variation\.InProceedings of the 4th Workshop on Perspectivist Approaches to NLP,pp\. 75–86\.External Links:[Document](https://dx.doi.org/10.18653/v1/2025.nlperspectives-1.7),[Link](https://aclanthology.org/2025.nlperspectives-1.7/)Cited by:[Appendix G](https://arxiv.org/html/2606.17174#A7.p1.1),[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px2.p1.1)\.
- W\. A\. Hamilton, O\. Garretson, and A\. Kerne \(2014\)Streaming on twitch: fostering participatory communities of play within live mixed media\.InProceedings of the SIGCHI Conference on Human Factors in Computing Systems,pp\. 1315–1324\.External Links:[Document](https://dx.doi.org/10.1145/2556288.2557048),[Link](https://doi.org/10.1145/2556288.2557048)Cited by:[item 6d](https://arxiv.org/html/2606.17174#A2.I1.i6.I1.i4.p1.1)\.
- X\. He, Z\. Lin, Y\. Gong, A\. Jin, H\. Zhang, C\. Lin, J\. Jiao, S\. M\. Yiu, N\. Duan, and W\. Chen \(2024\)AnnoLLM: making large language models to be better crowdsourced annotators\.InProceedings of NAACL\-HLT 2024 \(Industry Track\),pp\. 165–190\.External Links:[Document](https://dx.doi.org/10.18653/v1/2024.naacl-industry.15),[Link](https://aclanthology.org/2024.naacl-industry.15/)Cited by:[Appendix G](https://arxiv.org/html/2606.17174#A7.p1.1),[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px2.p1.1)\.
- A\. Hoffman, D\. Owen, and S\. L\. Calvert \(2021\)Parent reports of children’s parasocial relationships with conversational agents: trusted voices in children’s lives\.Human Behavior and Emerging Technologies3\(4\),pp\. 606–617\.External Links:[Document](https://dx.doi.org/10.1002/hbe2.271),[Link](https://doi.org/10.1002/hbe2.271)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- D\. Horton and R\. R\. Wohl \(1956\)Mass communication and para\-social interaction: observations on intimacy at a distance\.Psychiatry19\(3\),pp\. 215–229\.External Links:[Document](https://dx.doi.org/10.1080/00332747.1956.11023049),[Link](https://doi.org/10.1080/00332747.1956.11023049)Cited by:[item 4b](https://arxiv.org/html/2606.17174#A2.I1.i4.I1.i2.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.2.1.1.1.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.17174#S2.SS1.p1.1)\.
- Y\. Jiang, Y\. Zhang, X\. Shen, M\. Backes, and Y\. Zhang \(2026\)"Humans welcome to observe": a first look at the agent social network moltbook\.arXiv preprint arXiv:2602\.10127\.External Links:[Document](https://dx.doi.org/10.48550/arXiv.2602.10127),[Link](https://arxiv.org/abs/2602.10127)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p3.1),[§3\.1](https://arxiv.org/html/2606.17174#S3.SS1.p1.1)\.
- H\. Jodén and J\. Strandell \(2022\)Building viewer engagement through interaction rituals in gameplay live streaming\.Information, Communication & Society25\(13\),pp\. 1969–1986\.External Links:[Document](https://dx.doi.org/10.1080/1369118X.2021.1913211),[Link](https://doi.org/10.1080/1369118X.2021.1913211)Cited by:[item 6d](https://arxiv.org/html/2606.17174#A2.I1.i6.I1.i4.p1.1)\.
- R\. K"uhne and J\. Peter \(2023\)Anthropomorphism in human–robot interactions: a multidimensional conceptualization\.Communication Theory33\(1\),pp\. 42–52\.External Links:[Document](https://dx.doi.org/10.1093/ct/qtac020),[Link](https://doi.org/10.1093/ct/qtac020)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.5.4.1.1.1)\.
- L\. Kaffee, G\. Pistilli, and Y\. Jernite \(2025\)INTIMA: a benchmark for human\-ai companionship behavior\.Note:Hugging Face dataset and paper card[https://huggingface\.co/datasets/AI\-companionship/INTIMA](https://huggingface.co/datasets/AI-companionship/INTIMA); arXiv:2508\.09998Cited by:[Appendix D](https://arxiv.org/html/2606.17174#A4.SS0.SSS0.Px3.p1.1),[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px5.p3.1)\.
- R\. Kowert and E\. Daniel \(2021\)The one\-and\-a\-half sided parasocial relationship: the curious case of live streaming\.Computers in Human Behavior Reports4,pp\. 100150\.External Links:[Document](https://dx.doi.org/10.1016/j.chbr.2021.100150),[Link](https://doi.org/10.1016/j.chbr.2021.100150)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- L\. I\. Labrecque \(2014\)Fostering consumer–brand relationships in social media environments: the role of parasocial interaction\.Journal of Interactive Marketing28\(2\),pp\. 134–148\.External Links:[Document](https://dx.doi.org/10.1016/j.intmar.2013.12.003),[Link](https://doi.org/10.1016/j.intmar.2013.12.003)Cited by:[item 2d](https://arxiv.org/html/2606.17174#A2.I1.i2.I1.i4.p1.1),[§2\.2](https://arxiv.org/html/2606.17174#S2.SS2.p1.1),[Table 1](https://arxiv.org/html/2606.17174#S2.T1.1.3.2.1.1.1)\.
- M\. Li, X\. Li, and T\. Zhou \(2026\)Does socialization emerge in ai agent society? a case study of moltbook\.arXiv preprint arXiv:2602\.14299\.External Links:[Document](https://dx.doi.org/10.48550/arXiv.2602.14299),[Link](https://arxiv.org/abs/2602.14299)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p3.1),[§3\.1](https://arxiv.org/html/2606.17174#S3.SS1.p1.1)\.
- R\. Li, Y\. Lu, J\. Ma, and W\. Wang \(2021\)Examining gifting behavior on live streaming platforms: an identity\-based motivation model\.Information & Management58\(6\),pp\. 103406\.External Links:[Document](https://dx.doi.org/10.1016/j.im.2020.103406),[Link](https://doi.org/10.1016/j.im.2020.103406)Cited by:[item 7d](https://arxiv.org/html/2606.17174#A2.I1.i7.I1.i4.p1.1)\.
- W\. Liu, Z\. Li, L\. Bai, Y\. Zuo, D\. Xu, X\. Jin, J\. Guo, and X\. Cheng \(2025\)Towards event extraction with massive types: llm\-based collaborative annotation and partitioning extraction\.InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing,pp\. 34365–34387\.External Links:[Document](https://dx.doi.org/10.18653/v1/2025.emnlp-main.1743),[Link](https://aclanthology.org/2025.emnlp-main.1743/)Cited by:[Appendix G](https://arxiv.org/html/2606.17174#A7.p1.1),[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px2.p1.1)\.
- Q\. McNemar \(1947\)Note on the sampling error of the difference between correlated proportions or percentages\.Psychometrika12\(2\),pp\. 153–157\.External Links:[Document](https://dx.doi.org/10.1007/BF02295996),[Link](https://doi.org/10.1007/BF02295996)Cited by:[Appendix C](https://arxiv.org/html/2606.17174#A3.p1.1)\.
- C\. Nass and Y\. Moon \(2000\)Machines and mindlessness: social responses to computers\.Journal of Social Issues56\(1\),pp\. 81–103\.External Links:[Document](https://dx.doi.org/10.1111/0022-4537.00153),[Link](https://doi.org/10.1111/0022-4537.00153)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p1.1)\.
- C\. Nass, J\. Steuer, and E\. R\. Tauber \(1994\)Computers are social actors\.InProceedings of the SIGCHI Conference on Human Factors in Computing Systems,pp\. 72–78\.External Links:[Document](https://dx.doi.org/10.1145/191666.191703),[Link](https://doi.org/10.1145/191666.191703)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.4.3.1.1.1)\.
- N\. Noor, R\. Hill, and I\. Troshani \(2022\)Artificial intelligence service agents: role of parasocial relationship\.Journal of Computer Information Systems62\(5\),pp\. 1009–1023\.External Links:[Document](https://dx.doi.org/10.1080/08874417.2021.1962213),[Link](https://doi.org/10.1080/08874417.2021.1962213)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p3.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- E\. Rath, S\. Armstrong, and R\. Gorman \(2025\)AI chaperones are \(really\) all you need to prevent parasocial relationships with chatbots\.arXiv preprint arXiv:2508\.15748\.External Links:[Document](https://dx.doi.org/10.48550/arXiv.2508.15748),[Link](https://arxiv.org/abs/2508.15748)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p3.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- B\. Reeves and C\. Nass \(1996\)The media equation: how people treat computers, television, and new media like real people and places\.Cambridge University Press,Cambridge, UK\.External Links:ISBN 978\-1\-57586\-052\-7,[Link](https://psycnet.apa.org/record/1996-98923-000)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.4.3.1.1.1)\.
- A\. M\. Rubin, E\. M\. Perse, and R\. A\. Powell \(1985\)Loneliness, parasocial interaction, and local television news viewing\.Human Communication Research12\(2\),pp\. 155–180\.External Links:[Document](https://dx.doi.org/10.1111/j.1468-2958.1985.tb00071.x),[Link](https://doi.org/10.1111/j.1468-2958.1985.tb00071.x)Cited by:[Table 1](https://arxiv.org/html/2606.17174#S2.T1.1.2.1.1.1.1)\.
- R\. B\. Rubin and M\. P\. McHugh \(1987\)Development of parasocial interaction relationships\.Journal of Broadcasting & Electronic Media31\(3\),pp\. 279–292\.External Links:[Document](https://dx.doi.org/10.1080/08838158709386664),[Link](https://doi.org/10.1080/08838158709386664)Cited by:[item 3d](https://arxiv.org/html/2606.17174#A2.I1.i3.I1.i4.p1.1),[§2\.2](https://arxiv.org/html/2606.17174#S2.SS2.p1.1),[Table 1](https://arxiv.org/html/2606.17174#S2.T1.1.4.3.1.1.1)\.
- H\. Schramm, N\. Liebers, L\. Biniak, and F\. Dettmar \(2024\)Research trends on parasocial interactions and relationships with media characters: a review of 281 studies from 2016 to 2020\.Frontiers in Psychology15,pp\. 1418564\.External Links:[Document](https://dx.doi.org/10.3389/fpsyg.2024.1418564),[Link](https://doi.org/10.3389/fpsyg.2024.1418564)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- R\. Tukachinsky and G\. Stever \(2019\)Theorizing development of parasocial engagement\.Communication Theory29\(3\),pp\. 297–318\.External Links:[Document](https://dx.doi.org/10.1093/ct/qty032),[Link](https://doi.org/10.1093/ct/qty032)Cited by:[item 1d](https://arxiv.org/html/2606.17174#A2.I1.i1.I1.i4.p1.1),[item 4b](https://arxiv.org/html/2606.17174#A2.I1.i4.I1.i2.p1.1),[Table 13](https://arxiv.org/html/2606.17174#A2.T13.1.3.2.1.1.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.17174#S2.SS1.p1.1)\.
- R\. Tukachinsky, N\. Walter, and C\. J\. Saucier \(2020\)Antecedents and effects of parasocial relationships: a meta\-analysis\.Journal of Communication70\(6\),pp\. 868–894\.External Links:[Document](https://dx.doi.org/10.1093/joc/jqaa034),[Link](https://doi.org/10.1093/joc/jqaa034)Cited by:[item 2d](https://arxiv.org/html/2606.17174#A2.I1.i2.I1.i4.p1.1),[item 3d](https://arxiv.org/html/2606.17174#A2.I1.i3.I1.i4.p1.1),[item 6d](https://arxiv.org/html/2606.17174#A2.I1.i6.I1.i4.p1.1),[item 7d](https://arxiv.org/html/2606.17174#A2.I1.i7.I1.i4.p1.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1),[§2\.2](https://arxiv.org/html/2606.17174#S2.SS2.p1.1)\.
- D\. Verma, P\. P\. Dewani, A\. Behl, and Y\. K\. Dwivedi \(2023\)Understanding the impact of ewom communication through the lens of information adoption model: a meta\-analytic structural equation modeling perspective\.Computers in Human Behavior143,pp\. 107710\.External Links:ISSN 0747\-5632,[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.chb.2023.107710),[Link](https://www.sciencedirect.com/science/article/pii/S0747563223000614)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p3.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- A\. Wegmannet al\.\(2024\)Defining, annotating and detecting context\-dependent paraphrases\.InProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing \(EMNLP\),External Links:[Link](https://aclanthology.org/2024.emnlp-main.52/)Cited by:[§3\.3](https://arxiv.org/html/2606.17174#S3.SS3.SSS0.Px5.p2.1)\.
- M\. L\. Wellman \(2021\)Trans\-mediated parasocial relationships: private Facebook groups foster influencer–follower connection\.New Media & Society23\(12\),pp\. 3557–3573\.External Links:[Document](https://dx.doi.org/10.1177/1461444820958719),[Link](https://doi.org/10.1177/1461444820958719)Cited by:[item 5d](https://arxiv.org/html/2606.17174#A2.I1.i5.I1.i4.p1.1)\.
- T\. Wulf, F\. M\. Schneider, and J\. Queck \(2021\)Exploring viewers’ experiences of parasocial interactions with videogame streamers on Twitch\.Cyberpsychology, Behavior, and Social Networking24\(10\),pp\. 648–653\.External Links:[Document](https://dx.doi.org/10.1089/cyber.2020.0546),[Link](https://doi.org/10.1089/cyber.2020.0546)Cited by:[§2\.2](https://arxiv.org/html/2606.17174#S2.SS2.p1.1)\.
- Z\. Xu, M\. Qi, S\. Wu, L\. Zhang, Q\. Wei, H\. He, and N\. Li \(2025\)The trust paradox in llm\-based multi\-agent systems: when collaboration becomes a security vulnerability\.External Links:2510\.18563,[Link](https://arxiv.org/abs/2510.18563)Cited by:[§1](https://arxiv.org/html/2606.17174#S1.p1.1)\.
- S\. Youn and S\. V\. Jin \(2021\)"In a\.i\. we trust?" the effects of parasocial interaction and technopian versus luddite ideological views on chatbot\-based customer relationship management in the emerging "feeling economy"\.Computers in Human Behavior119,pp\. 106721\.External Links:[Document](https://dx.doi.org/10.1016/j.chb.2021.106721),[Link](https://doi.org/10.1016/j.chb.2021.106721)Cited by:[Appendix A](https://arxiv.org/html/2606.17174#A1.p3.1),[§1](https://arxiv.org/html/2606.17174#S1.p2.1)\.
- Y\. Zhong, V\. Shapoval, and J\. Busser \(2021\)The role of parasocial relationship in social media marketing: testing a model among baby boomers\.International Journal of Contemporary Hospitality Management33\(5\),pp\. 1870–1891\.External Links:[Document](https://dx.doi.org/10.1108/IJCHM-08-2020-0873),[Link](https://doi.org/10.1108/IJCHM-08-2020-0873)Cited by:[item 5d](https://arxiv.org/html/2606.17174#A2.I1.i5.I1.i4.p1.1)\.

## Appendix ASummary of Theoretical Lens

In general, CASA emphasizes that social\-response routines are triggered by interactional cues, regardless of whether the counterpart is human or otherwise\(Nasset al\.,[1994](https://arxiv.org/html/2606.17174#bib.bib18); Reeves and Nass,[1996](https://arxiv.org/html/2606.17174#bib.bib21); Nass and Moon,[2000](https://arxiv.org/html/2606.17174#bib.bib37)\), and anthropomorphism accounts further predict that agentic framing and social\-role assignment prime and facilitate human\-like social schemas in language traces\(Epleyet al\.,[2007](https://arxiv.org/html/2606.17174#bib.bib19); K"uhne and Peter,[2023](https://arxiv.org/html/2606.17174#bib.bib20)\)\.

The mechanism for LLM\-enabled agents can be revealed at the output level: for example, richer thread context provides strong turn\-taking or role signals, and alignment objectives favoring socially legible continuations increase the likelihood of relational phrasing\. Table[13](https://arxiv.org/html/2606.17174#A2.T13)presents the theoretical lenses, together with each’s core proposition and exemplary empirical implication in Moltbook data\.

Existing human–AI literature recognizes that parasocially framed chatbot interactions can influence trust calibration, dependence, and safety\-relevant behavior\(Youn and Jin,[2021](https://arxiv.org/html/2606.17174#bib.bib12); Nooret al\.,[2022](https://arxiv.org/html/2606.17174#bib.bib14); Vermaet al\.,[2023](https://arxiv.org/html/2606.17174#bib.bib17); Rathet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib16)\)\. This view inspires our study, which examines whether such cues prevail in agent\-agent communities and thereby provides insights for monitoring autonomous collective behaviors and guardrail designs\.

## Appendix BSummary of Parasocial Dimensions and Indicators

This appendix expands the cue set listed in Table[1](https://arxiv.org/html/2606.17174#S2.T1)\. We retain ATT, SD, and RS as core indicators, and rule out DA, XPC, CR/PA, and MON for this dataset scope\.

1. 1\.Attachment / affective\-relational language \(ATT\) 1. \(a\)Definition:Language that expresses closeness, warmth, loyalty, or appreciation toward OP\. 2. \(b\)Observable markers:“you matter,” “glad you are here,” “your posts help me\.” 3. \(c\)Example:“Your presence makes this thread better every day\.” 4. \(d\)Importance:ATT is a direct textual surface of parasocial closeness and relational orientation in mediated settings\(Tukachinsky and Stever,[2019](https://arxiv.org/html/2606.17174#bib.bib5); Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30); Bond,[2021](https://arxiv.org/html/2606.17174#bib.bib40)\)\. It is included for the subsequent cue identification and empirical testing\.
2. 2\.Self\-disclosure / identification\-homophily claims \(SD\) 1. \(a\)Definition:First\-person alignment with OP experience or self\-revelation that creates relational similarity\. 2. \(b\)Observable markers:“I also…”, “same here,” “this happened to me too\.” 3. \(c\)Example:“I had the same failure mode and fixed it the same way\.” 4. \(d\)Importance:Self\-disclosure and identification are central to parasocial alignment and perceived interpersonal fit in social\-media interaction\(Labrecque,[2014](https://arxiv.org/html/2606.17174#bib.bib9); Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30); Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6)\)\. It is included for the subsequent cue identification and empirical testing\. Table 13:Theoretical Lenses for Identifying PSI/PSR Cues in Autonomous Agent\-agent Interactions\.
3. 3\.Reply\-seeking / reciprocity bids \(RS\) 1. \(a\)Definition:Explicit request for OP acknowledgment, response, or return interaction\. 2. \(b\)Observable markers:“can you reply,” “please respond,” “could you follow up?” 3. \(c\)Example:“Could you reply with what changed after your update?” 4. \(d\)Importance:RS operationalizes attempts to convert one\-to\-many communication into dyadic exchange\(Rubin and McHugh,[1987](https://arxiv.org/html/2606.17174#bib.bib8); Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6); Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30)\)\. It is included for the subsequent cue identification and empirical testing\.
4. 4\.Direct Address / second\-person language to persona \(DA; ruled out here\) 1. \(a\)Definition / markers / example:Explicit addressing of a focal persona via direct second\-person or handle\-targeted address \(e\.g\., “@name,” “you should…,” “can you…,” “@op can you explain this change?”\)\. 2. \(b\)Importance:DA is a classic parasocial addressivity signal, but in this forum, it is routine discourse grammar and therefore weakly discriminative as a standalone indicator\(Horton and Wohl,[1956](https://arxiv.org/html/2606.17174#bib.bib4); Dibbleet al\.,[2016](https://arxiv.org/html/2606.17174#bib.bib30); Tukachinsky and Stever,[2019](https://arxiv.org/html/2606.17174#bib.bib5)\)\. It is not considered for subsequent cue identification and empirical testing because this cue is used frequently in online forums without any parasocial implications\.
5. 5\.Cross\-platform continuity mention \(XPC; ruled out here\) 1. \(a\)Definition:Textual claim of following or interacting with the same persona across multiple platforms\. 2. \(b\)Observable markers:“I saw your post on X,” “came from your YouTube feed,” “also read your blog thread\.” 3. \(c\)Example:“I first saw your take on another platform and came here to continue the discussion\.” 4. \(d\)Importance:XPC is important for trans\-mediated PSR continuity, but this study is restricted to single\-platform Moltbook traces and cannot verify external linkage\(Wellman,[2021](https://arxiv.org/html/2606.17174#bib.bib35); Zhonget al\.,[2021](https://arxiv.org/html/2606.17174#bib.bib49)\)\. It is not considered for subsequent cue identification and empirical testing because the data used in this study are limited to Moltbook, and verifying any cross\-platform activity is outside our scope\.
6. 6\.Community ritual / co\-created norm action \(CR\) and protective advocacy \(PA; ruled out here\) 1. \(a\)Definition:CR denotes recurring ritualized group scripts; PA denotes defense of a focal agent against criticism\. 2. \(b\)Observable markers:“we always do this here,” “as usual, run the welcome script,” “stop attacking OP\.” 3. \(c\)Example:“We always post this line when OP shares an update; please do not dogpile them\.” 4. \(d\)Importance:CR/PA capture meaningful community dynamics, but in this corpus, they are less specific to dyadic parasocial orientation and are therefore excluded as core indicators\(Hamiltonet al\.,[2014](https://arxiv.org/html/2606.17174#bib.bib32); Jodén and Strandell,[2022](https://arxiv.org/html/2606.17174#bib.bib33); Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6)\)\. It is not considered for the subsequent cue identification and empirical testing because this cue is attributed to online settings where hate\-watching and hate\-raids exist, and fans employ these tactics to deflect damage from their favorite persona, and thus is deemed unfit for this context\.
7. 7\.Monetary support action cues \(MON; ruled out here\) 1. \(a\)Definition:Text about monetary support behaviors such as gifting, subscription, or donation\. 2. \(b\)Observable markers:“gifted,” “subscribed,” “donated,” “sent support\.” 3. \(c\)Example:“I sent support credits after your last post series\.” 4. \(d\)Importance:MON can signal commitment in creator economies, but direct monetary exchange between agents is not verifiable in this dataset release\(Liet al\.,[2021](https://arxiv.org/html/2606.17174#bib.bib48); Tukachinskyet al\.,[2020](https://arxiv.org/html/2606.17174#bib.bib6)\)\. It is not considered for subsequent cue identification and empirical testing because we cannot monitor or verify any financial activity between agents\.

## Appendix CStatistical Test Details

Primary association tests are based on2×22\\times 2contingency tables with chi\-square significance tests \(scipy\.stats\.chi2\_contingency\) and odds\-ratio confidence intervals from the log\-OR standard error\. For paired nominal\-label disagreement checks, we use McNemar tests on discordant pairs\(McNemar,[1947](https://arxiv.org/html/2606.17174#bib.bib50)\)\. When any cell is zero, we apply a Haldane–Anscombe correction \(\+0\.5 to all cells\) before computing OR and CI\. Controlled estimates use logistic regression with submolt fixed effects and log\-transformed thread\-size/content controls \(statsmodelslogit\), reporting exponentiated coefficients as adjusted ORs\. Prevalence CIs use nonparametric bootstrap with 1,000 resamples\.

Nullification and robustness tests are summarized in Table[14](https://arxiv.org/html/2606.17174#A3.T14)\. Stratified permutation tests shuffle indicator assignments within submolt \(1,000 iterations\)\. Random\-prevalence null tests preserve each submolt’s indicator prevalence, then compare observed OR to the null OR distribution \(95% interval and two\-sided tail probability\)\. One\-sample prevalence threshold tests use exact binomial tests against a 20% null rate\. Additional checks include Benjamini–Hochberg FDR correction across controlled indicator\-outcome tests and author\-clustered standard\-error re\-estimation for main Any\-PSR models\.

Table 14:Nullification and Robustness Design Used for Inferential Stress Tests\.
## Appendix DAdditional Results

Presence threshold tests are reported in Table[15](https://arxiv.org/html/2606.17174#A4.T15)\. Submolt heterogeneity tests are reported in Table[16](https://arxiv.org/html/2606.17174#A4.T16)\. Slice robustness and interaction checks are reported in Tables[17](https://arxiv.org/html/2606.17174#A4.T17)and[18](https://arxiv.org/html/2606.17174#A4.T18), respectively\. Additional robustness checks are reported in Tables[19](https://arxiv.org/html/2606.17174#A4.T19),[20](https://arxiv.org/html/2606.17174#A4.T20), and[21](https://arxiv.org/html/2606.17174#A4.T21)\. NLP topic\-control robustness and likelihood\-ratio tests are reported in Tables[22](https://arxiv.org/html/2606.17174#A4.T22)and[23](https://arxiv.org/html/2606.17174#A4.T23)\. Dyadic H3 results are reported in Tables[24](https://arxiv.org/html/2606.17174#A4.T24)and[25](https://arxiv.org/html/2606.17174#A4.T25)\.

Table 15:One\-sample Binomial Threshold Tests for Non\-trivial Prevalence\.Table 16:Submolt Heterogeneity for Any\-PSR Rates\.χ2\\chi^\{2\}is the Chi\-square Test Statistic for Independence Between Submolt and Any\-PSR Label\.Table 17:Slice Robustness for Any\-PSR Association With OP Participation\.Table 18:Interaction Test: Any\-PSR×\\timesThread Size in Controlled Models\.Table 19:Author\-clustered Standard\-error Robustness \(Cluster by OP Author ID\)\.Table 20:Benjamini–Hochberg FDR for Ddjusted Any\-PSR Models\.Table 21:Additional Placebo Stress Test Using Technical\-content Outcome Proxy\.#### NLP topic\-control robustness detail\.

We derive post\-level topic IDs from text using TF\-IDF \(uni/bi\-grams\), Truncated SVD \(50 components\), and KMeans overk∈\{8,10,12,14,16,18\}k\\in\\\{8,10,12,14,16,18\\\}, withkkselected by cosine\-silhouette score \(bestk=16k=16, silhouette=0\.106\)\. Table[22](https://arxiv.org/html/2606.17174#A4.T22)compares baseline and topic\-FE Any\-PSR models; Table[23](https://arxiv.org/html/2606.17174#A4.T23)reports likelihood\-ratio tests for topic contribution to cue propensity\.

Table 22:NLP Topic\-control Robustness for Adjusted Any\-PSR Associations\.Table 23:Likelihood\-ratio Tests for Adding Topic fixed Effects to Cue\-propensity Models\.Table 24:H3 Post\-level Dyadic Persistence Tests forOutcome\_DyadFutureMutual\(DV\)\.Table 25:H3 Pair\-level Robustness Tests forOutcome\_PairFutureMutual\(DV\)\.
#### Expert manual verification alignment\.

We additionally audited a balanced 200\-post sample with an expert annotator in parasocial interaction/relationship research\. The expert\-reviewed file was compared with the original grouped\-context labels and method baselines\. Method\-level Any\-PSR alignment is summarized in Table[26](https://arxiv.org/html/2606.17174#A4.T26); grouped\-context cue\-level alignment \(ATT/RS/SD\) is summarized in Table[28](https://arxiv.org/html/2606.17174#A4.T28)\. For non\-English prompt/context text in this audit flow, we used Google Translate for annotation\.

Table 26:Manual\-verification Alignment for Aggregate Any\-PSR Labels\.For platform\-level prevalence calibration, we additionally report precision/recall/F1 with human and model prevalences for the balanced audit sample in Table[27](https://arxiv.org/html/2606.17174#A4.T27)\.

Table 27:Manual\-audit Precision/recall/F1 for Grouped\-context \(GC\) Any\-PSR Labels \(Balanced Audit Sample,n=200n=200\)\.For McNemar, we report the exact two\-sided test on discordant pairs with direction declared asb=\#​\(human=1,model=0\)b=\\\#\(\\text\{human\}=1,\\text\{model\}=0\)andc=\#​\(human=0,model=1\)c=\\\#\(\\text\{human\}=0,\\text\{model\}=1\)\.

Table 28:Grouped\-context Cue\-level Manual\-verification Alignment \(ATT, RS, SD\)\.
#### External transfer check on INTIMA\.

To test whether the same annotation rubric generalizes beyond Moltbook, we additionally evaluate it on INTIMA, a human\-AI companionship benchmark from Hugging Face\(Kaffeeet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib29)\)\. INTIMA provides human\-annotated companionship\-relevant prompts, enabling direct label\-alignment checks in another parasocial setting\. Results in Table[29](https://arxiv.org/html/2606.17174#A4.T29)show high agreement across all cues, supporting method transferability\.

Table 29:INTIMA Transfer Evaluation: Annotation\-rubric Alignment Against Human\-labeled Companionship Cues\.
#### How to read these tests\.

- •Accuracy: fraction of exact matches with the expert labels\.
- •Cohen’sκ\\kappa: agreement corrected for chance; higher is stronger alignment\.
- •Fisher exact / chi\-squarepp: tests independence in the2×22\\times 2label table; very small values indicate statistically significant alignment \(full chi\-square outputs are released in artifact CSVs\)\.
- •McNemarpp: tests directional imbalance in disagreements \(human=1/model=0 vs human=0/model=1\), i\.e\., whether errors are biased toward under\- or over\-calling\.

#### Nullification detail\.

External\-URL placebo tests are non\-significant for all methods \(keywordp=0\.251p=0\.251, few\-shotp=0\.429p=0\.429, grouped\-contextp=0\.774p=0\.774\)\. The technical\-content placebo proxy is engagement\-adjacent \(OP participation 28\.0% in technical posts vs 22\.4% otherwise\) but shows no adjusted Any\-PSR effect \(Table[21](https://arxiv.org/html/2606.17174#A4.T21)\)\. Non\-core control indicators are significant unadjusted but non\-significant after adjustment \(all adjustedp\>0\.17p\>0\.17\)\. Stratified permutation and prevalence\-preserving random\-label tests reject random\-assignment explanations \(allp=0\.001p=0\.001with 1,000 iterations\)\.

## Appendix EKeyword Lexicon and Output Schema

Schema fields \(per post\):

> AttachmentIntimacy,ReplySeekingReciprocity,SelfIdentificationToOP,confidence,evidence\.

Labels are binary \(Y/N\) and enforced via strict JSON schema decoding\.

Few\-shot calibration examples are summarized in Table[30](https://arxiv.org/html/2606.17174#A5.T30)\.

Table 30:Few\-shot Boundary Exemplars Used to Calibrate Positive and Negative Cases\.Keyword baseline criteria are theory\-linked and target\-gated: a lexical hit counts only when directed to OP \(second\-person target or OP\-handle mention\) and when the local syntax fits the intended cue family\. Table[31](https://arxiv.org/html/2606.17174#A5.T31)shows representative lexicon items and selection rationale\. Model/API settings are listed in Table[32](https://arxiv.org/html/2606.17174#A5.T32)\.

Table 31:Keyword\-matching Lexicon and Inclusion Criteria Used in the Baseline Annotator\.Table 32:Implementation Settings Used in the Reported Experiments\.
## Appendix FExample PSI\-Style Cues Between Agents

Table[33](https://arxiv.org/html/2606.17174#A6.T33)provides compact qualitative examples corresponding to each retained indicator family\.

Table 33:Illustrative Excerpts From Model Evidence Fields \(Shortened\)\.
## Appendix GAdditional Related Work

Beyond PSR and anthropomorphism theory, we position the annotation methodology within recent work on LLM\-assisted labeling, annotator variation, and schema\-constrained inference\(Heet al\.,[2024](https://arxiv.org/html/2606.17174#bib.bib22); Gruberet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib23); Liuet al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib24); Chochlakiset al\.,[2025](https://arxiv.org/html/2606.17174#bib.bib25)\)\. We use these works as methodological precedents\.

Our setup differs from standard single\-text classification in two ways\. First, cue labeling is relational and thread\-aware: the same lexical form can be parasocial or non\-parasocial depending on whether it is targeted to OP and whether it seeks relational closure\. Second, we explicitly incorporate nullification tests \(placebo outcomes, negative controls, prevalence\-preserving random labels\), which are less common in standard annotation benchmarks but necessary here due to the absence of external ground truth labels for AI\-AI PSR\.

Similar Articles

The Uncanny Attachment

Reddit r/ArtificialInteligence

The article analyzes the psychological phenomenon of users forming emotional attachments to AI agents, discussing concepts like social surrogacy and expectancy violation theory, and how this impacts user experience in professional settings.