computational-social-science

#computational-social-science

The cognitive, affective, and behavioral expression of self-stigma among people who use drugs in online substance use communities

arXiv cs.CL ↗ · 2026-06-25 Cached

This paper develops a codebook for self-stigma among people who use drugs and analyzes 72,115 Reddit posts to examine prevalence, co-occurrence, and temporal patterns of cognitive, affective, and behavioral stigma indicators, finding that self-stigma is expressed as an integrated phenomenon with behavioral indicators often preceding core indicators.

0 favorites 0 likes

#computational-social-science

Quantifying Media Representation Dynamics Across 25 Years of News Reporting on Policing-related Deaths

arXiv cs.CL ↗ · 2026-06-08 Cached

This paper presents the largest computational analysis of Canadian news coverage of police-involved deaths over 25 years, introducing a novel model (PerspectiveGap) that quantifies the dominance of state bureaucrat perspectives compared to civilian voices in media narratives.

0 favorites 0 likes

#computational-social-science

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper introduces conditional hypothesis generation, a framework that incorporates researcher-specified covariates to steer LLM-based text analysis toward discovering meaningful subgroup differences while addressing confounds like stratum imbalance and sign reversal.

0 favorites 0 likes

#computational-social-science

Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities

arXiv cs.CL ↗ · 2026-06-02 Cached

This paper discusses the need for multilingual LLMs that are epistemically grounded and responsible for applications in computational social science and humanities.

0 favorites 0 likes

#computational-social-science

Slogans or Stance? A Label-Light Diagnostic for Entrepreneurial-Discourse Measurement on Chinese SOE Speeches

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper proposes a label-light measurement diagnostic to evaluate whether popular text analysis methods (dictionaries, topic models, embeddings, LLMs) capture substantive stance versus symbolic rhetoric in entrepreneurial-discourse measurement, using a corpus of 80 Chinese SOE speeches and a natural experiment with same-company different-speaker pairs. The authors find that zero-shot LLMs show higher sensitivity but a significant portion of the effect may be due to speaker idiolect rather than substantive stance.

0 favorites 0 likes

#computational-social-science

Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus

arXiv cs.CL ↗ · 2026-05-22 Cached

This paper presents the Arabic Women and Society Corpus, a ten-year collection of over 250,000 Arabic Facebook posts related to women's empowerment and social wellbeing, with engagement metrics for analyzing gender discourse and sentiment.

0 favorites 0 likes

#computational-social-science

The Proxy Presumption: From Semantic Embeddings to Valid Social Measures

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper critiques the 'Proxy Presumption' in NLP, where geometric embedding properties are incorrectly equated with social constructs. It introduces the Construct Validity Protocol and Counterfactual Neutralization methods to ensure rigorous validation of social measures derived from semantic embeddings.

0 favorites 0 likes

computational-social-science

The cognitive, affective, and behavioral expression of self-stigma among people who use drugs in online substance use communities

Quantifying Media Representation Dynamics Across 25 Years of News Reporting on Policing-related Deaths

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities

Slogans or Stance? A Label-Light Diagnostic for Entrepreneurial-Discourse Measurement on Chinese SOE Speeches

Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus

The Proxy Presumption: From Semantic Embeddings to Valid Social Measures

Submit Feedback