llm-bias

#llm-bias

AI Coding Agents in Social Science: Methodologically Diverse, Empirically Consistent, Interpretively Vulnerable

arXiv cs.CL ↗ · 4d ago Cached

This paper evaluates LLM-based coding agents (Claude Code and Codex) in social science analysis, finding they match or exceed human methodological diversity while remaining vulnerable to interpretation bias through verdict-layer manipulation.

0 favorites 0 likes

#llm-bias

Has anyone else noticed this LLM language bias?

Reddit r/artificial ↗ · 2026-06-07

The author observes that LLMs exhibit denominational bias depending on language (Protestant-leaning in English, Catholic-leaning in Spanish/French/Portuguese) and introduces a free Bible study app called Biblians.

0 favorites 0 likes

#llm-bias

Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper investigates how LLMs produce different outcomes based on conversational context, finding that topic, rather than explicit user demographics, is the primary driver of disparities in high-stakes scenarios like salary advice.

0 favorites 0 likes

#llm-bias

I analyzed 25,500 LLM resume screenings to measure hiring bias. The results are a wake-up call.

Reddit r/artificial ↗ · 2026-06-01

A study analyzing 25,500 LLM resume evaluations across 10 models found a 45% bias rate driven by 'silent bias', with models inventing professional-sounding excuses to penalize candidates. It highlights significant variability in fairness and stability, with Claude, Mistral-Large, and Llama 4 being most stable, while Qwen and older Gemini models were volatile.

0 favorites 0 likes

#llm-bias

GPT Guesses Between 1 and 100

Hacker News Top ↗ · 2026-05-25 Cached

This paper presents an experiment where GPT-4.1 is asked to pick a random number between 1 and 100, 10,000 times, and the resulting distribution is analyzed for bias compared to a uniform baseline.

0 favorites 0 likes

#llm-bias

Do AI systems accidentally reinforce big brands too much?

Reddit r/AI_Agents ↗ · 2026-05-25

A discussion on how AI language models may disproportionately recommend well-known brands, potentially making it harder for smaller companies to be discovered in AI-powered search.

0 favorites 0 likes

#llm-bias

Mechanics of Bias and Reasoning: Interpreting the Impact of Chain-of-Thought Prompting on Gender Bias in LLMs

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper investigates how chain-of-thought prompting affects gender bias in large language models, finding that it does not consistently reduce bias and that apparent improvements stem from superficial compliance rather than genuine understanding.

0 favorites 0 likes

#llm-bias

Artificial Intolerance: Stigmatizing Language in Clinical Documentation Skews Large Language Model Decision-Making

arXiv cs.CL ↗ · 2026-05-19 Cached

This study demonstrates that large language models inherit and amplify biases from stigmatizing language in clinical notes, leading to less aggressive patient management, and that current mitigation strategies are insufficient.

0 favorites 0 likes

#llm-bias

Auditing Multimodal LLM Raters: Central Tendency Bias in Clinical Ordinal Scoring

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper investigates central tendency bias in multimodal LLMs used for clinical ordinal scoring of the Clock Drawing Test, finding that LLMs compress predictions toward the middle of the scale, disproportionately affecting critical extremes. The study extends the LLM-as-judge bias literature to clinical assessment, highlighting the need for calibration-aware evaluation before deployment.

0 favorites 0 likes

#llm-bias

The Geopolitics of AI Safety: A Causal Analysis of Regional LLM Bias

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces a Probabilistic Graphical Model framework to causally audit LLM safety mechanisms, revealing that standard observational metrics overestimate demographic bias by ignoring context toxicity.

0 favorites 0 likes

#llm-bias

Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper investigates whether assigning personas to large language models induces human-like motivated reasoning, finding that persona-assigned LLMs show up to 9% reduced veracity discernment and are up to 90% more likely to evaluate scientific evidence in ways congruent with their induced political identity, with prompt-based debiasing largely ineffective.

0 favorites 0 likes

#llm-bias

Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper presents a large-scale audit of recommendation biases in LLM-based content curation across OpenAI, Anthropic, and Google using 540,000 simulated selections from Twitter/X, Bluesky, and Reddit data. The study finds that LLMs systematically amplify polarization, exhibit distinct toxicity handling trade-offs, and show significant political leaning bias favoring left-leaning authors despite right-leaning plurality in datasets.

0 favorites 0 likes

llm-bias

Submit Feedback