gender-bias

#gender-bias

Explaining GAND: A Resource on Gender-Ambiguous Natural Data & Contrastive Attribution

arXiv cs.CL ↗ · 2d ago Cached

This paper introduces GAND, a benchmarking resource of gender-ambiguous natural English sentences for analyzing gender bias in machine translation, and presents an interpretability analysis using contrastive translations to reveal source words influencing gender assignment.

0 favorites 0 likes

#gender-bias

Evaluated 6 frontier LLMs (GPT-5.4, Claude Sonnet 4.6, Claude Opus 4.7, Gemini Pro/Flash, Grok 4.3) on political, gender, and racial bias across 8 benchmarks (~20,600 examples) [R]

Reddit r/MachineLearning ↗ · 2d ago

A solo evaluation of six frontier LLMs on 8 bias benchmarks finds that most models lean left politically, and Grok's self-reported right-leaning stance is inconsistent with its left-leaning behavior. Refusal rates vary, with GPT-5.4 refusing 20% of race-related questions.

0 favorites 0 likes

#gender-bias

Can LLMs Hire Fairly? Racial Bias in Resume Screening

arXiv cs.CL ↗ · 2026-06-30 Cached

This paper audits 14 large language models for hiring discrimination using a paired-resume methodology, finding that older models exhibit pro-White bias while newer models show null or pro-Black bias, indicating a reversal in algorithmic hiring bias across model generations.

0 favorites 0 likes

#gender-bias

Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios

arXiv cs.CL ↗ · 2026-06-15 Cached

This paper introduces GAMA-Bench, a benchmark of 1,298 gender-mirrored conflict scenarios, and finds that LLMs consistently apply harsher punitive and blame-centered framing to male actors while giving female actors more empathetic and therapeutic responses for the same misconduct.

0 favorites 0 likes

#gender-bias

Anchoring LLM Gender Bias to Human Baselines: A Cross-Lingual Audit

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper audits six large language models for gender stereotyping across English, Korean, Chinese, and Japanese, anchoring against human baselines. It finds that LLM stereotyping often exceeds human cross-country variation and can compound across languages, introducing a four-pattern framework to characterize such behaviors.

0 favorites 0 likes

#gender-bias

Neuron-Level Interventions for Gendered and Gender-Neutral Generation in Language Models

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper proposes a neuron-level intervention method to identify gender-specific neurons in language models (feminine, masculine, gender-neutral) and steer sentence generation toward a target gender form while preserving meaning, with experiments showing precise control and bias mitigation.

0 favorites 0 likes

#gender-bias

Your Multimodal Speech Model Says I Have a Face for Radio

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper presents the first bias evaluation of multimodal speech recognition models, finding significant accuracy differences across gender and ethnicity when pairing faces with audio, with implications for fairness in AI systems.

0 favorites 0 likes

#gender-bias

EquiSumm : A Gender Bias-Aware Framework for Inclusive Tweet Summarization

arXiv cs.CL ↗ · 2026-05-25 Cached

Proposes EquiSumm, a gender bias-aware framework for inclusive tweet summarization that ensures representation of opinions from different gender groups, addressing demographic fairness in automated summarization.

0 favorites 0 likes

#gender-bias

Mechanics of Bias and Reasoning: Interpreting the Impact of Chain-of-Thought Prompting on Gender Bias in LLMs

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper investigates how chain-of-thought prompting affects gender bias in large language models, finding that it does not consistently reduce bias and that apparent improvements stem from superficial compliance rather than genuine understanding.

0 favorites 0 likes

#gender-bias

AI generated identical resumes for a man and a woman: Hers was more likely to be labeled "weak," while his got a 97% approval rating

Reddit r/ArtificialInteligence ↗ · 2026-05-11 Cached

A study found that identical AI-generated resumes for a man and a woman received significantly different evaluations, with the woman's CV more likely to be doubted for competence and trustworthiness. This reflects broader gender biases in AI usage perceptions and may exacerbate the AI adoption gap.

0 favorites 0 likes

gender-bias

Submit Feedback