fairness

#fairness

Polar: A Benchmark for Evaluating Political Bias in LLMs

arXiv cs.CL ↗ · 2d ago Cached

Polar is a 4,026-instance multiple-choice benchmark for evaluating political bias in LLMs across U.S. and South Korean political contexts, measuring bias through option-level likelihoods. Experiments on 38 LLMs show systematic bias patterns varying by political context, issue category, and presentation language.

0 favorites 0 likes

#fairness

Toward Calibrated, Fair, and accurate Deepfake Detection

arXiv cs.LG ↗ · 4d ago Cached

Introduces Face-Fairness (FF), a plug-and-play framework for bias mitigation in deepfake detection, featuring Face-Feature Tuning (FFT) as the first demographic label-free fairness method that improves group accuracy and reduces performance gaps across demographics.

0 favorites 0 likes

#fairness

Speaker Group Encoding in Self-supervised Speech Recognition Models

arXiv cs.CL ↗ · 4d ago Cached

Investigates how self-supervised speech recognition models encode speaker group information (gender, age, dialect, ethnicity, native speaker status) across layers, and how finetuning for tasks like ASR or speaker identification affects this encoding.

0 favorites 0 likes

#fairness

Pareto-Guided Teacher Alignment for Fair Personalized Text Generation

arXiv cs.CL ↗ · 4d ago Cached

This paper introduces a Pareto-guided teacher alignment method for fair personalized text generation, aiming to balance multiple objectives in language model outputs.

0 favorites 0 likes

#fairness

PAFO: Pareto Fairness Optimization for Personalized Reward Modeling

arXiv cs.AI ↗ · 5d ago Cached

This paper proposes PAFO, a Pareto fairness optimization framework to mitigate personalized reward bias in reward models for LLMs, improving accuracy for minority user groups without harming majority groups.

0 favorites 0 likes

#fairness

Stress-testing medical large language models reveals latent safety pathology beyond benchmark accuracy

arXiv cs.AI ↗ · 5d ago Cached

This paper introduces AI-MASLD, a stress-audit framework for medical LLMs that reveals how benchmark accuracy can hide serious safety failures, and demonstrates that open-weight models can match or exceed proprietary ones on safety dimensions.

0 favorites 0 likes

#fairness

Aquifer: Bounded Queues, Fairness, and Dynamic Pacing for AI Workloads

Reddit r/AI_Agents ↗ · 5d ago

Aquifer is an MCP runtime that provides bounded queues, fairness controls, and dynamic pacing to handle rate limits and traffic spikes in AI agent systems. It also introduces the Aqueduct Protocol for dynamic flow state communication.

0 favorites 0 likes

#fairness

Detecting and Mitigating Bias by Treating Fairness as a Symmetry Operation

arXiv cs.AI ↗ · 6d ago Cached

The paper proposes treating fairness as a symmetry operation in machine learning classifiers, implementing loss-based regularization to enforce invariance under swapping of sensitive attributes while holding merit features fixed. The framework achieves over 90% bias reduction with minimal accuracy loss and requires no causal graph knowledge.

0 favorites 0 likes

#fairness

Algorithmic Monocultures in Hiring

Hacker News Top ↗ · 6d ago Cached

This large-scale study of 3.4 million job applicants across 156 employers reveals that algorithmic monocultures in hiring algorithms from a single vendor cause racial disparities and systemic rejections, with 25.87% of Black applicants and 14.74% of Asian applicants adversely impacted.

0 favorites 0 likes

#fairness

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

Researchers from the University of Amsterdam propose a tabular reinforcement learning approach to the Metro Network Expansion Problem, showing it achieves comparable performance to Deep RL while reducing training episodes by 18x and carbon emissions by 12x on average. The method also incorporates social equity criteria and is evaluated on real-world metro networks in Xi'an and Amsterdam.

0 favorites 0 likes

#fairness

Effect of Demographic Bias on Skin Lesion Classification

arXiv cs.AI ↗ · 2026-06-03 Cached

This paper investigates the impact of demographic bias (sex and age) on skin lesion classification using ResNet models, finding that sex biases stem from data imbalances while age biases consistently favor younger groups, and evaluating multi-task and adversarial learning mitigation strategies.

0 favorites 0 likes

#fairness

Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper investigates how LLMs produce different outcomes based on conversational context, finding that topic, rather than explicit user demographics, is the primary driver of disparities in high-stakes scenarios like salary advice.

0 favorites 0 likes

#fairness

A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models

arXiv cs.CL ↗ · 2026-06-02 Cached

This paper presents a multi-domain red teaming framework for evaluating safety, robustness, and fairness of medical LLMs across 690 clinically grounded scenarios. Results show that high aggregate accuracy can mask critical failures, and hybrid evaluation with clinician oversight is necessary for credible safety assessment.

0 favorites 0 likes

#fairness

TrustLDM: Benchmarking Trustworthiness in Language Diffusion Models

arXiv cs.CL ↗ · 2026-06-02 Cached

Introduces TrustLDM, a comprehensive benchmark for evaluating safety, privacy, and fairness of Language Diffusion Models, revealing that their alignment degrades with malicious post contexts. Proposes an automatic evaluation framework, TrustLDM-Auto, to identify vulnerable configurations.

0 favorites 0 likes

#fairness

Neuron-Level Interventions for Gendered and Gender-Neutral Generation in Language Models

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper proposes a neuron-level intervention method to identify gender-specific neurons in language models (feminine, masculine, gender-neutral) and steer sentence generation toward a target gender form while preserving meaning, with experiments showing precise control and bias mitigation.

0 favorites 0 likes

#fairness

COFT: Counterfactual-Conformal Decoding for Fair Chain-of-Thought Reasoning in Large Language Models

arXiv cs.CL ↗ · 2026-06-01 Cached

COFT is a training-free decoding method that applies token-level fairness control and conformal calibration to reduce bias in chain-of-thought reasoning of large language models, achieving 30-55% bias reduction with minimal computational overhead.

0 favorites 0 likes

#fairness

Your Multimodal Speech Model Says I Have a Face for Radio

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper presents the first bias evaluation of multimodal speech recognition models, finding significant accuracy differences across gender and ethnicity when pairing faces with audio, with implications for fairness in AI systems.

0 favorites 0 likes

#fairness

GPF-LiveNews: A Streaming Evaluation Protocol for Group-Conditioned Framing in Large Language Models

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper introduces GPF-LiveNews, a streaming evaluation protocol for auditing how large language models frame live news events differently for various demographic groups, using semantic sensitivity and sentiment disparity measures across 42 identity labels and seven prompt families.

0 favorites 0 likes

#fairness

@WGOV: Algorithmic Monocultures in Hiring Rishi Bommasani, Sarah H. Bana, Kathleen A. Creel, Dan Jurafsky, Percy Liang https:/…

X AI KOLs Timeline ↗ · 2026-05-28 Cached

A research paper analyzing how algorithmic monoculture in hiring—where many employers use the same vendor's screening algorithms—leads to systematic rejection of the same individuals and racial groups, using a dataset of 3 million applicants.

0 favorites 0 likes

#fairness

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

This paper presents a two-level autoresearch framework where an outer-loop AI agent autonomously optimizes inner-loop LLM policy-synthesis pipelines for multi-agent sequential social dilemmas, achieving superior performance and discovering objective-specific mechanisms like fairness under a maximin welfare objective.

0 favorites 0 likes

fairness

Submit Feedback