cultural-competence

#cultural-competence

CCBENCH: Assessing LLM Cultural Competence via Implicitly Signaled Norms using Health Queries

arXiv cs.CL ↗ · 2026-07-08 Cached

Introduces CCBench, a framework for evaluating LLMs' cultural competence via health queries with personas across six cultures, finding that even top models achieve only 20-30% culturally appropriate responses.

0 favorites 0 likes

#cultural-competence

LLMs Infer Cultural Context but Fail to Apply It When Responding

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper introduces CAPRI, a dataset to evaluate whether LLMs can infer a user's cultural background from conversational cues and adapt their responses (e.g., using appropriate measurement units). Experiments show LLMs can infer cultural context but often fail to apply it unless explicitly prompted.

0 favorites 0 likes

#cultural-competence

CulturALL: Benchmarking Multilingual and Multicultural Competence of LLMs on Grounded Tasks

arXiv cs.CL ↗ · 2026-04-22 Cached

CulturALL introduces a 2,610-sample benchmark across 14 languages and 51 regions to evaluate LLMs on real-world, culturally grounded tasks; top model scores only 44.48%, highlighting large room for improvement.

0 favorites 0 likes

#cultural-competence

x1: Learning to Think Adaptively Across Languages and Cultures

arXiv cs.CL ↗ · 2026-04-21 Cached

Researchers introduce x1, a family of reasoning models that adaptively select optimal languages for reasoning on a per-instance basis, demonstrating that language choice impacts reasoning quality in multilingual and cultural tasks.

0 favorites 0 likes

cultural-competence

CCBENCH: Assessing LLM Cultural Competence via Implicitly Signaled Norms using Health Queries

LLMs Infer Cultural Context but Fail to Apply It When Responding

CulturALL: Benchmarking Multilingual and Multicultural Competence of LLMs on Grounded Tasks

x1: Learning to Think Adaptively Across Languages and Cultures

Submit Feedback