construct-validity

#construct-validity

Coordinates of Capability: A Unified MTMM-Geometric Framework for LLM Evaluation

arXiv cs.CL ↗ · 6d ago Cached

This Systematization of Knowledge paper proposes a unified Multi-Trait Multi-Method (MTMM) geometric framework for evaluating Large Language Models, unifying disparate metrics into a shared latent coordinate space to address construct validity issues in current benchmarks.

0 favorites 0 likes

#construct-validity

The Proxy Presumption: From Semantic Embeddings to Valid Social Measures

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper critiques the 'Proxy Presumption' in NLP, where geometric embedding properties are incorrectly equated with social constructs. It introduces the Construct Validity Protocol and Counterfactual Neutralization methods to ensure rigorous validation of social measures derived from semantic embeddings.

0 favorites 0 likes

construct-validity

Coordinates of Capability: A Unified MTMM-Geometric Framework for LLM Evaluation

The Proxy Presumption: From Semantic Embeddings to Valid Social Measures

Submit Feedback