Tag
This critical survey examines the Annotation Scarcity Paradox in low-resource NLP evaluation, where rapid model scaling outpaces the human infrastructure needed for authentic evaluation, and discusses emerging responses with equity and validity trade-offs.