Tag
This paper investigates how LLMs' internal priors affect zero-shot annotation performance, finding that nearly two-thirds of errors resist prompt-based correction and introducing Definition-Specific Familiarity as a better predictor than memorization metrics.
A reflective post on how quickly opinions in AI change, cautioning against rigid beliefs and emphasizing adaptability over perfect prediction.