Tag
This paper proposes RoPoLL, a robust panel of LLM judges that replaces standard averaging with geometric median aggregation to handle biased contamination from individual judges, providing theoretical guarantees and empirical gains over standard PoLL.