asymmetry

#asymmetry

Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

arXiv cs.CL ↗ · yesterday Cached

This paper tests the assumption that LLMs judge better than they generate in in-context QA, finding generation accuracy exceeds self-evaluation on most benchmarks, with evaluation attending less to context. The findings challenge core assumptions in self-evaluation pipelines.

0 favorites 0 likes

#asymmetry

Hubness, Not Anisotropy, Drives Cross-Lingual Retrieval Asymmetry in Multilingual Embedding Models

arXiv cs.CL ↗ · 2026-05-27 Cached

This paper investigates the cause of cross-lingual retrieval asymmetry in multilingual embedding models. The authors propose and test the hub-mediation hypothesis, finding that hubness, not anisotropy, is the dominant cause, and recommend using CSLS instead of cosine similarity.

0 favorites 0 likes

asymmetry

Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

Hubness, Not Anisotropy, Drives Cross-Lingual Retrieval Asymmetry in Multilingual Embedding Models

Submit Feedback