semantic-similarity

#semantic-similarity

On the Persistent Effects of Lexicality in Large Language Mod

arXiv cs.CL ↗ · 2d ago Cached

This paper investigates how lexical overlap, rather than semantic content, influences LLM representations across layers and architectures, and demonstrates that this lexical effect persists even in models trained for semantic similarity, leading to degraded performance on downstream tasks.

0 favorites 0 likes

#semantic-similarity

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation

arXiv cs.LG ↗ · 4d ago Cached

This paper introduces bounded behavioral indistinguishability, a formal framework for evaluating black-box LLM distillation beyond semantic similarity. Experiments on Qwen and Llama models show that distillation reduces but does not eliminate adversarial distinguishability, highlighting the need for category-aware evaluation.

0 favorites 0 likes

#semantic-similarity

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Hugging Face Daily Papers ↗ · 5d ago Cached

OmniOPD introduces a logit-free on-policy distillation method that uses chunk-level semantic similarity and speculative verification to train student models with black-box teachers, achieving up to +28.64% improvement on math benchmarks over standard OPD.

0 favorites 0 likes

#semantic-similarity

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring

arXiv cs.CL ↗ · 2026-04-22 Cached

Researchers from PNNL and Washington University introduce a systematic framework to test how five LLMs detect subtle semantic changes in documents, revealing positional bias, context coherence effects, and model-specific scoring fingerprints.

0 favorites 0 likes

semantic-similarity

On the Persistent Effects of Lexicality in Large Language Mod

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring

Submit Feedback