unsupervised

#unsupervised

Layer-Resolved Optimal Transport for Hallucination Detection in NMT and Abstractive Summarization

arXiv cs.CL ↗ · 4d ago Cached

This paper extends optimal transport-based hallucination detection to all decoder layers in NMT and abstractive summarization, finding that detection is concentrated in early layers and that the geometric signal transfers poorly to summarization due to faithfulness failures not detectable via attention concentration.

0 favorites 0 likes

#unsupervised

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

arXiv cs.LG ↗ · 6d ago Cached

This paper proposes Global-Local Uncertainty (GLU), an unsupervised single-pass score that fuses token-level local entropy with hidden-state geometric global entropy for uncertainty quantification in LLMs, showing that the two are near-orthogonal and together capture confident-but-wrong failures.

0 favorites 0 likes

#unsupervised

Unsupervised Process Reward Models

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper proposes unsupervised Process Reward Models (uPRM) that eliminate the need for human annotations by using LLM next-token probabilities to identify erroneous reasoning steps, achieving up to 15% accuracy improvements over LLM-as-a-Judge and performing comparably to supervised PRMs as verifiers and reward signals.

0 favorites 0 likes

#unsupervised

Logic-Regularized Verifier Elicits Reasoning from LLMs

arXiv cs.CL ↗ · 2026-05-08 Cached

Introduces LoVer, an unsupervised verifier that uses logical rules (negation consistency, intra-group and inter-group consistency) to improve LLM reasoning without labeled data, achieving performance close to supervised verifiers on reasoning benchmarks.

0 favorites 0 likes

#unsupervised

My Unsupervised Compliance Layer Project

Reddit r/artificial ↗ · 2026-04-22

A developer built an unsupervised multi-agent pipeline that lets Claude and GPT-4 autonomously prep and host a podcast, including scouting topics, planning episodes, and conversing for 10 rounds before text-to-speech output.

0 favorites 0 likes

unsupervised

Layer-Resolved Optimal Transport for Hallucination Detection in NMT and Abstractive Summarization

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

Unsupervised Process Reward Models

Logic-Regularized Verifier Elicits Reasoning from LLMs

My Unsupervised Compliance Layer Project

Submit Feedback