token-level

#token-level

ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate

arXiv cs.LG ↗ · 2d ago Cached

This paper identifies a structural failure mode in token-level credit assignment for LLM reinforcement learning when using LoRA, where intrinsic signals degenerate. It proposes Adapter-Residual Credit Assignment (ARCA), which derives token salience from adapter hidden-state residuals and remains competitive with baselines.

0 favorites 0 likes

#token-level

RAGognizer: Hallucination-Aware Fine-Tuning via Detection Head Integration

arXiv cs.CL ↗ · 2026-04-20 Cached

RAGognizer introduces a hallucination-aware fine-tuning approach that integrates a lightweight detection head into LLMs for joint optimization of language modeling and hallucination detection in RAG systems. The paper presents RAGognize, a dataset of naturally occurring closed-domain hallucinations with token-level annotations, and demonstrates state-of-the-art hallucination detection while reducing hallucination rates without degrading language quality.

0 favorites 0 likes

token-level

ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate

RAGognizer: Hallucination-Aware Fine-Tuning via Detection Head Integration

Submit Feedback