DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

Hugging Face Daily Papers 05/11/26, 12:00 AM Papers

Summary

DeepRefine is a research paper introducing an LLM-based reasoning model that refines agent-compiled knowledge bases using reinforcement learning and multi-turn interactions to improve downstream task performance.

Agent-compiled knowledge bases provide persistent external knowledge for large language model (LLM) agents in open-ended, knowledge-intensive downstream tasks. Yet their quality is systematically limited by incompleteness, incorrectness, and redundancy, manifested as missing evidence or cross-document links, low-confidence or imprecise claims, and ambiguous or coreference resolution issues. Such defects compound under iterative use, degrading retrieval fidelity and downstream task performance. We present DeepRefine, a general LLM-based reasoning model for agent-compiled knowledge refinement that improves the quality of any pre-constructed knowledge bases with user queries to make it more suitable for the downstream tasks. DeepRefine performs multi-turn interactions with the knowledge base and conducts abductive diagnosis over interaction history, localizes likely defects, and executes targeted refinement actions for incremental knowledge base updates. To optimize refinement policies of DeepRefine without gold references, we introduce a Gain-Beyond-Draft (GBD) reward and train the reasoning process end-to-end via reinforcement learning. Extensive experiments demonstrate consistent downstream gains over strong baselines.

Original Article

View Cached Full Text

Cached at: 05/13/26, 12:20 AM

Paper page - DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

Source: https://huggingface.co/papers/2605.10488

Abstract

DeepRefine is an LLM-based reasoning model that refines agent-compiled knowledge bases through multi-turn interactions and targeted updates to improve downstream task performance.

Agent-compiled knowledge basesprovide persistent external knowledge for large language model (LLM) agents in open-ended, knowledge-intensive downstream tasks. Yet their quality is systematically limited by incompleteness, incorrectness, and redundancy, manifested as missing evidence or cross-document links, low-confidence or imprecise claims, and ambiguous or coreference resolution issues. Such defects compound under iterative use, degrading retrieval fidelity and downstream task performance. We present DeepRefine, a general LLM-based reasoning model for agent-compiledknowledge refinementthat improves the quality of any pre-constructed knowledge bases with user queries to make it more suitable for the downstream tasks. DeepRefine performsmulti-turn interactionswith the knowledge base and conductsabductive diagnosisover interaction history, localizes likely defects, and executes targeted refinement actions for incremental knowledge base updates. To optimize refinement policies of DeepRefine without gold references, we introduce a Gain-Beyond-Draft (GBD) reward and train the reasoning process end-to-end viareinforcement learning. Extensive experiments demonstrate consistent downstream gains over strong baselines.

View arXiv page View PDF GitHub2 Add to collection

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2605.10488 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2605.10488 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2605.10488 in a Space README.md to link it from this page.

DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

Paper page - DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper1

Similar Articles

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models

Deep Reasoning in General Purpose Agents via Structured Meta-Cognition

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

@jiqizhixin: Awesome blog! State of RL for reasoning LLMs https://aweers.de/blog/2026/rl-for-llms/…

Submit Feedback

Similar Articles

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models

Deep Reasoning in General Purpose Agents via Structured Meta-Cognition

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

@jiqizhixin: Awesome blog! State of RL for reasoning LLMs https://aweers.de/blog/2026/rl-for-llms/…