parametric-knowledge

#parametric-knowledge

Decoupled Mixture-of-Experts for Parametric Knowledge Injection

arXiv cs.CL ↗ · 3d ago Cached

Decoupled Mixture-of-Experts (DMoE) proposes a modular architecture for parametric knowledge injection, decoupling experts and router from the base model to enable efficient auto-regressive inference and mitigate catastrophic forgetting.

0 favorites 0 likes

#parametric-knowledge

ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs

arXiv cs.AI ↗ · 6d ago Cached

ToolSense is an open-source diagnostic framework that generates three benchmarks (realistic retrieval, MCQ probing, QA probing) to audit LLMs' parametric tool knowledge, revealing a knowledge-retrieval dissociation where strong retrieval performance can coexist with poor factual understanding.

0 favorites 0 likes

#parametric-knowledge

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper investigates whether reinforcement learning can improve the direct recall of parametric knowledge in LLMs beyond reasoning tasks. It demonstrates that RL with binary rewards yields significant gains in factual QA benchmarks by redistributing probability mass to unlock latent knowledge rather than acquiring new facts.

0 favorites 0 likes

parametric-knowledge

Decoupled Mixture-of-Experts for Parametric Knowledge Injection

ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs

Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

Submit Feedback