large-reasoning-models

#large-reasoning-models

PolitNuggets: Benchmarking Agentic Discovery of Long-Tail Political Facts

arXiv cs.AI ↗ · 2d ago Cached

PolitNuggets is a multilingual benchmark for evaluating large reasoning models within agentic frameworks on their ability to discover and synthesize long-tail political facts by constructing biographies for 400 global elites. The benchmark introduces evaluation protocols like FactNet and reveals that current systems struggle with fine-grained details and efficiency.

0 favorites 0 likes

#large-reasoning-models

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper investigates safety failures in Large Reasoning Models where harmful content appears in reasoning traces despite safe final answers, proposing an adaptive multi-principle steering method to mitigate these risks.

0 favorites 0 likes

#large-reasoning-models

CiPO: Counterfactual Unlearning for Large Reasoning Models through Iterative Preference Optimization

arXiv cs.CL ↗ · 2026-04-20 Cached

CiPO is a novel framework for machine unlearning in Large Reasoning Models that uses iterative preference optimization with counterfactual reasoning traces to selectively remove unwanted knowledge while preserving reasoning abilities. The method addresses the challenge of unlearning in models that rely on chain-of-thought reasoning by generating logically valid alternative reasoning paths during training.

0 favorites 0 likes

large-reasoning-models

PolitNuggets: Benchmarking Agentic Discovery of Long-Tail Political Facts

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

CiPO: Counterfactual Unlearning for Large Reasoning Models through Iterative Preference Optimization

Submit Feedback