negative-samples

#negative-samples

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Hugging Face Daily Papers ↗ · 2026-05-01 Cached

This paper introduces ResRL, a method to boost LLM reasoning by decoupling semantic distributions between positive and negative responses through negative sample projection. It aims to maintain generation diversity while improving performance on various benchmarks.

0 favorites 0 likes

negative-samples

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Submit Feedback