negative-samples

Tag

Cards List
#negative-samples

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Hugging Face Daily Papers · 2026-05-01 Cached

This paper introduces ResRL, a method to boost LLM reasoning by decoupling semantic distributions between positive and negative responses through negative sample projection. It aims to maintain generation diversity while improving performance on various benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback