wikipedia-co-occurrence

Tag

Cards List
#wikipedia-co-occurrence

Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering

Hugging Face Daily Papers · 2026-05-28 Cached

CorVer is a lightweight, corpus-grounded reward mechanism that uses Wikipedia co-occurrence statistics to provide efficient sentence-level feedback for reinforcement learning in factual question answering, outperforming neural verifiers while training 4.8 to 8.4x faster.

0 favorites 0 likes
← Back to home

Submit Feedback