co-evolutionary-training

#co-evolutionary-training

Self-Evolving Deep Research via Joint Generation and Evaluation

arXiv cs.CL ↗ · 6d ago Cached

Researchers from HKUST, ByteDance, and UCL propose SCORE, a co-evolutionary training framework that jointly trains an LLM as both a deep research report generator and an evaluator, using a meta-harness to dynamically adjust evaluation difficulty and prevent reward saturation. Experiments show consistent improvement in open-ended research report quality.

0 favorites 0 likes

co-evolutionary-training

Self-Evolving Deep Research via Joint Generation and Evaluation

Submit Feedback