evidence-selection

Tag

Cards List
#evidence-selection

Reinforcing Recursive Language Models (18 minute read)

TLDR AI · 6d ago Cached

The article explores reinforcement learning fine-tuning of small (4B) recursive language models (RLMs) to perform evidence selection from scientific documents, showing that RL-trained 4B models match Claude Sonnet 4.6 performance at a fraction of the size and cost.

0 favorites 0 likes
#evidence-selection

AdaGATE: Adaptive Gap-Aware Token-Efficient Evidence Assembly for Multi-Hop Retrieval-Augmented Generation

arXiv cs.CL · 2026-05-08 Cached

AdaGATE is a training-free evidence controller for multi-hop RAG that uses entity-centric gap tracking, micro-query generation, and utility-based selection to improve robustness under noisy retrieval, achieving state-of-the-art evidence F1 with fewer input tokens.

0 favorites 0 likes
← Back to home

Submit Feedback