active-reasoning

#active-reasoning

Active-GRPO: Adaptive Imitation and Self-Improving Reasoning for Molecular Optimization

arXiv cs.LG ↗ · 3d ago Cached

Active-GRPO introduces an adaptive imitation and self-improving reasoning framework that dynamically decides when to imitate references and when to reinforce the model's own discoveries for molecular optimization, achieving statistically significant improvements over previous methods on the TOMG-Bench-MolOpt benchmark.

0 favorites 0 likes

#active-reasoning

BALAR : A Bayesian Agentic Loop for Active Reasoning

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces BALAR, a training-free Bayesian agentic loop algorithm that enables large language models to actively reason and ask clarifying questions in multi-turn interactions. It demonstrates significant performance improvements over baselines on detective, puzzle, and clinical diagnosis benchmarks.

0 favorites 0 likes

active-reasoning

Active-GRPO: Adaptive Imitation and Self-Improving Reasoning for Molecular Optimization

BALAR : A Bayesian Agentic Loop for Active Reasoning

Submit Feedback