Deliberate Evolution: Agentic Reasoning for Sample-Efficient Symbolic Regression with LLMs

arXiv cs.CL Papers

Summary

Deliberate Evolution (DE) is an agentic framework that improves LLM-based symbolic regression by decoupling candidate generation from search control, using adaptive operators, structural diagnosis tools, and reflective memory to achieve better results with only 40% of the standard sample budget.

arXiv:2606.04360v1 Announce Type: new Abstract: Symbolic regression (SR) discovers compact mathematical expressions from data, yet recent LLM-based evolutionary methods remain sample-inefficient because they rely mainly on scalar feedback such as MSE. We identify a core limitation: existing methods conflate candidate proposal with search guidance, requiring the LLM to infer how to evolve an expression, diagnose its errors, and reuse past experience from a single score. To address this, we propose Deliberate Evolution (DE), an agentic framework that decouples symbolic generation from search control. DE guides LLM proposals with adaptive operators for search direction, analytical tools for structural diagnosis, and reflective memory for trajectory-level experience. Experiments on LLM-SRBench show that DE consistently outperforms representative LLM-based SR baselines across diverse scientific domains while using only 40% of the standard sample budget.
Original Article
View Cached Full Text

Cached at: 06/05/26, 02:13 AM

# Deliberate Evolution: Agentic Reasoning for Sample-Efficient Symbolic Regression with LLMs
Source: [https://arxiv.org/abs/2606.04360](https://arxiv.org/abs/2606.04360)
[View PDF](https://arxiv.org/pdf/2606.04360)

> Abstract:Symbolic regression \(SR\) discovers compact mathematical expressions from data, yet recent LLM\-based evolutionary methods remain sample\-inefficient because they rely mainly on scalar feedback such as MSE\. We identify a core limitation: existing methods conflate candidate proposal with search guidance, requiring the LLM to infer how to evolve an expression, diagnose its errors, and reuse past experience from a single score\. To address this, we propose Deliberate Evolution \(DE\), an agentic framework that decouples symbolic generation from search control\. DE guides LLM proposals with adaptive operators for search direction, analytical tools for structural diagnosis, and reflective memory for trajectory\-level experience\. Experiments on LLM\-SRBench show that DE consistently outperforms representative LLM\-based SR baselines across diverse scientific domains while using only 40% of the standard sample budget\.

## Submission history

From: Xinyu Pang \[[view email](https://arxiv.org/show-email/4ae232aa/2606.04360)\] **\[v1\]**Wed, 3 Jun 2026 02:22:16 UTC \(4,845 KB\)

Similar Articles

CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution

arXiv cs.CL

CoEvolve proposes an agent-data mutual evolution framework for training LLM agents through closed-loop, interaction-driven learning that adapts both the agent and its training data distribution. The method extracts feedback signals from rollout trajectories to guide LLM-based task synthesis, demonstrating significant improvements (15-19% absolute gains) across multiple Qwen models on AppWorld and BFCL benchmarks.