TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

arXiv cs.AI 06/11/26, 04:00 AM Papers

deep-search tree-structured trial-and-error branch-and-return web-search agent

Summary

TreeSeeker is an inference-time framework that organizes deep search as branch-and-return over tree-structured states, using textual UCB signals to balance exploitation, exploration, and pruning. It outperforms strong baselines on deep search benchmarks, showing that explicit branch-and-return control improves multi-step web search.

arXiv:2606.11662v1 Announce Type: new Abstract: Deep search requires agents to answer complex questions through multi-step web search, browsing, evidence comparison, and synthesis. A central challenge is deciding how to search when several directions look plausible but only some will later lead to reliable evidence. If an agent greedily follows the current best-looking direction, it may keep extending a weak continuation. If it explores without discipline, it may waste budget on disconnected trials. We propose TreeSeeker, an inference-time framework for controlled trial-and-error in deep search. TreeSeeker organizes search as branch-and-return search over tree-structured states, where each branch is a tentative direction for a sub-goal. At each round, TreeSearch reads all sub-goal trees, identifies active goals, and uses textual UCB signals of value, uncertainty, and risk to select among exploiting a promising branch, exploring an uncertain alternative, or pruning an unproductive continuation and returning to an earlier branch point. TreeMem supports this control loop by keeping evidence, uncertainty, conflicts, progress, and failure cues attached to the branches that produced them, so trial outcomes can guide later decisions. Experiments on XBench-DeepSearch, BrowseComp, and BrowseComp-ZH show that TreeSeeker consistently outperforms strong open-source baselines, suggesting that explicit branch-and-return control complements stronger reasoning and tool execution.

Original Article

View Cached Full Text

Cached at: 06/11/26, 01:48 PM

# TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search
Source: [https://arxiv.org/abs/2606.11662](https://arxiv.org/abs/2606.11662)
Authors:[Zhuofan Shi](https://arxiv.org/search/cs?searchtype=author&query=Shi,+Z),[Mingzhe Ma](https://arxiv.org/search/cs?searchtype=author&query=Ma,+M),[Lu Wang](https://arxiv.org/search/cs?searchtype=author&query=Wang,+L),[Fangkai Yang](https://arxiv.org/search/cs?searchtype=author&query=Yang,+F),[Pu Zhao](https://arxiv.org/search/cs?searchtype=author&query=Zhao,+P),[Yiming Guan](https://arxiv.org/search/cs?searchtype=author&query=Guan,+Y),[Youling Huang](https://arxiv.org/search/cs?searchtype=author&query=Huang,+Y),[Wei Zhang](https://arxiv.org/search/cs?searchtype=author&query=Zhang,+W),[Qingwei Lin](https://arxiv.org/search/cs?searchtype=author&query=Lin,+Q),[Dongmei Zhang](https://arxiv.org/search/cs?searchtype=author&query=Zhang,+D),[Saravan Rajmohan](https://arxiv.org/search/cs?searchtype=author&query=Rajmohan,+S)

[View PDF](https://arxiv.org/pdf/2606.11662)

> Abstract:Deep search requires agents to answer complex questions through multi\-step web search, browsing, evidence comparison, and synthesis\. A central challenge is deciding how to search when several directions look plausible but only some will later lead to reliable evidence\. If an agent greedily follows the current best\-looking direction, it may keep extending a weak continuation\. If it explores without discipline, it may waste budget on disconnected trials\. We propose TreeSeeker, an inference\-time framework for controlled trial\-and\-error in deep search\. TreeSeeker organizes search as branch\-and\-return search over tree\-structured states, where each branch is a tentative direction for a sub\-goal\. At each round, TreeSearch reads all sub\-goal trees, identifies active goals, and uses textual UCB signals of value, uncertainty, and risk to select among exploiting a promising branch, exploring an uncertain alternative, or pruning an unproductive continuation and returning to an earlier branch point\. TreeMem supports this control loop by keeping evidence, uncertainty, conflicts, progress, and failure cues attached to the branches that produced them, so trial outcomes can guide later decisions\. Experiments on XBench\-DeepSearch, BrowseComp, and BrowseComp\-ZH show that TreeSeeker consistently outperforms strong open\-source baselines, suggesting that explicit branch\-and\-return control complements stronger reasoning and tool execution\.

## Submission history

From: Mingzhe Ma \[[view email](https://arxiv.org/show-email/758d8abd/2606.11662)\] **\[v1\]**Wed, 10 Jun 2026 05:10:19 UTC \(951 KB\)

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

Similar Articles

Structure-Induced Information for Rerooting Levin Tree Search

LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories

Arbor: Tree Search as a Cognition Layer for Autonomous Agents

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

@tom_doerr: Trains deep search agents from knowledge graphs https://github.com/THUDM/DeepDive

Submit Feedback

Similar Articles

Structure-Induced Information for Rerooting Levin Tree Search

LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories

Arbor: Tree Search as a Cognition Layer for Autonomous Agents

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

@tom_doerr: Trains deep search agents from knowledge graphs https://github.com/THUDM/DeepDive