deep-search

#deep-search

Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning

arXiv cs.AI ↗ · 2026-06-16 Cached

Visual-Seeker proposes a visual-native multimodal deep search agent that actively reasons over fine-grained visual details and synthesizes multimodal evidence, achieving state-of-the-art performance on five challenging multimodal search benchmarks.

0 favorites 0 likes

#deep-search

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

arXiv cs.AI ↗ · 2026-06-11 Cached

TreeSeeker is an inference-time framework that organizes deep search as branch-and-return over tree-structured states, using textual UCB signals to balance exploitation, exploration, and pruning. It outperforms strong baselines on deep search benchmarks, showing that explicit branch-and-return control improves multi-step web search.

0 favorites 0 likes

#deep-search

@DanKornas: DeepDive is a pattern for deep search agents: synthesize QA from knowledge graphs, then train multi-turn browsing with …

X AI KOLs Timeline ↗ · 2026-05-16 Cached

DeepDive is a pattern for building deep search agents that synthesizes QA from knowledge graphs and trains multi-turn browsing with reinforcement learning (GRPO). It includes entity obfuscation and test-time scaling with tool calls.

0 favorites 0 likes

#deep-search

@tom_doerr: Trains deep search agents from knowledge graphs https://github.com/THUDM/DeepDive

X AI KOLs Timeline ↗ · 2026-05-16 Cached

DeepDive presents an automated approach to training deep search agents using knowledge graphs for data synthesis and multi-turn reinforcement learning, enabling complex multi-step reasoning and web browsing.

0 favorites 0 likes

#deep-search

Scaling Retrieval-Augmented Reasoning with Parallel Search and Explicit Merging

arXiv cs.AI ↗ · 2026-05-14 Cached

Introduces MultiSearch, an RL-based framework that generates multiple queries at each reasoning step and explicitly merges retrieved information to improve signal-to-noise ratio and reasoning accuracy in question-answering tasks.

0 favorites 0 likes

#deep-search

The new AI-powered Google Finance is expanding to Europe.

Google AI Blog ↗ · 2026-05-11 Cached

Google is expanding its new AI-powered Google Finance service to Europe, featuring enhanced AI research, advanced charting visualizations, and live earnings insights with local language support.

0 favorites 0 likes

#deep-search

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper introduces On-Policy Data Evolution (ODE) and a visual-native agent harness to improve multimodal deep search agents. By enabling reusable visual evidence and closed-loop data generation, ODE significantly boosts the performance of Qwen3-VL agents across multiple benchmarks, surpassing Gemini 2.5 Pro.

0 favorites 0 likes

#deep-search

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Hugging Face Daily Papers ↗ · 2026-05-06 Cached

OpenSearch-VL is an open-source framework and paper introducing a recipe for training frontier multimodal search agents using reinforcement learning, featuring specialized data curation and a novel training algorithm.

0 favorites 0 likes

deep-search

Submit Feedback