deep-research

#deep-research

@TowardMu: https://x.com/TowardMu/status/2069194694228431273

X AI KOLs Timeline ↗ · yesterday Cached

Introducing Apodex, a self-evolving heavy-duty solver that uses a verification-centric agent team architecture for in-depth research. It supports self-solving, evidence chain verification, and more. Currently in early access and completely free.

0 favorites 0 likes

#deep-research

ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research

arXiv cs.AI ↗ · 4d ago Cached

ScaffoldAgent introduces a utility-guided dynamic outline optimization framework for open-ended deep research, using expansion, contraction, and revision operations to improve long-form report generation and factual grounding.

0 favorites 0 likes

#deep-research

MetaResearcher: Scaling Deep Research via Self-Reflective Reinforcement Learning in Adversarial Virtual Environments

arXiv cs.AI ↗ · 4d ago Cached

MetaResearcher proposes a framework for training deep research agents using self-reflective reinforcement learning in adversarial virtual environments, addressing limitations of static environments and fact-retrieval-only tasks.

0 favorites 0 likes

#deep-research

Researchers trained a Deep Research agent with 32 H100s and open-sourced everything

Reddit r/LocalLLaMA ↗ · 5d ago

Researchers trained a Deep Research agent using 32 H100 GPUs and open-sourced all components, enabling community access and further development.

0 favorites 0 likes

#deep-research

MosaicLeaks: Can your research agent keep a secret?

Hugging Face Blog ↗ · 5d ago Cached

MosaicLeaks introduces a new benchmark for measuring privacy leakage in deep-research AI agents, showing that agents often leak private information through external queries and proposing a training method (PA-DR) to reduce leakage while improving task performance.

0 favorites 0 likes

#deep-research

Using AI to help physicians diagnose rare genetic diseases affecting children

Reddit r/singularity ↗ · 5d ago Cached

Researchers from Boston Children's Hospital, Harvard, and OpenAI used the OpenAI o3 Deep Research reasoning model to reanalyze 376 unsolved rare disease cases, leading to diagnoses in 18 additional cases (4.8% yield) after expert review and clinical confirmation. The study, published in NEJM AI, demonstrates how AI-assisted workflows can help experts revisit difficult cases as scientific knowledge evolves.

0 favorites 0 likes

#deep-research

@OpenAI: Rare disease diagnosis is challenging, as sequencing can surface millions of variants, and medical knowledge changes co…

X AI KOLs ↗ · 5d ago Cached

OpenAI highlights how o3 Deep Research can aid rare disease diagnosis by integrating clinical features, inheritance patterns, variant evidence, and scientific literature into actionable hypotheses for specialists.

0 favorites 0 likes

#deep-research

@sheriyuo: Best-of-N, rejection sampling, and rubric-based ranking all assume you already have a reliable way to evaluate candidat…

X AI KOLs Timeline ↗ · 6d ago Cached

Apodex releases Apodex-1.0, a deep-research model that uses a heavy-duty agent team with global verification, achieving state-of-the-art results on multiple benchmarks including BrowseComp, DeepSearchQA, and HLE.

0 favorites 0 likes

#deep-research

@Zesee: https://x.com/Zesee/status/2067512488665522216

X AI KOLs Timeline ↗ · 6d ago Cached

The article analyzes the problem of AI-generated writing that often appears correct but actually contains errors, and introduces a workflow using Deep Research tools (such as Apodex) to break down problems, find evidence, check risks, and finally write, helping creators improve content quality.

0 favorites 0 likes

#deep-research

@Ex0byt: A must bookmark.. tiny cracked team, 4 H100 nodes, open source 3 stage recipe, trained on 8k synthetic rubric tasks, fu…

X AI KOLs Timeline ↗ · 6d ago Cached

A small team trained a frontier-level Deep Research Agent on an academic budget using only 32 H100s and 8K synthetic samples, releasing fully open weights, code, and paper for models from 2B to 35B that match or beat closed frontier agents on key benchmarks.

0 favorites 0 likes

#deep-research

@KaiZhang_CS: Check out one of the best open-source search agents trained by @jianxie_ !! glad to see early experience methods work o…

X AI KOLs Timeline ↗ · 6d ago Cached

Yu Su's team trained a frontier Deep Research Agent on an academic budget using 8K synthetic samples and RL, releasing fully open training infrastructure and models from 2B to 35B parameters.

0 favorites 0 likes

#deep-research

@heyshrutimishra: Apodex 1.0 dropped and the architecture is genuinely different. It's post-trained on Qwen3.5 as a self-evolving system:…

X AI KOLs Following ↗ · 6d ago Cached

Apodex 1.0 is a self-evolving AI system post-trained on Qwen3.5, achieving SOTA on BrowseComp, DeepSearchQA, and HLE-text. Its 4B mini model outperforms 30B-class models, with an AgentOS runtime for task orchestration. Open weights available.

0 favorites 0 likes

#deep-research

Deep Research in Physical Sciences: A Multi-Agent Framework and Comprehensive Benchmark

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

This paper introduces PhySciBench, a benchmark of 200 expert-curated questions for physical sciences, and DelveAgent, a multi-agent framework that improves accuracy and reduces inference costs compared to baselines like Gemini Deep Research.

0 favorites 0 likes

#deep-research

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

arXiv cs.CL ↗ · 2026-06-16 Cached

Introduces XBCP (Cross-lingual BrowseComp-Plus), a benchmark for evaluating deep research agents and retrievers in cross-lingual and multilingual settings. Results show significant performance degradation when evidence is in a different language from the query, highlighting both retrieval failures and agent-side difficulty in integrating language-mismatched evidence.

0 favorites 0 likes

#deep-research

S1-DeepResearch: Beyond Search, Toward Real-World Long-Horizon Research Agents

arXiv cs.AI ↗ · 2026-06-16 Cached

This paper introduces S1-DeepResearch-32B, an open-source model and 15K trajectory dataset for deep research agents, achieving state-of-the-art performance across 20 benchmarks by jointly modeling information acquisition, knowledge synthesis, and planning.

0 favorites 0 likes

#deep-research

Hybrid Open-Ended Tri-Evolution Makes Better Deep Researcher

arXiv cs.AI ↗ · 2026-06-15 Cached

This paper proposes the Hybrid Open-Ended Tri-Evolution (HOTE) framework, which uses hybrid-mode reinforcement learning to evolve a proposer, solver, and judge collaboratively for deep research tasks, achieving state-of-the-art results with an 8B model surpassing larger static models.

0 favorites 0 likes

#deep-research

Started vetting library health with a deep research agent, the signal that mattered was which one flags when its sources disagree

Reddit r/AI_Agents ↗ · 2026-06-12

The author shares an approach to vetting library health using a deep research agent, discovering that the most valuable signal is when the agent flags disagreements among its sources rather than producing polished, false-confidence summaries. Apodex notably surfaced contradictions clearly, making it easier to adjudicate trust.

0 favorites 0 likes

#deep-research

@tavilyai: Tavily Deep Research is a single API endpoint that runs multi-step research end-to-end, returning a structured, source-…

X AI KOLs Following ↗ · 2026-06-08 Cached

Tavily announces its Deep Research API, a single endpoint that performs multi-step research end-to-end and returns structured, source-cited reports. The API supports custom files, output schemas, and configurable research modes.

0 favorites 0 likes

#deep-research

@Apodex_AI: Dive in Blog: https://apodex.com/blog/apodex-1.0 Tech report: http://apodex.com/pdf/20260608 Github: https://github.com…

X AI KOLs Following ↗ · 2026-06-08 Cached

ApodexAI releases Apodex-1.0, a deep-research model that operates as a tool-using ReAct agent. Its heavy-duty mode, Apodex-1.0-H, uses an asynchronous agent team with up to 150 sub-agents and achieves new state-of-the-art results on deep-research benchmarks including BrowseComp, DeepSearchQA, HLE, and FrontierScience, surpassing models like GPT-5.5-pro and Claude-Opus-4.8.

0 favorites 0 likes

#deep-research

@Apodex_AI: Meet 𝗔𝗽𝗼𝗱𝗲𝘅 𝟭.𝟬 — a heavy-duty agent team for deep research, which sets the SOTA! The team searches the web, re…

X AI KOLs Timeline ↗ · 2026-06-08 Cached

Apodex 1.0 is a heavy-duty AI agent team for deep research that achieves state-of-the-art performance by searching the web, reasoning over evidence, and producing reports with verifiable evidence chains.

0 favorites 0 likes

deep-research

Submit Feedback