Tag
An open-source project providing an Opencode Skill that automatically generates in-depth research reports comparable to those from brokerages/research institutions through a four-stage pipeline (outline → data collection → parallel writing → review and assembly). Cost is less than 0.6 yuan, takes 10–20 minutes, supports output in 19 languages, suitable for independent developers and researchers.
Researchers from HKUST, ByteDance, and UCL propose SCORE, a co-evolutionary training framework that jointly trains an LLM as both a deep research report generator and an evaluator, using a meta-harness to dynamically adjust evaluation difficulty and prevent reward saturation. Experiments show consistent improvement in open-ended research report quality.
Introduces TVIR, a benchmark and hierarchical multi-agent framework for generating text-visual interleaved reports, evaluating factual reliability and visual alignment in automated report generation.
This paper presents Ptah, a multi-agent harness for generating verifiable multimodal deep research reports by interleaving textual and visual evidence through specialized agents and verification mechanisms. It introduces PtahEval for evaluation.
The article demonstrates how to use Sense Nova Skills, an AI tool, to generate a full global EV industry research report from a single prompt, with links to the GitHub repo and plugin.
AnchorDiff proposes a topology-aware masked diffusion framework for radiology report generation, integrating RadGraph-derived clinical anchors and confidence-based rewriting to achieve state-of-the-art results on MIMIC-CXR and MIMIC-RG4 benchmarks.
Google DeepMind launched Deep Research and Deep Research Max, autonomous agents using Gemini 3.1 Pro to browse web and custom data for professional, fully-cited reports.
DR³-Eval is a benchmark for evaluating deep research agents on multimodal, multi-file report generation with a realistic web environment simulation and comprehensive evaluation framework measuring information recall, factual accuracy, citation coverage, instruction following, and depth quality.
The demo showcases Codex combined with data analysis plugins as an intelligent data analyst, capable of collecting context across systems, generating business reports, and supporting real-time editing, chart adjustments, and exporting to Google Slides, providing a one-stop data analysis experience.