@geekbb: Opencode Deep Research Report Generation Skill
Summary
An open-source project providing an Opencode Skill that automatically generates in-depth research reports comparable to those from brokerages/research institutions through a four-stage pipeline (outline → data collection → parallel writing → review and assembly). Cost is less than 0.6 yuan, takes 10–20 minutes, supports output in 19 languages, suitable for independent developers and researchers.
View Cached Full Text
Cached at: 06/08/26, 03:28 PM
Opencode Deep Research Report Generation Skill
Traditional deep research is expensive, industry reports cost $50–500+ each, making them unaffordable for individual researchers and indie developers. AI-driven research is often too shallow (a few summaries then a conclusion) or fabricates numbers. This project uses a 4-stage pipeline (outline → data collection → parallel writing → validation and assembly) to produce deep research reports matching the standards of brokerage firms and third-party research institutes, taking ~10–20 minutes per report at a cost of < ¥0.6.
https://github.com/hoolulu/deep-research…
hoolulu/deep-research
Source: https://github.com/hoolulu/deep-research
deep-research Skill
🇨🇳 中文 · 🇬🇧 English
Deep Research Report Generation Skill · Supports 19 languages output
Current version: See updates (https://github.com/hoolulu/deep-research/commits/main)
✨ One Minute Overview
| 🎯 One command | /research <your topic> → fully automated research, no manual intervention |
| ⏱ Report in ten minutes | quick mode ~8–12 min, standard ~10–15 min |
| 🌍 19 languages | The report will be written in the language of the topic, automatically detected |
| 🔧 Not exclusive to OpenCode | Works with Claude Code, Cursor, Codex CLI, Windsurf, Cline, etc. |
| Command | Description |
|---|---|
/research Current status of China’s new energy vehicle industry | Chinese report |
/research Competitive landscape of AI cloud computing | English report |
/research Анализ рынка нефти и газа в России | Report in Russian |
/research 日本のアニメ産業のグローバル市場戦略 | Report in Japanese |
/research 한국 반도체 산업의 글로벌 경쟁력 분석 | Report in Korean |
The entire interaction and search for materials is conducted in the specified language—not merely a translation of the output.
I. Why You Need This
If you’ve tried getting AI to do research, you’ve probably run into these pitfalls:
Search + summary → too shallow, only a few snippets with no depth.
Industry reports priced per copy $50–500+ → too expensive for individuals.
Overseas tools → can’t search domestic resources such as Baidu Baike, Zhihu, 199IT, iResearch.
AI inventing numbers → looks plausible but has no source.
This skill completes a 4-layer process before delivering the report. It doesn’t just search and output; it follows: analyze → search & verify → write → verify.
II. Who Should Use It
Indie developers, independent researchers, small teams.
Anyone who needs professional-grade research capabilities but doesn’t want to rely on paid databases or research institutions.
III. Output of One Standard Mode Research
| Metric | Data (standard mode example) |
|---|---|
| Report length | 500–700 lines / approx. 12,000–18,000 characters |
| Data tables | 15–25 tables covering market size, competitive landscape, technical parameters, etc. |
| Analysis paragraphs | 80–120 paragraphs (each with conclusion + data + causality + judgment) |
| Cited independent sources | 15–25 (CAICT, iResearch, National Bureau of Statistics, Baidu Baike, Zhihu, 36Kr, The Paper, etc.) |
| Opposing viewpoints | 3–8 points, at least one controversy or counterargument per chapter |
| Data collection time | ~1–3 minutes |
| Report generation time | ~8–15 minutes |
| Total time | ~10–20 minutes |
The above figures are typical ranges for standard mode; actual results may vary depending on topic complexity, data availability, search engine responsiveness, etc.
📂 View all generated reports → — Browse all generated research reports here; click to open and read.
IV. How It Works
The entire process consists of 4 stages, executed automatically in sequence:
1 Analyze outline — analyze the topic, generate research framework and search plan
↓
2 Collect data — SearXNG / Exa cascading search → Scrapling batch scraping → data pool extraction → data quality check
↓
3 Parallel writing — multiple chapters written simultaneously, facts embedded directly in prompts, no tool calls
↓
4 Validate and assemble — batch validate → assemble-report → convert-citations → qa-report
V. Search Pipeline & Built-in Resources
All tools are built-in—no additional purchases needed. The system uses a three-layer cascading search strategy:
SearXNG (author-deployed meta search engine, 70+ engines including Baidu/Google/Brave) → Exa (OMO built-in cold backup) → over ten free search engines + domestic data sources (final fallback). Each layer checks if it’s reachable before proceeding; if available, it’s used immediately without probing lower layers, ensuring fast response. The entire flow:
Layer 1 — SearXNG (author-deployed, 70+ engines including Baidu/Google/Brave, ready to use out of the box)
↓
If unavailable: Layer 2 — Exa (OMO built-in cold backup, zero cost)
↓
If unavailable: Layer 3 — Free source reinforcement (fallback)
├─ Search reinforcement line
│ Known source reinforcement line
├─ DuckDuckGo
│ Baidu Baike / Wikipedia
├─ Bing China
│ Zhihu / 36Kr / The Paper
├─ Brave / Mojeek
│ 199IT / iResearch / East Money
├─ Semantic Scholar
│ National Bureau of Statistics / Weibo / CSDN
└─ GDELT / arXiv
Douban / Huxiu
The search reinforcement line can dynamically discover any other websites, not limited to the list above. All source URLs are eventually batch-scraped for full text by Scrapling.
VI. Unique Report Highlights
| Dimension | Description |
|---|---|
| Professional multilingual writing | Automatically detects the topic’s language and writes the report directly in 19 languages—not a translation mode |
| Every number has a source | Inline citations (N) are clickable references; a list of sources is provided at the end. Numbers without a source are not written |
| Pro & con viewpoints coexist | Each chapter presents controversies and opposing perspectives, avoiding one-sidedness |
| Confidence level classification | Summary table at the end (High/Medium/Low) shows what is reliable and what is disputed |
| Data anti-error mechanism | Automatically identifies common data errors—unit mix-ups, fabricated data, misattributed figures—to prevent problematic data from entering the report |
| Paragraphs over lines | Each chapter’s core is 8–12 substantive paragraphs; tables and blank lines cannot pad the content |
VII. Three Depth Levels
| Command | Purpose | Min. chapters | Min. paragraphs/chapter | Max characters | Approx. time |
|---|---|---|---|---|---|
/research <topic> | standard (default) | 8 | ≥ 5 | ≤ 12,000 | ~10–15 min |
/research <topic> -quick | Quick insight | 5 | ≥ 4 | ≤ 8,000 | ~8–12 min |
/research <topic> -deep | Extreme depth | 10 | ≥ 6 | ≤ 25,000 | ~15–25 min |
Parameters are in
profiles.json; modify and restart to apply. Character count excludes spaces and Markdown syntax.
VIII. Screenshots
(Placeholder for screenshots)
IX. Installation
🧠 Method 1: AI Easy Install (Recommended)
Copy the following prompt into the OpenCode chat and send it—AI will do everything automatically:
Please research the project at https://github.com/hoolulu/deep-research, and follow the documentation requirements in order:
1. Install prerequisites (determine installation method based on Scrapling official docs and your OS)
2. Register the Scrapling MCP Server, ensure it works after CLI restart
3. Register the /research and /research-update commands
After each step, confirm the result. After completion, read VERSION to confirm the version number and summarize the installation status.
The AI will read the project docs → understand the system type → install each item → verify usability. No manual commands required.
🔧 Method 2: Non-OpenCode Users (Claude Code / Codex CLI / Cursor, etc.)
Paste this prompt into your AI coding tool:
Please research the project at https://github.com/hoolulu/deep-research, automatically install prerequisites and adapt it to the current CLI tool:
1. Install Python and Scrapling (see Scrapling official docs and your system for method)
2. Register the Scrapling MCP Server, restart to activate
3. Register custom commands equivalent to /research and /research-update
4. Translate the Task chain architecture into an equivalent implementation for the current tool
After each step, confirm the result. After completion, read VERSION to confirm the version number and summarize the status.
Adaptation points for different tools: multi-agent orchestration needs to map to each tool’s native mechanisms (Claude Code’s sub-agent, Codex CLI’s multi-file tasks, Cursor’s agent mode, etc.). The search and scraping logic (python-scrapling + search API) can be reused as-is.
Prerequisites
| Component | Purpose | How to get |
|---|---|---|
| OpenCode (core) | AI coding agent runtime | Visit https://opencode.ai/ |
| oh-my-openagent (required) | Provides analysis/search sub-agents + auto-configures MCP | Ask AI to install it for you using oh-my-openagent official docs |
| Scrapling (required) | Full-text web scraping | Ask AI to install and register MCP |
| SearXNG | Web search (primary, author deployed 70+ engines including Google/Baidu/Brave) | Skill includes default endpoint, ready to use out of the box—no need to install |
| Exa MCP | Web search (cold backup) | Included in OMO, no need to install |
This skill depends on the sub-agent provided by the oh-my-openagent plugin. Without it, the
/researchcommand cannot execute. Other programming tools have their own multi-agent frameworks, so it may not be required.
X. Usage
After installation and restart of OpenCode, type in the chat:
| Command | Description | Approx. time |
|---|---|---|
/research <your topic> | standard mode | ~10–15 min |
/research <your topic> -quick | quick mode | ~8–12 min |
/research <your topic> -deep | deep mode | ~15–25 min |
/research-update | Check for updates | — |
What happens after you send it
The entire process runs automatically—you don’t need to do anything:
1 Analyze outline — analyze the topic, generate research framework and search plan
2 Collect data — SearXNG / Exa cascading search → Scrapling batch scraping → data pool extraction → data quality check
3 Parallel writing — multiple chapters written simultaneously, facts embedded directly in prompts, no additional tool calls
4 Assemble and validate — batch validate → assemble-report → convert-citations → qa-report
Total time ~10–20 minutes. Complex topics may take longer, simple ones may be faster.
Output file
The report is saved in Markdown format to the reports/ folder inside the skill directory, with a date-time stamp in the filename:
~/.opencode/skills/deep-research/reports/
Open it with any Markdown reader (Typora / Obsidian / VS Code, etc.).
You can also specify a custom storage path by asking AI to modify it.
👉 Browse all generated reports — Click to browse all generated research reports; you can click to open and read.
XI. Cost
| Component | Cost |
|---|---|
| LLM (the one you already use) | DeepSeek v4 Flash baseline: quick ~100k–200k tokens / < ¥0.15, standard ~150k–350k / < ¥0.3, deep ~300k–600k / < ¥0.6 |
| SearXNG search (author deployed) | Deployed on VPS, zero cost, unlimited usage |
| Exa search | Built into OpenCode, zero additional cost (cold backup engine) |
| Scrapling scraping | Runs locally, zero cost |
| Domestic sources (Baidu Baike/Wikipedia/Zhihu/36Kr/The Paper/199IT/iResearch/East Money/NBS, etc.) | Direct connection, zero cost, no proxy needed |
| OpenCode runtime | MIT open source, zero cost |
Estimates based on DeepSeek v4 Flash; actual figures may vary depending on model and topic complexity.
XII. FAQ
1. Search quota? How to ensure uninterrupted search?
The system uses a three-layer cascading search architecture. Each layer’s engine is independent; if the upper layer fails, it automatically falls back to the lower layer:
- Layer 1 — SearXNG (author deployed): A meta search engine deployed by the author on a VPS, aggregating 70+ search engines (including Baidu/Google/Brave), covering both Chinese and English. Built-in default endpoint, ready to use out of the box—unlimited, unthrottled, no quota.
- Layer 2 — Exa (cold backup): Built-in search engine in OpenCode, automatically configured by the OMO plugin, zero cost. If rate limited, automatically falls to Layer 3.
- Layer 3 — Free source reinforcement (final fallback): DuckDuckGo / Bing / Brave / Mojeek / Semantic Scholar / GDELT / arXiv + Baidu Baike / Zhihu / 199IT / iResearch / 36Kr / The Paper / East Money / Weibo / CSDN / Huxiu / Douban, 20+ sources. No API key required, always available.
2. How to generate a report using local materials?
You can try sending these three prompts to AI after installing the skill, adjusted to your needs. However, it’s best to first ensure your LLM can parse PDF and DOCX content; plain text and Markdown files can be read directly.
- Scenario 1: Local materials + online supplement (recommended for the most complete research)
Please use the deep-research skill. Based on the local materials in D:\My Notes\ProjectA, generate a research report about XX (quick mode). Prefer materials from the local files; if insufficient, supplement by searching online.
- Scenario 2: Local materials only, no online search (when materials are sufficient and you worry online search might distract)
Please use the deep-research skill. Based on the local materials in D:\My Notes\ProjectA, generate a research report about XX (quick mode). Only use local materials; do not search online.
- Scenario 3: Purely local, no skill, lightest (when you don’t need a professional report format, just a simple collation)
Based on the materials in D:\My Notes\ProjectA, help me organize them into a structured research report with a table of contents and chapter headings.
3. How to update to the latest version?
Version strategy: The main branch always contains the latest code; minor changes are pushed directly. GitHub Releases are only used for milestone version markers (e.g., v2.1.0 → v2.2.0). There’s no need to wait for a new Release to update.
OpenCode users:
- Automatic: Type
/research-update, AI will automatically rungit pullto fetch the latest. - Manual:
cd ~/.opencode/skills/deep-research && git pull
Check the version number with cat ~/.opencode/skills/deep-research/VERSION.
4. Can non-OpenCode users update automatically?
You can simply ask AI to perform version comparison and update adaptation. Paste the following prompt into your AI coding tool:
Please compare the differences between the latest version of https://github.com/hoolulu/deep-research and your local version,
identify new features and fixes from upstream, and apply them one by one to your local adapted version, preserving platform-specific modifications.
5. Is the data secure?
All processing is done locally. No user data is collected or uploaded.
License
MIT
This project is licensed under the MIT License. MIT was chosen over stricter licenses (GPL, CC, etc.) because the project’s core is a portable methodology and pipeline design, not a copyrighted library of finished works. MIT allows it to be reused and modified across different platforms and toolchains to the greatest extent, consistent with the “not exclusive to OpenCode” positioning.
Created by hoolulu (https://github.com/hoolulu) · Project repository: github.com/hoolulu/deep-research (https://github.com/hoolulu/deep-research)
Community discussion: LINUX DO (https://linux.do/t/topic/2312664)
Similar Articles
@neil_xbt: https://x.com/neil_xbt/status/2058536136901902728
Presents an AI-powered research service that produces executive-level briefs in hours instead of weeks, priced at $350-$500 per report to generate $7,000 per month, targeting senior leaders who lack time for manual research.
@aronhouyu: A market research artifact!!!! It can do: Input a topic, automatically conduct research across more than a dozen platforms like Reddit, X, YouTube, Hacker News, Polymarket, etc., and synthesize the results into a well-supported summary report. It's quite awesome!!!!! I'm off to play with it first. https://gi…
Introduces a search engine called /last30days driven by an AI agent that can simultaneously search multiple platforms like Reddit, X, YouTube, Hacker News, Polymarket, etc., and generate summary reports.
@gaoren7716: Writing a thesis? It might get rewritten by a systematic AI system. Not just polishing or rewriting a sentence – from topic selection onward, there are 13 agents collaborating. Feature list: Deep Research (13-agent research team), Systematic Review (P…
Introduces an open-source AI tool called Academic Research Skills that automates the entire academic research process from topic selection to writing and peer review through collaboration of 13 agents. It can be installed as a Claude Code plugin, turning academic research into a standardized production line.
@IndieDevHailey: A blessing for researchers! This open-source tool helps you break through the sea of literature and manage the entire academic workflow with one click. Still struggling with slow literature research, writer's block, improper citations, and harsh peer reviews? Check out this open-source repository: academic-research-skills. It's not an AI ghostwriting tool, but a reliable human-AI collaboration framework—…
Recommends the open-source repository academic-research-skills, which provides a set of human-AI collaborative tools for the entire academic research workflow, including in-depth literature research, paper writing, peer review simulation, and citation audit. It supports AI assistance while keeping the user in control, suitable for graduate students and researchers.
@tychozzz: I strongly recommend you try out the scheduled task feature of OpenAI Codex. It automatically produces a US stock financial daily report at 8 AM every morning. Compared to Claude Code, it's simpler — no code configuration required — and more affordable at just $20 per month. A week ago, I moved the US stock daily report from Claude...
Recommend using the scheduled task feature of OpenAI Codex. For just $20 per month, it automatically generates professional US stock financial daily reports without any code configuration, and includes a detailed prompt template.