Show HN: Dari-docs – Optimize your docs using parallel coding agents

Hacker News Top Tools

Summary

dari-docs is a CLI tool that tests documentation quality by simulating AI agents performing tasks, identifying where agents get stuck, and optionally generating proposed edits to improve doc clarity.

It’s well known at this point that documentation needs to be optimized for AI agents - we’re all pointing our Claude Code &#x2F; Codex &#x2F; Pi agents at documentation, and expecting the models to figure out how to implement a product.<p>This, however, changes the entire optimization problem when writing documentation. Good documentation now becomes more objective - you are solving the very concrete problem: can a dumb harness running the dumbest model implement this reliably?<p>Humans can typically compensate for inconsistent terminology or scattered context across pages, but for agents, this often will waste time (or even just completely confuse the agent).<p>We’ve been building a small project around this called dari-docs: users can upload their documentation via website or CLI and run agents across different providers to see where they falter. You can upload your documentation, feed a list of tasks, and ask agents with varying intelligence &#x2F; cost levels to complete those tasks in parallel. When a run is complete, you get back a list feedback markdown files from each agent run and can apply changes based on agent feedback.<p>Managed service: <a href="https:&#x2F;&#x2F;optimize.dari.dev&#x2F;">https:&#x2F;&#x2F;optimize.dari.dev&#x2F;</a>, repo link: <a href="https:&#x2F;&#x2F;github.com&#x2F;mupt-ai&#x2F;dari-docs" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;mupt-ai&#x2F;dari-docs</a><p>The agents actually try to use the product end-to-end. They search through the docs, follow instructions, run commands, try examples, and attempt to debug failures. Importantly, this is not a static LLM review of the documentation. The agents are actually attempting the integration.<p>You can also enable live verification with test credentials so the agents can actually verify workflows against real APIs:<p><pre><code> dari-docs check . --live-verify --secret-env DARI_TEST_API_KEY --task &quot;Create a checkout session&quot; </code></pre> If you’re building a CLI, API, MCP server, or SDK and actively maintaining docs for humans or agents, we’d love to work with you and test this on real workflows!
Original Article
View Cached Full Text

Cached at: 05/20/26, 11:30 PM

mupt-ai/dari-docs

Source: https://github.com/mupt-ai/dari-docs

dari-docs

Make your docs so good even the dumbest agent can ship.

dari-docs is a CLI for testing whether your documentation is clear enough for agents to use. It sends your docs to simulated developer agents, asks them to complete real tasks, reports where they get stuck, and can generate proposed docs edits from that feedback.

Use it to turn documentation quality from “seems understandable” into “an agent can actually complete the task.”

Why dari-docs?

Good docs used to mean “a developer can eventually figure this out.” That is no longer enough.

When the reader is an agent, ambiguity becomes measurable. Inconsistent terminology, hidden assumptions, scattered context, and missing setup steps all increase the chance that the agent fails the task or wastes context trying to infer what the docs meant.

dari-docs gives you a repeatable feedback loop for agent-readable documentation: define the task, run simulated users, inspect failures, and optionally pull back edited docs.

What it does

  • Tests docs with simulated developers — agents attempt concrete tasks using only the docs you provide.
  • Finds task-blocking ambiguity — reports missing context, unclear setup, inconsistent terms, and places where the agent had to guess.
  • Generates proposed fixesoptimize turns tester feedback into edited documentation you can review locally.
  • Runs managed or self-managed — use the hosted dari.dev Docs service, or run against agents in your own dari.dev org.
  • Uses normal agent projects — the tester and editor are just folders of prompts, skills, setup scripts, and a dari.yml manifest.

Install

Install the latest dari-docs binary with the install script, then verify the CLI is available:

curl -fsSL https://raw.githubusercontent.com/mupt-ai/dari-docs/main/install.sh | bash
dari-docs --help

Quickstart

Managed mode uses the hosted dari.dev Docs service and a separate dari.dev Docs credit balance. New accounts start with five dollars worth of free credits.

From your docs repo:

dari-docs auth login

Run a docs check:

dari-docs check . \
  --managed \
  --task "Install the SDK and make a first API call"

The command submits a managed run and prints the run ID. To wait for completion in the same command, add --wait.

Generate proposed docs edits:

dari-docs optimize . \
  --managed \
  --wait \
  --task "Install the SDK and make a first API call"

With --wait, edited files are downloaded into .dari-docs/updated/ without changing your repo. Review that folder and copy changes into your repo when ready.

How it works

  1. You point dari-docs at a docs directory or public docs URL and give it one or more tasks.
  2. The CLI bundles your local docs, or passes public docs URLs for checker agents to inspect with internet access, then submits a run to hosted Dari Docs agents.
  3. Tester agents try to complete the task and report where the docs blocked progress.
  4. Use dari-docs runs wait, dari-docs runs download, or pass --wait to collect local run artifacts.
  5. If you run optimize, an editor agent proposes documentation changes.
  6. Proposed edits can be downloaded to .dari-docs/updated/ for review.

The simulated users are plain dari.dev agents. Managed mode uses the hosted Dari Docs tester and editor agents automatically. If you want to customize the agent prompts, skills, setup scripts, or dari.yml, use self-managed mode.

Managed vs self-managed

ModeUse whenRequires
ManagedYou want the fastest setup and hosted execution.dari-docs auth login
Self-managedYou want runs in your own dari.dev org.A dari.dev API key and deployed agents

Most users should start with managed mode.

Documentation

Similar Articles

Designing the hf CLI as an agent-optimized way to work with the Hub

Hugging Face Blog

Hugging Face redesigned its `hf` CLI to be optimized for both human users and AI coding agents like Claude Code and Codex, with agent-aware output rendering and benchmarking showing up to 6× token savings versus no-CLI baselines on complex tasks.