ai-docs-indexer

Generate compressed documentation indexes for AI agent context files (AGENTS.md, CLAUDE.md, etc.).

Why This Approach?

Based on Vercel's research on AGENTS.md, embedding a compressed documentation index directly in agent context dramatically outperforms other approaches:

Approach	Pass Rate
Baseline (no docs)	53%
Skills (tool-based retrieval)	53%
Skills with explicit instructions	79%
AGENTS.md (passive context)	100%

The key insight: rather than requiring agents to decide when to invoke documentation tools, embedding a compressed index (~8KB, 80% reduction from full docs) makes the information always available. This eliminates decision friction and sequencing problems that cause agents to skip retrieval.

Installation

pip install ai-docs-indexer

Or with uv:

uv pip install ai-docs-indexer

Usage

Running with uv

If you have uv installed, you can run the tool directly without manual environment setup:

uv run ai-docs-indexer scan ./docs --output AGENTS.md

This ensures the command runs in the expected Python environment with all dependencies.

Basic usage

ai-docs-indexer scan ./docs --output AGENTS.md

With options

ai-docs-indexer scan ./docs \
  --output CLAUDE.md \
  --format pipe \
  --name "Project Docs" \
  --root ./.docs \
  --extensions .md,.mdx,.rst \
  --instruction "Prefer retrieval-led reasoning"

Multiple formats

ai-docs-indexer scan ./docs --format pipe --format json

Output to stdout

ai-docs-indexer scan ./docs --format json --stdout

Compressed output

Use --compress to output on a single line without newlines:

ai-docs-indexer scan ./docs -n "Wordpress CLI Abilities" -i "Use retrieval-led reasoning" --compress

Output:

[mcp-adapter docs]|root: ./mcp-adapter|IMPORTANT: Use retrieval-led reasoning|.:{README.md}|architecture:{overview.md}|getting-started:{README.md,basic-examples.md,installation.md}|guides:{cli-usage.md,creating-abilities.md,custom-transports.md,default-server.md,error-handling.md,observability.md,testing.md,transport-permissions.md}|migration:{v0.3.0.md}|troubleshooting:{common-issues.md}

Each entry like architecture:{overview.md} represents a subfolder and its files. The .:{README.md} entry contains files in the root of the scanned directory.

Output Formats

Pipe format (default)

Compact, AGENTS.md-style format:

[Project Docs Index]|root: ./.docs
|IMPORTANT: Prefer retrieval-led reasoning
|01-getting-started:{01-install.mdx,02-config.mdx}

JSON format

{
  "name": "Project Docs Index",
  "root": "./.docs",
  "instruction": "Prefer retrieval-led reasoning",
  "directories": {
    "01-getting-started": ["01-install.mdx", "02-config.mdx"]
  }
}

YAML format

name: Project Docs Index
root: ./.docs
instruction: Prefer retrieval-led reasoning
directories:
  01-getting-started:
    - 01-install.mdx
    - 02-config.mdx

CLI Reference

ai-docs-indexer scan [OPTIONS] PATH

Arguments:
  PATH  Directory to scan for documentation files

Options:
  -o, --output PATH           Output file path
  -f, --format [pipe|json|yaml]  Output format (can specify multiple)
  -n, --name TEXT             Name for the index
  -r, --root TEXT             Root path in output
  -e, --extensions TEXT       Comma-separated extensions (.md,.mdx)
  -i, --instruction TEXT      Instruction for AI agents
  --include-hidden            Include hidden files/directories
  --follow-symlinks           Follow symbolic links
  --stdout                    Force output to stdout
  -q, --quiet                 Suppress status messages
  -c, --compress              Output on a single line without newlines

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src/ai_docs_indexer		src/ai_docs_indexer
tests		tests
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-docs-indexer

Why This Approach?

Installation

Usage

Running with uv

Basic usage

With options

Multiple formats

Output to stdout

Compressed output

Output Formats

Pipe format (default)

JSON format

YAML format

CLI Reference

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ai-docs-indexer

Why This Approach?

Installation

Usage

Running with uv

Basic usage

With options

Multiple formats

Output to stdout

Compressed output

Output Formats

Pipe format (default)

JSON format

YAML format

CLI Reference

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages