doc-extractor
v0.1.4
Published
Doc extraction UI + Claude Code integration — crawl any docs site and query it with Claude
Maintainers
Readme
doc-extractor
Crawl documentation sites and pipe them into your Claude Code terminal.
You point it at a docs URL, it extracts every page as markdown, and one click sends the content into your active Claude Code session so you can code against it in your own project.
Install
npx doc-extractorThis starts a local UI on http://localhost:3000 and installs three hooks into ~/.claude/settings.json so Claude Code can receive extractions.
Use
- Open your project, start Claude Code (
claude) in its terminal. - Open
http://localhost:3000and paste a docs URL. - When the crawl finishes, click Send to Claude on any page (or the whole extraction).
- Type anything in your Claude terminal — the extracted docs are injected into your prompt and Claude answers grounded in them, with access to your project files.
How it works
┌─────────────┐ writes queue ┌──────────────┐
│ UI (:3000)│ ─────────────────> │ ~/.claude/ │
└─────────────┘ │ doc-extractor│
│ -queue/ │
└──────┬───────┘
│ read by UserPromptSubmit hook
▼
┌──────────────┐
│ Claude Code │
│ (your term) │
└──────────────┘The crawler uses cheerio + turndown and runs entirely on your machine. Nothing is sent to a third-party service.
Requirements
- Node 18+
- Claude Code installed and on PATH
bashavailable (Git Bash on Windows, built-in on macOS/Linux)
Uninstall
npx doc-extractor uninstallRemoves hooks, scripts and the queue directory. Your extractions history is kept in ~/.claude/doc-extractor-history/.
License
MIT
