mintlify-parser
v1.0.1
Published
Parse Mintlify llms-full.txt files and extract English-only documentation
Maintainers
Readme
Mintlify Parser
A Node.js tool to parse Mintlify's llms-full.txt files and extract English-only documentation by filtering out non-English content.
Features
- Fetches
llms-full.txtfiles from any Mintlify documentation site - Automatically filters out non-English content (removes URLs with language codes like
/es/,/fr/, etc.) - Outputs clean markdown file named after the source domain
- Shows detailed statistics about parsed and filtered content
Installation
Global Installation
npm install -g mintlify-parserLocal Usage
npm installUsage
As a Global Command
mintlify-parser https://docs.firecrawl.dev/llms-full.txtWith Node
node mintlify-parser.js https://docs.firecrawl.dev/llms-full.txtWith npm script
npm start https://docs.firecrawl.dev/llms-full.txtOutput
The script will:
- Fetch the content from the provided URL
- Parse and filter out non-English sections
- Save the result as
{domain}.md(e.g.,docs.firecrawl.dev.md) - Display statistics about the parsing process
Example output:
⬇️ Fetching data from: https://docs.firecrawl.dev/llms-full.txt
⚙️ Parsing content...
🗑️ Removed: https://docs.firecrawl.dev/es/guide/intro
✅ Success!
📊 Total sections found: 45
🗑️ Non-English sections removed: 12
📝 English sections kept: 33
💾 Output saved to: docs.firecrawl.dev.mdRequirements
- Node.js >= 18.0.0 (for native fetch API support)
License
MIT
