defuddle-cli
v0.6.6
Published
Command line interface for Defuddle - extract article content from web pages
Readme
Defuddle CLI
Command line interface for Defuddle. Extract clean HTML or Markdown from pages.
Installation
npm install -g defuddle-cliUsage
defuddle parse <source> [options]Arguments
source: HTML file path or URL to parse
Options
-o, --output <file>: Output file path (default: stdout)-m, --markdown, --md: Convert content to markdown-j, --json: Output as JSON with both HTML and markdown content-p, --property <name>: Extract a specific property (e.g., title, description, domain)--debug: Enable debug mode-h, --help: Display help for command
Examples
Parse a local HTML file (outputs HTML):
defuddle parse article.htmlParse a URL and convert to markdown:
defuddle parse https://example.com/article --mdParse and get the full JSON response from Defuddle:
defuddle parse article.html --jsonSave markdown output to a file:
defuddle parse article.html --md -o output.mdExtract specific properties:
# Get just the title
defuddle parse article.html --property title
# Get the description
defuddle parse article.html -p description
# Get the domain
defuddle parse article.html --property domainDevelopment
# Install dependencies
npm install
# Build
npm run build
# Run in development mode
npm run dev