get_web_contents
v1.0.0
Published
CLI tool to fetch web content and convert to markdown
Readme
npm_get_web_contents
A CLI tool to extract main content from web pages as structured JSON, given any free-form text containing URLs.
Usage
npx get_web_contents <text including URLs>- You can input any sentence or paragraph containing one or more URLs.
- The tool will extract all URLs from the input, fetch each web page, and output an array of JSON objects with the main content and metadata.
Example
npx get_web_contents "Check this: https://example.com and also https://another.com for more info."Output
The output is a JSON array. Each element contains:
uri: The URLtitle: Page titledescription: Short description or excerptkeywords: (currently always empty)language: Detected languagecontent: Main content in Markdowntoken: Character count of contentimages: List of images (title, src)links: List of links (title, href)
No API key or setup required. Just run with npx!
