@sisu-ai/tool-web-fetch
v9.0.0
Published
Fetch web pages and return structured text, HTML, or JSON for LLM consumption.
Maintainers
Readme
@sisu-ai/tool-web-fetch
Fetch web pages and return structured text, HTML, or JSON for LLM consumption.
Install
npm i @sisu-ai/tool-web-fetchEnvironment / Flags
WEB_FETCH_USER_AGENTorHTTP_USER_AGENT(flag:--web-fetch-user-agent)WEB_FETCH_MAX_BYTES(flag:--web-fetch-max-bytes) — default 500kBWEB_FETCH_RESPECT_ROBOTS(flag:--web-fetch-respect-robots) —1/true(default) to honor robots.txt; set0/falseto disable
Tool
- Name:
webFetch - Args:
{ url: string; format?: 'text'|'html'|'json'; maxBytes?: number } - Returns:
{ url, finalUrl?, status, contentType?, title?, text?, html?, json? }
Behavior
- Respects robots.txt by default for the provided User-Agent.
- Follows redirects and reads up to
maxBytesto avoid huge pages. - If
format: 'text'(default) and page is HTML, strips tags (removes script/style) and decodes basic entities; includestitle. - If
format: 'html', returns raw HTML andtitle. - If server returns JSON or
format: 'json', parses intojson. - Non-OK responses return status and a short text body snippet for debugging.
Notes
- This is a minimal fetcher to empower summarization / extraction workflows. For deeper crawling, add queueing, URL normalization, and robots.txt handling in upstream middleware.
Community & Support
Discover what you can do through examples or documentation. Check it out at https://github.com/finger-gun/sisu. Example projects live under examples/ in the repo.
Documentation
Core — Package docs · Error types
Adapters — OpenAI · Anthropic · Ollama
- @sisu-ai/mw-agent-run-api
- @sisu-ai/mw-context-compressor
- @sisu-ai/mw-control-flow
- @sisu-ai/mw-conversation-buffer
- @sisu-ai/mw-cors
- @sisu-ai/mw-error-boundary
- @sisu-ai/mw-guardrails
- @sisu-ai/mw-invariants
- @sisu-ai/mw-orchestration
- @sisu-ai/mw-rag
- @sisu-ai/mw-react-parser
- @sisu-ai/mw-register-tools
- @sisu-ai/mw-tool-calling
- @sisu-ai/mw-trace-viewer
- @sisu-ai/mw-usage-tracker
- @sisu-ai/tool-aws-s3
- @sisu-ai/tool-azure-blob
- @sisu-ai/tool-extract-urls
- @sisu-ai/tool-github-projects
- @sisu-ai/tool-rag
- @sisu-ai/tool-summarize-text
- @sisu-ai/tool-terminal
- @sisu-ai/tool-web-fetch
- @sisu-ai/tool-web-search-duckduckgo
- @sisu-ai/tool-web-search-google
- @sisu-ai/tool-web-search-openai
- @sisu-ai/tool-wikipedia
Anthropic — hello · control-flow · stream · weather
Ollama — hello · stream · vision · weather · web-search
OpenAI — hello · weather · stream · vision · reasoning · react · control-flow · branch · parallel · graph · orchestration · orchestration-adaptive · guardrails · error-handling · rag-chroma · web-search · web-fetch · wikipedia · terminal · github-projects · server · aws-s3 · azure-blob
Contributing
We build Sisu in the open. Contributions welcome.
Contributing Guide · Report a Bug · Request a Feature · Code of Conduct
Star on GitHub if Sisu helps you build better agents.
Quiet, determined, relentlessly useful.
