n8n-nodes-simplescraper
v0.2.0
Published
n8n community node for Simplescraper — web scraping API
Maintainers
Readme
n8n-nodes-simplescraper
n8n community node for Simplescraper. Scrape any URL, run saved scrape recipes, extract structured data with AI, capture screenshots, and discover URLs from sitemaps - without writing scraping code.
Install
In n8n: Settings → Community Nodes → Install → enter n8n-nodes-simplescraper and confirm.
Self-hosted instances can also npm install n8n-nodes-simplescraper in their custom-nodes directory. The package follows the standard n8n community node convention.
Credentials
Get your API key from your Simplescraper account page. In n8n: Credentials → New → Simplescraper API, paste the key, save. The credential test hits GET /recipes - a green check means the key is valid.
Operations
Recipe
Work with saved Simplescraper recipes.
| Operation | What it does |
|---|---|
| Run | Runs a saved recipe. Optionally override the recipe's saved URL, or pass a list of URLs to scrape many pages through the recipe in one async run. Returns a results_id for long-running scrapes. |
| Get Latest Results | Fetches the most recent scrape results for a recipe - one item per scraped row. |
| Get History | Lists the last 100 runs of a recipe - one item per run. |
| Get Results by ID | Fetches a run's results by its results_id (from an async Run or the New Results trigger) - one item per scraped row. |
Page
Scrape any URL without a saved recipe.
| Operation | What it does |
|---|---|
| Extract Data | One-shot scrape of any URL. Optionally returns Markdown, raw HTML, and a screenshot URL. |
| AI Extract | Uses AI to extract structured data from a URL given a schema (e.g. name, price, availability). |
| Screenshot | Captures a pixel-perfect screenshot of any URL (1 credit). Returns a hosted URL by default. |
URL
| Operation | What it does | |---|---| | Extract URLs | Discovers all URLs from a website's sitemap - one item per URL. No credits consumed. |
Example: scrape every blog post on a site
Combine operations to discover blog URLs and extract their content as clean markdown:
- URL > Extract URLs with
https://simplescraper.io- returns one item per URL in the site's sitemap. No credits consumed. - Filter (n8n core node) keeping items where
{{ $json.url }}contains"blog". - Page > Extract Data with
Include Markdownenabled and URL set to{{ $json.url }}- pulls clean markdown from each filtered blog URL.
Useful for content audits, SEO inventories, or building a RAG corpus from a public site's blog. The Extract URLs step is free; each blog scrape costs 2 credits.
Long-running scrapes
For recipes that take more than ~30 seconds, the Run operation uses async mode by default and returns immediately with a results_id and status: "running". Use the Get Results by ID operation with that results_id to fetch the data once status becomes completed.
AI Extract follows the same pattern - if the response status is running, poll GET /v1/smart-extract/{extract_uid}.
Links
- Simplescraper - the service
- API documentation - endpoint reference
- Report a bug or request a feature
- npm package
Changelog
0.2.0
- Get Results by ID operation - fetch a run's results by
results_id. - URLs field on the Run a Recipe operation - scrape many pages through one recipe in a single async run.
- Get Results by ID, Get Latest Results, and Extract URLs return one item per record (per scraped row, per URL).
- Renamed the Extract URLs action to "Extract all URLs from a website".
0.1.4
- Packaging and verification fixes for n8n community-node verification (provenance, icon, listing metadata). No functional changes.
0.1.0
- Initial release: Recipe, Page, and URL operations.
