n8n-nodes-simplescraper

v0.2.0

Published

7 days ago

n8n community node for Simplescraper — web scraping API

0High
0Medium
0Low

simplescraperapp

n8n-community-node-package n8n simplescraper scraping web-scraping automation

n8n-nodes-simplescraper

n8n community node for Simplescraper. Scrape any URL, run saved scrape recipes, extract structured data with AI, capture screenshots, and discover URLs from sitemaps - without writing scraping code.

Install

In n8n: Settings → Community Nodes → Install → enter n8n-nodes-simplescraper and confirm.

Self-hosted instances can also npm install n8n-nodes-simplescraper in their custom-nodes directory. The package follows the standard n8n community node convention.

Credentials

Get your API key from your Simplescraper account page. In n8n: Credentials → New → Simplescraper API, paste the key, save. The credential test hits GET /recipes - a green check means the key is valid.

Operations

Recipe

Work with saved Simplescraper recipes.

| Operation | What it does | |---|---| | Run | Runs a saved recipe. Optionally override the recipe's saved URL, or pass a list of URLs to scrape many pages through the recipe in one async run. Returns a results_id for long-running scrapes. | | Get Latest Results | Fetches the most recent scrape results for a recipe - one item per scraped row. | | Get History | Lists the last 100 runs of a recipe - one item per run. | | Get Results by ID | Fetches a run's results by its results_id (from an async Run or the New Results trigger) - one item per scraped row. |

Page

Scrape any URL without a saved recipe.

| Operation | What it does | |---|---| | Extract Data | One-shot scrape of any URL. Optionally returns Markdown, raw HTML, and a screenshot URL. | | AI Extract | Uses AI to extract structured data from a URL given a schema (e.g. name, price, availability). | | Screenshot | Captures a pixel-perfect screenshot of any URL (1 credit). Returns a hosted URL by default. |

URL

| Operation | What it does | |---|---| | Extract URLs | Discovers all URLs from a website's sitemap - one item per URL. No credits consumed. |

Example: scrape every blog post on a site

Combine operations to discover blog URLs and extract their content as clean markdown:

URL > Extract URLs with https://simplescraper.io - returns one item per URL in the site's sitemap. No credits consumed.
Filter (n8n core node) keeping items where {{ $json.url }} contains "blog".
Page > Extract Data with Include Markdown enabled and URL set to {{ $json.url }} - pulls clean markdown from each filtered blog URL.

Useful for content audits, SEO inventories, or building a RAG corpus from a public site's blog. The Extract URLs step is free; each blog scrape costs 2 credits.

Long-running scrapes

For recipes that take more than ~30 seconds, the Run operation uses async mode by default and returns immediately with a results_id and status: "running". Use the Get Results by ID operation with that results_id to fetch the data once status becomes completed.

AI Extract follows the same pattern - if the response status is running, poll GET /v1/smart-extract/{extract_uid}.

Changelog

0.2.0

Get Results by ID operation - fetch a run's results by results_id.
URLs field on the Run a Recipe operation - scrape many pages through one recipe in a single async run.
Get Results by ID, Get Latest Results, and Extract URLs return one item per record (per scraped row, per URL).
Renamed the Extract URLs action to "Extract all URLs from a website".

0.1.4

Packaging and verification fixes for n8n community-node verification (provenance, icon, listing metadata). No functional changes.

0.1.0

Initial release: Recipe, Page, and URL operations.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

n8n-nodes-simplescraper

Install

Credentials

Operations

Recipe

Page

URL

Example: scrape every blog post on a site

Long-running scrapes

Links

Changelog

0.2.0

0.1.4

0.1.0

License