n8n-nodes-scrape-html
v1.0.1
Published
Fetch fully-rendered HTML from any webpage including SPAs using PDF API Hub
Downloads
249
Maintainers
Readme
n8n-nodes-scrape-html
Fetch the fully-rendered HTML of any webpage using a headless browser — perfect for SPAs and JavaScript-heavy sites.
This is an n8n community node powered by PDF API Hub.
🚀 Install
- Go to Settings → Community Nodes in n8n
- Enter
n8n-nodes-scrape-html - Click Install
🔑 Setup
Sign up at pdfapihub.com → copy your API key → add to n8n credentials.
✨ Features
| Parameter | Description |
|-----------|-------------|
| URL | Any public webpage |
| Wait Until | Fully Loaded, DOM Ready, Network Quiet, or First Response |
| Wait for Element | CSS selector to wait for before capturing (e.g. #main-content) |
| Extra Delay | Additional milliseconds for lazy content |
| Viewport | Desktop, Laptop, Mobile, Tablet, or Custom |
Advanced Options
| Option | Description | |--------|-------------| | Navigation Timeout | Max milliseconds to wait for page load | | User Agent | Custom user-agent string for mobile/bot versions | | Extra HTTP Headers | Additional headers sent with every request |
💡 Use Cases
- Web scraping — get rendered HTML from React/Vue/Angular SPAs
- Content monitoring — track changes on dynamic websites
- SEO analysis — fetch rendered HTML for SEO auditing
- Data extraction — scrape JS-rendered content that simple HTTP can't reach
