n8n-nodes-cheerio
v1.0.7
Published
n8n node to parse HTML using Cheerio
Maintainers
Readme
n8n-nodes-cheerio
An n8n community node for parsing and extracting data from HTML using Cheerio.
What is n8n?
n8n is a fair-code licensed workflow automation platform that allows you to connect various services and automate tasks.
Installation
Follow the installation guide in the n8n documentation to install this community node.
Operations
The Cheerio node allows you to:
- Extract text content using CSS selectors
- Extract HTML content using CSS selectors
- Extract element attributes
- Remove unwanted elements before parsing
- Process multiple selectors in a single operation
- Return single items or arrays of matched elements
Configuration
Inputs
- HTML: The HTML content to parse
- Selectors: One or more selector configurations
- Name: Identifier for the extracted data
- CSS Selector: Standard CSS selector syntax
- Attribute: (Optional) Specific attribute to extract
- Trim Text: Remove whitespace from results
- Return Single Item: Get only the first match
- Return HTML: Get HTML instead of text content
- Remove Elements: (Optional) CSS selectors for elements to remove before parsing
Output
Returns a JSON object with:
results: Extracted data organized by selector namestotalElements: Count of all matched elementsselectors: Number of selectors processed
Usage Example
{
"html": "<div class='product'><h2>Product Name</h2><span class='price'>$99</span></div>",
"selectors": [
{
"name": "productName",
"selector": "h2",
"singleItem": true
},
{
"name": "price",
"selector": ".price",
"singleItem": true
}
]
}Result:
{
"results": {
"productName": "Product Name",
"price": "$99"
},
"totalElements": 2,
"selectors": 2
}Resources
Development
# Install dependencies
npm install
# Build the node
npm run build
# Run tests
npm test