n8n-nodes-docupipe
v0.1.0
Published
n8n community node for DocuPipe - automate document processing, extraction, and classification
Downloads
102
Readme
n8n-nodes-docupipe
This is an n8n community node for DocuPipe - an intelligent document processing platform. It lets you automate document uploads, data extraction, classification, and more directly from your n8n workflows.
What is DocuPipe? DocuPipe processes documents (PDFs, images, Word files) using AI to extract structured data, classify documents into categories, and split/merge multi-page files. You define schemas (what fields to extract) and classes (document categories) in the DocuPipe dashboard, then use this node to automate the processing.
n8n is a fair-code licensed workflow automation platform.
Installation | Credentials | Operations | Triggers | Examples | Compatibility | Resources
Installation
Follow the installation guide in the n8n community nodes documentation.
Credentials
You need a DocuPipe API key to use this node:
- Sign up for free or log in at app.docupipe.ai
- Go to Settings → General
- Copy your API Key
- In n8n, create a new DocuPipe API credential and paste your key
- Click Test to verify the connection
Operations
Document
- Upload - Upload a document from a URL or binary file (e.g. email attachment)
- Get - Retrieve a document's details by its ID
- Split - Split a multi-page document into individual documents
- Merge - Merge multiple documents into one
Extraction
- Extract - Extract structured data from a document using a schema you defined in DocuPipe
- Get Result - Retrieve the extracted data after extraction completes
- Upload and Extract - Upload a document and extract data in one step
Classification
- Classify - Classify a document into categories you defined in DocuPipe
Schema / Class
- List Schemas - List all extraction schemas in your account
- List Classes - List all document classes in your account
Triggers
The DocuPipe Trigger node automatically starts a workflow when DocuPipe finishes processing. No polling needed - DocuPipe sends data to n8n in real-time via webhooks.
| Event | What it returns | |-------|-----------------| | Document Processed | documentId, filename, text, numPages | | Extraction Complete | standardizationId, documentId, schemaId, extracted data | | Classification Complete | documentId, assigned classIds | | Split Complete | parentDocumentId, childDocumentIds | | Merge Complete | new documentId, source documentIds | | Document Error | documentId, errorMessage | | Extraction Error | standardizationId, documentId, errorMessage | | Classification Error | documentId, errorMessage | | Split Error | documentId, errorMessage | | Merge Error | documentIds, errorMessage |
Examples
Example 1: Upload a file from URL and extract data
The simplest workflow - upload a document and extract structured data from it.
- Add a Manual Trigger node (click to run)
- Add a DocuPipe node → Resource: Extraction → Operation: Upload and Extract
- Set Input Mode to URL
- Enter the file URL (e.g.
https://example.com/invoice.pdf) and filename - Select a Schema from the dropdown (create one first at app.docupipe.ai → Schemas)
- Run the workflow - you'll get back a
documentIdandstandardizationId - To get the extracted data, add a DocuPipe Trigger (Extraction Complete) in a separate workflow
Example 2: Process email attachments automatically
Automatically extract data from every email attachment you receive.
- Add a Gmail Trigger node → Event: Message Received
- Add a DocuPipe node → Resource: Document → Operation: Upload
- Set Input Mode to Binary File
- Leave Binary Property as
data(Gmail outputs attachments as "data") - Connect Gmail → DocuPipe
Now every email attachment is automatically uploaded to DocuPipe. Add a DocuPipe Trigger (Document Processed) in another workflow to process them after upload.
Example 3: Extract data from Google Drive files
Process documents saved in Google Drive.
- Add a Google Drive Trigger node → Event: File Created
- Add a Google Drive node → Operation: Download (to get the file as binary)
- Add a DocuPipe node → Resource: Extraction → Operation: Upload and Extract
- Set Input Mode to Binary File
- Leave Binary Property as
data - Select your Schema
- Connect: Google Drive Trigger → Google Drive Download → DocuPipe
Example 4: Extract data and send results to Google Sheets
End-to-end automation: extract invoice data and write it to a spreadsheet.
Workflow 1 - Kick off extraction:
- Gmail Trigger (Message Received)
- DocuPipe → Upload and Extract (Binary File mode, select your invoice schema)
Workflow 2 - Process results:
- DocuPipe Trigger → Event: Extraction Complete (receives standardizationId and extracted data)
- Google Sheets → Append Row (map the extracted fields to spreadsheet columns)
Example 5: Classify and route documents
Automatically classify documents and take different actions based on the type.
Workflow 1 - Upload and classify:
- HTTP Request node → Download a file (set Response Format to File)
- DocuPipe → Upload (Binary File mode)
- DocuPipe → Classify (use the documentId from upload, select your classes)
Workflow 2 - Route based on classification:
- DocuPipe Trigger → Event: Classification Complete (returns documentId and classIds)
- IF node → Check which class was assigned
- Route to different actions (e.g. invoices → accounting software, contracts → legal team)
Example 6: Upload a document from base64 (database or API)
If your documents are stored as base64 strings (e.g. in a database or returned by an API):
- Add a node that outputs base64 content (e.g. a Postgres node querying a
documentstable, or a Code node) - Add a DocuPipe node → Resource: Document → Operation: Upload
- Set Input Mode to Base64
- Set Base64 Content to reference the field from the previous node (e.g.
{{ $json.fileContent }}) - Set Filename (e.g.
invoice.pdf) - DocuPipe needs this to know the file format
Example 7: Split a document and process each part
Split a multi-page PDF into individual documents, then extract data from each one.
Workflow 1 - Upload and split:
- Manual Trigger
- DocuPipe → Upload (URL or Binary File)
- DocuPipe → Split (use the documentId from upload)
Workflow 2 - Process each child document:
- DocuPipe Trigger → Event: Split Complete (returns parentDocumentId and childDocumentIds)
- Split In Batches node → loop through each childDocumentId
- DocuPipe → Extract (use each childDocumentId + select a schema)
How Binary File mode works
Many n8n nodes output files as "binary data" (Gmail attachments, HTTP downloads, Google Drive files, Dropbox, etc.). DocuPipe's Binary File mode reads these files automatically - no manual conversion needed.
Steps:
- Add a node that outputs a file (e.g. HTTP Request with Response Format: File)
- Connect it to a DocuPipe node
- Set Input Mode to Binary File
- Leave Binary Property as data (this is the default for most nodes)
- Run it - DocuPipe automatically reads the file and uploads it
How to check the binary property name: If "data" doesn't work, run the previous node first and click on the Binary tab in the output. The property name is shown there (e.g. "data", "attachment_0", "file").
AI Agent Support
This node supports usableAsTool, making it available as a tool in n8n's AI Agent workflows. Self-hosted users need to set the environment variable:
N8N_COMMUNITY_PACKAGES_ALLOW_TOOL_USAGE=trueCompatibility
- Tested with n8n version 2.12.3
- Requires Node.js 18 or later
Resources
- DocuPipe Dashboard - Create schemas, classes, and manage documents
- DocuPipe Documentation - API reference and guides
- DocuPipe Website - Sign up for a free account
- n8n Community Nodes Documentation
