n8n-nodes-docxtotext
v0.2.1
Published
n8n node to convert DOCX files to text using mammoth
Maintainers
Readme
DOCX to Text Node for n8n
An n8n node that converts DOCX files to plain text using the mammoth library.
Features
- Convert DOCX files to formatted text
- Preserve basic formatting (bold, italics, etc.)
- Option to include the original filename in the output
- Simple and easy to use
Installation
In n8n
- Go to Settings > Community Nodes
- Click on Install
- Enter the npm package name:
n8n-nodes-docxtotext - Click Install
- Restart n8n
Manual Installation
- In your n8n root directory, run:
npm install n8n-nodes-docxtotext - Restart n8n
Usage
- Add the DOCX to Text node to your workflow
- Connect it to a node that provides a DOCX file (e.g., Read Binary File or HTTP Request)
- Configure the node:
- Binary Property: The name of the binary property containing the DOCX file (default:
data) - Options > Include Filename: Whether to include the original filename in the output
- Binary Property: The name of the binary property containing the DOCX file (default:
- The node will output an object containing:
text: The extracted text contentmessages: Any messages from the conversion processfilename: The filename without the extension (if enabled in options)fileExtension: The file extension in lowercase (if available and enabled in options)
Example
Input
A DOCX file with the following content:
Hello World!
This is a test document.Output
{
"text": "Hello World!\nThis is a test document.",
"messages": [],
"filename": "example",
"fileExtension": "docx"
}Development
Prerequisites
- Node.js (v14.x, v16.x, or v18.x)
- npm (v6.x or later)
- n8n (v0.198.0 or later)
Setup
- Clone the repository
- Install dependencies:
npm install - Build the project:
npm run build - Link the package for local development:
npm link - In your n8n directory, link the package:
npm link n8n-nodes-docxtotext - Restart n8n
Testing
- Run the linter:
npm run lint - Build the project:
npm run build
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
- mammoth.js - Used for DOCX to text conversion
- n8n - Workflow automation tool
