ccxc-ocr
v0.0.39
Published
CCxC ocr to scan different types of documents
Readme
CCxC OCR
Library to scan documents such as invoices, state of results, bank movements, credit card histories, etc.
Usage
const {scanInvoice} = require('ccxc-ocr');
const {
dates,
names,
nits,
results
totals,
} = await scanInvoice('PATH_OF_THE_FILE', options);Params
- PATH_OF_THE_FILE: path to the file, can be a relative path or an url
Supported extensions: jpg, jpeg, png, pdf.
Troubleshooting
Help
tesseract --helptesseract --help-extra
Example commands
tesseract IMAGE_NAME FILENAME OUTPUT_FORMAT_TYPEStesseract sample2.jpg out hocrtesseract sample2.jpg out pdf
Dependencies:
- python 2.7
- node -v x64 8.0 or greater and npm.
- gcc --version 4.8.1 or greater.
- g++ -v 4.8.1 or greater.
- java -version x64 JRE 1.8
Additional Info
- Package: https://github.com/zapolnoch/node-tesseract-ocr
- Manual: https://github.com/tesseract-ocr/tesseract/blob/master/doc/tesseract.1.asc
- Configs: https://github.com/tesseract-ocr/tesseract/tree/master/tessdata/configs
