ccxc-ocr

CCxC ocr to scan different types of documents

Usage no npm install needed!

<script type="module">
  import ccxcOcr from 'https://cdn.skypack.dev/ccxc-ocr';
</script>

README

CCxC OCR

Library to scan documents such as invoices, state of results, bank movements, credit card histories, etc.

Usage

const {scanInvoice} = require('ccxc-ocr');

const {
    dates, 
    names, 
    nits, 
    results
    totals, 
} = await scanInvoice('PATH_OF_THE_FILE', options);

Params

  • PATH_OF_THE_FILE: path to the file, can be a relative path or an url

Supported extensions: jpg, jpeg, png, pdf.

Troubleshooting

Help

  • tesseract --help
  • tesseract --help-extra

Example commands

tesseract IMAGE_NAME FILENAME OUTPUT_FORMAT_TYPES
  • tesseract sample2.jpg out hocr
  • tesseract sample2.jpg out pdf

Dependencies:

  • python 2.7
  • node -v x64 8.0 or greater and npm.
  • gcc --version 4.8.1 or greater.
  • g++ -v 4.8.1 or greater.
  • java -version x64 JRE 1.8

Additional Info