OCR Node

The OCR node allows you to add text to scanned documents or images so that the document can be searched or marked up as you would any other text document.

What is OCR?

Optical character recognition (OCR) is the mechanical or electronic conversion of images of typed or printed text into machine-encoded searchable text data.

 

General Settings

Language - Select the language to OCR the documents with 

DPI Resolution: From our testing, a resolution of 300dpi produces good OCR results for most images. When dealing with scans containing noise, you may try using a lower dpi setting to get rid of the noise and obtain better OCR results.

Discard Invisible Text - Removes any previous OCR text that has been added to the page.

Auto Deskew Images - When checked, if the document’s text/images are slanting too far in one direction or is misaligned, PAS will attempt to auto-rotate the document so that the alignment is corrected.

Rotate Pages - auto rotate pages when scanned upside down

 


Qoppa Software's PDF Automation Server for Windows, Linux, Unix, and macOS

Automate PDF Document Workflows through RESTful Web Services & Folder Watching

Copyright © 2002-Present Qoppa Software. All rights reserved.