OCR plugin configuration

Top  Previous  Next

The OCR plugin converts images of text into real characters that can be used to supply text to EscapeE in the same way that a database supplies values to fields. To configure the OCR plugin:

1.Sweep out the part of the page containing the image of the text and select New field... from the pop-up menu to define a field as usual.
2.Select OCR plugin from the list on the 'Advanced' page then click Configure. The "Include text when exporting?" dialogue pops up.
3.Choose either:
Yes to include the "recognized" text in the document. It will be hidden under the image so that it is available for use (e.g. searching) without showing the actual characters. Or
No to exclude recognized text from the exported document.
4.The "Leave blank if OCR failure?" dialogue pops up. Choose:
No to show "OCR failed" message whenever no text can be recognized, or
Yes when, for example, it is likely that there is no text to be found in the field area on some of the pages. In this case, the warning would be superfluous.
5.Click OK.


The OCR plugin makes use of Microsoft Office Document Imaging. If MODI is not installed on your system, you may simulate it by means of a further dialogue that pops up automatically:

6.In "Microsoft Office Document Imaging - not installed. Simulate OCR?" dialogue, choose Yes.

Choosing Yes exports dummy text where OCR text would occur in the document. Up to 5 lines are generated so as to fill the field area at 1/6" line spacing. For example, a field named ADDRESS might appear as:

Line 1 of field ADDRESS
Line 2 of field ADDRESS

Choosing No instead just causes the field to be blanked or to contain the error message according to the configuration set in step 4 above.

Choosing Cancel causes an error message box each time OCR is attempted.

See also

Examples using plugins