Help files

Extract Text From PDF With OCR Action

Description:

This Action extracts text from a PDF file using OCR.

extracttextfrompdfwithocraction.png
extracttextfrompdfwithocr_advanced_action.png
Properties:

General Tab's Properties:

OCR Engine:

This text field with drop-down menu options invites you to enter the OCR Engine instance you want to work with.

PDF File:

Enter or choose the PDF file, whose text will be extracted. It can be a file path, a variable containing a file or a text path.

Page(s) To Extract:

Choose how many pages will be extracted using OCR. All, Single or Range of pages

Single Page Number:

Set the value of the page you want to extract text from using OCR.

The following two options will be available if you have chosen Range on the "Page(s) To Extract" Property Value:

From Page Number:

Set the first page number from the range of pages that text will be extracted using OCR.

To Page Number:

Set the last page number from the range of pages that text will be extracted using OCR.

Store Extracted Text into:

Enter a name to be the variable that will store the extracted text using OCR.

Advanced Tab's Properties:

Use Password:

Choose whether you want to work with PDFs that are password protected.

Enter PDF Password:  

If you choose 'directly', the password entered in the Password field will be hidden. If you choose 'as variable' you must enter a variable containing the password and the '%' character will be treated as an indicator of a variable, not part of the password.

Password (Directly):

Enter the Password here. The password will be hidden.

Password (as Variable):

Enter a variable containing the password here.