Batch Extract Text from PDF

Batch extract text from PDF allows you to extract text from multiple PDF documents. For each document the batch process will output a separate text file with the text contents of that document.

Note: If the document does not contain text (for example: scanned documents or images) it will need to be OCR'd first so that there is text to extract.

How to Extract Text from a Batch of PDFs

  1. On the menu bar go to Batch > Document > Extract Text from PDF
  2. Set the options for the batch process. Additional details for each of the settings are available below.
  3. Once all of the settings are complete, click on Start... to begin the batch process.  

Batch Extract Text Settings

File List

Add Files - Displays a file chooser to add individual files to the list.

Add Folder - Displays a file chooser that adds the contents of a directory to the list.

- Removes the selected file(s) from the list.

- Moves the selected file(s) up the list.

- Moves the selected file(s) down the list.

- Moves the selected file(s) to the top of the list.

- Moves the selected file(s) to the bottom of the list.

Set Default Batch Directory - When checked, all files from the default batch directory will be added to the File List each time a batch dialog is opened.

Save Files To

Destination Folder - This option allows you to set a destination folder to place all of the processed files. You can type the destination manually or click on the "..." button to open a directory chooser to set the destination folder

Overwrite Files - When checked, if a file with the same name as the file being processed is located in the destination folder it will be overwritten with the newly processed file. If this option is not checked, a number will be appended to the output file name, to avoid duplicate file names.

Passwords to try when opening documents

To set a password click in the password field or on the Edit button. Then enter the password you want to be used. Do this for up to four passwords to try on password protected PDFs during the batch process.

Note: The passwords entered here will only be used for this batch process and will not be stored anywhere else. Passwords will have to be entered for each new batch process.