PDF
Automation
PDF automation though UiPath means extracting text from PDF. This can be reading the entire PDF text or the required portion of text from the PDF.
UiPath uses
the following activities to perform PDF automation
For
Searchable PDF’s use “Read PDF text” activity
For
scanned PDF’s use “Read PDF with OCR”
activity
These activities if they are not visible by default in the activities panel, then goto manage packages and install PDF activities.
Read PDF text
- This activity extracts text from searchable PDF and takes PDF file path as input and returns PDF text as output.
- If the PDF is password protected provide the password in the properties for this action.
- If you want to read a specific range then mention the range in the Range property where "All" means it reads all pages, "1-2" means it reads 1 to 2 page, "2-4" means it reads 2 to 4 pages.
Read PDF with OCR
- This activity extracts text from Scanned PDF and takes PDF file path as input and returns PDF text as output.
- If the PDF is password protected provide the password in the properties for this action.
- If you want to read a specific range then mention the range in the Range property where "All" means it reads all pages, "1-2" means it reads 1 to 2 page, "2-4" means it reads 2 to 4 pages.
- Additionally to this activity you need to provide OCR engine with the help of which it will extract text from the scanned PDF.
The following are the list of OCR engines available in UiPath
- Microsoft Cloud OCR
- Google Cloud OCR
- Abbyy Cloud OCR
- Google OCR
- Microsoft OCR
how can convert data set to data table.I extracted the data by using intelligent ocr activities,variable is in data set form how can i write the data in single excel sheet .
ReplyDelete