Robotic Process Automation (RPA)

 View Only
Expand all | Collapse all

Get PDF Text by OCR output option "success" field

  • 1.  Get PDF Text by OCR output option "success" field

    Posted Sat November 20, 2021 04:56 AM
    Hi,

    Kindly advise how the RPA determine the operation extractPdfText is success is True base on what counter value "confidence level" etc. I am referring to below link.

    Can we manually set such certain fields extraction need to achieve over 90% confidence level only return success True?

    Thanks.

    ------------------------------
    Sonny Siah
    Infra365 Sdn Bhd
    ------------------------------


  • 2.  RE: Get PDF Text by OCR output option "success" field

    Posted Fri December 17, 2021 12:54 PM
    Edited by Joba Diniz Fri December 17, 2021 12:55 PM
    The confidence level comes from the underlying provider, which can be Abbyy, Google Cloud Vision, or Tesseract, depending on which provider you chose.
    The success is true when the command does not fail. Success will be false when the command fails, for example, when connecting to Google Vision there's no internet, or when connecting to Abbyy there's an error Abbyy returns, that is, it fails if the underlying provider fails to respond. Success does not have a relation to the extracted OCR content.

    "Can we manually set such certain fields extraction need to achieve over 90% confidence level only return success True?"

    The command does not provide this behavior, but you can achieve it using conditions (if) right after the command is executed.

    ------------------------------
    Joba Diniz
    Product Manager
    IBM
    ------------------------------