Robotic Process Automation (RPA)

 View Only
  • 1.  Compare excel data with pdf

    Posted Wed August 10, 2022 09:25 AM
    Hello Members,

    Greetings..

    I am new to IBM RPA and learning RPA. I have a scenario that i need to compare data between excel sheet and pdf document. Report need to be generated for comparison results. Is it possible IBM RPA? Appreciate members guidance on this. Thanks in advance.


    ------------------------------
    Vivek
    ------------------------------


  • 2.  RE: Compare excel data with pdf

    IBM Champion
    Posted Wed August 10, 2022 09:31 AM
    Good morning Vivek,

    To be able to compare a pdf and data within an excel sheet, you will need to convert your pdf to a format that is comparable with the excel data. Please look into the attached link towards utilizing the built-in ocr tool to extract text from the pdf.
    https://www.ibm.com/docs/en/rpa/21.0?topic=ocr-using-pdf-text-extractor

    Then from the excel side, you can grab specific values to compare using the office get text, or any other way you feel comfortable grabbing the value.

    From there, you can look to compare specific fields from the pdf with those within excel. Hope this helps!


  • 3.  RE: Compare excel data with pdf

    Posted Thu August 11, 2022 05:39 AM
    Thanks Martin for quick response. 

    I have tried as you mentioned and could read pdf value using OCR with pdf text extractor. But the problem i am facing is, i want to read all row and column data of table in pdf. For ex: read all data from monthly sales report and compare with buyers data.

    Any idea whether it is possible in IBM RPA as i could see it is possible in UiPath?

    Thank you.


    ------------------------------
    Vivek
    ------------------------------



  • 4.  RE: Compare excel data with pdf

    Posted Thu August 11, 2022 09:15 AM
    When you read an PDF, it will return the content as text. In this case, you can try to use some commands, like the Split Text to split the text and store each data in a collection, so you can read this collection and compare with the data from the Excel file.

    Check these learning paths, I think they can help you:

    • Data table and Office pack with IBM Robotic Process Automation
    • Excel basics with IBM Robotic Process Automation
    • IBM Robotic Process Automation - Basic II (it has a PDF module)


    ------------------------------
    Giulianele Alves Barbosa
    ------------------------------