B2B Integration

 View Only
Expand all | Collapse all

Reading the pdf which is having tabular data in ITX

  • 1.  Reading the pdf which is having tabular data in ITX

    Posted Sat October 03, 2020 10:02 AM
    Hi anyone worked on scenario handling the PDF which is having data in tabular format to CSV in ITX.When I tried using PDF adapter it is not working as in infocenter it mentioned as supports only acroform or template pdf files only.


    ------------------------------
    sunil kumar chellu
    WTX DEVELOPER
    AIDIN
    vizag
    ------------------------------

    #B2BIntegration
    #SupplyChain


  • 2.  RE: Reading the pdf which is having tabular data in ITX

    Posted Mon October 05, 2020 02:53 AM
    Hi Sunil,

    Please check these blog to understand more about reading pdf files.

    https://community.ibm.com/community/user/supplychain/communities/community-home/digestviewer/viewthread?MessageKey=e1cdba79-36ff-4b93-94a7-30077a9c81e9&CommunityKey=de289f68-d664-45ae-b718-baf57628fc8c&tab=digestviewer#bme1cdba79-36ff-4b93-94a7-30077a9c81e9

    https://community.ibm.com/community/user/supplychain/communities/community-home/digestviewer/viewthread?MessageKey=aa52978a-1e91-464a-949d-4d8428fc9dc6&CommunityKey=de289f68-d664-45ae-b718-baf57628fc8c&tab=digestviewer#bmaa52978a-1e91-464a-949d-4d8428fc9dc6

    Thanks


    ------------------------------
    Manoj Bansal
    ------------------------------



  • 3.  RE: Reading the pdf which is having tabular data in ITX

    Posted Mon October 05, 2020 02:02 PM
    Hi Manoj thank u for sharing the links. Actually the pdf which I am working is not in acroform it is having data in columns which need to captured

    ------------------------------
    sunil kumar chellu
    WTX DEVELOPER
    AIDIN
    vizag
    ------------------------------



  • 4.  RE: Reading the pdf which is having tabular data in ITX

    IBM Champion
    Posted Tue October 20, 2020 04:13 AM
    When you read the data as a text blob, are there any identifiable characters between the column data, so that you can split it out?

    ------------------------------
    Paul Brett
     
    IBM Sterling Transformation Extender (ITX) Client Support
    ------------------------------



  • 5.  RE: Reading the pdf which is having tabular data in ITX

    Posted Mon October 26, 2020 12:14 PM
    Edited by System Wed March 22, 2023 11:53 AM
    Hi Paul there are only spaces appearing between Column to column after reading data  but thing is the data in each column is also having spaces

    I Have tried using below pdf adapter command

    -U inputfile.pdf -PAGE 1-2 -T 

    I am able to capture the data but If data in column is in 2 separate rows then data is captured as separate row only 

    Ex:
     Column1 Data       Column2 Data    
      Ac1000                   Account                
                                      123456

    ------------------------------
    sunil kumar chellu
    WTX DEVELOPER
    ------------------------------



  • 6.  RE: Reading the pdf which is having tabular data in ITX

    IBM Champion
    Posted Thu October 29, 2020 04:17 AM
    With row data spread across multiple lines, I don't see how you can use a 'normal' typetree.
    You would need a special typetree to read in the values, doing a split at a specific number of entries, inserting a line break, to make the data readable by a more traditional typetree.

    This would only work if you had a set number of columns, and EVERY column was populated in EVERY row.

    ------------------------------
    Paul Brett
     
    IBM Sterling Transformation Extender (ITX) Client Support
    ------------------------------



  • 7.  RE: Reading the pdf which is having tabular data in ITX

    IBM Champion
    Posted Thu October 29, 2020 04:19 AM
    Ignore the fact that the data is in a PDF for now.  Can you even design a Typetree that can read the data if it was in a .TXT file?

    ------------------------------
    Paul Brett
     
    IBM Sterling Transformation Extender (ITX) Client Support
    ------------------------------



  • 8.  RE: Reading the pdf which is having tabular data in ITX

    Posted Wed November 11, 2020 05:52 AM
    Edited by System Wed March 22, 2023 11:48 AM
      |   view attached
    Hi Paul PFA of the test PDF file for your reference

    ------------------------------
    sunil kumar chellu
    WTX DEVELOPER

    ------------------------------

    Attachment(s)

    pdf
    Test Report-converted.pdf   109 KB 1 version


  • 9.  RE: Reading the pdf which is having tabular data in ITX

    IBM Champion
    Posted Wed November 11, 2020 12:39 PM
      |   view attached
    Perhaps the attached 2 maps (Stage1 and Stage2) will give some clues on how to get to your solution.

    Output:

    100 Nov 01,2020 03:59:51 PM ET sunil001@gmail.com Users Grp – Pay
    200 Nov 02,2020 04:23:33 PM ET sunil002@gmail.com Users Grp – Pay
    300 Nov 03,2020 06:50:16 PM ET sunil003@gmail.com Users Grp – Pay
    400 Nov 04,2020 02:04:15 PM ET Sunil004@gmail.com Users Grp – Pay
    500 Nov 01,2020 03:59:51 PM ET sunil005@gmail.com Users Grp – Pay
    600 Nov 02,2020 04:23:33 PM ET sunil006@gmail.com Users Grp – Pay
    700 Nov 03,2020 06:50:16 PM ET sunil007@gmail.com Users Grp – Pay
    800 Nov 04,2020 02:04:15 PM ET Sunil008@gmail.com Users Grp – Pay

    ------------------------------
    Paul Brett
     
    IBM Sterling Transformation Extender (ITX) Client Support
    ------------------------------

    Attachment(s)

    zip
    ReadPDF.zip   1 KB 1 version