Content Management and Capture

Expand all | Collapse all

PDF to PDF conversion bypassing tiff conversion

  • 1.  PDF to PDF conversion bypassing tiff conversion

    Posted Thu October 08, 2020 10:10 PM
    Hello all,
    We would like to convert our PDF documents into Pdf again before starting the OCR process.  Is there a Datacap action which achieves the same purpose as "printoPDF" or "SaveAs->PDF" functionality? The input and output will both be PDF. I am trying to avoid writing custom actions using third party libraries, an existing action within Datacap would be much easier. Thanks.

    ------------------------------
    Kteegala
    ------------------------------


  • 2.  RE: PDF to PDF conversion bypassing tiff conversion

    Posted Fri October 09, 2020 04:41 PM
    Would you mind explaining the purpose behind this? I'm a little confused as to why you might want this.

    ------------------------------
    Scott Sumner-Moore
    ------------------------------



  • 3.  RE: PDF to PDF conversion bypassing tiff conversion

    Posted Fri October 09, 2020 05:46 PM
    Hi Scott,
    We have several PDF invoices which are created by different PDF softwares which seem incompatible with Datacap. Some have embedded images, content copying disabled, or have high resolution mobile images converted as PDF documents. 

    Some of these get aborted during Tiff conversion, or produce large tiffs which result in long running batches later during page identification or extraction. All of these issues are resolved when we manually recreate PDFs (using Save As->PDF or Print PDF) and reprocess the batch. 
    We can probably address each issue individually, but it would be easier to just recreate the PDF using a Datacap action and address all issues in one go. 


    Thanks

    ------------------------------
    Kteegala
    ------------------------------