Content Management and Capture

 View Only
  • 1.  COLD document compression

    Posted Tue May 10, 2022 11:00 AM
    I have an image services system which has some COLD documents with Enable Page Compression checked on. I want to know how FileNet compresses COLD document content. I tried zlib and gzip and not able to decompress the content. This must be a very difficult question to answer and i do not know if there is anyone out there in the world who can shed any lights on it.

    ------------------------------
    Kevin Sheng
    ------------------------------


  • 2.  RE: COLD document compression

    IBM Champion
    Posted Wed May 11, 2022 04:42 AM
    Hi,

    I can save you a lot of time...the compression is proprietary and non of the standard tools will work (as you noticed). Only 3 tools can decompress COLD pages:

    1. IDM Desktop (or one of its libraries)
    2. Image Services Toolkit
    3. Daeja Viewer

    We converted > 100 Million COLD documents to PDF and we downloaded the pages using several IDM Desktops in parallel. You could try to decompile the Daeja Applet, but IBM frowns this endeavor as this violates the licensing agreement.

    BUT: When you have the decompressed page, they will be in P-code (specs date back to 1987, the last millenium!). What are you going to to with the pcode encoded pages? 

    Hope this helps,

    /Gerold

    ------------------------------
    Gerold Krommer
    ------------------------------



  • 3.  RE: COLD document compression

    Posted Wed May 11, 2022 04:41 PM
    Thank you Gerold. You made a very good point. As a matter of fact, i do not know i need to decode even after successful decompression. This makes things even harder, doesn't it?

    I did try de-compiling Daeja Viewer code to search for a clue but without success. There is a zip package using GZip but i tried Gzip on the compressed bytes and i got an error message: not a GZip. I know I am asking too much for help, so my guess is as you said, it is proprietary compression no one can get to.

    I am thinking of using ISTK to write some C code but I cannot find any example of using ISTK. Everything becomes a searching needle in a haystack.

    I may have to go back to using IDM desktop even if it requires IS connection, but it will be a hell slow process, unfortunately.

    if you can think of anything i should try, i will be really appreciative for your guidance.


    ------------------------------
    Kevin Sheng
    ------------------------------



  • 4.  RE: COLD document compression

    IBM Champion
    Posted Wed May 11, 2022 05:06 PM
    Kevin,

    it would probably help if you would tell us your requirements :-)... makes advising much more accurate....

    When you install the ISTK there will be a bunch of C samples in a subdirectory, but who wants to do the ** pointer to pointer C  magic nowadays... brrr!

    I have written some programs that actually PRODUCED p-code documents for IS (using ISTK by the way) so could well be I'm the only living veteran who speaks p-code (hurry up, I'm 59). We converted our last customer from IS to P8 in 01/21... (the one with the > 100 mill COLD documents)

    Feel free to ask more,

    /Gerold

    ------------------------------
    Gerold Krommer
    ------------------------------