Thank you Gerold. You made a very good point. As a matter of fact, i do not know i need to decode even after successful decompression. This makes things even harder, doesn't it?
I did try de-compiling Daeja Viewer code to search for a clue but without success. There is a zip package using GZip but i tried Gzip on the compressed bytes and i got an error message: not a GZip. I know I am asking too much for help, so my guess is as you said, it is proprietary compression no one can get to.
I am thinking of using ISTK to write some C code but I cannot find any example of using ISTK. Everything becomes a searching needle in a haystack.
I may have to go back to using IDM desktop even if it requires IS connection, but it will be a hell slow process, unfortunately.
if you can think of anything i should try, i will be really appreciative for your guidance.
------------------------------
Kevin Sheng
------------------------------
Original Message:
Sent: Wed May 11, 2022 04:42 AM
From: Gerold Krommer
Subject: COLD document compression
Hi,
I can save you a lot of time...the compression is proprietary and non of the standard tools will work (as you noticed). Only 3 tools can decompress COLD pages:
1. IDM Desktop (or one of its libraries)
2. Image Services Toolkit
3. Daeja Viewer
We converted > 100 Million COLD documents to PDF and we downloaded the pages using several IDM Desktops in parallel. You could try to decompile the Daeja Applet, but IBM frowns this endeavor as this violates the licensing agreement.
BUT: When you have the decompressed page, they will be in P-code (specs date back to 1987, the last millenium!). What are you going to to with the pcode encoded pages?
Hope this helps,
/Gerold
------------------------------
Gerold Krommer
Original Message:
Sent: Mon May 09, 2022 04:17 PM
From: Kevin Sheng
Subject: COLD document compression
I have an image services system which has some COLD documents with Enable Page Compression checked on. I want to know how FileNet compresses COLD document content. I tried zlib and gzip and not able to decompress the content. This must be a very difficult question to answer and i do not know if there is anyone out there in the world who can shed any lights on it.
------------------------------
Kevin Sheng
------------------------------