Thank you for the point. I will have to choose between design-time and real time lineage.
Original Message:
Sent: Tue December 19, 2023 09:30 AM
From: Udo Neumann
Subject: Searching documentation about DataStage export and DSX or ISX
You could analyse the lineage in Information Server if you have IGC licensed. Also you have to differenciate between design-time and runtime lineage. The maintance task could also be automated via scripts.
------------------------------
Udo Neumann
Original Message:
Sent: Tue December 19, 2023 09:04 AM
From: Mark Hickok
Subject: Searching documentation about DataStage export and DSX or ISX
You might also look at doing the export in XML format (as opposed to DSX) - as THEN it could be read by Manta (the lineage company IBM purchased a couple of months ago) and have the lineage created all.
But - a better solution would be to look at modernizing the platform to the Cloud Pak for Data and DataStage NextGen.. That way the DataStage job would reside in a project and you could schedule the linage job via the on board scheduler or even run thiongs via a 3rd party scheduler.. And all that could run inside of the environment (including the lineage!)
------------------------------
Mark Hickok
Original Message:
Sent: Mon December 18, 2023 06:55 AM
From: Jérôme Mainaud
Subject: Searching documentation about DataStage export and DSX or ISX
Hi John,
Thank you for your idea. I will look at these reports.
My goal is to automatically import the lineage in a data catalog every night. Therefore I need to get a parseable result without requiring human action.
The report can be exported in CSV format, so it is somehow parseable.
However I see two limitations for now:
- I see no automated ways to trigger this.
- "The default maximum number of nodes that are displayed in a lineage report is 500. If more than this number of nodes is present in your lineage report, the report is truncated."
If I can find a solution for both of them, it cloud be cheaper.
I see this is a part of the InfoSphere Information Governance Catalog which offers a REST API. This API seems to be focused on assets, but maybe there is information about lineage.
------------------------------
Jérôme Mainaud
Original Message:
Sent: Mon December 18, 2023 04:33 AM
From: John McKeever
Subject: Searching documentation about DataStage export and DSX or ISX
Hi Jérôme,
Is it not possible for your client to run the data lineage reports themselves directly on their platform?
It'd certainly be quicker, easier, cheaper and more accurate than asking you to import their entire DataStage landscape into your environment and running the reports there,
John
------------------------------
John McKeever
Original Message:
Sent: Thu December 14, 2023 01:56 PM
From: Jérôme Mainaud
Subject: Searching documentation about DataStage export and DSX or ISX
Hello,
A client asked me to extract dataset lineage information from DataStage.
They send me an export of their process as a DSX file and showed me how to export it on their desktop clickable application.
They use an on-premise DataStage server.
Now I'm looking for a solution to automate this export. If possible by calling an API otherwise by calling a non-interactive command line client.
I'm also looking for documentation, or a specification of the file format DSX.
During my first search, I've heard about an ISX export format that is said to be more recent. So I'm also looking for information about its format or any API to produce it. I've already found a istool export command.
However my client seamed reluctant to use this format for some reason.
Any link to relevant documentation would be appreciated.
Thank you,
------------------------------
Jérôme Mainaud
------------------------------