I am happy to announce that Watson Knowledge Catalog now provides the ability for users to gain insights on their business and technical metadata through an external database for reporting. Using BI tools of their choice, they can connect to the WKC reporting database and run queries and create reports to answer questions such as:
- Which are the assets that are missing business terms?
- How many assets are in the catalog by type?
- What is the trend of data quality score of a data asset over time?
- Are there terms or other governance artifacts missing descriptions?
This feature is currently available only in Watson Knowledge Catalog in IBM Cloud (Saas) with the Professional and Enterprise plans. It is targeted to be available in Cloud Pak for Data in early 2022.
Here are some key things to know:
- The database to store WKC reporting data will be provided and managed by the customer.
- The supported source systems for the WKC reporting database are Db2 or Db2 on Cloud.
- Scope of data available for reporting in this initial phase are: published governance artifacts and categories, project assets, and catalog assets including their published data quality scores and term assignments.
- An administrator can configure which catalogs, projects and top-level categories to be included in reporting.
- Once reporting has been configured and started, metadata from WKC will start flowing to the WKC reporting database automatically.
- Users will connect to the WKC reporting database through their own BI tools.
- WKC data stored in the reporting database will have temporal awareness to enable users to create time-based queries to understand trends.
To learn more, check out this external blog
and our documentation
This is just the beginning. In future phases, we will be adding more data to be supported for reporting, including data curation details, data quality analysis results, data quality exception reporting and workflows. We also plan to have more database choices to use as the database for WKC reporting.