Cloud Pak for Data

 View Only
Expand all | Collapse all

Trying to understand any relationship between quality data rules and data classes

  • 1.  Trying to understand any relationship between quality data rules and data classes

    Posted Fri July 16, 2021 05:13 PM

    I've got CP 3.5.2 and have been able to add a few tables, analyze columns, and assign Data Classes.  What I'm trying to understand is how data rules related (if they do) to data classes.  For example if I've got a table individual_rtab which has a DOB (date of birth) column which is a SQL date type and I assign the built in "Date or Birth" data class to it, what actual data quality checks get applied to it just from that?  The edit page says its backed by the com.ibm.infosphere.classification.impl.DOBClassifier Java class, but as far as I can tell there's no other documentation on what that class actually does.  At that point I can run a data quality analysis on the table and I do see some OB violations for unreasonable values.

    But then if I go into rules -> add rule -> create a data rule -> published rules -> 01 personal identity -> date birth I see two rules:

    DobReasonableRangeNumeric and DOBReasonableRangeString

    So can someone explain whether Data Classes imply Data/Quality Rules, or are they independent things assigned separately?



    ------------------------------
    Steve Prior
    ------------------------------

    #CloudPakforDataGroup


  • 2.  RE: Trying to understand any relationship between quality data rules and data classes

    Posted Fri July 30, 2021 02:50 PM
    Me too!!

    ------------------------------
    Edgar F Delgado Z
    ------------------------------