ITNM ROOT CAUSE ANALYSIS DOWNSTREAM CORRELATION
IBM Tivoli Network Manager (ITNM) is in use around the world, providing IP network discovery and polling capabilities. Other features include downstream root-cause analysis (RCA) and event correlation based on its discovered topology. In Netcool/OMNIbus WebGUI, this is visualised in the event list via a Relationship. The Relationship links the root-cause event to the symptomatic events by writing the Serial
of the root-cause event to the NmosSerial
field of the symptom events. When the ITNM Relationship is selected in the WebGUI view, the parent-child relationships are visible, enabling an Operator to see problems in the context of the root-cause events, effectively suppressing the symptomatic ones. Operators can also opt to filter out Symptom events.
This blog post outlines a way to generically migrate correlation in Netcool to AIOps. The approach essentially identifies a common attribute among the members of the Netcool event grouping, then passes the common attribute to AIOps as a custom attribute, which is then grouped on using AIOps scope-based grouping.
This approach could be extended to ITNM RCA correlation also, by identifying a common attribute among a root-cause event and its children, then passing it to AIOps in a similar manner. This blog post outlines a suggested approach that can co-exist with any other correlations that are in use in the existing AIOps system.
A real advantage in AIOps over traditional Netcool is the ability for alerts to be a member of multiple groups at the same time, and all these correlations to co-exist and be visualised together. This was especially not the case with ITNM root-cause correlation which needed a separate view in WebGUI. With AIOps however, the ITNM correlations can be combined with the other correlation mechanisms and fully contribute to super-grouping.
ITNM PARENT CHILD RELATIONSHIP IN NETCOOL
In Netcool, the ITNM root-cause alert is linked to its corresponding symptomatic alerts. This is done by copying the Serial
field value of the root-cause event to the NmosSerial
field of each of the child events. WebGUI then offers this as a Relationship option in the Event viewer View configuration:
Applying this View to your Event Viewer reveals ITNM RCA correlation:
MAP THE CUSTOM CORRELATION FIELD
Just like in the other blog post, the first step to bringing a correlation over to AIOps is to identify the common attribute among the group members and set them to be a custom correlation attribute in AIOps. An AIOps scope-based grouping policy can then be created to cause groups to be formed based on this common attribute.
In the case of ITNM RCA correlation events, the common attribute tying the events together is the Serial
of the root-cause alert. The root-cause alert (NmosCauseType
= 1) holds this value in the Serial
field and the symptom events (NmosCauseType
= 2) hold the same value in the NmosSerial
field.
Hence we can map this value to our custom correlation field in the AIOps Netcool Connector mapping by using the following:
In this example, we are defining a sub-attribute of details
called itnmCorrelation
based on the value of NmosCauseType
:
For the latter case, we would simply add the condition that the policy should fire only if our ITNM correlation attribute changes:
For the condition sets, our criteria is simply when the contents of our ITNM correlation attribute is not empty, then use its value to group on:
Finally, set a time window for the correlation and choose a type:
Scroll back up to the top of the Policy definition window, and check your settings, then click Save to save your new Policy.
HIGHLIGHT THE ROOT-CAUSE ALERTS
You should then see the ITNM RCA correlations being replicated in AIOps, with the root-cause alert highlighted:
Note that the root-cause alert is correlated together with the symptoms in AIOps since AIOps does not support real alerts being a parent event in the view. It is however highlighted as an ITNM root-cause alert, via its Summary field.
PRIME THE PROBABLE CAUSE ANALYSIS ENGINE
A final optional step is to prime the AIOps probable-cause analysis engine to increase the probable-cause score for ITNM root-cause alerts.
Care should be taken if customising the probable-cause engine, since ITNM RCA correlation is not the only type of correlation that AIOps leverages to correlate alerts together and hence there may be other alerts with a higher probable-cause score due to the keywords contained in their Summary
fields, for example. It would be reasonable however to give ITNM root-cause event probable cause scores a boost, since a correlated network root-cause event is highly likely to be the probable cause of any ongoing incident.
To prime the probable-cause analysis engine for the appearance of ITNM root-cause alerts, do the following:
EXAMPLE:
Log into your cluster:
Run the following to set up your environment parameters:
Run the following curl command to extract the current word list: