AIX

AIX

Connect with fellow AIX users and experts to gain knowledge, share insights, and solve problems.


#Power
#Power
 View Only
  • 1.  NIM thread blocked across multiple clusters at exactly the same time

    Posted Sun April 15, 2007 07:50 PM

    Originally posted by: SystemAdmin


    I noticed the other day that there were some NIM thread blocked errors in errpt on one of our servers. Curious to see whether this was specific to this machine I ran the following command from our csm master:

    dsh "errpt | grep "NIM thread blocked"

    The result I got back was that at exactly the same time all servers reported this error. Looking at the detail of the errpt entry it says that the affected item is the disk heartbeat device which sits on the SAN.

    My question is this: Is it as I suspect that there must have been some issue on the SAN that caused this, perhaps some zoning going on or something similar. Could I have reduced the possibility of this error by alerting the syncd setting?

    AIX 5.3 ML2 HACMP 5.2

    Thanks in advance.

    #AIX-Forum


  • 2.  Re: NIM thread blocked across multiple clusters at exactly the same time

    Posted Mon April 16, 2007 04:52 AM

    Originally posted by: steevojb


    Hi there,

    I had a similar issue which was occuring during some flashcopy processing. The resolution was a patch, can't remember whether it was an hacmp or rsct update though.

    HTH

    Steve
    #AIX-Forum


  • 3.  Re: NIM thread blocked across multiple clusters at exactly the same time

    Posted Mon April 16, 2007 05:37 PM

    Originally posted by: SystemAdmin


    Thanks Steve, I'll take a look and see what fixes are available.

    I think I'll also approach this from the other end and see if there's anything on an EMC Web page/forum that relates to this issue too.

    It seems reasonable to me that our SAN team may be able to take a look at any logs that the Symmetrix has to list what was happening on the device at that time. I'll ask them too.
    #AIX-Forum


  • 4.  Re: NIM thread blocked across multiple clusters at exactly the same time

    Posted Sun April 13, 2008 11:39 PM

    Originally posted by: SystemAdmin


    I have seen a similar issue with three clusters, however they are on two different SAN devices and fabrics. In our case it appears to be related to small changes in time due to xntpd corrections... see APARs IZ02759 and IZ03716.
    #AIX-Forum


  • 5.  Re: NIM thread blocked across multiple clusters at exactly the same time

    Posted Wed April 16, 2008 02:29 AM

    Originally posted by: vinodn


    Hi ,

    I faced the similar issue in my clustered nodes, and the storage logs(ds4700) shows some warning messages at that time... As u stated before, this error normaly occurs when the disk heardbeat got disconnected for sometime. So normaly have to start checking with SAN/Storage people .

    Thanks
    #AIX-Forum