Automation with Power

Power Business Continuity and Automation

Connect, learn, and share your experiences using the business continuity and automation technologies and practices designed to ensure uninterrupted operations and rapid recovery for workloads running on IBM Power systems. 


#Power
#TechXchangeConferenceLab

 View Only
  • 1.  HACMP 5.4 Sync and Discovery Failure

    Posted Tue April 13, 2010 02:11 PM

    Originally posted by: SystemAdmin


    I have a pb in Discovery HACMP related Information and Synchronization. I'm in AIX 5.3TL8 and HACMP 5.4.1.7

    Discovery Error:

    IP Network Discovery completed normally

    Discovering Volume Group Configuration

    Initializing..

    Gathering cluster information, which may take a few minutes...

    crmpr01b: cexec47: cllsvgdata: not found

    crmpr01b: cl_rsh had exit code = 127, see cspoc.log and/or clcomd.log for mor
    nformation

    Processing...

    Storing the following information in file

    /usr/es/sbin/cluster/etc/config/clvg_config cl_cw_harvest_vg:

    Hdisk: Errorgatheringclusterinformation.

    PVID:

    VGname:

    VGmajor:

    Conc-capable: Yes

    VGactive: No
    Synchronization Error :

    WARNING: No entries in /etc/hosts on node: crmpr01b.

    A corrective action is available for the condition reported below:
    ERROR: Node: crmpr01b is missing entry '192.168.140.1 pr1aboot4' in the /etc/hos
    ts configuration file.
    The two /etc/hosts file are OK on the 2 nodes.

    Please Help !!
    #PowerHA-(Formerly-known-as-HACMP)-Technical-Forum
    #PowerHAforAIX


  • 2.  Re: HACMP 5.4 Sync and Discovery Failure

    Posted Wed April 14, 2010 03:48 PM

    Originally posted by: Casey_B


    Could be alot of things.

    First on my mind would be a communication error between the two nodes.
    Sometimes that would affect clcomd's ability to send information for discovery, or verify from one node
    to the other.

    clcomd is built to be robust in case of individual connections between the two nodes in the clusters.
    To do that, it may communicate on any ip address that is configured for HA.

    So, if you have an odd route, or some other configuration problem...It is possible for clcomd to fail
    intermittently but the other parts of the cluster work just fine.

    But it is hard to tell from a description. Usually the quickest way to solve the problem is through reviewing
    a snap -e. And the quickest person to do that is your local friendly IBM support person. :)

    Hope this helps,
    Casey
    #PowerHA-(Formerly-known-as-HACMP)-Technical-Forum
    #PowerHAforAIX


  • 3.  Re: HACMP 5.4 Sync and Discovery Failure

    Posted Sat April 17, 2010 05:27 AM

    Originally posted by: SystemAdmin


    I fixed the problem by updating AIX to last fix 5.3 TL11 SP3. and HACMP 5.4.1.3 to the latest fix 5.4.1.7
    #PowerHAforAIX
    #PowerHA-(Formerly-known-as-HACMP)-Technical-Forum