Automation with Power

Power Business Continuity and Automation

Connect, learn, and share your experiences using the business continuity and automation technologies and practices designed to ensure uninterrupted operations and rapid recovery for workloads running on IBM Power systems. 


#Power
#TechXchangeConferenceLab

 View Only

PowerHA Question : Failure Detection Rate Value

  • 1.  PowerHA Question : Failure Detection Rate Value

    Posted Fri May 29, 2015 04:12 AM

    Originally posted by: hwangryeyoun


    Hello~ Dear?

    I have a question about FDR(Failure Detection Rate) in HACMP V5.4.1

    Our Customer had a performance issue, so all of network's heartbeat didn't sent

    I was check nim_topsvcs.en3.xxx.cluster file in /var/ha/log/

    05/28 17:19:58.639: Heartbeat was NOT received. Missed HBs: 1. Limit: 10
    05/28 17:20:00.649: Heartbeat was NOT received. Missed HBs: 2. Limit: 10
    05/28 17:20:00.649: Starting sending ICMP ECHOs.
    05/28 17:20:00.649: Invoking netmon to find status of local adapter.
    05/28 17:20:02.649: Heartbeat was NOT received. Missed HBs: 3. Limit: 10
    05/28 17:20:04.409: netmon response: Adapter is up
    05/28 17:20:04.652: Heartbeat was NOT received. Missed HBs: 4. Limit: 10
    05/28 17:20:04.652: Invoking netmon to find status of local adapter.
    05/28 17:20:05.452: netmon response: Adapter is up
    05/28 17:20:06.652: Heartbeat was NOT received. Missed HBs: 5. Limit: 10
    05/28 17:20:06.652: Invoking netmon to find status of local adapter.
    05/28 17:20:07.453: netmon response: Adapter is up
    05/28 17:20:08.658: Heartbeat was NOT received. Missed HBs: 6. Limit: 10
    05/28 17:20:08.658: Invoking netmon to find status of local adapter.
    05/28 17:20:09.459: netmon response: Adapter is up
    05/28 17:20:10.659: Heartbeat was NOT received. Missed HBs: 7. Limit: 10
    05/28 17:20:10.659: Invoking netmon to find status of local adapter.
    05/28 17:20:11.459: netmon response: Adapter is up
    05/28 17:20:12.659: Heartbeat was NOT received. Missed HBs: 8. Limit: 10
    05/28 17:20:12.659: Invoking netmon to find status of local adapter.
    05/28 17:20:13.460: netmon response: Adapter is up
    05/28 17:20:14.659: Heartbeat was NOT received. Missed HBs: 9. Limit: 10
    05/28 17:20:14.659: Invoking netmon to find status of local adapter.
    05/28 17:20:15.460: netmon response: Adapter is up
    05/28 17:20:16.665: Heartbeat was NOT received. 
    Missed HBs: 10. Limit: 10
    05/28 17:20:16.665: Invoking netmon to find status of local adapter.
    05/28 17:20:16.665: Receive thread seems to be blocked. 
    Setting sensitivity to 30.
    05/28 17:20:17.466: netmon response: Adapter is up
    05/28 17:20:18.666: Heartbeat was NOT received. 
    Missed HBs: 11. Limit: 30
    05/28 17:20:18.666: Invoking netmon to find status of local adapter.
    05/28 17:20:19.467: netmon response: Adapter is up
    
    

    as you can see above log...

    Missed HBs value was change 10 to 30.

    I'm wonder how can changed FDR value autometically

    Please give your comment.

    Thanks and Regards

     


    #PowerHAforAIX
    #PowerHA-(Formerly-known-as-HACMP)-Technical-Forum