HMC

  • 1.  Where to find power supply status, hardware sensor data?

    Posted 20 days ago
    I've asked this before in other places, but the topic came up again today. Where can I find the current power supply status, line in status, CPU temp, fan speed, and other sensor data common to a high end system?

    The HMC does a great job of notifying when a power leg is lost, calling home, setting alarm led, etc. However there's no way to tell when it's fixed.

    Back in AIX 4 we could use uesensor to get limited information, but that's no longer supported.

    machstat gives very poor information for rc.powerfail, so that's terribly incomplete.

    Where else can we look for this information?

    ------------------------------
    Russell Adams
    ------------------------------


  • 2.  RE: Where to find power supply status, hardware sensor data?

    Posted 8 days ago

    Russell
    I don´t know if you check it this before. With HMC API, theorically is possible.
    Regards

    Below is the HMC Knowledge Center documentation that gets updated periodically to reflect the changes in HMC REST Interfaces.

    https://www.ibm.com/support/knowledgecenter/POWER8/p8ehl/concepts/ApiOverview.htm


    Below is the HMC API that provides Power Supplies and Fans related information along with Status of each of these devices.

    https://<hmcip>/rest/api/uom/ManagedSystem/<UUID>?group=None&hwinventory=true

    In order to see these fields (power supply and fans), the following levels are needed on the HMCs and / or managed servers:

    HMC - 8.6 SP2 with PTF MH01716

    Server firmware - 860_103



    ------------------------------
    Humberto Sosa
    ------------------------------



  • 3.  RE: Where to find power supply status, hardware sensor data?

    Posted 7 days ago
    Do you have an example? I've looked through the API pages and it doesn't document anything about fans or power.

    I did see a python example for querying some data: https://www.ibm.com/support/pages/power8-watts-temp-ssp-io-serverlpar-stats-hmc-rest-api-version-10


    ------------------------------
    Russell Adams
    ------------------------------



  • 4.  RE: Where to find power supply status, hardware sensor data?

    Posted 6 days ago
    Hi Russell,

    I haven't seen metrics for fan speeds in the HMC performance metrics data, but there are metrics for temperatures and power consumption.

    You can see an example of the output here:
    https://bitbucket.org/mnellemann/hmci/src/master/src/test/resources/pcm-data-energy.json


    Best regards,

    ------------------------------
    Mark Nellemann
    ------------------------------



  • 5.  RE: Where to find power supply status, hardware sensor data?

    Posted 6 days ago
    On Fri, Nov 26, 2021 at 08:54:10AM +0000, Mark Nellemann via IBM Community wrote:
    > I haven't seen metrics for fan speeds in the HMC performance metrics
    > data, but there are metrics for temperatures and power consumption.
    >
    > You can see an example of the output here:
    > https://bitbucket.org/mnellemann/hmci/src/master/src/test/resources/pcm-data-energy.json

    This is a great example of temperature. I think temp and power
    consumption are addressed, but not power health.

    ------------------------------------------------------------------
    Russell Adams Russell.Adams@AdamsSystems.nl
    Principal Consultant Adams Systems Consultancy
    http://adamssystems.nl/




  • 6.  RE: Where to find power supply status, hardware sensor data?

    Posted 6 days ago
    Hello Russell

    Definitely, the sensors exists but for some reason are hidden.

    Energy Monitoring
    Maybe in the RAW metrics the fan and psu status are gathered.


    Another way may be configure and manage your system by using the Intelligent Platform Management Interface (IPMI).
    ipmitool -I lanplus -H myserver.example.com -P mypass sdr list Lists status of all sensors.
    ipmitool -I lanplus -H myserver.example.com -P mypass chassis status Checks the server status.
    Sensors

    IBM® Power Systems servers use a baseboard management controller (BMC) for system service management, monitoring, maintenance, and control. The BMC also provides access to the system event log files (SEL). The BMC is a specialized service processor that monitors the physical state of the system by using sensors. 
    Managing the system by using OpenBMC-based HMC (7063-CR2)


    ------------------------------
    Humberto Sosa
    ------------------------------



  • 7.  RE: Where to find power supply status, hardware sensor data?

    Posted 2 days ago

    Have a look at Nigel's stuff

    AIXpert Blog from Nigel Griffiths (@mr_nmon)



    nextract Plus for HMC REST API Performance Statistics






    ------------------------------
    Bryan Dietz
    ------------------------------