IBM FlashSystem

 View Only
  • 1.  IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Tue September 10, 2024 04:24 PM

    a canister node failed about a month ago and we removed it from the system.

    we replaced the node with another v5010 PN: 01ac370. We put the original RAM and cache battery in place to retain the service IP from old canister to new canister. 

    I can log in to the new canister, I was able to remove the system info it previously contained and now shows up with the correct WWNN numbers and serial. 

    The active node however does not see the new node which is right now in "candidate" mode. When i run svcinfo lsnodecandidate there's nothing in return.

    Active node information:

    Node ID: 3
    Node Name: node1
    Node Status: Active
    Part Identity: 11S01AC367YM12BG65B01K
    Node FRU: 01AC370
    Configuration Node: Yes
    Model: T5L
    System: SAN02P04
    Site Name:
    System Software Build: 125.0.1605241053000
    Software Version: 7.6.1.4
    Software Build: 125.0.1605241053000
    Console IP: 10.8.89.252:443
    Has File Module Key: No

    Canister Location: 1
    Enclosure ID: 11S00MJ082YM12BG65E002
    Machine Type and Model: 2078-124
    Serial Number: 781C582
    Detected Hardware Same as Configured: Yes
    Detected Hardware Valid: Yes
    Able to provide additional IO Ports? No
    Battery
    Estimated Start Time: 0
    Battery Charge: 100

    Canister Location: 1 1
    Machine Type and Model: 2078-124 2078-124
    Serial Number: 781C582 781C582
    WWNN 1: 500507680d005dbc 500507680d005dbc
    WWNN 2: 500507680d005dbd 500507680d005dbd
    System ID: e020400c86 --
    Next System ID: e020600c86 --

    Here is info on the new canister:

    Node ID:
    Node Name:
    Node Status: Candidate
    Part Identity: 11S01AC367YM12BG71901Y
    Node FRU: 01AC370
    Configuration Node: No
    Model: T5L
    System:
    Site Name:
    System Software Build:
    Software Version: 7.8.1.4
    Software Build: 135.5.1712071656000
    Console IP:
    Has File Module Key:

    No

    Canister Location: 2
    Enclosure ID: 11S00MJ082YM12BG65E002
    Machine Type and Model: 2078-124
    Serial Number: 781C582
    Detected Hardware Same as Configured: Yes
    Detected Hardware Valid: Yes
    Able to provide additional IO Ports? No
    Battery
    Estimated Start Time: 0
    Battery Charge: 100

    Canister Location: 2 2
    Machine Type and Model: 2078-124 2078-124
    Serial Number: 781C582 781C582
    WWNN 1: 500507680d005dbc 500507680d005dbc
    WWNN 2: 500507680d005dbd 500507680d005dbd
    System ID: e020400c86 --
    Next System ID: e020600c86 --

    Am i missing something? thank you in advance.



    ------------------------------
    Edstrom IT
    ------------------------------


  • 2.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Wed September 11, 2024 01:25 AM

    Hi,
    Use 'lsnodecanister'  when logged into the Cluster IP address.

    you should see you have an offline node.  take note of the very left number in the output line, which is the ID assigned to that offline node,
    then issue 'rmnodecanister x' whee x is the ID you took note of in the previous output.

    you can now check 'lsnodecanister' again and you should only have the one node that is online.
    then from the Service Assistant "http://<cluster IP address>/service
     Select the Candidate Node and from the pulldown menu select 'reboot' and click Go.
     *** Ensure you do not black popups, if you get a popup blocked notification,
     enable the popups, and redo the Reboot request.

    Once the node reboots it should automatically add to your cluster.



    ------------------------------
    GLEN ROUTLEY
    ------------------------------



  • 3.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Thu September 19, 2024 10:23 AM

    Thank you for responding, Glen.

    Below is the output of lsnodecanister:

    IBM_Storwize:SAN02P04:superuser>lsnodecanister

    id name  UPS_serial_number WWNN             status IO_group_id IO_group_name config_node UPS_unique_id hardware iscsi_name                              iscsi_alias panel_name enclosure_id canister_id enclosure_serial_number site_id site_name

    3  node1                   500507680D005DBC online 0           io_grp0       yes                       T5L      iqn.1986-03.com.ibm:2145.san02p04.node1             01-1       1            1           781C582        

    Only one shows up.

    I've tried installing several canisters in the 2nd slot with the original RAM and batter - they all come online, show the correct serial and WWNN numbers but the canisters don't see each other with error 734 1 1 0. The system does not show anything connected to the 2nd slot with any canister I try. I have been able to login to the 2nd canister and even install matching code version with still no communication between the system and the 2nd canister. Does this mean there's possibly an issue with the mid-plane of the chassis that connects interconnects the two canisters?



    ------------------------------
    Edstrom IT
    ------------------------------



  • 4.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Thu September 19, 2024 08:20 PM

    Hello Edstrom IT,
      indeed the '734' error states that the PCIe link is not communicating and therefore cannot see the other node.
    While it is possible to be caused by midplane, it is more likely that the Active Canister is at fault.

    If this v5010 has FibreChannel interfaces, I suggest connecting 2 ports per node to a switch and zone them to see each other all in 1 zone.
    The nodes will communicate via the FiberChannel paths and the replaced node can be added to the cluster that way.
    Once you have a 2 node cluster, then try rebooting / reseating (or replacing) node -1 to resolve the '734' problem.



    ------------------------------
    GLEN ROUTLEY
    ------------------------------



  • 5.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Fri September 20, 2024 09:00 AM

    These canisters don't have fiberchannel but they have SFP+

    There's one free sfp+ on both canisters. Is it possible to do this without fiberchannel?



    ------------------------------
    Edstrom IT
    ------------------------------



  • 6.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Sun September 22, 2024 11:02 PM

    Hi Edstrom,
     an "SFP+" is merely a tranciever for an interface of either 10GbE  (Ethernet) or FC (Fibre Channel).
    Do the other interfaces connect directly to Hosts or via a switch,
     and is that Switch an Ethernet Switch (LAN) or a Fibre Channel (SAN) switch ?

    If the later, then you can zone both the nodes interfaces to see each other on the SAN switch(es).
    However if you are Ethernet, unfortunately this method will not work.

    You would need an outage, and given the state of things, the storage likely will not come back up from a power cycle.
     So I recommend opening a case with IBM to assist you further.



    ------------------------------
    GLEN ROUTLEY
    ------------------------------



  • 7.  RE: IBM Storwize v5010 canister node not displaying in lscandidatenodes

    Posted Mon September 23, 2024 10:58 AM

    More specifically the canisters contain 4-port 10GbE interfaces going to a pair of ethernet switches.

    Thank you for the reply and the help Glen.



    ------------------------------
    Edstrom IT
    ------------------------------