Maximo

Come for answers, stay for best practices. All we're missing is you.

View Only

Back to discussions

Expand all | Collapse all

Nodes going down

1. Nodes going down

Like
Chaitanya Kumar R
Posted 04/02/24 12:46 PM

Reply
Hi All,

We have a Maximo 7.6.1 of clustered environment with 3 Nodes. We are seeing all the nodes are shutting down frequently. couldn't get any information from the logs.

Can someone help me to solve this issue.

------------------------------
C K K
------------------------------
2. RE: Nodes going down

Like
Jason VenHuizen

IBM Champion
Posted 04/02/24 12:53 PM

Reply
That sounds like you are having an out of memory issue. If you look in the IBM/WebSphere/AppServer/profiles/ctgAppSrv01/ folder (yours may be slightly different) you will see javacore* files and phd files. These files have details about the memory usage at the time when the JVM ran out of memory. Here is the utility to do the analysis: https://www.ibm.com/support/pages/ibm-heapanalyzer

You may also want to check that Verbose Garbage Collection is turned off for the JVMs as one of the WebSphere 9.0.5.x versions introduced Verbose GC as on by default and it absolutely kills performance when it kicks in. It is great for getting information about garbage collection issues, but for production it should be turned off. You should also check that you have all the various JVM settings, which are documented here: https://www.ibm.com/support/pages/maximo-76-performance-best-practice-guide

Regards,

Jason
3. RE: Nodes going down

Like
Chaitanya Kumar R
Posted 04/03/24 03:42 PM

Reply
Hey Jason,

Thank you for the response. After you described this was an issue with the memory, i went to the server and have the process of the Node in task manager, i observe there was a lot of consumption of memory az well as cpu around 65%. I am not sure on what's the next step.

Could you please advise on what needs to be done for resolving this?

Thanks

Original Message
4. RE: Nodes going down

Like
Jeff Tippett
Posted 04/03/24 02:04 PM

Reply
Hi Chaitanya,

We've seen clients experience this when they have Sentinel virus software running with 7.6.1.3 clustered so this might be the same for you if you are running Sentinel virus software. The solution was to add exclusions to Sentinel for the \IBM\WebSphere folders on the servers where WebSphere is installed. You may have to restart the servers for this to take effect.

------------------------------
Jeff Tippett
M.G.I. Maintenance Group Inc.
Halifax NS
------------------------------
5. RE: Nodes going down

Like
Chaitanya Kumar R
Posted 04/03/24 03:45 PM

Reply
Hi Jeff,

Thanks for the response.

We are not using the sentinel virus software to run any application or in the server also not sure why the nodes are shutting down.

Original Message
6. RE: Nodes going down

Like
Mark Robbins

IBM Champion
Posted 04/04/24 04:19 AM
Edited by Mark Robbins 04/04/24 04:21 AM

Reply
As Jason says this sounds like at least one of your JVMs is running out of memory and then having to disconnect from the mesh **. when the JVM disconnects from the mesh there will be messages like the ones that you ask about in your post on the 8th March in the WebSphere forum.

You say that you have checked the memory usage but I suspect you have been checking that at the operating system level.

Maximo will grab as much memory as it is allowed and then manage that internally. If a thread asks for memory that isn't available then the dump files that Jason refers to will be created.... at an operating system level it will appear that the JVM is still using the memory that it was and it won't be asking for more than its allowed.

You need to understand what is using your memory.

There are two main ways to check this:

Checking the MBO counts - this will give you an idea about the Maximo Business Objects that are in memory - see this article for an introduction to this https://www.linkedin.com/pulse/seeing-which-mbosetsmbos-using-memory-mark-robbins/

analysing the heapdump - the JVM would need to be configured to generate heapdumps when the memory is exhausted. It is easy to implement but it can lead to 6GB files being generated on the server and that can cause the overall server to fail because it has run out of disk space

The whole area is quite complex so I would be surprised if you get a lot of free advice about how to investigate / resolve this.

Some organisations, like my employer Cohesive, offer support consultancy and we can definitely help you with this.

I have investigated similar situations in the past so I know that we can help. Message me privately if your organisation would like to buy some consultancy time.

** I mentioned the mesh earlier... you may find this article interesting as I talk about the mesh...

------------------------------
Mark Robbins
Support Lead/Technical Design Authority / IBM Champion 2017 - 2023
Cohesive (previously Vetasi Limited)
https://www.linkedin.com/pulse/maximo-support-advice-from-non-ibm-engineer-article-mark-robbins/
------------------------------
7. RE: Nodes going down

Like
Sankar Ganesh V S

IBM Champion
Posted 04/04/24 04:35 AM

Reply
Did you check the logs in the ffdc folder? inside NodeAgent profile?
Is the other JVMs are running correctly?

------------------------------
Sankar Ganesh
------------------------------
8. RE: Nodes going down

Like
Bradley Downing
Posted 04/04/24 07:07 PM

Reply
Hi!

Here is an oldie but a goodie: https://www.ibm.com/support/pages/maximo-76-performance-best-practice-guide. This is written by one of the best in the business @Darlene Nerden Pair the info in this doc with what @Jason VenHuizen and @Mark Robbins have said.

Hope this helps.

------------------------------
Bradley K. Downing , MBA
Senior Solution Architect
IBM
Bakersfield CA
------------------------------

Maximo

Maximo

Nodes going down

Chaitanya Kumar R04/02/24 12:46 PM

Jason VenHuizen04/02/24 12:53 PM

Chaitanya Kumar R04/03/24 03:42 PM

Jeff Tippett04/03/24 02:04 PM

Chaitanya Kumar R04/03/24 03:45 PM

Mark Robbins04/04/24 04:19 AM

Sankar Ganesh V S04/04/24 04:35 AM

Bradley Downing04/04/24 07:07 PM

1. Nodes going down

2. RE: Nodes going down

3. RE: Nodes going down

4. RE: Nodes going down

5. RE: Nodes going down

6. RE: Nodes going down

7. RE: Nodes going down

8. RE: Nodes going down

Office

Community Links

IBM Links

Maximo

Maximo

Nodes going down

Chaitanya Kumar R04/02/24 12:46 PM

Jason VenHuizen04/02/24 12:53 PM

Chaitanya Kumar R04/03/24 03:42 PM

Jeff Tippett04/03/24 02:04 PM

Chaitanya Kumar R04/03/24 03:45 PM

Mark Robbins04/04/24 04:19 AM

Sankar Ganesh V S04/04/24 04:35 AM

Bradley Downing04/04/24 07:07 PM

1. Nodes going down

2. RE: Nodes going down

3. RE: Nodes going down

4. RE: Nodes going down

5. RE: Nodes going down

6. RE: Nodes going down

7. RE: Nodes going down

8. RE: Nodes going down

Related Content

Websphere console’s JVM Started doesn't mean that Maximo has started - JVM Startup logs

5.17.16_Making Core Maximo Easier to Support

Maximo and the MAXSESSION Table

5.17.16_Making Core Maximo Easier to Support

It's Okay the Spreadsheet Holds That...

Office

Community Links

IBM Links