Originally posted by: SystemAdmin
Hi,
We are running multiple AIX 5.3.7 and AS/400 instances on a P6 570 system. On one AIX lpar, We recently upgraded Informix 9.4 to version 11.50, and two days later the whole system locked up. Users were unable to continue work, or logon; but, jobs continued to run from crontabs, and it was also possible to run commands from other lpars using ssh. Tests indicated that CPU usage was 100%, paging zero, and there was no change in memory utilisation. Initially the major users of the CPU were GIL and the database, but after a few minutes these were joined by runaway Korn shells. Using ssh we were able to shutdown the databases and reboot the system but two days later the problem reappeared; so we have since rolled-back the Informix upgade.
Can anyone suggest why/how a problem would lockup users and prevent logins, but still permit processes to run from remote systems or from crontabs?
Second question, how does one tell if a system has run out of ports or ptys?
There was no indication of any problem in the error or system logs. The DBA could not find errors output by the database, and the application itself did not record any errors.
Any suggestions,
Spook