Understood. It seems LSF command behavior is not consistent, suggest to create a case with Support.
Original Message:
Sent: Wed January 17, 2024 11:17 PM
From: t4tkq t4tkq
Subject: The suspended jobs does not display suspending reasons with bjobs customized output
I have already tried it, according to command output, will filter PEND job not [SUP]SUSP
lsfadmin ~ $ bstop 1033Job <1033> is being stoppedlsfadmin ~ $ bjobs -p1 -o 'jobid stat pend_reason suspend_reason'No pending job foundlsfadmin ~ $ bjobs -s -o 'jobid stat pend_reason suspend_reason'JOBID STAT PEND_REASON SUSPEND_REASON1033 USUSP - -
------------------------------
t4tkq t4tkq
Original Message:
Sent: Wed January 17, 2024 07:00 PM
From: YI SUN
Subject: The suspended jobs does not display suspending reasons with bjobs customized output
Could you try bjobs -p1 -o 'jobid stat pend_reason suspend_reason'?
------------------------------
YI SUN
Original Message:
Sent: Mon January 15, 2024 09:23 AM
From: t4tkq t4tkq
Subject: The suspended jobs does not display suspending reasons with bjobs customized output
I have downloaded and completed the installation LSF Community Edition 10.1.0.12, Jun 10 2021 in CentOS 8
lsfadmin ~ $ lsidIBM Spectrum LSF Community Edition 10.1.0.12, Jun 10 2021Copyright IBM Corp. 1992, 2016. All rights reserved.
I have found behaves differently on `bjobs -l` and `bjobs -o 'jobid stat pend_reason suspend_reason'` for suspended jobs(SSUSP, USUSP and PSUSP), specific fields output does not display suspended reason
lsfadmin ~ $ bsub sleep 1000Job <102> is submitted to default queue <normal>.lsfadmin ~ $ bjobs -o 'jobid stat'JOBID STAT102 RUNlsfadmin ~ $ bstop 102Job <102> is being stoppedlsfadmin ~ $ bjobs -l 102Job <102>, User <lsfadmin>, Project <default>, Status <USUSP>, Queue <normal>, Command <sleep 1000>, Share group charged </lsfadmin>... ... SUSPENDING REASONS: Job was suspended by an administrator or root;... ...lsfadmin ~ $ bjobs -o 'jobid stat pend_reason suspend_reason'JOBID STAT PEND_REASON SUSPEND_REASON102 USUSP - -
`bstop -C '...' <jobId>` will display customized reason, but I would like to get information about `bjobs -l` reason. Because sometimes some administrators will only `bstop <jobId>`(no -C). Common normal user does not also display reason for suspending job.
lsfadmin ~ $ bstop -C 'testreason' 103Job <103> is being stoppedlsfadmin ~ $ bjobs -l 103Job <103>, User <lsfadmin>, Project <default>, Status <USUSP>, Queue <normal>,... ... SUSPENDING REASONS: Job was suspended by an administrator or root;... ...lsfadmin ~ $ bjobs -o 'jobid stat pend_reason suspend_reason'JOBID STAT PEND_REASON SUSPEND_REASON103 USUSP - testreason
For PEND job(resource requirements are not met, e.g.: ` -R affinity[core(100)]' `) will display same reason message for `bjob -l` and `bjob -o ...`
I have below questions:
1) Is this a bug or a feature for bjobs customized output with suspended job, is there any documentation to further explain this behavior
2) How to display similar `bjobs -l` suspending reason with specific field for bjobs
Thanks
------------------------------
t4tkq t4tkq
------------------------------