Just tested, that did not go over too well:
[lsfadmin@vmhost6 configdir]$ bhist -l 101 | grep "Running with" | wc -l
80
But as soon as I set the max value in the queue, the job went suspended. I would suggest you talk to the LSF admin and suggest they add that setting as it will prevent the requeue loop for other users.
------------------------------
Larry Adams
------------------------------
Original Message:
Sent: Fri August 27, 2021 03:42 AM
From: Romain Bouquet
Subject: LSF bsub -Q maximal number of automatic requeue
Hi @Larry Adams,
Maybe there is something I do not understand,
according to this link it seems to me one has to be administrator to change the lsb.queues file. Am I wrong?
Sadly I am just a regular user.
Would there be a way to use MAX_JOB_REQUEUE for a regular user?
For instance something like (or similar syntax to retry jobs only twice)
export MAX_JOB_REQUEUE=2
bsub -Q "all ~0" <myjob>
So that the LSF/bsub command takes into account the value MAX_JOB_REQUEUE ?
Thanks in advance,
Best,
------------------------------
Romain Bouquet
Original Message:
Sent: Thu August 26, 2021 11:29 AM
From: Larry Adams
Subject: LSF bsub -Q maximal number of automatic requeue
Roman,
It's simple to do. Here is a section from the man pages for lsb.queues
MAX_JOB_REQUEUE
Specifies the maximum number of times to requeue a job automati‐
cally.
Syntax
MAX_JOB_REQUEUE=integer
Valid values
0 < MAX_JOB_REQUEUE < INFINIT_INT
INFINIT_INT is defined in lsf.h.
Default
Not defined. The number of requeue times is unlimited
------------------------------
Larry Adams
Original Message:
Sent: Thu August 26, 2021 03:21 AM
From: Romain Bouquet
Subject: LSF bsub -Q maximal number of automatic requeue
Hi @YI SUN,
Alright it is unfortunate but thanks a lot for your help!
Best,
------------------------------
Romain Bouquet
Original Message:
Sent: Tue August 24, 2021 06:19 PM
From: YI SUN
Subject: LSF bsub -Q maximal number of automatic requeue
It doesn't seem LSF has job level control on this. You will have to ask your admin to add this for you.
------------------------------
YI SUN
Original Message:
Sent: Tue August 24, 2021 05:32 AM
From: Romain Bouquet
Subject: LSF bsub -Q maximal number of automatic requeue
Hi @YI SUN,
Thanks for your reply, I had a look but it seems to me that one has to be administrator to create an application profile [link].
Unfortunately I am just a regular user.
Would there be a solution to create an application profile being a regular user ?
Thanks again,
Best
------------------------------
Romain Bouquet
Original Message:
Sent: Mon August 23, 2021 11:02 AM
From: YI SUN
Subject: LSF bsub -Q maximal number of automatic requeue
As the document says, you also can configure MAX_JOB_REQUEUE in lsb.applications for an application profile, e.g.
Begin Application
NAME = myrequeujob
DESCRIPTION = only requeue two times
MAX_JOB_REQUEUE = 2
End Application
Then submit your job like this, bsub -Q "all ~0" -app myrequeuejob <myjob>
------------------------------
YI SUN
Original Message:
Sent: Mon August 23, 2021 06:15 AM
From: Romain Bouquet
Subject: LSF bsub -Q maximal number of automatic requeue
Hi all,
I would like to know how to set the maximal number of automatic requeue for a job when using bsub -Q.
I saw here [link] that "Specifying a job-level exit value using bsub -Q overrides all MAX_JOB_REQUEUE settings."
I am using the following command to submit jobs
bsub -Q "all ~0" MyCommand
It there a way to set that MAX_JOB_REQUEUE to 2 for example for a specific job ?
Many thanks in advance for your help!
Best,
------------------------------
Romain Bouquet
------------------------------
#SpectrumComputingGroup