Htc cheat sheet: Difference between revisions
No edit summary |
|||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
=== List of online workernodes === | === List of online workernodes === | ||
condor_status | condor_status -server | ||
=== To put a node in draining === | === To put a node in draining === | ||
Line 6: | Line 6: | ||
Add the "-cancel" option to stop the draining. | Add the "-cancel" option to stop the draining. | ||
=== Summary of the users and their jobs === | |||
condor_status -submitters | |||
=== Get a list of jobs and where they are running === | === Get a list of jobs and where they are running === | ||
Line 64: | Line 67: | ||
=== Why a job stays in idle status? === | === Why a job stays in idle status? === | ||
condor_q -better-analyze <jobid> | condor_q -better-analyze <jobid> <br> | ||
condor_q -better-analyze -reverse <jobid> |
Latest revision as of 09:57, 23 September 2021
List of online workernodes
condor_status -server
To put a node in draining
condor_drain <name_of_workernode>
Add the "-cancel" option to stop the draining.
Summary of the users and their jobs
condor_status -submitters
Get a list of jobs and where they are running
condor_q -run
Note: you must be on the CE to get the workernodes
Get the list of jobs on a given worker
List of jobs on node19-7.wn.iihe.ac.be:
condor_q -constraint 'regexp(".*@node19-7.wn.iihe.ac.be", RemoteHost)'
Get the list of jobs in a given status
If you want to see jobs in run status (JobStatus == 2):
condor_q -constraint "JobStatus == 2" -nobatch
(-nobatch is to split the list when you submit a batch of jobs with queue > 1)
Here is the list of job statuses:
0 | Unexpanded |
1 | Idle |
2 | Running |
3 | Removed |
4 | Completed |
5 | Held |
6 | Submission err |
View details about a job
condor_q -l <jobid>
Get info on a finished job
condor_history [-long] <jobid>
Example:
condor_history -constraint 'Owner == "sgerard" && JobStatus ==4' -limit 10
->you'll get the list, limited to 10, of completed jobs of user sgerard
Why a job is on hold?
condor_q -hold -af HoldReason <jobid>
Why a job stays in idle status?
condor_q -better-analyze <jobid>
condor_q -better-analyze -reverse <jobid>