Checking Job Status
Job Status can be checked by bjobs
command. For more details on bjobs , go to jobs
Some example on jobs:
1. bjobs
It will display all the LSF jobs
for example
[bhatiag@ds-cmgpu-04 ~]$ bjobs JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME 5970 bhatiag RUN gpu ds-lg-01 ds-cmgpu-03 bash Dec 4 10:34 5971 bhatiag RUN gpu ds-cmgpu-03 ds-cmgpu-04 bash Dec 4 10:55
2. bjobs jobid
Using jobid with jobs will display the status for that specific job
for example
[bhatiag@ds-cmgpu-04 ~]$ bjobs 5970 JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME 5970 bhatiag RUN gpu ds-lg-01 ds-cmgpu-03 bash Dec 4 10:34
[bhatiag@ds-cmgpu-04 ~]$ bjobs 5970 5971 JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME 5970 bhatiag RUN gpu ds-lg-01 ds-cmgpu-03 bash Dec 4 10:34 5971 bhatiag RUN gpu ds-cmgpu-03 ds-cmgpu-04 bash Dec 4 10:55
3 bjobs -l -gpu jobid
Using -gpu with jobs will display the gpu jobs
for example
[bhatiag@ds-cmgpu-04 ~]$ bjobs -l -gpu 5970 Job <5970>, User <bhatiag>, Project <default>, Status <RUN>, Queue <gpu>, Inter active pseudo-terminal shell mode, Command <bash>, Share g roup charged </bhatiag> Fri Dec 4 10:34:08: Submitted from host <ds-lg-01>, CWD <$HOME>, Requested GPU ; Fri Dec 4 10:34:08: Started 1 Task(s) on Host(s) <ds-cmgpu-03>, Allocated 1 Sl ot(s) on Host(s) <ds-cmgpu-03>; Fri Dec 4 11:01:33: Resource usage collected. The CPU time used is 1 seconds. MEM: 14 Mbytes; SWAP: 0 Mbytes; NTHREAD: 6 PGID: 49226; PIDs: 49226 PGID: 49240; PIDs: 49240 PGID: 49242; PIDs: 49242 PGID: 51520; PIDs: 51520 RUNLIMIT 10080.0 min MEMORY USAGE: MAX MEM: 14 Mbytes; AVG MEM: 8 Mbytes SCHEDULING PARAMETERS: r15s r1m r15m ut pg io ls it tmp swp mem loadSched - - - - - - - - - - - loadStop - - - - - - - - - - - EXTERNAL MESSAGES: MSG_ID FROM POST_TIME MESSAGE ATTACHMENT 0 bhatiag Dec 4 10:34 ds-cmgpu-03:gpus=1; N RESOURCE REQUIREMENT DETAILS: Combined: select[(type == any ) && (ngpus>0)] order[r15s:pg] rusage[ngpus_phys ical=1.00] Effective: select[(type == any ) && (ngpus>0)] order[r15s:pg] rusage[ngpus_phy sical=1.00] GPU REQUIREMENT DETAILS: Combined: num=1:mode=exclusive_process:mps=yes:j_exclusive=yes Effective: num=1:mode=exclusive_process:mps=yes:j_exclusive=yes GPU_ALLOCATION: HOST TASK ID MODEL MTOTAL FACTOR MRSV SOCKET NVLINK ds-cmgpu-03 0 1 TeslaP100_SX 15.8G 6.0 0 1 -
Note: bkill, bresume, and bstop are some of the useful commands and users should read the manual page of them.Click Here