[an error occurred while processing this directive] [an error occurred while processing this directive] [an error occurred while processing this directive]

showq


showq [ -g ] [ -c | -i | -r] [-p PARTITION] [ -v ]

Purpose

Displays information about active, eligible, blocked, and/or recently completed jobs.

Permissions

This command can be run by any user.  However, the -c, -i, and -r flags can only be used by level 1, 2, or 3 scheduler administrators.

Parameters

PARTITION partition for which job information should be displayed

Flags

-c display extended details about recently completed jobs.
-g display grid job and system id's for all jobs.
-i display extended details about idle jobs.
-p display only jobs assigned to the specified partition.
-r display extended details about active (running) jobs.
-v display extended date information including year.

Description

Since LoadLeveler is not actually scheduling jobs, the job ordering it displays is no longer valid. The showq command displays the actual job ordering under the Maui Scheduler. When used without flags, this command displays all jobs in active, idle, and non-queued states.

Example 1

% showq

ACTIVE JOBS--------------------
         JOBNAME USERNAME      STATE  PROC  REMAINING            STARTTIME

   fr28n13.709.0 dsheppar    Running     1    0:55:09  Fri Aug 29 13:27:36
  fr28n07.2303.0 dsheppar    Running     1    0:55:10  Fri Aug 29 13:27:37
  fr17n08.1349.0 dsheppar    Running     1    1:02:29  Fri Aug 29 13:34:56
  fr28n15.4355.0 dsheppar    Running     1    1:03:08  Fri Aug 29 13:35:35
  fr28n05.2098.0 ebylaska    Running    16    1:25:17  Fri Aug 29 11:57:45
  fr28n05.2095.0    kossi    Running     1    1:26:24  Fri Aug 29 03:58:51
   fr28n13.683.0   xztang    Running     8    2:23:01  Thu Aug 28 17:52:08
  fr28n15.4354.0  moorejt    Running    16    3:41:06  Fri Aug 29 12:18:33
  fr17n08.1341.0    mukho    Running     8    3:41:48  Thu Aug 28 18:24:15
  fr17n05.1393.0    zhong    Running     8    4:01:47  Fri Aug 29 04:39:14
  fr28n05.2097.0    zhong    Running     8    4:50:03  Fri Aug 29 05:27:30
  fr28n11.3080.0    mukho    Running     8    5:12:21  Thu Aug 28 19:54:48
   fr28n13.682.0   wengel    Running    32    5:23:51  Thu Aug 28 19:56:58
  fr28n05.2064.0   vertex    Running     1    6:29:55  Thu Aug 28 23:02:22
  fr28n11.3037.0   vertex    Running     1    6:29:55  Thu Aug 28 23:02:22
    fr28n09.26.0    rampi    Running     1    8:37:27  Thu Aug 28 11:09:54
  fr17n08.1328.0   vertex    Running     1    9:29:49  Fri Aug 29 02:02:16
  fr17n10.1467.0    kossi    Running     1   10:27:10  Fri Aug 29 12:59:37
    fr28n09.49.0 holdzkom    Running     8   13:13:08  Fri Aug 29 11:45:35
  fr17n07.1498.0    jpark   Starting    16   14:10:05  Fri Aug 29 04:42:32
  fr17n05.1384.0    zhong    Running     8   18:45:27  Fri Aug 29 14:22:54
  fr28n07.2300.0  jimenez    Running    16   18:54:12  Fri Aug 29 09:26:39
   fr17n09.529.0   vertex    Running     1   19:03:49  Fri Aug 29 11:36:16
  fr28n01.1851.0   vertex    Running     1   19:09:49  Fri Aug 29 11:42:16
  fr17n11.1380.0   vertex    Running     1   19:41:22  Fri Aug 29 12:13:49
  fr17n16.1533.0   vertex    Running     1   20:04:32  Fri Aug 29 12:36:59
  fr17n06.1502.0   vertex    Running     1   20:16:24  Fri Aug 29 12:48:51
  fr17n10.1466.0   wengel    Running    32   20:24:04  Fri Aug 29 10:58:11
   fr28n13.701.0     kudo    Running     8   20:25:46  Fri Aug 29 10:58:13
  fr28n03.1689.0   vertex    Running     1   20:50:31  Fri Aug 29 13:22:58
   fr28n13.631.0   vertex    Running     1   21:17:40  Fri Aug 29 13:50:07
   fr28n13.708.0     yshi    Running     8   22:49:10  Fri Aug 29 13:21:37
  fr17n05.1395.0     yshi    Running     8   23:36:36  Fri Aug 29 14:09:03
  fr17n11.1388.0 jshoemak    Running    24   23:51:10  Fri Aug 29 14:23:37
  fr28n07.2304.0  rich001    Running     1   26:09:44  Fri Aug 29 13:42:11
  fr28n11.3091.0    rampi    Running     1   26:57:00  Fri Aug 29 05:29:27

36 Active Jobs   251 of 254 Processors Active   (Efficiency:    98.82)

ELIGIBLE JOBS----------------------
         JOBNAME USERNAME      STATE  PROC   CPULIMIT            QUEUETIME

  fr28n03.1718.0  ozturan       Idle    64    0:16:40  Thu Aug 28 22:25:48
  fr17n03.1430.0    jason       Idle   128    2:00:00  Wed Aug 27 00:56:49
  fr17n08.1331.0    jason       Idle   128    2:00:00  Wed Aug 27 00:56:21
  fr17n15.1393.0  moraiti       Idle   128    3:20:00  Fri Aug 29 09:58:56
   fr17n09.534.0  kdeacon       Idle    64    1:00:00  Fri Aug 29 04:38:48
   fr28n13.697.0    jpark       Idle    16   24:00:00  Fri Aug 29 03:44:45
  fr17n07.1499.0    jpark       Idle    16   24:00:00  Fri Aug 29 04:42:31
  fr17n06.1517.0   cholik       Idle    16   24:00:00  Fri Aug 29 06:45:46
   fr28n13.706.0  moorejt       Idle    16    5:55:00  Fri Aug 29 10:53:53
  fr17n16.1550.0  moorejt       Idle    16    7:55:00  Fri Aug 29 10:53:54
  fr17n12.1528.0 ebylaska       Idle    16    3:59:59  Fri Aug 29 12:11:30
  fr28n15.4356.0 dsheppar       Idle    16    3:00:00  Fri Aug 29 14:01:42
    fr28n09.50.0 dsheppar       Idle    16    3:00:00  Fri Aug 29 14:01:59
    fr28n09.51.0    zhong       Idle     8   13:55:00  Fri Aug 29 14:07:16
  fr17n16.1551.0    jacob       Idle     4    4:00:00  Fri Aug 29 12:51:19

15 Idle Job(s)

BLOCKED JOBS----------------
         JOBNAME USERNAME      STATE  PROC   CPULIMIT            QUEUETIME

  fr17n02.1476.0   vertex       Idle     1   22:00:00  Thu Aug 28 23:48:16
  fr17n05.1392.0   vertex SystemHold     1   22:00:00  Thu Aug 28 23:49:51
  fr17n10.1449.0   vertex       Idle     1   22:00:00  Tue Aug 26 23:49:51
  fr28n03.1674.0    maxia   UserHold     8   23:56:00  Mon Aug 25 16:22:10
  fr28n05.1581.0     sidt   UserHold     1    1:00:00  Sun Jul 27 12:46:17
  fr28n05.2092.0   vertex       Idle     1   22:00:00  Thu Aug 28 23:48:40
   fr28n13.705.2     gigi  NotQueued    32   15:58:00  Fri Aug 29 10:49:01
   fr28n13.705.3     gigi  NotQueued    32   13:58:00  Fri Aug 29 10:49:01
  fr17n08.1349.7 dsheppar  BatchHold     1    2:00:00  Fri Aug 29 13:34:44
  fr28n15.4355.1 dsheppar       Idle     1    2:00:00  Fri Aug 29 13:35:04
  fr28n15.4355.2 dsheppar   Deferred     1    2:00:00  Fri Aug 29 13:35:04
  fr28n15.4355.3 dsheppar       Idle     1    2:00:00  Fri Aug 29 13:35:04

Total Jobs: 63   Active Jobs: 36   Eligible Jobs: 15   Blocked Jobs: 12

The output of this command is divided into three parts, Active Jobs, Eligible Jobs, and Blocked Jobs.

Active jobs are those that are Running or Starting and consuming CPU resources. Displayed are the job name, the job's owner, and the job state. Also displayed are the number of processors allocated to the job, the amount of time remaining until the job completes (given in HH:MM:SS notation), and the time the job started. All active jobs are sorted in "Earliest Completion Time First" order.

* NOTE: job id's may be marked with a single character to to specify the following conditions:

Character Description
_ (underbar) job violates usage limit
* (asterisk) job is backfilled AND is preemptible
+ (plus) job is backfilled AND is NOT preemptible
- (hyphen) job is NOT backfilled AND is preemptible

Eligible Jobs are those that are queued and eligible to be scheduled. They are all in the Idle job state and do not violate any fairness policies or have any job holds in place. The jobs in the Idle section display the same information as the Active Jobs section except that the wall clock CPULIMIT is specified rather than job time REMAINING, and job QUEUETIME is displayed rather than job STARTTIME. The jobs in this section are ordered by job priority. Jobs in this queue are considered eligible for both scheduling and backfilling.

Blocked jobs are those that are ineligible to be run or queued. Jobs listed here could be in a number of states for the following reasons:

Idle Job violates a fairness policy. Use diagnose -q for more information.
UserHold A user hold is in place.
SystemHold An administrative or system hold is in place.
BatchHold A scheduler batch hold is in place (used when the job cannot be run because the requested resources are not available in the system or because the resource manager has repeatedly failed in attempts to start the job).
Deferred A scheduler defer hold is in place (a temporary hold used when a job has been unable to start after a specified number of attempts. This hold is automatically removed after a short period of time).
NotQueued Job is in the resource manager state NQ (indicating the job's controlling scheduling daemon in unavailable).

A summary of the job queue's status is provided at the end of the output.

Example 2

% showq -r

           JobName S  Pa Effic  XFactor  Q      User    Group  Nodes  Remaining            StartTime

     fr28n13.709.0 R   1 99.37      1.0  0  dsheppar      daf      1    0:55:50  Fri Aug 29 13:27:36
    fr28n07.2303.0 R   2 98.57      1.0  0  dsheppar      daf      1    0:55:51  Fri Aug 29 13:27:37
    fr17n08.1349.0 R   1 97.94      1.0  0  dsheppar      daf      1    1:03:10  Fri Aug 29 13:34:56
    fr28n15.4355.0 R   3 98.91      1.0  0  dsheppar      daf     64    1:03:49  Fri Aug 29 13:35:35
    fr28n05.2098.0 R   1 94.26      1.3  0  ebylaska    dnavy     16    1:25:58  Fri Aug 29 11:57:45
    fr28n05.2095.0 R   1 98.56      1.0  0     kossi      daf      1    1:27:05  Fri Aug 29 03:58:51
     fr28n13.683.0 R   1 99.75      1.0  0    xztang      daf      8    2:23:42  Thu Aug 28 17:52:08
    fr28n15.4354.0 R   3 98.90      1.2  0   moorejt      daf     16    3:41:47  Fri Aug 29 12:18:33
    fr17n08.1341.0 R   2 99.67      1.0  0     mukho    dnavy      8    3:42:29  Thu Aug 28 18:24:15
    fr28n05.2097.0 R   1 99.70      1.0  0     zhong     govt      8    4:50:44  Fri Aug 29 05:27:30
     fr28n13.682.0 R   3 99.83      1.1  0    wengel     univ     32    5:24:32  Thu Aug 28 19:56:58
    fr17n08.1328.0 R   3 99.69      2.5  0    vertex     univ      1    9:30:30  Fri Aug 29 02:02:16
    fr17n10.1467.0 R   3 98.12      1.0  0     kossi      daf      1   10:27:51  Fri Aug 29 12:59:37
    fr28n07.2300.0 R   1 97.60      1.1  0   jimenez    dnavy     16   18:54:53  Fri Aug 29 09:26:39
     fr17n09.529.0 R   1 99.10      2.9  0    vertex     univ      1   19:04:30  Fri Aug 29 11:36:16
    fr28n01.1851.0 R   1 98.01      2.9  0    vertex     univ      1   19:10:30  Fri Aug 29 11:42:16
    fr17n10.1466.0 R   1 99.51      1.2  0    wengel     univ     32   20:24:45  Fri Aug 29 10:58:11
     fr28n13.701.0 R   3 98.91      1.2  0      kudo      daf      8   20:26:27  Fri Aug 29 10:58:13
     fr28n13.631.0 R   1 99.89      3.0  0    vertex     univ      1   21:18:21  Fri Aug 29 13:50:07
    fr17n05.1395.0 R   2 95.29      1.0  0      yshi     univ      8   23:37:17  Fri Aug 29 14:09:03
    fr17n11.1388.0 R   2 63.46      1.4  0  jshoemak      daf     24   23:51:51  Fri Aug 29 14:23:37
    fr28n07.2304.0 R   1 97.62      1.0  0   rich001      daf      1   26:10:25  Fri Aug 29 13:42:11
    fr28n11.3091.0 R   1 98.87      1.0  0     rampi     univ      1   26:57:41  Fri Aug 29 05:29:27

23 Jobs   251 of 254 Processors Active   (Efficiency:    98.82)

The fields are as follows:

JobName Name of active job.
S Job State. Either "R" for Running or "S" for Starting.
Pa Partition in which job is running.
Effic CPU efficiency of job.
XFactor Current expansion factor of job, where XFactor = (QueueTime + WallClockLimit) / WallClockLimit
Q Quality Of Service specified for job.
User User owning job.
Group Primary group of job owner.
Nodes Number of processors being used by the job.
Remaining Time the job has until it has reached its wall clock limit. Time specified in HH:MM:SS notation.
StartTime Time job started running.

After displaying the running jobs, a summary is provided indicating the number of jobs, the number of allocated processors, and the system utilization.

Example 3

% showq -i

           JobName    Priority  XFactor  Q      User    Group  Nodes   WCLimit     Class      SystemQueueTime

    fr28n03.1718.0*   97615272     59.0  0   ozturan     govt     64   0:16:40     batch  Thu Aug 28 22:25:48
    fr17n03.1430.0      125372     11.0  0     jason      asp    128   2:00:00    medium  Thu Aug 28 18:29:26
     fr28n13.634.0      125365     11.0  0     jason      asp    128   2:00:00    medium  Thu Aug 28 18:30:04
      fr28n09.32.0      118071      7.0  0   moraiti     univ    128   3:20:00     batch  Thu Aug 28 18:32:58
    fr17n15.1393.0      110712      2.4  0   moraiti     univ    128   3:20:00     batch  Fri Aug 29 09:58:56
     fr17n09.534.0       68841     10.9  0   kdeacon      pdc     64   1:00:00     batch  Fri Aug 29 04:38:48
     fr28n13.697.0       21102      1.4  0     jpark    dnavy     16  24:00:00     batch  Fri Aug 29 03:44:45
    fr17n07.1499.0       20906      1.4  0     jpark    dnavy     16  24:00:00     batch  Fri Aug 29 04:42:31
    fr17n06.1517.0       20604      1.3  0    cholik     univ     16  24:00:00     batch  Fri Aug 29 06:45:46
     fr28n13.706.0       20180      1.6  0   moorejt      daf     16   5:55:00     batch  Fri Aug 29 10:53:53
    fr17n16.1550.0       20024      1.5  0   moorejt      daf     16   7:55:00     batch  Fri Aug 29 10:53:54
    fr17n12.1528.0       19916      1.6  0  ebylaska    dnavy     16   3:59:59     batch  Fri Aug 29 12:11:30
      fr28n09.50.0       19097      1.2  0  dsheppar      daf     16   3:00:00     batch  Fri Aug 29 14:01:59
      fr28n09.51.0       12547      1.0  0     zhong     govt      8  13:55:00     batch  Fri Aug 29 14:07:16
    fr17n16.1551.0        9390      1.0  0     jacob     univ      4   4:00:00     batch  Fri Aug 29 14:22:09

Jobs: 15  Total BackLog:  6434 Node Hours  ( 25.33 Hours)

The fields are as follows:

JobName Name of job.
Priority Calculated job priority.
XFactor Current expansion factor of job, where XFactor = (QueueTime + WallClockLimit) / WallClockLimit
Q Quality Of Service specified for job.
User User owning job.
Group Primary group of job owner.
Nodes Minimum number of processors required to run job.
WCLimit Wall clock limit specified for job. Time specified in HH:MM:SS notation.
Class Class requested by job.
SystemQueueTime Time job was admitted into the system queue.

An asterisk at the end of a job (job fr28n03.1718.0* in this example) indicates that the job has a job reservation created for it. The details of this reservation can be displayed using the checkjob command.

After displaying the job listing, the command summarizes the workload in the idle queue and indicates the total workload backlog in proc-hours. The value in parenthesis indicates the minimum amount of time required to run this workload using the currently available nodes on the system.

Related Commands

Use the showbf command to display resource availability.

Use the diagnose command to display partition configuration.

Use the checkjob command to check the status of a particular job. [an error occurred while processing this directive] [an error occurred while processing this directive]