Moab provides numerous interfaces allowing it to monitor and manage most services and resources. It also possesses flexible interfaces to allow it to interact with peer services and applications as both a broker and an information service. This appendix is designed to provide a general overview and links to more detailed interface documentation.
Query resource availability, submit, modify, and manage jobs, and query the status of active and completed jobs.
Moab interfaces to systems providing various services and using various protocols. This appendix is designed to assist users who want to enable Moab in new environments using one of the existing interfaces. It does not cover the steps required to create a new interface.
H.1 Query and Control APIs
The Moab Cluster and Grid Suites provide a (Moab) workload manager server that supports a broad array of client services. These services can either be directly accessed via Moab client commands or by way of a number of APIs. Which approach is best will depend on the particular use case.
The Moab C API provides access to all Moab information and services. This API consists of both easy to use high-level calls and powerful and flexible low-level calls.
Host: (char *) optional server hostname or NULL Port: (int) optional server port or 0 C: (void **) opaque interface handle
Returns 0 on success or -1 on failure
This call initializes the interface between the client and the server. It must be called once before attempting to issue any Moab client calls.
MCCJobCancel(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to cancel (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will cancel the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobCheckpoint(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to cancel (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will checkpoint the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobGetNumAllocRes(C,JID,Count,EMsg)
C: (void **) address of interface handle JID: (char *) job id to query (optional) Count: (int *) output value (number of allocated hosts) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will return the number of allocated hosts for the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobGetRemainingTime(C,JID,Time,EMsg)
C: (void **) address of interface handle JID: (char *) job id to query (optional) Time: (long *) output value (remaining time in seconds) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will return the remaining wallclock duration/time for the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
NOTE: This call can be made to cache data for performance reasons by setting the MCCCACHETIME #define to the desired cache duration (in seconds) at build time.
This call will return a list of allocated hosts for the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobMigrate(C,JID,Dest,EMsg)
C: (void **) address of interface handle JID: (char *) job id to migrate (optional) Dest: (char *) destination cluster or list of hosts (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will migrate the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobModify(C,JID,Attr,Val,Op,EMsg)
C: (void **) address of interface handle JID: (char *) id of job to modify (optional - if not specified, modify job in current context) Attr: (char *) job attribute to modify (see mjobctl) Val:(char *) value to use when modifying specified attribute Op:(char *) operation to perform on attribute (one of set, unset, incr, or decr) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will modify the attributes of either the specified job (if JID is populated) or the job in the current context (if JID is empty). If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobQuery(C,JID,OBuf,EMsg)
C: (void **) address of interface handle JID: (char *) job id to query (optional - if not specified, report all jobs) OBuf: (char **) output response string (reported in XML) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will suspend the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobRequeue(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to requeue (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will requeue the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobResume(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to resume (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will resume the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobSignal(C,JID,Signo,EMsg)
C: (void **) address of interface handle JID: (char *) job id to resume (optional) Signo: (int) job signal to send to job EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will signal the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobStart(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to start/run (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will immediately run/execute the specified job ignoring policies and dependencies provided that adequate idle resources are available. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCJobSubmit(C,JDesc,JID,EMsg)
C: (void **) address of interface handle JDesc: (char *) XML job description (details.doc, detail.html) JID: (char **) job id of new job (JID is populated on success only) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will submit the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message. On success, memory is allocated for JID. JID is then populated with a new job's ID (which can then be used in calls such as MCCJobQuery and MCCJobCancel). The client is responsible for freeing the space allocated for JID.
MCCJobSuspend(C,JID,EMsg)
C: (void **) address of interface handle JID: (char *) job id to suspend (optional) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will suspend the specified job. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCNodeModify(C,NID,Attr,Val,Op,EMsg)
C: (void **) address of interface handle NID: (char *) id of node to modify (optional - if not specified, modify job in current context) Attr: (char *) node attribute to modify (see mnodectl) Val:(char *) value to use when modifying specified attribute Op:(char *) operation to perform on attribute (one of set, unset, incr, or decr) EMsg: (char *) optional error message
Returns 0 on success or -1 on failure
This call will modify the attributes of either the specified node (if NID is populated) or the local node (if NID is empty). If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
MCCFree(C)
C: (void **) address of interface handle
Returns 0 on success or -1 on failure
This call will free allocated interface handle memory
Example 1 - Interfacing to an Explicitly Specified Moab Server
Example 2 - Interface to Monitor Remaining Time w/in a Batch Job
Low-Level C API
The Moab low-level C API allows direct access to all Moab client services. This API uses the MCCExecute routine and takes as an argument an XML request string. This interface provides a high-speed, low-latency interface directly into the server and provides very broad and flexible cluster, workload, and resource control. This routine creates a direct socket connection to provides the following services:
socket creation
client authentication
server discovery via built-in, configfile, and environment settings
command request and response encryption
marshalling and de-marshalling of requests and data
Requests are sent using native XML request strings and responses are returned in unformatted native XML. Use of Cluster Resources Consulting Services is recommended for organizations using this interface.
Function
Args
Return Code
Description
MCCInitialize(Host,Port,&C)
Host: (char *) optional server hostname or NULL Port: (int) optional server port or 0 C: (void **) opaque interface handle
Returns 0 on success or -1 on failure
This call initializes the interface between the client and the server. It should be called once before attempting to issue any Moab client calls if connecting to a server at a non-default location. If connecting to the default server, client commands will automatically initialize the interface handle.
This call executes the requested command specified by the RBuf string using the client command specified by RCmd. Command output is placed in the OBuf buffer. This buffer is allocated and must be freed after use. The EMsg parameter is a 1024 character array and is optional. If failures occur and EMsg is provided, this buffer will be populated with a human-readable error message.
H.1.2 CLI (Command Line Interface) XML API
All Moab client commands can report results in XML format to allow the information to be easily integrated into peer services, portals, databases, and other applications. To request that a client command report its output in XML, specify the --format=xml flag as in the following example:
Moab accounting interfaces allow Moab to export local utilization statistics, events, and accounting information directly to databases and peer services. Moab interfaces can also import global and remote accounting information for use in local decision making.
Moab provides interfaces to allow interaction with various grid brokers and services. These interfaces allow Moab to provide services as well as utilize services.
Services Utilized
Information Services (import and utilize information service data in making scheduling decisions)
Moab can import and export key event information regarding workload, cluster resources, cluster and grid services, and other components of hardware and software infrastructure.
By default, these clients communicate with the scheduler using the U.S. Department of Energy (DOE) Scalable Systems Software socket and wire protocols. These protocols are largely HTML- and XML-based, using PKI, 3DES, MD5, challenge, and other security protocols and are documented within the SSS project pages.
As part of this initiative, the scheduler/client protocol has been extended to support multiple socket level protocol standards in communicating with its clients and peer services. These include SingleUseTCP, SSS-HALF, and HTTP. The client socket protocol can be specified by setting the MCSOCKETPROTOCOL parameter to SUTCP, SSS-HALF, or HTTP. Further protocols are being defined and standardized over time and backwards compatibility will be maintained. Documentation on the SSS-HALF implementation can be found within the DOE's SSS Project Notebooks.
H.7 Job Submission and Management Interface
Moab provides interfaces to enable the following services:
Resource Availability Query
Determine quantity, state, and configuration of configured resources (idle, busy, and down nodes)
Determine quantity and configuration of all available resources (idle nodes)
Determine resources available subject now and in the future for potential job
Determine best target cluster destination for potential job
Determine largest/longest job which could start immediately
Determine estimated start time for potential job
Determine earliest guaranteed start time for potential job
Reserve Resources
Reserve specific resources for desired time frame
Submit Job
Submit job to specific cluster
Submit job to global job queue
Manage Job
Hold job
Adjust job priority
Modify job executable, args, data requirements, job dependencies, duration, hostcount, or other attributes
Suspend/resume job
Checkpoint/requeue job
Cancel job
Migrate job
Adjust job quality of service (QoS)
Query Job
Determine job state, utilization, or output results for idle, active, or completed job