[gold-users] Gold rejects reservation attempt from Moab made with setres -A

Scott Jackson scottmo at adaptivecomputing.com
Tue Jan 25 13:09:27 MST 2011


Chris,

This is a bug in Moab that I have seen before. Moab should be making a call to Account Withdraw or Job Charge, but not to Job WithDrawal. I would recommend opening a support ticket with Moab support.

Thanks,

Scott
 

----- Original Message -----
> From: "Christopher Samuel" <samuel at unimelb.edu.au>
> To: "Gold Users Mailing List" <gold-users at supercluster.org>
> Sent: Monday, January 24, 2011 8:18:46 PM
> Subject: [gold-users] Gold rejects reservation attempt from Moab made with setres -A
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Hiya,
> 
> We're trying to create a reservation in Moab 5.4.2 with
> setres and the manual page says that there is a handy -A
> option to charge a project for the unused time.
> 
> However, when I try and set one up I get an error saying:
> 
> tambo-m:~ # setres -u samuel -A VLSCI -s 15:30_02/01 -d 00:30 ALL
> ERROR: cannot create requested reservation - failed allocation charge
> 
> Looking at the Moab logs it states:
> 
> 01/25 14:14:52 ALERT: request failed with status code 740
> (WithDrawal is not a valid action for a Job object)
> 
> Taken from this excerpt:
> 
> 01/25 14:14:51 INFO: received service request from host 'tambo-m'
> 01/25 14:14:51 MSURecvData(,5000000,TRUE,SC,EMsg)
> 01/25 14:14:51 MUIRsvCtl(S,162,root)
> 01/25 14:14:51 MSysRegEvent(RESERVATIONCREATED: samuel.29 User
> samuel.29 1295925291 1296534600 1296534630 0
> ,0,0,1)
> 01/25 14:14:51 MSysLaunchAction(ASList,)
> 01/25 14:14:51 WARNING: partial rsv samuel.29 reserved 8192 of 4 procs
> in partition 'ALL' to start in 7:01:15:09 at (1296534600) Tue Feb 1
> 15:30:00
> 01/25 14:14:51 MS3DoCommand(allocation-manager,NULL,OBuf,ODE,SC,EMsg)
> 01/25 14:14:51 MSUSendData(S,30000000,FALSE,FALSE,SC,NULL)
> 01/25 14:14:51 INFO: packet sent (466 bytes of 466)
> 01/25 14:14:51 INFO: command sent to server
> 01/25 14:14:51 MSURecvData(,30000000,FALSE,SC,EMsg)
> 01/25 14:14:52 ALERT: request failed with status code 740
> (WithDrawal is not a valid action for a Job object)
> 01/25 14:14:52 MSUDisconnect(13)
> 01/25 14:14:52 ERROR: cannot receive response from
> allocation-manager server 'gold.vlsci.unimelb.edu.au':7112
> 01/25 14:14:52 MSysRegEvent(FAILURE: cannot receive response from
> allocation-manager server gold.vlsci.unimelb.edu.au:7112 (cmd:
> '<XML>')
> ,0,0,1)
> 01/25 14:14:52 MSysLaunchAction(ASList,)
> 01/25 14:14:52 ALERT: cannot debit allocation for rsv
> 01/25 14:14:52 MRsvDestroyCredLock(samuel.29)
> 01/25 14:14:52 MSysRegEvent(RESERVATIONDESTROYED: samuel.29 User
> 1295925291 1296534600 1296534630 0
> ,0,0,1)
> 01/25 14:14:52 MSysLaunchAction(ASList,)
> 01/25 14:14:52 MSUSendData(S,5000000,TRUE,TRUE,SC,NULL)
> 01/25 14:14:52 INFO: packet sent (446 bytes of 446)
> 01/25 14:14:52 MSUDisconnect(12)
> 
> The goldd.log says:
> 
> 2011-01-25 14:12:21.165 INFO Gold::Message::receiveChunk Read message
> payload (361, <Envelope component="ClusterScheduler" count="1"
> name="moab" version="5.4.2"><Signature><DigestValue>9peicJB2Lc
> RG1i/YbzFrraFNKgY=</DigestValue><SignatureValue>eULY/7TlllOzICQi2840LXaJS8w=</SignatureValue></Signature><Body
> actor="root" timestamp="1295925291"><Request action="WithDrawal"
> actor="root"><Object>
> Job</Object><Data></Data></Request></Body></Envelope>).
> 2011-01-25 14:12:21.165 TRACE Gold::Message::unmarshallChunk invoked
> with arguments: (<Envelope component="ClusterScheduler" count="1"
> name="moab" version="5.4.2"><Signature><DigestValue>9peicJB2L
> cRG1i/YbzFrraFNKgY=</DigestValue><SignatureValue>eULY/7TlllOzICQi2840LXaJS8w=</SignatureValue></Signature><Body
> actor="root" timestamp="1295925291"><Request action="WithDrawal"
> actor="root"><Object
> >Job</Object><Data></Data></Request></Body></Envelope>)
> 2011-01-25 14:12:21.166 TRACE Gold::Chunk::authenticate invoked with
> arguments: (XML::LibXML::Document=SCALAR(0xa668db8))
> 2011-01-25 14:12:21.167 DEBUG Gold::Chunk::authenticate The security
> token type is (Symmetric).
> 2011-01-25 14:12:21.167 DEBUG Gold::Chunk::authenticate The
> canonicalized body text is (<Body actor="root"
> timestamp="1295925291"><Request action="WithDrawal"
> actor="root"><Object>Job</Object><Data></Data></Request></Body>).
> 2011-01-25 14:12:21.167 DEBUG Gold::Chunk::authenticate The
> base64-encoded message digest is (9peicJB2LcRG1i/YbzFrraFNKgY=).
> 2011-01-25 14:12:21.168 DEBUG Gold::Chunk::authenticate The
> base64-encoded MAC is (eULY/7TlllOzICQi2840LXaJS8w=).
> 2011-01-25 14:12:21.168 TRACE Gold::Request::new invoked with
> arguments: ()
> 2011-01-25 14:12:21.169 DEBUG Gold::Message::unmarshallChunk Extracted
> the request ((WithDrawal, root, [(Job, , )], [], [], [], [], [], 0, 0,
> 100000, )).
> 2011-01-25 14:12:21.169 DEBUG Gold::Message::receiveChunk Read message
> tail ().
> 2011-01-25 14:12:21.170 TRACE Gold::Proxy::new invoked with arguments:
> (request => Gold::Request=HASH(0x9ce0a48))
> 2011-01-25 14:12:21.170 TRACE Gold::Database::new invoked with
> arguments: ()
> 2011-01-25 14:12:21.172 TRACE Gold::Database::nextId invoked with
> arguments: (Request)
> 2011-01-25 14:12:21.173 TRACE Gold::Database::nextId generated new id
> 1828771 for Request
> 2011-01-25 14:12:21.174 TRACE Gold::Proxy::prepare invoked with
> arguments: (Gold::Request=HASH(0x9ce0a48))
> 2011-01-25 14:12:21.174 TRACE Gold::Cache::objectExists invoked with
> arguments: (Job)
> 2011-01-25 14:12:21.176 TRACE Gold::Cache::actionExists invoked with
> arguments: (Job, WithDrawal)
> 2011-01-25 14:12:21.179 ERROR Gold::Exception::new WithDrawal is not a
> valid action for a Job object at /usr/local/gold/lib/Gold/Proxy.pm
> line 337
> 2011-01-25 14:12:21.181 ERROR main::__ANON__ Gold server error (740):
> WithDrawal is not a valid action for a Job object.
> 2011-01-25 14:12:21.181 TRACE Gold::Response::new invoked with
> arguments: ()
> 2011-01-25 14:12:21.181 TRACE Gold::Response::failure invoked with
> arguments: (740, WithDrawal is not a valid action for a Job object)
> 2011-01-25 14:12:21.182 TRACE Gold::Reply::new invoked with arguments:
> (connection => IO::Socket::INET=GLOB(0xa66d6c0))
> 2011-01-25 14:12:21.182 TRACE Gold::Reply::sendChunk invoked with
> arguments: (Gold::Chunk=HASH(0xa64dee8))
> 2011-01-25 14:12:21.182 TRACE Gold::Reply::marshallChunk invoked with
> arguments: (Gold::Chunk=HASH(0xa64dee8))
> 2011-01-25 14:12:21.183 DEBUG Gold::Reply::sendChunk Writing reply
> header (HTTP/1.1 200 OK
> Content-Type: text/xml; charset="utf-8"
> Transfer-Encoding: chunked
> ).
> 2011-01-25 14:12:21.183 INFO Gold::Reply::sendChunk Writing reply
> payload (231, <?xml version="1.0" encoding="UTF-8"?>
> <Envelope><Body><Response
> actor="root"><Status><Value>Failure</Value><Code>740</Code><Message>WithDrawal
> is not a valid action for a Job
> object</Message></Status></Response></Body></Envelope>
> ).
> 
> 
> Any idea which end the error is at ?
> 
> cheers!
> Chris
> - --
> Christopher Samuel - Senior Systems Administrator
> VLSCI - Victorian Life Sciences Computation Initiative
> Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
> http://www.vlsci.unimelb.edu.au/
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.10 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
> 
> iEYEARECAAYFAk0+QRYACgkQO2KABBYQAh/DYwCaAoPcgiByWof0mRvWPX4z9jSZ
> vaQAnA+hXyIq9kOx9qIHG0rJzKsshObU
> =1jdO
> -----END PGP SIGNATURE-----
> _______________________________________________
> gold-users mailing list
> gold-users at supercluster.org
> http://www.supercluster.org/mailman/listinfo/gold-users


More information about the gold-users mailing list