[torqueusers] pbs_sched problem in 4.2.5

Ken Nielson knielson at adaptivecomputing.com
Tue Sep 17 10:09:54 MDT 2013


Josh,

You are right. We need to fix pbs_sched

ken


On Tue, Sep 17, 2013 at 9:41 AM, Trutwin, Joshua <JTRUTWIN at csbsju.edu>wrote:

>  Yes it is running.  ****
>
> ** **
>
> # qmgr -c 'p s'****
>
> #****
>
> # Create queues and set their attributes.****
>
> #****
>
> #****
>
> # Create and define queue batch****
>
> #****
>
> create queue batch****
>
> set queue batch queue_type = Execution****
>
> set queue batch resources_default.nodes = 1****
>
> set queue batch resources_default.walltime = 01:00:00****
>
> set queue batch enabled = True****
>
> set queue batch started = True****
>
> #****
>
> # Set server attributes.****
>
> #****
>
> set server scheduling = True****
>
> set server acl_hosts = torque.csbsju.edu****
>
> set server managers = root at torque.csbsju.edu****
>
> set server operators = root at torque.csbsju.edu****
>
> set server default_queue = batch****
>
> set server log_events = 511****
>
> set server mail_from = adm****
>
> set server scheduler_iteration = 600****
>
> set server node_check_rate = 150****
>
> set server tcp_timeout = 300****
>
> set server job_stat_rate = 45****
>
> set server poll_jobs = True****
>
> set server log_level = 4****
>
> set server disable_server_id_check = True****
>
> set server mom_job_sync = True****
>
> set server mail_domain = csbsju.edu****
>
> set server keep_completed = 300****
>
> set server submit_hosts = lincl[1-17]****
>
> set server submit_hosts += lin[1-24]****
>
> set server submit_hosts += lincsb[1-3]****
>
> set server submit_hosts += linhab[1-2]****
>
> set server submit_hosts += linfac[1-6]****
>
> set server submit_hosts += linmath[1-4]****
>
> set server submit_hosts += linphys[1-9]****
>
> set server submit_hosts += linphysfac[1-4]****
>
> set server submit_hosts += nx****
>
> set server allow_node_submit = True****
>
> set server allow_proxy_user = True****
>
> set server auto_node_np = True****
>
> set server next_job_number = 16****
>
> set server record_job_info = True****
>
> set server record_job_script = True****
>
> set server moab_array_compatible = True****
>
> ** **
>
> ** **
>
> I installed maui and things are working well for me now, but it would be
> nice if pbs_sched worked as well.****
>
> ** **
>
> Thanks,****
>
> ** **
>
> Josh****
>
> ** **
>
> ** **
>
> *From:* torqueusers-bounces at supercluster.org [mailto:
> torqueusers-bounces at supercluster.org] *On Behalf Of *Ken Nielson
> *Sent:* Friday, September 13, 2013 11:30 AM
> *To:* Torque Users Mailing List
> *Subject:* Re: [torqueusers] pbs_sched problem in 4.2.5****
>
> ** **
>
> do you have trqauthd running?****
>
> What does your qmgr -c 'p s' output look like?****
>
> Thanks****
>
> ** **
>
> On Thu, Sep 12, 2013 at 6:19 PM, Trutwin, Joshua <JTRUTWIN at csbsju.edu>
> wrote:****
>
> Hi,****
>
>  ****
>
> I think I’m running into a known issue but wanted to confirm.****
>
>  ****
>
> I setup a simple torque environment using 4.2.5 – I have a single compute
> node and when I try to submit a test job it winds up getting stuck in the
> queue until I run qrun to force it.  I ran the scheduler like so:****
>
>  ****
>
> export PBSDEBUG=1****
>
> export PBSLOGLEVEL=3****
>
> /opt/torque-4.2.5/sbin/pbs_sched****
>
>  ****
>
> When I submit the job this shows up in the console:****
>
>  ****
>
> pbs_statserver failed: 15033****
>
> Problem with creating server data structure****
>
>  ****
>
> Looking up this error I see these two posts about it:****
>
>  ****
>
> http://comments.gmane.org/gmane.comp.clustering.torque.user/13273****
>
> http://comments.gmane.org/gmane.comp.clustering.torque.user/13058****
>
>  ****
>
> Is there a fix or do I have to switch to Maui?****
>
>  ****
>
> Thanks,****
>
>  ****
>
> Josh****
>
>  ****
>
>  ****
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers****
>
>
>
>
> --
> Ken Nielson
> +1 801.717.3700 office +1 801.717.3738 fax
> 1712 S. East Bay Blvd, Suite 300  Provo, UT  84606
> www.adaptivecomputing.com****
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>


-- 
Ken Nielson
+1 801.717.3700 office +1 801.717.3738 fax
1712 S. East Bay Blvd, Suite 300  Provo, UT  84606
www.adaptivecomputing.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130917/df53b3dd/attachment.html 


More information about the torqueusers mailing list