[torqueusers] pbs_mom segfaulting

Ken Nielson knielson at clusterresources.com
Wed Jan 28 09:15:51 MST 2009


A core or at least the output of a stack trace would be nice. I would like to know if it is the same issue Joshua has described and if his fix might address the seg fault you have.


----- Original Message -----
From: "Joshua Bernstein" <jbernstein at penguincomputing.com>
To: "Jan Lindheim" <lindheim at cacr.caltech.edu>
Cc: torqueusers at supercluster.org
Sent: Tuesday, January 27, 2009 5:49:29 PM GMT -07:00 US/Canada Mountain
Subject: Re: [torqueusers] pbs_mom segfaulting

Hi Jan,

Jan Lindheim wrote:
> After upgrading to the torque 2.3.6 recently, we have seen pbs_mom
> segfaulting and jobs getting stuck.  This is on an Opteron system, running
> SLES9.1.  Has anybody else reported instability with pbs_mom lately?

I've personally had problems with 2.3.6 and other versions producing a SEGV. You 
might want to read through the thread here:


I have an RPM of version of 2.4.0 I can send you that contains the fix I 
proposed in the post aforementioned. I'd be curious to see if that fixes your 
issue. Ping me off list and I'd be happy to send you the RPM.

-Joshua Bernstein
Senior Software Engineer
Penguin Computing
torqueusers mailing list
torqueusers at supercluster.org

More information about the torqueusers mailing list