[torqueusers] BUG: MOM segfaults

Wightman wightman at clusterresources.com
Fri Feb 4 11:21:41 MST 2005


Marc,

  TORQUE 1.2.0p0 now contains code to prevent the crash simply by
properly handling the fact that the srj structure is optional.  This has
been rolled in for both unicos and irix.  However, whether or not the
whole set_globid() call is required in the current release has not been
addressed.  Thanks to everyone in the community who are currently
looking at this.  

Doug



On Tue, 2005-02-01 at 10:50 -0700, Marc Aurele La France wrote:
> Hi.
> 
> init_abort_job() in src/resmom/catch_child.c contains a call to 
> set_globid(pj,NULL).  Consequently, it behooves all set_globid() 
> implementations in the various src/resmom/*/mom_start.c's to be able to deal 
> with a NULL struct startjob_rtn pointer.
> 
> But this turns out not to be the case for the irix6array and unico8 
> implementations.
> 
> This bug report applies to all torque versions since the introduction of 
> set_globid(), including all recent snapshots.
> 
> Marc.
> 
> +----------------------------------+-----------------------------------+
> |  Marc Aurele La France           |  work:   1-780-492-9310           |
> |  Computing and Network Services  |  fax:    1-780-492-1729           |
> |  352 General Services Building   |  email:  tsi at ualberta.ca          |
> |  University of Alberta           +-----------------------------------+
> |  Edmonton, Alberta               |                                   |
> |  T6G 2H1                         |     Standard disclaimers apply    |
> |  CANADA                          |                                   |
> +----------------------------------+-----------------------------------+
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://supercluster.org/mailman/listinfo/torqueusers



More information about the torqueusers mailing list