[Mauiusers] Maui unresponsive while writing strange lines unto log file

Tom Rudwick tomr at intrinsity.com
Thu Dec 13 10:38:14 MST 2007


It is debug printing that happens when it detects corruption in
the reservation table. (Why that happens is another issue...)

I've attached a patch that we use to turn off this printing unless
the log level is turned up.

Tom


Manuel Reiter wrote:
> Hi,
> 
> I'm running maui 3.2.6p14 and torque 2.0.0p8 on a ~250 node Opteron  
> cluster. While scheduling works fine, maui is often unresponsive to  
> commands like showq, showres and so on. While this is the case, maui  
> seems to write many lines of the form
> 
> 12/13 13:45:23 INFO:     R1[109]  S: 1197730705  E: 1197730718  T:  170  
> N: 92
> 
> into the log file, although I have
> 
> LOGLEVEL              0
> 
> and, experimentally, even
> 
> LOGFACILITY             fLL
> 
> in my maui config file.
> 
> Today alone, maui has written about 200.000 of these lines in about 5  
> hours. The pattern is that the index after R1 goes from 0 to 254,  
> followed by two lines like the above but with R1[n] replaced by R2[0]  
> and R2[1] and then things start over. Between these bursts, I have  
> hours when none of these lines appear in the log and maui is quite  
> responsive.
> 
> Can anybody tell me what these lines actually mean and why maui is  
> spitting out so many of them? Or provide any other insight into my  
> problem os unresponsiveness? I have put
> 
> RMPOLLINTERVAL        300
> NODEPOLLFREQUENCY       20
> JOBAGGREGATIONTIME 60
> 
> in the maui config in the hopes that this would make things better,  but 
> this didn't change things.
> 
> On another cluster I run (same maui, torque 2.0.0p4) no similar lines  
> appear in the maui logs at all.
> 
> Any help would be greatly appreciated.
> 
> Thanks and best regards,
> 
>   Manuel
> 
> ------------------------------------------------------------------------ 
> ------
> Manuel Reiter                      |         reiter at th.physik.uni- 
> frankfurt.de
> Center for Scientific Computing    |
> J.W.Goethe Universität             |
> D-60054 Frankfurt am Main          |
> Germany                            | 
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/mauiusers
> 

-------------- next part --------------
--- MRes.c~	2007-03-28 15:05:39.000000000 -0500
+++ MRes.c	2007-10-30 16:14:05.334437000 -0500
@@ -5973,7 +5973,6 @@
           C[index1].EndTime);
         }
       }
-    }
 
   if (IsCorrupt == TRUE)
     {
@@ -5997,6 +5996,7 @@
         R2[index1].NodeCount);
       }  /* END for (index1) */
     }    /* END if (IsCorrupt == TRUE) */
+    }
 
   memcpy(R1,C,sizeof(mrange_t) * (cindex + 1));
 


More information about the mauiusers mailing list