[torqueusers] Cluster Resources Newsletter - June 2005

amber webb awebb at clusterresources.com
Tue Jun 28 11:31:05 MDT 2005


Cluster Resources Newsletter - June 2005
Optimizing your Compute Resources
__________________________________________________
 
www.ClusterResources.com 
__________________________________________________
 
NEWS/ANNOUNCEMENTS 
 
1. ANNOUNCING CLUSTER BUILDER
2. TORQUE 1.2.0p5
3. MOAB GRID SUITE UPDATES
4. DID YOU KNOW: AUTOMATED RESERVATION POLICIES
 
******************************************************
ANNOUNCING CLUSTER BUILDER
 
Cluster Resources, Inc. and LinuxHPC.org announced the 
release of Cluster Builder (www.clusterbuilder.org) - a new Web 
site highlighting a broad spectrum of technologies and high 
performance computing (HPC) information. 
 
Cluster Builder acts as an organized repository of first- and 
third-party content relating to supercomputing technologies 
that helps alleviate the research burden of sifting through 
millions of search engine results. The Web site is organized 
into software categories such as operating systems, resource 
managers, compilers, etc., as well as hardware categories 
such as processors, switches and interconnect standards. 
Cluster Builder includes both open source and commercial 
products. 
 
To further aid in the research, Cluster Builder directs evaluators 
to LinuxHPC's free Request-For-Quote service. Purchasers can 
use this service to complete one quote form, which is sent to 
multiple vendors, instead of completing a form for each individual 
vendor.
 
******************************************************
TORQUE 1.2.0p5
 
TORQUE* Resource Manager's user community, in cooperation 
with Cluster Resources, contributed several fixes in the last 
month to create TORQUE 1.2.0p5.
 
One important fix stops the problem of qstat hanging while 
pbs_server is waiting for an update from an offline MOM. 
The problem occurred when a MOM in the "down, job-exclusive" 
state was marked offline using the "pbsnodes -o" command. 
 
A patch submitted by the University of Maine fixes a communication 
error during job cancellation. When Cluster Resources tested the 
patch, they found it increased the performance of qdel in certain 
circumstances.
 
On behalf of the user community we would also like to thank all 
those not mentioned above who helped find and fix problems 
and test new patches. To learn more about TORQUE visit:
 http://www.clusterresources.com/products/torque/. To join 
TORQUE's user community go to:
 http://www.clusterresources.com/mailing.shtml.
 
******************************************************
MOAB GRID SUITE UPDATES
 
The Alpha version of the new Moab Grid Suite continues to 
receive valuable updates. The new version will include true 
peer-to-peer (P2P) capabilities, so local clusters can set their 
own policies, instead of relying on an administrator to set 
policies. An efficient multi-threaded I/O protocol ensures that 
Moab commands and P2P communication is fast and doesn't 
suffer from unnecessary blocking.
 
Another new feature in Moab Grid Suite now protects users 
from losing information due to the Internet's sometimes unreliable 
connections and bandwidth or a downed head-node. Through 
a new robust check-pointing feature, grid users can get 
up-to-date information on their grid workloads, following 
connectivity failure or failure on any of the peer head nodes.
 
Moab Grid Suite will be available later this summer. To learn more 
about the product, and its capabilities, visit 
http://www.clusterresources.com/products/moabgridsuite.shtml.
 
******************************************************
DID YOU KNOW: AUTOMATED RESERVATION POLICIES
 
Reservations can be created in Moab to ensure users are allowed 
the resources they need to complete critical tasks. However, a 
reservation can become
his or her plans, or reserves more 
resources than necessary. This leaves a set of resources that no 
one else can use, thereby decreasing the efficiency of cluster 
resources. 
 
To combat this problem, Moab Workload Manager can be set to 
cancel reservations if the resources are not used within a certain 
time frame after the reservation begins. If a certain percentage of 
the resources are not used, Moab Workload Manager can also be 
set to reduce the amount of reserved resources. This allows others 
to use the resources, rather than letting them sit idle for the full 
reservation time alloted. The user who reserved the resources may 
still use them after the reservation has been canceled if they are 
available, but the availability is no longer guaranteed.  
 
This feature is available through Moab Workload Manager's event 
policy engine. The event policy engine allows users to automate
 reservation policies, as well as virtually all other management
policies. 
To learn more about reservation policies and other automated policies,
visit 
http://www.clusterresources.com/products/mwm/docs/20.1triggers.shtml. 
Users can test Moab Workload Manager by downloading the Moab Cluster 
Suite evaluation at http://www.clusterresources.com/products/eval/.
 ______________________________________________________
 
Contact Us: 
Product Feedback, Technical Support, Consulting and Custom Development

queries can be sent to info at clusterresources.com or call us at +1 (801)
873-3400
 
Moab Workload ManagerTM, Moab Cluster ManagerTM and MonitorTM, and 
Moab Access PortalTM are trademarks or registered trademarks of Cluster

Resources IncorporatedTM.
 
* This product includes software developed by NASA Ames Research
Center, Lawrence 
Livermore National Laboratory, and Veridian Information Solutions, Inc.
Visit www.OpenPBS.org 
for OpenPBS software support, products, and information. TORQUE is
neither endorsed by nor 
affiliated with Altair Grid Solutions, Inc. 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20050628/b03fc4b5/attachment.html


More information about the torqueusers mailing list