|
|||
17.19 Case Study: Grid OverflowOverviewNCSA owns a 512-processor cluster, part of which they want to make available to clients for use when needed. They want to create a protected area of their cluster, containing 64 processors, to which clients have priority access. When those resources are not in use by clients, they would like to use them for internal workload. However, any client workload that shows up would be able to preempt NCSA internal workload on those processors. NCSA wants to establish a service level agreement with each client, including a billing rate. Each client should have control over when NCSA resources are used. Tracking the actual resource usage by each client should be as automated as possible. NCSA wants to maintain a high level of security by maintaining tight control over access to their shared resources and encrypting communication to and from each client. They also want to minimize the overhead of providing access to clients to avoid affecting scheduling performance for their clients and themselves. GoalsThe goals of establishing a relationship with a peer site in order to share extra capacity include the following:
AnalysisThe best way to avoid any user-visible changes is to allow Moab to migrate jobs transparently by creating a peer relationship with another instance of Moab. This allows users to continue to submit jobs normally, while experiencing the increased turn-around time gained by using the additional resources. Moab has flexible and convenient mechanisms for handling authorization and mapping user credentials on one system to a different set of credentials on a peer system. It also supports encryption and authentication using secret keys or certificates. Moab is designed to handle large numbers of jobs during scheduling. The additional work of evaluating jobs for migration to a peer instance of Moab adds only a minor amount of computation. Moab can immediately begin migrating jobs when a preconfigured backlog threshold is reached. Client (Job Source) ConfigurationNSCA (Job Destination) ConfigurationMonitoringThe mdiag -R command will allow both NCSA and their clients to monitor how many jobs flow over to NCSA's grid sandbox. Detailed information about what types of jobs are migrating can be obtained using job profiles. ConclusionCreating peer relationships between clusters allows great flexibility in how sites can work with each other to increase their overall efficiency and meet their own goals. Creating a unidirectional peer relationship from each of the clients allows NCSA to provide an overflow service that is beneficial to both parties. Clients can choose to use NCSA's resources to reduce their workflow backlog, and NCSA can provide overflow capacity to their clients and bill for usage.
|
|||
| © 2001-2008 Cluster Resources, Incorporated | |||