METHOD AND SYSTEM FOR AN IMPROVED WORK-LOAD BALANCING WITHIN A CLUSTER

Information

  • Patent Application
  • 20070233843
  • Publication Number
    20070233843
  • Date Filed
    March 23, 2007
    17 years ago
  • Date Published
    October 04, 2007
    17 years ago
Abstract
The present invention provides a method and system for an improved workload-balancing in a cluster which is characterized by a new extrapolation process which is based on a modified workload query process. The extrapolation process is automatically initiated for each node each time a start decision of a resource within the cluster is being made and is characterized by the steps of: accessing exclusively said actual workload data of each node stored in the workload data workload-data history repository without initiating a new workload query,accessing information how many resources are actually active and are to be intended active on each node,calculating the expected workload of all resources which are intended to be active on each node based on said previous accessing steps,calculating the expected free capacity of each node,providing expected free capacity of each node to the CM,starting said resource at that node which provides the highest amount of free capacity, andupdating said workload data history repository for said node accordingly.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example and is not limited by the shape of the figures of the drawings in which:



FIG. 1 A shows prior art cluster architecture,



FIG. 1 B-C show prior art methods of incorporating workload data into the CMs decision process of starting an resource within the cluster,



FIG. 2 A shows the prior art cluster architecture extended by the inventive components, and



FIG. 2 B-D show the inventive method carried out by the inventive components.


Claims
  • 1. Method for an improved work-load balancing within a cluster, wherein said cluster consists of nodes which provide resources, wherein each resource is member of a resource group that ensures that at least one instance of a resource is active at a given time, wherein said resource group is controlled by a cluster manager (CM) which decides to start or stop a resource at a certain node, wherein said method is characterized by the steps of: querying workload data for each node in time intervals selected such that the query overhead is reduced to a minimum,storing said workload data in a workload data history repository which provides at least the total capacity per node, and the used capacity per node,automatically starting for each node an extrapolation process at each time a start decision of a resource within said cluster is being initiated comprising the steps of:accessing exclusively said actual workload data of each node stored in said data workload data history repository without initiating a new workload query,accessing information how many resources are actually active and are intended to be active on each node,calculating the expected workload of all resources which are intended to be active on each node based on said previous accessing steps,calculating the expected free capacity of each node,providing expected free capacity of each node to the CM,starting said resource at that node which provides the highest amount of free capacity, andupdating said workload data history repository for said node accordingly.
  • 2. Method according to claim 1, further including the step: automatically starting for each node an extrapolation process at each time a stop decision of a resource within said cluster is being initiated resulting in a update of said workload data history repository.
  • 3. Method according to claim 1, wherein said workload data stored in said workload data history repository representing a rolling average, and said time intervals are selected not shorter than half of the interval represented by said rolling average.
  • 4. Method according to claim 1, wherein said workload data stored in said workload data history repository includes the actual workload of said resources.
  • 5. Method according to claim 1, wherein said cluster manager makes the decision to start a plurality of resources further including the steps of: sorting said resources according their actual workload,assigning said resource with the highest actual workload to that node with the highest amount of free capacity, andrepeating said previous steps for each resource.
  • 6. System for an improved work-load balancing within a cluster, wherein said cluster consists of nodes, a local resource manager (RM), a local workload manager (WM), and at least one resource is assigned each node, wherein each resource is member of a resource group that ensures that at least one instance of a resource is active at a given time, wherein said resource group is controlled by a cluster manager (CM) which decides to start or stop a resource at a certain node, wherein said system is characterized by the further function components: a workload query function component for querying workload data for each node in time intervals selected such that the query overhead is reduced to a minimum, wherein said workload query component uses an interface provided by said workload manager for accessing workload data,a workload data history repository for storing said workload data which provides at least the total capacity per node, and the used capacity per node,an extrapolation function component for automatically starting for each node an extrapolation process at each time a start decision of a pre-installed resource within said cluster is being initiated comprising the means of:means for accessing exclusively said actual workload data of each node stored in said workload data history repository without initiating a new workload query,means for accessing information how many resources are actually active and are intended to be active on each node,means for calculating the expected workload of all resources which are intended to be active on each node based on said previous accessing steps,means for calculating the expected free capacity of each node,means for providing expected free capacity of each node to said cluster manager,means for starting said resource at that node which provides the most free capacity, andmeans for updating said workload data history repository for said node accordingly.
  • 7. System according to claim 6, wherein said workload query function component is part of the cluster manager or provides an interface that the cluster manager may use.
  • 8. System according to claim 6, wherein said workload data is provided by the workload manager in a representation required by said cluster manager.
  • 9. System according to claim 6, wherein said work load query function component transforms said workload data in said required representation.
  • 10. System according to claim 6, wherein said extrapolation process function component is part of the cluster manager or provides an interface that said cluster manager may use.
  • 11. A Computer program product in a computer usable medium comprising computer readable program means for causing the computer to perform a method for workload balancing, when said computer program product is executed on computer, the method comprising the steps of: querying workload data for each node in time intervals selected such that the query overhead is reduced to a minimum,storing said workload data in a workload data history repository which provides at least the total capacity per node, and the used capacity per node,automatically starting for each node an extrapolation process at each time a start decision of a resource within said cluster is being initiated comprising the steps of:accessing exclusively said actual workload data of each node stored in said data workload data history repository without initiating a new workload query,accessing information how many resources are actually active and are intended to be active on each node,calculating the expected workload of all resources which are intended to be active on each node based on said previous accessing steps,calculating the expected free capacity of each node,providing expected free capacity of each node to the CM,starting said resource at that node which provides the highest amount of free capacity andupdating said workload data history repository for said node accordingly.
Priority Claims (1)
Number Date Country Kind
06111995.4 Mar 2006 EP regional