The present invention relates to a system and method for the estimation of computer resource usage by transaction types and specifically, but not exclusively, to a more computationally efficient method for estimation of resource usage by transaction types.
In a previous patent application filed by the applicant of the present patent application, a system and method were disclosed which enabled the estimation of computer resource usage by individual transaction types in a transaction processing system. In one embodiment of the invention disclosed in application PCT/09/110,000 filed on Mar. 14, 2002 at the United States Patent Office, there was disclosed a method which used a linear least squares algorithm to obtain resource usage estimates from log data. The log data included information on the number of transaction types and the amount of computer resource usage.
The method disclosed by the above mentioned patent application is comprised, in one embodiment, of the following steps:
The method outlined above is commonly used offline (i.e. not in a “real world” situation), because the matrix equation can be computationally intensive and therefore may degrade performance in real world situations. This is caused, in part, by the large size of the matrices X and Y.
The size of the X and Y matrices will depend on the application and resource usage estimation requirements. The number of columns in the X array is equal to the number of transaction types. In real world systems, it is not uncommon for larger enterprise systems to have hundreds of transaction types. For example, an Enterprise Application Environment application such as UNISURE™ has over 1000 transaction types.
Similarly, the Y array represents the number of system characteristics, and generally, many large enterprise systems will contain hundreds of measurable system characteristics. These characteristics may include, for example, CPU time, disk seek time, number of interrupts, network packet throughput, the number of processes running on the system, etc. These characteristics may measure characteristics of the hardware (such as CPU time) or characteristics of the software (such as the number of processes running on the system).
Therefore, to obtain estimates of resource usage by transaction type, it is necessary to solve matrix equations with large matrices. Typically, such matrices may have thousands of rows and hundreds of columns, and in extreme cases, tens of thousands of rows and thousands of columns.
This may potentially require a large amount of system memory and a large amount of processor time, making the application of the technique not practical for certain “online” monitoring situations.
In a first aspect, the present invention provides a method of estimating computing system resource usage for each transaction type, comprising the steps of,
An advantage of the present invention is that the linear least squares method is preferably computed in a manner that reduces the number of operations required to provide a result.
Preferably, the method comprises the further step of dividing the computation into separate sub-parts.
A resource will be understood to mean any hardware or software component of a computing system which carries out a function. This may include a central processing unit (CPU), a disk drive or other persistent storage mechanism, or volatile memory. The preceding examples are given to illustrate some types of computing resources and should not be construed as limiting.
An embodiment of the present invention may be used in many situations where computing resource usage is scarce. For example, the method may be used in an “online” situation where a system administrator wishes to run an analysis whilst other transactions are being processed (i.e. a “live” situation).
As the cross-product computation is performed at every defined time interval, it is important to minimise the amount of computing time spent calculating cross-products.
It is only necessary to compute the cross-products of each resource counter with each transaction counter and each transaction counter with each other transaction counter.
This results in an appreciable “saving” of time and computing resources, as the number of operations that are performed is reduced. The magnitude of the saving depends on the ratio of the number of system counters to the number of transaction counters. This may be considerable if, for example, there are 100 transaction types but 500 system counters.
A transaction counter will be understood to mean any type of software application or hardware device arranged to count the total number of transactions executed or processed within a given interval of time.
Analogously, a system counter will be understood to mean any type of software application or hardware device or module arranged to “count” the resource utilisation of any hardware device or software application or module in a computing system.
Many commercial packages exist to perform such a function. For example, the Windows™ operating system includes a software module termed “perfmon”, which may be used to count or log the resource usage for any number of hardware or software sub-systems.
Preferably, the resource usage data includes a resource utilisation value for each resource type for each given time interval, and the transaction count data includes the total number of transactions executed for each transaction type in each given time interval.
Preferably, the number of calculations required to produce an estimate of resource usage by transaction type are reduced by computing a select number of cross-products from the total number of possible cross-products in the matrix.
Preferably, the mathematical algorithm is the Cholesky method.
In a second aspect, the present invention provides a system for estimating computing system resource usage for each transaction type, comprising,
In a third embodiment, the present invention provides a computing program arranged, when loaded on a computing system, to control the computing system to implement a method in accordance with the first aspect of the invention.
In a fourth aspect, the present invention provides a computer readable medium providing a computer program in accordance with a third aspect of the invention.
Features of the present invention will be presented in the description of an embodiment thereof, by way of example, with reference to the accompanying drawings, in which:
At
It will be understood that the computing system described in the preceding paragraphs is illustrative only, and that an embodiment of the present invention may be executed on any suitable computing system, with any suitable hardware and/or software.
In one embodiment, the present invention is implemented as a software module 30, the module forming part of a larger suite of proprietary testing applications 32 that are provided under the name “NOFRTE”. NOFRTE is a computing system testing application, capable of testing Enterprise Application Environment applications on various different computing systems, including Windows™ and Unix™ platforms.
Examples of computing systems which can execute the NOFRTE package include the ES7000 and ES5000 range of Unisys servers. However, it will be understood that the present invention may be incorporated into any appropriate testing application, or any other software application, and may be arranged to be executed on any suitable computing system.
A method in accordance with one embodiment of the present invention will now be described. The embodiment hereinafter described utilises the same data as described in the applicant's earlier submission PCT/09/110,000, which is incorporated herein by reference.
The embodiment, termed the “computationally effective” method comprises the following steps:
The above mentioned equation is a standard least linear squares algorithm, which may be found in many standard textbooks on statistics. One example is [Johnson et. al. Applied Multivariate Statistical Analysis 4th edition, Prentice Hall, 1998].
In this equation, X represents a matrix containing the transaction count data, namely the number of transactions of type “j” executed in the time interval “i”, and Y represents a matrix containing the resource counter data, namely a variable such as, for example, the CPU time used in a given time interval.
The result, B, represents the average resource used per transaction type. For example, the average CPU time used by a particular transaction type.
The superscript ‘T’ represents the transpose of the matrix (i.e. interchanging the rows and columns of the matrix), and the superscript ‘−1’, represents the inverse of the matrix. Such terminology is known to a person skilled in the art.
The number of operations required to perform the preceding step (i.e. the application of the least squares algorithm to obtain resource usage estimates by transaction types) is reduced by employing the following intermediate steps when performing step 3 in the preceding sequence:
The term “crossproduct” will be understood to mean a mathematical operation performed on a matrix. The term “crossproduct” is a term known in the art. See, for example, [Kreyszig, E., Advanced Engineering Mathematics, 7th Ed. (1993, John Wiley & Sons, Inc.) Singapore)]
These steps are now described in more detail.
The module reserves a square matrix CV size Nc by Nc (where Nc in an arbitrary number representing the number of rows and columns in the square matrix CV) in memory, which may be volatile memory such as RAM, or virtual memory, such as disk space reserved by a operating system on a hard drive or other permanent memory device. The matrix stores all data required for future solutions of the equation. The matrix further collects the sums of cross-products of all data rows, (that is, the data rows themselves are not be stored).
The size of the storage matrix depends only on the number of transaction types and system variables (resources) that are measured.
Cross-product computation is performed at every defined time interval. In order to minimise the time cost associated with cross-product computation, at each time interval a row of data, Nc is delivered to the module in vector V. The vector, Nc is the sum of the number of transaction types and the number of system characteristics (for example, CPU time).
Vector V contains two types of values describing the system in the current interval:
The software module performs the following operation (note that the following extract is pseudocode):
Note that, since V[r]*V[c]==V[c]*V[r], the matrix CV is symmetric. So, it is only necessary to compute the values in the upper triangular matrix only (in line 2 of the pseudocode r ranges from 1 to c, not to Nc). As only the upper triangular matrix is computed, preferably an appreciable amount of computing time is conserved.
The cost of cross-product computation may be reduced further. As stated above, the data vector V is comprised of two types of values, namely transaction counts and system resource counters. The applicant has discovered that it is not necessary to compute every cross-product.
For example, it is not necessary to compute cross-products between resource counters, as the cross-products between resource counters are not required for computation of the solution.
It is only necessary to compute the cross-products of each resource counter with each transaction counter. This results in an appreciable “saving” of time and computing resources, as the number of operations that must be performed is reduced. The magnitude of the saving depends on the ratio of the number of system counters to the number of transaction counters. This may be considerable if, for example, there are 100 transaction types but 500 system counters. Such values are reasonably common in modern enterprise transaction processing systems.
As described earlier, the equation to compute Linear Least Squares solution commonly used is
B=(XT*X)−1*(XT*Y)
The CV matrix contains all the data required on the right hand side of this equation.
Selecting from the CV matrix a submatrix CVX, containing cross-products of transaction counts, the equivalent of the (XT*X) matrix is obtained. Selecting from the CV matrix a row containing cross-products for, say, CPU time (i.e. CVY), the exact equivalent of the (XT*Y) matrix is obtained. Thus, it becomes necessary to solve the equation
B=CVX−1*CVY
Since CVX is a symmetric matrix, the Cholesky decomposition may be used to obtain a solution to the linear least squares equation with fewer operations than using algorithms designed for general matrices.
In addition, it is possible to code the Cholesky algorithm in such a way that it will use only the lower triangular part of the CV matrix. This leads to memory savings, as the CV matrix can be used for both storing data and computing the solution.
As stated, in one embodiment the computationally efficient method comprises two principal steps or stages:
In this example, the data set will be comprised of CPU utilization statistics (hereinafter described by the variable name ‘CPU’) and three transaction types, namely “NewOrder” (NO), “StockLevel” (SL), and “Delivery” (DE). In each time period (at sampling time) there is collected a vector of length “four”—the first element contains the amount of CPU time used in the last interval, and the remaining three elements contain the number of transactions executed in that given time interval. A sample data file is shown below in Table I:
(Note that the numbers presented in square brackets are for convenience only, they do not form part of the data).
Memory is reserved for a matrix CV, and all entries in the matrix are set to the value 0. Matrix CV collects the sum of the cross-products of data rows. Initially CV contains the following contents, as shown in Table II below:
The cross-products of input data vectors are computed using the following algorithm (which is presented in pseudo-code):
where:
V is the row-vector of freshly arrived data. Thus, after the arrival of the first vector, the matrix CV contains the following data, as shown in table III:
Note that the entries below the diagonal are computed, since the matrix is symmetric. Therefore, no additional information is derived from computing the lower half of the matrix.
After arrival of the second vector, it's cross-product with itself is added to the matrix CV resulting in the following content, shown in Table IV:
This process is continued, such that after arrival of the tenth vector, the matrix CV contains the elements shown in Table V below:
The final step in the process involves solving the equation by applying the Cholensky method. This operation is performed either ‘on request’ or periodically, after the arrival of a given number of data samples (in this example, after first 10 data samples/rows).
The CV array contains two types of data:
a) the first row, on positions 2 to 4 contains sums of cross-products of transaction counts with CPU usage—this is the vector CVY
b) rows 2 to 4 contain, on positions 2 to 4 cross products of transaction counts—this is the matrix CVX:
To solve that system we need to compute the equation
B=CVX−1*CVY
Computation of CVX−1 can be done using Cholesky method as the matrix CVX is symmetric (see table VI below):
Multiplying B=CVX−1*CVY, the sought solution expressing CPU usage by each transaction type (in seconds in this example) is obtained (See table VII below).
The example given above presents a step-by-step solution using a highly simplified embodiment of the present invention. The example is merely illustrative of the two main steps necessary to reduce the number of operations required to apply the mathematical method. Further refinements to the basic method present in this embodiment are possible. These refinements will now be presented in more detail. The example given in the preceding paragraphs highlighted the fact that sections of the matrices remained unused. The primary motivation for not using the entire matrix was the reduction in processing time. It is not necessary to compute the entire matrix if only a portion of the matrix is required to compute the desired solution. However, this also provides memory savings, in addition to computational time savings.
Since the CV matrix is symmetric, it can be used for both data collection (the upper triangular part) and solution computation (lower triangular part). Note that the ‘conceptual’ CVX and CVY matrices are effectively submatrices of CV so no separate storage is required.
Hence all the operations may be performed using only the CV matrix with appropriate indexing. (Approach identical to that of standard pivoting can be used—by remapping indexes). In other words, there is no requirement to extract data from the CV matrix to the CVX and CVY matrices. It is sufficient to provide two additional index vectors, containing the indices of the X and Y data in the Cv matrix. These index vectors are then utilised when manipulation of the contents of the CV matrix is required. This approach is analogous to the approach used in the mathematical branch of numerical methods, such as the Gaussian method of solving a system of equations with pivoting. [see, Kreyszig, E., Advanced (Engineering Mathematics, 7th ed. 1993, John Wiley and Sons, Inc., Singapore)].
The only overlap between data and solution matrices occurs on the diagonal. To avoid losing data a temporary store for the diagonal values must be implemented while the solution is computed. An additional vector for holding the diagonal can be declared and the diagonal data values can be copied to the additional vector before computation of the solution and copied back to the data matrix afterwards. Such an approach minimizes memory usage at the cost of additional memory copy operations. Note that the overhead of such operations is minimal, as it occurs only when the solution is computed.
When memory is not that critical, separate matrices can be used for data collection and solution, thus simplifying the coding of the algorithm. Note that the main memory gain is achieved by not keeping in memory the whole data matrix (containing thousands of rows), but just the CV matrix—all other memory gains are secondary. In addition, there is no need to compute explicitly the inverted matrix CVX−1—as for all standard linear algebra algorithms it is possible to compute only the matrix factorization and solve the system by back substitution. Examples of this process are available in many undergraduate textbooks. For example, see [Strang, G., Introduction to Applied Mathematics, Wellesley-Cambridge Press, 1986].
Therefore, the present invention, in at least one embodiment, provides a method and system for on-line estimation of resource usage by transaction type. That is, one advantage of the invention is the ability to provide instantaneous statistics on resource usage by transaction type. This is useful because it may allow a system administrator or programmer to study comparting system behaviour in real time, and potentially make changes to the computing system “on the fly”. In the prior art, such analysis was generally performed “off-line”, as the amount of computing resources required to compute a solution did not make the technique practical for on-line use, especially where high load demands from other transaction requests are concurrently placed on the transaction processing system.
Moreover, in at least one embodiment, the present invention utilises less CPU resources than prior art methodologies, and also requires less volatile memory than prior art methodologies. Both these characteristics contribute to the usefulness of the technique in “on-line” computing systems, where such resources are generally scarce, especially in transaction processing systems which process a large amount of transactions. For example, a 5% increase in total system load may be deemed as unacceptable by a system administrator, especially, if the load is caused by what is deemed to be a “non-essential” software application. However, a smaller 1% increase in total system load may be acceptable in certain circumstances. In some embodiments (depending on the type of hardware and software that comprises a computing system), such small increase in load are achievable with an embodiment of the present invention.
Modifications and variations as would be apparent to a skilled addressee are deemed to be within the scope of the present invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US02/41622 | 12/27/2002 | WO | 4/19/2006 |