Claims
- 1. A method for generating an approximate histogram of a data set, the method comprising the steps of:
applying a quantile function on the data set to create a computational result; selecting a subset of the data set in response to the computational result; determining a condensed quantile function from the subset of the data set; and rendering the approximate histogram in response to the condensed quantile function.
- 2. The method of claim 1 wherein the subset of the data set includes at least one of a minimum value of the data set, a median value of the data set, and a maximum value of the data set.
- 3. The method of claim 1 wherein the step of determining the condensed quantile function further comprises the step of interpolating between a plurality of values in the data set.
- 4. The method of claim 1 wherein the interpolating step comprises at least one of linear interpolation and polynomial interpolation.
- 5. The method of claim 1 wherein the step of rendering the approximate histogram further comprises the step of calculating an expected sample count based at least in part on an inverse of the condensed quantile function.
- 6. The method of claim 1 wherein the approximate histogram comprises a plurality of bins, the rendering step further comprising the steps of:
reducing the plurality of bins in the approximate histogram into a plurality of adjacent cells; and applying an indicium to each cell in response to a percentage of the data set within each cell.
- 7. The method of claim 6 wherein the indicium is visually discernable.
- 8. A method of merging a plurality of data sets with reduced data storage requirements, the method comprising the steps of:
calculating a condensed quantile function for each data set, each condensed quantile function supplying quantile values; interleaving the quantile values; calculating an inverse of each condensed quantile function at each interleaved quantile value; calculating an average of the inverse of the condensed quantile functions at each interleaved quantile value; and defining a merged quantile function as an inverse of the average of the inverse of the condensed quantile functions at each interleaved quantile value.
- 9. The method of claim 8 wherein the plurality of data sets comprise data associated with a single variable from adjacent time intervals.
- 10. The method of claim 8 wherein the plurality of data sets comprise data associated with a plurality of instances of a single variable from a single time interval.
- 11. The method of claim 8 further comprising the step of rendering a merged histogram in response to the merged quantile function.
- 12. The method of claim 11 wherein the step of rendering the merged histogram further comprises the step of calculating an expected sample count based at least in part on an inverse of the merged quantile function.
- 13. The method of claim 11 wherein the step of rendering the merged histogram further comprises the steps of:
reducing a plurality of bins in the merged histogram into a plurality of adjacent cells; and applying an indicium to each cell in response to a percentage of the data set within each cell.
- 14. The method of claim 13 wherein the indicium is visually discernable.
- 15. A method of optimizing performance in a distributed transaction system, the method comprising the steps of:
collecting data associated with at least one system performance metric; applying a quantile function on the data to create a computational result; selecting a subset of the data in response to the computational result; determining a condensed quantile function from the subset of the data; rendering at least one approximate histogram in response to the condensed quantile function; identifying at least one trend in the at least one approximate histogram; and adjusting, on an as needed basis, operation of the distributed transaction system to modify the at least one trend.
- 16. An apparatus for optimizing performance in a distributed transaction system comprising:
a data collection apparatus for collecting transaction system performance data; logic to apply a quantile function to the transaction system performance data, creating a computational result; logic to select a subset of the transaction system performance data in response to the computational result; logic to determine a condensed quantile function from the subset of the transaction system performance data; logic to render an approximate histogram in response to the condensed quantile function; and a display to render the approximate histogram.
- 17. An article of manufacture comprising a program storage medium having computer readable program code embodied therein for causing the generation of an approximate histogram of a data set, the computer readable program code in the article of manufacture including:
computer readable code for causing a computer to apply a quantile function on the data set to create a computational result; computer readable code for causing a computer to select a subset of the data set in response to the computational result; computer readable code for causing a computer to determine a condensed quantile function from the subset of the data set; and computer readable code for causing a computer to render the approximate histogram in response to the condensed quantile function, so as to generate the approximate histogram.
- 18. A program storage medium readable by a computer, tangibly embodying a program of instructions executable by the computer to perform method steps for generating an approximate histogram of a data set, the method steps comprising:
applying a quantile function on the data set to create a computational result; selecting a subset of the data set in response to the computational result; determining a condensed quantile function from the subset of the data set; and rendering the approximate histogram in response to the condensed quantile function, so as to generate the approximate histogram.
CROSS-REFERENCE TO RELATED CASES
[0001] This application claims priority to and the benefit of, and incorporates herein by reference, in its entirety, provisional U.S. patent application serial No. 60/322,021, filed Sep. 13, 2001. This application also incorporates herein by reference, in their entirety, U.S. patent application Ser. No. 10/186,401, filed Jul. 1, 2002, and Ser. No. 10/198,689, filed Jul. 18, 2002.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60322021 |
Sep 2001 |
US |