Traditional scatter plots have been widely used to display correlation or association between two variables. A scatter plot is a chart that uses Cartesian coordinates (e.g., x-axis or y-axis coordinates) to display values for the two variables. The data displayed in the scatter plot is a collection of points, each having one coordinate on the horizontal axis and one on the vertical axis. An example of a scatter plot is depicted in
Various points are plotted in the scatter plot of
From the scatter plot of
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
Some embodiments of the invention are described, by way of example, with respect to the following figures:
In accordance with some embodiments, a visualization technique or mechanism is provided to allow for representation of data records of a scatter plot without overlapping of data records (non-overlapping visualization of the data records). The visualization technique or mechanism provides a visualization screen that has rows and columns containing cells representing respective data records of the scatter plot. The rows correspond to value ranges of a first attribute of the data records, and the columns correspond to value ranges of a second, different attribute of the data records. The value ranges for the rows and columns can be automatically generated from incoming data of the data records. A “value range” of an attribute refers to a range of values of the attribute, where the values can be numerical values. Also, the cells can be ordered according to a third attribute. Ordering the cells refers to arranging the cells according to values of an ordering attribute (such as arranging cells from left to right and/or from bottom to top according to the values of the ordering attribute).
The cells can have the same size or different sizes.
In addition, colors (or other visual indicators) can be assigned to the cells according to values of a fourth attribute. An example of another visual indicator includes different patterns. Effectively, the visualization screen according to some embodiments provides for a multi-dimensional representation of the data records, including a first dimension corresponding to the first attribute, a second dimension corresponding to the second attribute, a third dimension corresponding to the third attribute, and a fourth dimension corresponding to the fourth attribute.
As used here, the term “scatter plot” refers to either a traditional scatter plot or a scatter plot represented with a visualization screen according to some embodiments. A traditional scatter plot uses Cartesian coordinates (e.g., a horizontal or x-axis and a vertical or a y-axis), with data points plotted against the values of the variables in the Cartesian coordinate system to provide the scatter plot. On the other hand, a scatter plot that is represented by a visualization screen according to some embodiments refers to a representation of data records in rows and columns that have cells that represent respective data records.
In some embodiments, each row of cells runs generally in a horizontal direction of the visualization screen, while each column of cells runs generally in a vertical direction of the visualization screen. In such embodiments, each row is confined between a corresponding first horizontal line and corresponding second horizontal line spaced apart from the first horizontal line. In this manner, the rows do not intrude into neighboring rows—in other words, each of the rows does not intrude into neighboring rows. Similarly, in such embodiments, each column is confined between a corresponding first vertical line and corresponding second vertical line, such that each of the columns does not intrude into neighboring columns.
The arrangement of rows and columns of cells defines a regular array of blocks of cells, where each block is at the intersection of a corresponding row and column. A “regular array” of blocks means that within each particular row, the blocks of the particular rows are of equal height, and within each particular column, the blocks of the particular column have equal width. Note, however, that the blocks in different rows can have different heights (due to different value ranges of the first attribute), and the blocks in different columns can have different widths (due to different value ranges of the second attribute).
The visualization screen 200 also includes columns 204A, 204B, . . . , 204J (collectively referred to as “columns 204”) that correspond to value ranges of the CPU busy attribute (second attribute) (e.g., 0-10, 11-20, 21-30, 31-40, and so forth). The intersections of the rows 202 and columns 204 define corresponding blocks 205.
Cells 206 are schematically illustrated as enlarged circles in the block 205 in column 204B of row 202A. The cells 206 represent respective data records. Note that the cells 206 are enlarged in the diagram to allow for ease of viewing in
In the example of
In row 202B, note that as the CPU busy values increase (in columns 204C and forward), the number of data records appearing in these successive columns increase. This is due to the fact that as CPU busy values increase (indicating that loading on the CPU becomes heavier), the queue length also increases (indicating that the number of jobs waiting for execution is increasing). This pattern is also present in the other combinations of rows and columns.
Note that a section 210 of the visualization screen 200 corresponds to the section 102 of
What is depicted in
On the other hand, with the visualization screen 200 of
Generally, to allow for distinct data points associated with a scatter plot to be viewed in the visualization screen 200, first value groups and second value groups are defined to represent the horizontal axis and vertical axis values of a scatter plot. The first value groups define corresponding columns 204 in
A further feature provided by some embodiments is that user interaction is enabled such that a user can perform zooming or other actions to view portions of a scatter plot or to perform other interactions in the visualization screen.
Note that in
As further depicted in
As further depicted in
The value groups (defining rows and columns) of the visualization screen are filled (at 406) with corresponding cells. The cells are then ordered (at 408) from left to right and bottom to top according to the values of the ordering attribute(s). Also, cells are assigned colors (at 410) according to values of the coloring attribute.
All distinct data records are then rendered (at 412) in the visualization screen according to the tasks performed at 406-410.
Thereafter, the procedure repeats based on receiving (at 414) a user interaction or receiving (at 402) further input data records. The user interaction with the visualization screen, can be a selection, a move, or a rubber-banding operation. In response to the user interaction (414), the definitions at 404 can be modified to depict fewer value groups, such as in response to selection of a sub-region in the visualization screen, and/or other actions. Receipt of further input data records can also cause definitions (404) to change, since the further data records can be associated with new values (of the row and column attributes) that may trigger new value groups to be defined, or existing value groups to be modified (value groups may be defined to have more values or less values to enlarge or reduce the respective ranges).
The tasks of
The columns of the visualization screen 504 correspond to different value ranges of the CPU busy attribute, and the rows of the visualization screen 504 correspond to different values ranges of the Disk usage attribute. The color assigned to cells within blocks 506 of the visualization screen 504 represent values of Queue length (the coloring attribute). The cells can be ordered by time stamp (the ordering attribute) in the example of
In the example of
As depicted in
Also, note that there is low disk usage when CPU busy is above 60%. Note that
The computer 604 also includes a display device 606 in which visualization screen 608 (such as the visualization screens depicted in
By using the visualization technique or mechanism according to some embodiments, various benefits may be provided. First, overlapping data records that traditionally are present in conventional scatter plots can be avoided in visualization screens according to some embodiments. In this way, the true volume of data records within each value group can be shown to provide a complete view of data value distribution. Also, multiple dimensions can be depicted using various attributes, including attributes to define rows and columns, coloring attributes, and ordering attributes.
Instructions of software described above (including the visualization software 600 of
Data and instructions (of the software) are stored in respective storage devices, which are implemented as one or more computer-readable or computer-usable storage media. The storage media include different forms of memory including semiconductor memory devices such as dynamic or static random access memories (DRAMs or SRAMs), erasable and programmable read-only memories (EPROMs), electrically erasable and programmable read-only memories (EEPROMs) and flash memories; magnetic disks such as fixed, floppy and removable disks; other magnetic media including tape; and optical media such as compact disks (CDs) or digital video disks (DVDs). Note that the instructions of the software discussed above can be provided on one computer-readable or computer-usable storage medium, or alternatively, can be provided on multiple computer-readable or computer-usable storage media distributed in a large system having possibly plural nodes. Such computer-readable or computer-usable storage medium or media is (are) considered to be part of an article (or article of manufacture). An article or article of manufacture can refer to any manufactured single component or multiple components.
In the foregoing description, numerous details are set forth to provide an understanding of the present invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these details. While the invention has been disclosed with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover such modifications and variations as fall within the true spirit and scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
3487308 | Johnson | Dec 1969 | A |
5581797 | Baker | Dec 1996 | A |
5608904 | Chaudhuri et al. | Mar 1997 | A |
5623590 | Becker et al. | Apr 1997 | A |
5623598 | Voigt et al. | Apr 1997 | A |
5634133 | Kelley | May 1997 | A |
5659768 | Forbes et al. | Aug 1997 | A |
5694591 | Du et al. | Dec 1997 | A |
5757356 | Takeasaki et al. | May 1998 | A |
5801688 | Mead et al. | Sep 1998 | A |
5878206 | Chen et al. | Mar 1999 | A |
5903891 | Chen et al. | May 1999 | A |
5924103 | Ahmed et al. | Jul 1999 | A |
5929863 | Tabei et al. | Jul 1999 | A |
5940839 | Chen et al. | Aug 1999 | A |
5986673 | Martz | Nov 1999 | A |
5999193 | Conley, Jr. et al. | Dec 1999 | A |
6052890 | Malagrino, Jr. et al. | Apr 2000 | A |
6144379 | Bertram et al. | Nov 2000 | A |
6188403 | Sacerdoti et al. | Feb 2001 | B1 |
6211880 | Impink, Jr. | Apr 2001 | B1 |
6211887 | Meier et al. | Apr 2001 | B1 |
6269325 | Lee et al. | Jul 2001 | B1 |
6384847 | Rabenhorst | May 2002 | B1 |
6400366 | Davies et al. | Jun 2002 | B1 |
6429868 | Dehner, Jr. et al. | Aug 2002 | B1 |
6466946 | Mishra et al. | Oct 2002 | B1 |
6502091 | Chundi et al. | Dec 2002 | B1 |
6584433 | Zhang et al. | Jun 2003 | B1 |
6590577 | Yonts | Jul 2003 | B1 |
6603477 | Tittle | Aug 2003 | B1 |
6658358 | Hao et al. | Dec 2003 | B2 |
6684206 | Chen et al. | Jan 2004 | B2 |
6727926 | Utsuki et al. | Apr 2004 | B1 |
6934578 | Ramseth | Aug 2005 | B2 |
7020869 | Abrari et al. | Mar 2006 | B2 |
7202868 | Hao | Apr 2007 | B2 |
7221474 | Hao et al. | May 2007 | B2 |
7313533 | Chang et al. | Dec 2007 | B2 |
7567250 | Hao et al. | Jul 2009 | B2 |
7714876 | Hao | May 2010 | B1 |
7800613 | Hanrahan et al. | Sep 2010 | B2 |
7844926 | Russell | Nov 2010 | B1 |
7924283 | Hao | Apr 2011 | B1 |
20020118193 | Halstead, Jr. | Aug 2002 | A1 |
20030030637 | Grinstein et al. | Feb 2003 | A1 |
20030065546 | Goruer et al. | Apr 2003 | A1 |
20030071815 | Hao et al. | Apr 2003 | A1 |
20030128212 | Pitkow | Jul 2003 | A1 |
20030221005 | Betge-Brezetz et al. | Nov 2003 | A1 |
20040051721 | Ramseth | Mar 2004 | A1 |
20040054294 | Ramseth | Mar 2004 | A1 |
20040054295 | Ramseth | Mar 2004 | A1 |
20040095349 | Bito et al. | May 2004 | A1 |
20040183799 | Hao et al. | Sep 2004 | A1 |
20040201588 | Meanor | Oct 2004 | A1 |
20040210540 | Israel et al. | Oct 2004 | A1 |
20050066026 | Chen et al. | Mar 2005 | A1 |
20050119932 | Hao | Jun 2005 | A1 |
20050219262 | Hao et al. | Oct 2005 | A1 |
20060095858 | Hao et al. | May 2006 | A1 |
20060221077 | Wright et al. | Oct 2006 | A1 |
20070225986 | Bowe, Jr. et al. | Sep 2007 | A1 |
20080033790 | Nickerson et al. | Feb 2008 | A1 |
20090033664 | Hao et al. | Feb 2009 | A1 |
Number | Date | Country |
---|---|---|
0778001 | Nov 1996 | EP |
Entry |
---|
M. Trutschl et al., “Intelligently Resolving Point Occlusion,” Information Visualization, 2003, INFOVIS 2003, IEEE Symposium, Published Oct. 2003, Abstract only. |
T. Buring et al., “User Interaction with Scatterplots on Small Screens—A Comparative Evaluation of Geometric-Semantic Zoom and Fisheye Distortion,” Visualization and Computer Graphics, IEEE Transactions, Published Sep.-Oct. 2006, pp. 1-8. |
Deun et al., Multidimensional Scaling, Open and Distance Learning, Jan. 12, 2000 (pp. 1-16). |
http://www.pavis.org/essay/multidimensional—scaling.html, 2001 Wojciech Basalaj, (pp. 1-30). |
D. Keim et al Pixel Bar Charts: A New Technique for Visualization Large Multi-Attribute Data Sets with Aggregation:, HP Technical Report, Apr. 2001, pp. 1-10. |
M. Ankerst et al “Towards an effective cooperation of the computer and the computer user for classification, Proc. 6th Int. Conf. on Knowledge Discovery and Data Mining ,” (KDD'2000), Aug. 20-23, 2000, Boston, MA, 2000, pp. 1-10. |
M.C. Hao et al “Visual Mining of E-customer Behavior Using Pixel Bar Charts,”, HP Technical Report, Jun. 20, 2001, pp. 1-7. |
B. Shneiderman, “Tree Visualization with Treemaps: a 2-D Space-Filling Approach”, pp. 1-10, Jun. 1991. |
Daniel Keim et al “Designing Pixel-Orientated Visualization Techniques: Theory and Applications” IEEE Transactions on Visualization and Computer Graphics, vol. 6, No. 1, Jan.-Mar. 2000, pp. 59-78. |
Jessica Lin, Eamonn Keogh, Stefano Lonardi, Jeffrey P. Lankford, Donna M. Nystrom; Visually Mining and Monitoring Massive Time Series; 2004; International Conference on Knowledge Discovery and Data Mining archive, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents; pp. 460-469. |
Eamonn Keogh, Harry Hochheiser, and Ben Shneiderman; An Augmented Visual Query Mechanism for Finding Patterns in Time Series Data; 2002; Lecture Notes in Computer Science, Proceedings of the 5th International Conference on Flexible Query Answering Systems; Springer-Verlag; vol. 252212002; pp. 240-250. |
Chris Stolte et al., “Polaris: A System for Query, Analysis and Visualiztion of Multidimensional Relational Databases,” IEEE Transactions on Visualization and ComputerGraphics, vol. 8, No. 1, pp. 1-14 (Jan.-Mar. 2002). |
Daniel A. Keim et al., “VisDB: Database Exploration Using Multidimensional Visualization,” IEEE Graphics and Applications, vol. 14, No. 5, pp. 40-49 (1994). |
Matthew O. Ward, “XmdvTool: Integrating Multiple Methods for Visualizing Multivariate Data,” Proc. Visualization, pp. 326-331 (Oct. 1994). |
H. Hochheiser et al., “Dynamic Query Tools for Time Series Data Sets: Timebox Widgets for Interactive Exploration,” Information Visualization, vol. 3, pp. 1-18 (2004. |
P. Buono et al., “Technical Research Report, Interactive Pattern Search in Time Series,” Institute for Systems Research, TR 2005-57, pp. 1-11 (2004. |
J. Yang et al., “Visual Hierarchical Dimension Reduction for Exploration of High Dimensional Datasets,” Joint Eurographics/IEEE TCVG Symposium on Visualization, pp. 19-28 (May 2003). |
J. Fekete et al., “Interactive Information Visualization of a Million Items,” Information Visualization, 2002, INFOVIS 2002, IEEE Symposium, Published Oct. 2002, pp. 1-8. |
UCLA Academic Technology Services, Stat Computing, SPSS FAQ—How do I interpret the results from crosstabs? dated at least as early as Sep. 2008 (2 pages). |
Number | Date | Country | |
---|---|---|---|
20100103176 A1 | Apr 2010 | US |