Claims
- 1. A method for displaying and manipulating data to facilitate identification, trends, correlations or other useful relationships among the data, the method comprising the steps of:
inputting, into a memory of a processing system, data items associated with entities to be observed, wherein the entities are arranged in an n×m matrix, n equaling the number of columns in the matrix and m equaling the number of rows in the matrix; identifying a data item with respect to each entity that is to be represented in a display matrix, wherein each identified data item represents the same characteristic of each respective entity, although the value of each identified data item may vary; converting the identified data items to determine graphical representations of the identified data items to be displayed, wherein the graphical representations graphically represent variations in the values of the identified data items; displaying the first c×d graphical representations of the identified data items in a c×d matrix, where c<n and d<m, with each graphical representation from the first c×d representations occupying a corresponding cell in the c×d matrix displayed; selecting a row or column in the displayed c×d matrix; sorting the order of arrangement of the entities in the n×m matrix based on a comparison of the values of the identified data items in the row or column of the n×m matrix corresponding to the row or column selected from the displayed c×d matrix; reordering the order of arrangement of the entities in the n×m matrix based on the sort performed; and displaying the first c×d graphical representations of the identified data items from the reordered n×m matrix for observation by a user, for visual identification of changes in the relationships between the graphical representations.
- 2. The method of claim 1, wherein a row is selected and the sorting comprises rank ordering the identified items 1 through n which are assigned to the selected row, and wherein upon reordering, the column in which the first ranked identified data item is located is reordered to the first column position of the n×m matrix, the column in which the second ranked identified data item is located is reordered to the second column position of the n×m matrix and successive columns are reordered according to the rank of the identified data item contained therein which also belongs to the selected row, wherein the first c columns of the reordered m×n matrix are then displayed.
- 3. The method of claim 1, wherein a column is selected and the sorting comprises rank ordering the identified items 1 through m which are assigned to the selected column, and wherein upon reordering, the row in which the first ranked identified data item is located is reordered to the first row position of the n×m matrix, the row in which the second ranked identified data item is located is reordered to the second row position of the n×m matrix and successive rows are reordered according to the rank of the identified data item contained therein which also belongs to the selected column, wherein the first d rows of the reordered m×n matrix are then displayed.
- 4. The method of claim 1, wherein a row is selected and the sorting comprises similarity sorting the rows of the n×m matrix, wherein the selected row is reordered as row one of the reordered n×m matrix and all other rows are processed to determine a relative similarity value to the selected row, and wherein upon reordering, all other rows are repositioned in descending order from row one, based on ranking by the relative similarity values, and wherein the first d rows of the reordered m×n matrix are then displayed.
- 5. The method of claim 4, wherein a relative similarity value is determined by calculating a distance value between the selected row and a row to be assigned the similarity value, wherein each cell of the selected row is compared with a respective cell in the row to be assigned the similarity value.
- 6. The method of claim 5, wherein the distance value is determined by calculating a squared Euclidean distance between the two rows.
- 7. The method of claim 5, wherein the distance value is determined by calculation of the Pearson correlation coefficient.
- 8. The method of claim 1, wherein a column is selected and the sorting comprises similarity sorting the columns of the n×m matrix, wherein the selected column is reordered as column one of the reordered n×m matrix and all other columns are processed to determine a relative similarity value to the selected column, and wherein upon reordering, all other columns are repositioned in descending order from column one, based on ranking by the relative similarity values, and wherein the first c columns of the reordered m×n matrix are then displayed.
- 9. The method of claim 8, wherein a relative similarity value is determined by calculating a distance value between the selected column and a column to be assigned the similarity value, wherein each cell of the selected column is compared with a respective cell in the column to be assigned the similarity value.
- 10. The method of claim 9, wherein the distance value is determined by calculating a squared Euclidean distance between the two columns.
- 11. The method of claim 1, wherein a row is selected and the sorting comprises nearest neighbor sorting the rows of the n×m matrix, wherein the selected row is reordered as row one of the reordered n×m matrix, all other rows are processed to determine a relative similarity value to the selected row, the row having the nearest relative similarity value is reordered as row two of the reordered n×m matrix, all remaining rows which have not been reordered are processed to determine a relative similarity value to reordered row two, the row having the nearest relative similarity to row two is reordered as row three, and this process is reiterated until all rows have been reordered by nearest neighbor, wherein the first d rows of the reordered m×n matrix are then displayed.
- 12. The method of claim 1, wherein a column is selected and the sorting comprises nearest neighbor sorting the columns of the n×m matrix, wherein the selected column is reordered as column one of the reordered n×m matrix, all other columns are processed to determine a relative similarity value to the selected column, the column having the nearest relative similarity value is reordered as column two of the reordered n×m matrix, all remaining columns which have not been reordered are processed to determine a relative similarity value to reordered column two, the column having the nearest relative similarity to column two is reordered as column three, and this process is reiterated until all columns have been reordered by nearest neighbor, wherein the first c columns of the reordered m×n matrix are then displayed.
- 13. The method of claim 1, further comprising the steps of:
identifying a second data item, other than the data item which is graphically represented on the display, with respect to each entity in the n×m matrix, wherein each identified second data item represents the same characteristic of each respective entity, although the value of each identified second data item may vary; selecting a row or column in the displayed c×d matrix; sorting the order of arrangement of the entities in the n×m matrix based on a comparison of the values of the identified second data items in the row or column of the n×m matrix corresponding to the row or column selected from the displayed c×d matrix; reordering the order of arrangement of the entities in the n×m matrix based on the sort performed on the second data items; and displaying the first c×d graphical representations of the identified data items from the reordered n×m matrix for observation by a user, for visual identification of changes in the relationships between the graphical representations.
- 14. The method of claim 1, further comprising the steps of:
selecting a displayed item e in a selected column wherein item e is displayed in a row other than the first row of the selected column; sub-sorting the n×m matrix by rank ordering the identified items e through m in the selected column and reordering items e through m according to rank, wherein upon reordering items e through m, the rows in which the respective items e through m are located are reordered to the respective rank positions in the n×m matrix; and displaying the first c×d graphical representations of the identified data items from the reordered n×m matrix for observation by a user, for visual identification of changes in the relationships between the graphical representations.
- 15. The method of claim 1, further comprising the steps of:
selecting a displayed item f in a selected row wherein item f is displayed in a column other than the first column of the selected row; sub-sorting the n×m matrix by rank ordering the identified items f through n in the selected row and reordering items f through n according to rank, wherein upon reordering items f through n, the columns in which the respective items f through n are located are reordered to the respective rank positions in the n×m matrix; and displaying the first c×d graphical representations of the identified data items from the reordered n×m matrix for observation by a user, for visual identification of changes in the relationships between the graphical representations.
- 16. The method of claim 1, further comprising the steps of
selecting a row g other than the first row of the reordered, displayed c×d matrix; sub-sorting rows g through m by similarity sorting the rows of the n×m matrix from row g to row m, wherein the selected row is maintained as row position g in the reordered n×m matrix and all rows below row g are processed to determine a relative similarity value to selected row g, and wherein upon reordering, all other rows below row g are repositioned in descending order from row g, based on ranking by the relative similarity values to row g, and wherein the first d rows of the reordered m×n matrix are then displayed.
- 17. The method of claim 1, further comprising the steps of
selecting a column h other than the first column of the reordered, displayed c×d matrix; sub-sorting columns h through n by similarity sorting the columns of the n×m matrix from column h to column n, wherein the selected column h is maintained as column position h in the reordered n×m matrix and all columns from h+1 to n are processed to are processed to determine a relative similarity value to selected column h, and wherein upon reordering, columns g+1 to n are repositioned in descending order from column h, based on ranking by the relative similarity values to column h, and wherein the first c columns of the reordered m×n matrix are then displayed.
- 18. The method of claim 1, further comprising the steps of
selecting a row g other than the first row of the reordered, displayed c×d matrix; sub-sorting rows g through m by nearest neighbor sorting the rows of the n×m matrix from row g to row m, wherein the row g is maintained as row position g in the reordered n×m matrix and rows g+1 to m are processed to determine a relative similarity to row g, the row having the nearest relative similarity value to row g is reordered as row g+1 of the reordered n×m matrix, all remaining rows which have not been reordered are then processed to determine a relative similarity value to reordered row g+1, the row having the nearest relative similarity to row g+1 is reordered as row g+2, and this process is reiterated until all rows g to m have been sub-sorted by nearest neighbor, wherein the first d rows of the reordered m×n matrix are then displayed.
- 19. The method of claim 1, further comprising the steps of
selecting a column h other than the first column of the reordered, displayed c×d matrix; sub-sorting columns h through n by nearest neighbor sorting the columns of the n×m matrix from column h to column n, wherein the column h is maintained as column position h in the reordered n×m matrix and columns h+1 to n are processed to determine a relative similarity to column h, the column having the nearest relative similarity value to column h is reordered as column h+1 of the reordered n×m matrix, all remaining columns which have not been reordered are then processed to determine a relative similarity value to reordered column h+1, the column having the nearest relative similarity to column h+1 is reordered as row h+2, and this process is reiterated until all columns h to n have been sub-sorted by nearest neighbor, wherein the first c columns of the reordered m×n matrix are then displayed.
- 20. The method of claim 1, further comprising manually repositioning at least one column or row of the displayed c×d matrix, wherein such manually repositioning repositions the corresponding rows and columns of the entire n×m matrix.
- 21. The method of claim 20, wherein the manual repositioning is carried out by dragging and dropping the row or column.
- 22. The method of claim 1, further comprising the steps of:
inputting additional data into the memory of the processing system, wherein the additional data corresponds to the entities in the n×m matrix and is arranged in an n×y matrix; linking the column addresses (n values) of the n×y matrix with those of the n×m matrix; and displaying the data from the n×y matrix together with the first c×d−y graphical representations of the identified data items in a c×d matrix; wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a column of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding column in the n×y matrix is reordered by its linking address to maintain the same relative display position to the reordered column of the n×m matrix when displayed in the c×d matrix on the display.
- 23. The method of claim 1, further comprising the steps of:
inputting additional data into the memory of the processing system, wherein the additional data corresponds to the entities in the n×m matrix and is arranged in a z×m matrix; linking the row addresses (m values) of the z×m matrix with those of the n×m matrix; and displaying the data from the z×m matrix together with the first c−z×d graphical representations of the identified data items in a c×d matrix; wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a row of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding row in the z×m matrix is reordered by its linking address to maintain the same relative display position to the reordered row of the n×m matrix when displayed in the c×d matrix on the display.
- 24. The method of claim 22, further comprising the steps of:
inputting further additional data into the memory of the processing system, wherein the further additional data corresponds to the entities in the n×m matrix and is arranged in a z×m matrix; linking the row addresses (m values) of the z×m matrix with those of the n×m matrix; and displaying the data from the z×m matrix together with data from the n×y matrix and the first c−z×d−y graphical representations of the identified data items in a c×d matrix; wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a row of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding row in the z×m matrix is reordered by its linking address to maintain the same relative display position to the reordered row of the n×m matrix when displayed in the c×d matrix on the display.
- 25. The method of claim 1, wherein the data items are experimental data.
- 26. The method of claim 1, wherein the entities are derived from microarray experiments, the identified data items are gene expression ratios and the graphical representations of the identified data items are color-coded graphical representations typically used in heat maps.
- 27. The method of claim 22, wherein the data items comprise experimental data and the additional data is selected from the group consisting of clinical data, patient data, and a combination of clinical and patient data.
- 28. The method of claim 1, wherein the graphical representations are color coded with a first color corresponding to an identified data item having a relatively neutral value, a second color corresponding to an identified item having a higher than neutral value, and a third color having a lower than neutral value.
- 29. The method of claim 28, wherein the color-coding further comprises shades of the second and third colors to represent degrees of values higher and lower than the neutral value, respectively.
- 30. The method of claim 29, wherein the graphical representations are further variably sized to represent degrees of values higher and lower than the neutral value, and wherein a completely neutral value is represented by a blank cell in the displayed matrix.
- 31. The method of claim 30, wherein the graphical representations are variably sized colored circles.
- 32. The method of claim 31, wherein a graphical representation appears as a colored rectangle or square, completely filling a cell of the displayed matrix when a colored circle is calculated to have a diameter greater than or equal to a width or height of the cell in which it is to be displayed.
- 33. The method of claim 30, wherein the graphical representations are variably sized vertical bars.
- 34. The method of claim 30, wherein the graphical representations are variably sized horizontal bars.
- 35. The method of claim 1, wherein the graphical representations are variably sized to represent degrees of values higher and lower than a relative neutral value of the identified data items, and wherein a completely neutral value is represented by a blank cell in the displayed c×d matrix.
- 36. The method of claim 35, wherein the graphical representations are variably sized circles.
- 37. The method of claim 36, wherein a graphical representation appears as a rectangle or square, completely filling a cell of the displayed c×d matrix when a circle is calculated to have a diameter greater than or equal to a width or height of the cell in which it is to be displayed.
- 38. The method of claim 35, wherein the graphical representations are variably sized vertical bars.
- 39. The method of claim 35, wherein the graphical representations are variably sized horizontal bars.
- 40. A method for displaying and manipulating data to facilitate identification, trends, correlations or other useful relationships among the data, the method comprising the steps of:
inputting, into a memory of a processing system, experimental data items associated with biological entities to be observed, wherein the entities are arranged in an n×m matrix, n equaling the number of columns in the matrix and m equaling the number of rows in the matrix; inputting additional data into the memory of the processing system, wherein the additional data is non-experimental data corresponding to the entities in the n×m matrix and is arranged in an n×y matrix; linking the column addresses (n values) of the n×y matrix with those of the n×m matrix; identifying an experimental data item with respect to each entity that is to be represented in a display matrix, wherein each identified experimental data item represents the same characteristic of each respective biological entity, although the value of each identified data item may vary; converting the identified experimental data items to determine graphical representations of the identified data items to be displayed, wherein the graphical representations graphically represent variations in the values of the identified data items; displaying at least a portion of the non-experimental data and the graphical representations of the identified data items in a c×d matrix, where c<n and d<(m+y); selecting a row or column in the displayed c×d matrix; sorting the order of arrangement of the entities in the n×m matrix based on a comparison of the values of the identified experimental data items in the row or column of the n×m matrix corresponding to the row or column selected from the displayed c×d matrix; reordering the order of arrangement of the entities in the n×m matrix based on the sort performed, wherein, when a column of experimental data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding column in the n×y matrix is reordered by its linking address to maintain the same relative display position to the reordered column of the n×m matrix; and displaying the first c×d non-experimental data items and graphical representations of the experimental data items in a c×d matrix display.
- 41. The method of claim 40, further comprising the steps of:
inputting additional non-experimental data into the memory of the processing system, wherein the additional non-experimental data corresponds to the entities in the n×m matrix and is arranged in a z×m matrix; linking the row addresses (m values) of the z×m matrix with those of the n×m matrix; and displaying at least a portion of the data from the z×m matrix together with at least a portion of the data from the n×y matrix and graphical representations of at least a portion of the identified experimental data items in a c×d matrix; wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a row of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding row in the z×m matrix is reordered by its linking address to maintain the same relative display position to the reordered row of the n×m matrix when displayed in the c×d matrix on the display.
- 42. The method of claim 40, wherein the entities are derived from microarray experiments, the identified experimental data items are gene expression ratios and the graphical representations of the identified data items are color-coded graphical representations typically used in heat maps.
- 43. The method of claim 40, wherein the graphical representations are variably sized to represent relative values of the experimental data.
- 44. The method of claim 43, wherein the graphical representations are variably sized circles.
- 45. The method of claim 44, wherein a graphical representation appears as a rectangle or square, completely filling a cell of the displayed matrix when a circle is calculated to have a diameter greater than or equal to a width or height of the cell in which it is to be displayed.
- 46. The method of claim 43, wherein the graphical representations are variably sized vertical bars.
- 47. The method of claim 43, wherein the graphical representations are variably sized horizontal bars.
- 48. The method of claim 43, wherein the graphical representations are biased toward discovering trends in the horizontal directions along which the rows extend.
- 49. The method of claim 43, wherein the graphical representations are biased toward discovering trends in the vertical directions along which the columns extend.
- 50. The method of claim 40, further comprising linking the entities in the n×m matrix with an additional source of informational data associated with the entities and stored outside of the memory of the processing system; selecting a graphical representation of an identified experimental data item; accessing and searching the additional source of informational data, based upon identifying information of the entity associated with the selected graphical representation; retrieving additional informational data pertaining to the entity associated with the selected graphical representation; and displaying the additional informational data over the display of the c×d matrix, in a manner that it is viewable in conjunction with the selected graphical representation.
- 51. A method for displaying and manipulating data to facilitate identification, trends, correlations or other useful relationships among the data, the method comprising the steps of:
inputting, into a memory of a processing system, data items associated with entities to be observed, wherein the entities are arranged in an n×m matrix, n equaling the number of columns in the matrix and m equaling the number of rows in the matrix; identifying a data item with respect to each entity that is to be represented in a display matrix, wherein each identified data item represents the same characteristic of each respective biological entity, although the value of each identified data item may vary; converting the identified data items to determine graphical representations of the identified data items to be displayed, wherein the graphical representations graphically represent variations in the values of the identified data items; displaying a compressed view of an entire n×m matrix of the graphical representations of the identified data items; displaying a non-compressed view of a selected c×d portion of the compressed n×m matrix; selecting a row or column in the displayed c×d matrix; sorting the order of arrangement of the entities in the n×m matrix based on a comparison of the values of the identified data items in the row or column of the n×m matrix corresponding to the row or column selected from the displayed c×d matrix; reordering the order of arrangement of the entities in the n×m matrix based on the sort performed; displaying the reordered arrangement of the graphical representations in the compressed n×m matrix view; and displaying a non-compressed view of the reordered, compressed n×m display taken from the same selection location of the compressed view that the originally displayed non-compressed c×d matrix was selected from.
- 52. The method of claim 51, further comprising the steps of:
inputting additional data into the memory of the processing system, wherein the additional data corresponds to the entities in the n×m matrix and is arranged in an n×y matrix; linking the column addresses (n values) of the n×y matrix with those of the n×m matrix; and displaying at least a portion of the data from the n×y matrix together with the graphical representations of the identified data items displayed in the non-compressed c×d matrix; wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a column of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding column in the n×y matrix is reordered by its linking address to maintain the same relative display position to the reordered column of the n×m matrix when displayed in the c×d matrix on the display, so that only columns of additional data corresponding to the columns of identified data displayed in the c×d matrix are displayed.
- 53. The method of claim 51, further comprising the steps of:
inputting additional data into the memory of the processing system, wherein the additional data corresponds to the entities in the n×m matrix and is arranged in a z×m matrix; linking the row addresses (m values) of the z×m matrix with those of the n×m matrix; and displaying at least a portion of the data from the z×m matrix together with the graphical representations displayed in the non-compressed c×d display, wherein, upon selecting a row or column in the displayed c×d matrix for sorting, only data items from the n×m matrix are considered for sorting procedures, and wherein, when a row of data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding row in the z×m matrix is reordered by its linking address to maintain the same relative display position to the reordered row of the n×m matrix when displayed in the c×d matrix on the display, so that only data from the z×m matrix that corresponds to the rows of graphical representations displayed in the c×d matrix are displayed.
- 54. A method for displaying and manipulating data sets containing disparate types of data, the method comprising the steps of:
inputting, into a memory of a processing system, experimental data items associated with entities to be observed, wherein the entities are arranged in an n×m matrix, n equaling the number of columns in the matrix and m equaling the number of rows in the matrix; inputting additional data into the memory of the processing system, wherein the additional data is non-experimental data corresponding to the entities in the n×m matrix and is arranged in a matrix selected from the group including a z×m matrix and an n×y matrix; identifying an experimental data item with respect to each entity that is to be represented in a display matrix, wherein each identified experimental data item represents the same characteristic of each respective biological entity, although the value of each identified data item may vary; converting the identified experimental data items to determine graphical representations of the identified data items to be displayed, wherein the graphical representations graphically represent variations in the values of the identified data items; displaying at least a portion of the graphical representations of the identified data items in a c×d matrix, where c≦n and d≦m; displaying at least a portion of the additional data in a z×d matrix or a c×y matrix; moving the matrix display of the additional data in proximity with the c×d matrix display, wherein if the additional data is displayed in a z×d matrix, the z×d matrix is approximated with a side of the c×d matrix and if the additional data is displayed in a c×y matrix, the c×y matrix is approximated with a top or bottom of the c×d matrix; and docking the matrix display of the additional data with the c×d matrix display, thereby synchronizing the matrices.
- 55. The method of claim 54, further comprising the steps of:
selecting a row or column in the displayed c×d matrix; sorting the order of arrangement of the entities in the n×m matrix based on a comparison of the values of the identified experimental data items in the row or column of the n×m matrix corresponding to the row or column selected from the displayed c×d matrix; reordering the order of arrangement of the entities in the n×m matrix based on the sort performed, wherein, when a column of experimental data is reordered in the n×m matrix according to the results of a sorting procedure, a corresponding column in the n×y matrix or row in the z×m matrix is reordered synchronously to maintain the same relative display position to the reordered column of the n×m matrix; and displaying the first c×d graphical representations in the c×d matrix and displaying the additional data corresponding to the displayed graphical representations in the docked z×d or c×y matrix.
CROSS-REFERENCE
[0001] This application claims the benefit of U.S. Provisional Application No. 60/402,566, filed Aug. 8, 2002, which application is incorporated herein, in its entirety, by reference thereto.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60402566 |
Aug 2002 |
US |