Claims
- 1. A method of data analysis comprising,
employing data comprising, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories, providing a multidimensional array having intersecting first and second axes, assigning each of said attributes as a one dimensional vector aligned along said first axis, assigning each of said records as a one dimensional vector aligned along said second axis, displaying a graphical indication of at least one of an occurrence and a value of each said attribute for each said record at an intersection of each said record vector with each said attribute vector, and manipulating at least one of said record vectors and said attribute vectors to produce a graphical pattern representative of said at least two categories.
- 2. The method of claim 1, wherein said manipulating step comprises transposing vectors in at least one of a pair of said record vectors and a pair of said attribute vectors to produce said graphical pattern representative of said at least two categories.
- 3. The method of claim 1 further comprising, determining, from said graphical pattern, a result-effective subset of attributes that is sufficient to divide said records into said at least two categories.
- 4. The method of claim 3, wherein said result-effective subset is a minimum subset.
- 5. The method of claim 4, wherein said minimum subset is unique.
- 6. The method of claim 2 further comprising, repeating said transposing step until said graphical pattern is produced.
- 7. The method of claim 1, wherein said graphical indication has at least two states.
- 8. The method of claim 7, wherein each of said states is represented by a color.
- 9. The method of claim 7, wherein each of said states is represented by a symbol.
- 10. The method of claim 7, wherein each of said states is represented by an integer.
- 11. The method of claim 7, wherein said each of said states is represented by a shade of gray
- 12. The method of claim 1, wherein said data comprises training data obtained from control specimens.
- 13. The method of claim 1, wherein said manipulating step comprises employing a principal uncorrelated record set algorithm.
- 14. The method of claim 1, wherein said records represent cells and said attributes are properties of said cells.
- 15. The method of claim 1, wherein said records represent mammals and said attributes are characteristics of said mammals.
- 16. The method of claim 1, wherein said records represent a sample from a mammal and said attributes are biological markers.
- 17. The method of claim 16, wherein said biological marker is a gene product.
- 18. The method of claim 16, wherein said biological marker is at least one of a protein and an mRNA.
- 19. The method of claim 1, wherein at least one of said at least two categories represents a predisposition to contract a disease.
- 20. The method of claim 19, wherein said disease is leukemia.
- 21. The method of claim 1, wherein at least one of said at least two categories represents a predisposition to a medical treatment efficacy.
- 22. The method of claim 1, wherein a first category represents a mammal having a first phenotype and a second category represents a mammal having a second, different phenotype.
- 23. The method of claim 22, wherein the first phenotype is a disease affected phenotype.
- 24. The method of claim 22, wherein the second phenotype is a non-disease affected phenotype.
- 25. The method of claim 22, wherein the disease is a cancer.
- 26. The method of claim 12, wherein said control specimens are mammals having a non-disease affected phenotype.
- 27. The method of claim 12, wherein said control specimens are mammals having a disease affected phenotype.
- 28. A system for data analysis, comprising;
a processor adapted for,
providing data including, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories, providing a two dimensional array having intersecting first and second axes, assigning each of said attributes as a one dimensional vector aligned along said first axis, assigning each of said records as a one dimensional vector aligned along said second axis, and manipulating at least one of said record vectors and said attribute vectors to produce a graphical pattern representative of said at least two categories; and a display adapted to display a graphical indication of at least one of an occurrence and a value of each said attribute for each said record at an intersection of each said record vector with each said attribute vector.
- 29. The system of claim 28, wherein said manipulating comprises transposing vectors in at least one of a pair of said record vectors and a pair of said attribute vectors to produce said graphical pattern representative of said at least two categories.
- 30. The system of claim 28, wherein said processor is further adapted for determining, from said graphical pattern, a result-effective subset of attributes that is sufficient to divide said records into said at least two categories.
- 31. The system of claim 30, wherein said result-effective subset is a minimum subset.
- 32. The system of claim 30, wherein said minimum subset is unique.
- 33. The system of claim 29, wherein said processor is further adapted for repeating said transposing step unit said graphical pattern is produced.
- 34. The system of claim 28, wherein said graphical indication has at least two states.
- 35. The system of claim 34, wherein each of said states is represented by a color.
- 36. The system of claim 34, wherein each of said states is represented by a symbol.
- 37. The system of claim 34, wherein each of said states is represented by an integer.
- 38. The system of claim 34, wherein said each of said states is represented by a shade of gray.
- 39. The system of claim 28, wherein said data comprises training data obtained from control specimens.
- 40. The system of claim 28, wherein said manipulating comprises employing a principal uncorrelated record set algorithm.
- 41. The system of claim 28, wherein said records represent cells and said attributes are properties of said cells.
- 42. The system of claim 28, wherein said records represent mammals and said attributes are characteristics of said mammals.
- 43. The system of claim 28, wherein said records represent a sample from a mammal and said attributes are biological markers.
- 44. The system of claim 43, wherein said biological marker is a gene product.
- 45. The system of claim 43, wherein said biological marker is at least one of a protein and an mRNA.
- 46. The system of claim 28, wherein at least one of said at least two categories represents a predisposition to contract a disease.
- 47. The system of claim 46, wherein said disease is leukemia.
- 48. The system of claim 28, wherein at least one of said at least two categories represents a predisposition to a medical treatment efficacy.
- 49. The system of claim 28, wherein a first category represents a mammal having a first phenotype and a second category represents a mammal having a second, different phenotype.
- 50. The system of claim 49, wherein the first phenotype is a disease affected phenotype.
- 51. The system of claim 49, wherein the second phenotype is a non-disease affected phenotype.
- 52. The system of claim 50, wherein the disease is a cancer.
- 53. The system of claim 39, wherein said control specimens are mammals having a non-disease affected phenotype.
- 54. The system of claim 39, wherein said control specimens are mammals having a disease affected phenotype.
- 55. A computer program recorded on a computer-readable medium for graphical data analysis, said computer program when operating performing said steps of,
accepting data comprising, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories,
constructing a multidimensional array having intersecting first and second axes, assigning each of said attributes as a one dimensional vector aligned along said first axis, assigning each of said records as a one dimensional vector aligned along said second axis, displaying a graphical indication of at least one of an occurrence and a value of said attribute for each said record at an intersection of each said record vector with each said attribute vector, and manipulating at least one of said record vectors and said attribute vectors to produce a graphical pattern representative of said at least two categories.
- 56. The computer program of claim 55, wherein said manipulating step comprises transposing vectors in at least one of a pair of said record vectors and a pair of said attribute vectors to produce said graphical pattern representative of said at least two categories.
- 57. The computer program of claim 55, when operating, further comprising, determining, form said graphical pattern, a result-effective subset of attributes that is sufficient to divide said records into said at least two categories.
REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. provisional patent application Serial No. 60/285,385, filed Apr. 20, 2001, U.S. provisional patent application Serial No. 60/285,945, filed Apr. 23, 2001, U.S. provisional patent application Serial No. 60/322,771, filed Sep. 17, 2001, and U.S. provisional application identified by Attorney Docket Code ANV-003PR, entitled Multi-Dimensional Interactive Data Visualization Applied To Small Molecule Research, filed Jan. 15, 2002, all of which applications are incorporated herein in their entirety by reference.
[0002] This application is related to U.S. patent application identified by Attorney Docket Code ANV-002, entitled “Method And System For Data Analysis” and to U.S. patent application identified by Attorney Docket Code ANV-004, and entitled “Method And System For Data Analysis”, both of which are filed on even date herewith and incorporated herein in their entirety by reference.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60285385 |
Apr 2001 |
US |
|
60285954 |
Apr 2001 |
US |
|
60322771 |
Sep 2001 |
US |