Claims
- 1. A method of data analysis comprising,
employing data comprising, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories, assigning an attribute position to each of said plurality of attributes on a locus defined on a multi-dimensional representation, assigning a record position on said multidimensional representation to each of said plurality of records, said record position being dependent at least in part on at least one of, an occurrence and a value, of at least one of said plurality of attributes, and reassigning said attribute position of at least one of said plurality of attributes on said locus to divide said plurality of records into said at least two categories.
- 2. The method of claim 1, wherein said assigning step comprises assigning said record position at least in part in dependence on said attribute position assigned to each of said plurality of attributes.
- 3. The method of claim 1 further comprising, reassigning said record position of each of said plurality of records in response to said reassigning said attribute position step.
- 4. The method of claim 1 further comprising, repeating said reassigning said attribute position step until said plurality of records divide into said at least two categories.
- 5. The method of claim 1, wherein said assigning a record position comprises,
defining a vector associated with each of said plurality of attributes for at least one of said plurality of records, defining a magnitude for each said vector corresponding to a value of said associated attribute for said particular one of said plurality of records, determining a relationship between each said vector for said particular one of said plurality of records, and assigning said record location in dependence of said relationship.
- 6. The method of claim 5, wherein said relationship comprises a vector sum.
- 7. The method of claim 5, wherein said relationship is based at least in part on Hooke's law.
- 8. The method of claim 1, wherein said locus defined on said multi-dimensional representation is a periphery of said multi-dimensional representation.
- 9. The method of claim 1, wherein reassigning said attribute position comprises exchanging said attribute position of at least two of said attributes.
- 10. The method of claim 1, wherein reassigning said attribute position comprises shifting said attribute position of said at least one attribute.
- 11. The method of claim 1, wherein assigning an attribute position to each of said attributes on a periphery of a multi-dimensional representation comprises assigning said attribute position to be equidistant from an attribute on either side of said attribute.
- 12. The method of claim 1, wherein said associated plurality of attributes is identical for each of said plurality of records.
- 13. The method of claim 1, wherein said data comprises data recorded from a plurality of test specimens about which category information is unknown.
- 14. The method of claim 1, wherein said data comprises control data recorded from a plurality of control specimens about which category information is known.
- 15. The method of claim 14, further comprising employing said control data to calibrate said method of claim 1.
- 16. The method of claim 15, further comprising employing test data, about which category information is unknown, for said data, subsequent to said calibration.
- 17. The method of claim 1 further comprising, displaying said record position for at least one of said records to an operator to indicate into which of said at least two categories said at least one record divides.
- 18. The method of claim 1 wherein said multi-dimensional representation has more than two dimensions.
- 19. The method of claim 1, wherein said multi-dimensional representation comprises a polygon.
- 20. The method of claim 1, wherein said multi-dimensional representation comprises a conic section.
- 21. The method of claim 1, wherein said records represent cells and said attributes are properties of said cells.
- 22. The method of claim 1, wherein said records represent mammals and said attributes are characteristics of said mammals.
- 23. The method of claim 1, wherein said records represent a sample from a mammal and said attributes are biological markers.
- 24. The method of claim 23, wherein said biological marker is a gene product.
- 25. The method of claim 23, wherein said biological marker is at least one of a protein and an mRNA.
- 26. The method of claim 1, wherein at least one of said at least two categories represents a predisposition to contract a disease.
- 27. The method of claim 26, wherein said disease is leukemia.
- 28. The method of claim 1, wherein at least one of said at least two categories represents a predisposition to a medical treatment efficacy.
- 29. The method of claim 1, wherein a first category represents a mammal having a first phenotype and a second category represents a mammal having a second, different phenotype.
- 30. The method of claim 29, wherein the first phenotype is a disease affected phenotype.
- 31. The method of claim 30, wherein the second phenotype is a non-disease affected % phenotype.
- 32. The method of claim 30, wherein the disease is a cancer.
- 33. The method of claim 14, wherein said control specimens are mammals having a non-disease affected phenotype.
- 34. The method of claim 14, wherein said control specimens are mammals having a disease affected phenotype.
- 35. The method of claim 13, wherein said test specimen is a mammal of unknown phenotypic disposition.
- 36. A system for data analysis comprising,
a processor adapted for,
employing data comprising, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories, assigning an attribute position to each of said plurality of attributes on a locus defined on a multi-dimensional representation, assigning a record position on said multidimensional representation to each of said plurality of records, said record position being dependent at least in part on at least one of, an occurrence and a value, of at least one of said plurality of attributes, and reassigning said attribute position of at least one of said plurality of attributes on said locus to divide said plurality of records into said at least two categories.
- 37. The system of claim 36, wherein said assigning comprises assigning said record position at least in part in dependence on said attribute position assigned to each of said plurality of attributes.
- 38. The system of claim 36, wherein said processor is further adapted for reassigning said record position of each of said plurality of records in response to said reassigning said attribute position step.
- 39. The system of claim 36, wherein said processor is further adapted for, repeating said reassigning said attribute position step until said plurality of records divide into said at least two categories.
- 40. The system of claim 36, wherein said assigning a record position comprises,
defining a vector associated with each of said plurality of attributes for at least one of said plurality of records, defining a magnitude for each said vector corresponding to a value of said associated attribute for said particular one of said plurality of records, determining a relationship between each said vector for said particular one of said plurality of records, and assigning said record location in dependence of said relationship.
- 41. The system of claim 40, wherein said relationship comprises a vector sum.
- 42. The system of claim 40, wherein said relationship is based at least in part on Hooke's law.
- 43. The system of claim 36, wherein said locus defined on said multi-dimensional representation is a periphery of said multi-dimensional representation.
- 44. The system of claim 36, wherein reassigning said attribute position comprises exchanging said attribute position of at least two of said attributes.
- 45. The system of claim 36, wherein reassigning said attribute position comprises shifting said attribute position of said at least one attribute.
- 46. The system of claim 36, wherein assigning an attribute position to each of said attributes on a periphery of a multi-dimensional representation comprises assigning said attribute position to be equidistant from an attribute on either side of said attribute.
- 47. The system of claim 36, wherein said associated plurality of attributes is identical for each of said plurality of records.
- 48. The system of claim 36, wherein said data comprises data recorded from a plurality of test specimens about which category information is unknown.
- 49. The system of claim 36, wherein said data comprises control data recorded from a plurality of control specimens about which category information is known.
- 50. The system of claim 49, wherein in said processor is further adapted for employing said control data to calibrate said system of claim 1.
- 51. The system of claim 49, wherein said processor is further adapted for employing test data, about which category information is unknown, for said data, subsequent to said calibration.
- 52. The system of claim 36 further comprising, a display device adapted for displaying said record position for at least one of said records to an operator to indicate into which of said at least two categories said at least one record divides.
- 53. The system of claim 36, wherein said multi-dimensional representation has more than two dimensions.
- 54. The system of claim 36, wherein said multi-dimensional representation comprises a polygon.
- 55. The system of claim 36, wherein said multi-dimensional representation comprises a conic section.
- 56. The system of claim 36, wherein said records represent cells and said attributes are properties of said cells.
- 57. The system of claim 36, wherein said records represent mammals and said attributes are characteristics of said mammals.
- 58. The system of claim 36, wherein said records represent a sample from a mammal and said attributes are biological markers.
- 59. The system of claim 36, wherein said biological marker is a gene product.
- 60. The system of claim 59, wherein said biological marker is at least one of a protein and an mRNA.
- 61. The system of claim 36, wherein at least one of said at least two categories represents a predisposition to contract a disease.
- 62. The system of claim 61, wherein said disease is a cancer.
- 63. The system of claim 36, wherein at least one of said at least two categories represents a predisposition to a medical treatment efficacy.
- 64. The system of claim 36, wherein a first category represents a mammal having a first phenotype and a second category represents a mammal having a second, different phenotype.
- 65. The system of claim 36, wherein the first phenotype is a disease affected phenotype.
- 66. The system of claim 36, wherein the second phenotype is a non-disease affected phenotype.
- 67. The system of claim 65, wherein the disease is a cancer.
- 68. The system of claim 49, wherein said control specimens are mammals having a non-disease affected phenotype.
- 69. The system of claim 49, wherein said control specimens are mammals having a disease affected phenotype.
- 70. The system of claim 49, wherein said test specimen is a mammal of unknown phenotypic disposition.
- 71. A computer program for data analysis recorded on a computer-readable medium, the computer program when operating performing the steps of,
employing data comprising, a plurality of records, each of said records having an associated plurality of attributes, said plurality of records being divisible into at least two categories, assigning an attribute position to each of said plurality of attributes on a locus defined on a multi-dimensional representation, assigning a record position on said multidimensional representation to each of said plurality of records, said record position being dependent at least in part on at least one of, an occurrence and a value, of at least one of said plurality of attributes, and reassigning said attribute position of at least one of said plurality of attributes on said locus to divide said plurality of records into said at least two categories.
- 72. The computer program of claim 71, wherein said assigning comprises assigning said record position at least in part in dependence on said attribute position assigned to each of said plurality of attributes.
- 73. The computer program of claim 71, when operating, further comprising, reassigning said record position of each of said plurality of records in response to said reassigning said attribute position step.
- 74. The computer program of claim 71, when operating further comprising, repeating said reassigning said attribute position step until said plurality of records divide into said at least two categories.
- 75. The computer program of claim 71, wherein said assigning a record position comprises,
defining a vector associated with each of said plurality of attributes for at least one of said plurality of records, defining a magnitude for each said vector corresponding to a value of said associated attribute for said particular one of said plurality of records, determining a relationship between each said vector for said particular one of said plurality of records, and assigning said record location in dependence of said relationship.
REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. provisional patent application Serial No. 60/285,385, filed Apr. 20, 2001, U.S. provisional patent application Serial No. 60/285,945, filed Apr. 23, 2001, U.S. provisional patent application Serial No. 60/322,771, filed Sep. 17, 2001, and U.S. provisional application identified by Attorney Docket Code ANV-003PR, entitled Multi-Dimensional Interactive Data Visualization Applied To Small Molecule Research, filed Jan. 15, 2002, all of which applications are incorporated herein in their entirety by reference.
[0002] This application is related to U.S. patent application identified by Attorney Docket Code ANV-001, entitled “Method And System For Data Analysis” and to U.S. patent application identified by Attorney Docket Code ANV-004, and entitled “Method And System For Data Analysis”, both of which are filed on even date herewith and incorporated herein in their entirety by reference.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60285385 |
Apr 2001 |
US |
|
60285945 |
Apr 2001 |
US |
|
60322771 |
Sep 2001 |
US |