Claims
- 1. A method for determining one or more candidate genes, or their encoded RNAs or proteins, responsible for a phenotype of interest displayed by a cell type or organism, comprising:
(a) determining measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism to create a first profile; (b) comparing said first profile, or a predicted profile derived therefrom, to a database comprising a plurality of landmark profiles to determine the one or more landmark profiles most similar to said first or predicted profile, each landmark profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein, wherein the genes, or their encoded RNAs or proteins, perturbed in the one or more landmark profiles determined in step (b) are those candidate genes responsible for the phenotype of interest.
- 2. A method for determining one or more candidate genes, or their encoded RNAs or proteins, responsible for a phenotype of interest displayed by a cell type or organism, comprising:
comparing a first profile or a predicted profile derived therefrom to a database comprising a plurality of landmark profiles to determine the one or more landmark profiles most similar to said first or predicted profile; wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism; wherein each landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein; and wherein the genes, or their encoded RNAs or proteins, perturbed in the one or more landmark profiles determined to be most similar are those candidate genes responsible for the phenotype of interest.
- 3. A method for relating the phenotype of a cell type or organism to a genotype, said method comprising:
(a) determining the measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism exhibiting a phenotype to create a first profile; (b) determining the measured amounts of a plurality of cellular constituents in a second cell of said cell type or of said organism having a genetic perturbation to a known gene to create a landmark profile; and (c) determining the degree of similarity between said first profile and said landmark profile by comparing said degree of similarity between the measured amounts of said pluralities of cellular constituents, wherein said degree of similarity between said first profile and said landmark profile indicates the degree of similarity between the genotype resulting in the phenotype of said first cell or organism and the known mutant genotype of said second cell or organism, thereby relating the phenotype of said first cell or organism to the genotype of said second cell or organism.
- 4. A method for relating the phenotype of a cell type or organism to a genotype, said method comprising:
determining the degree of similarity between a first profile and a landmark profile by comparing the degree of similarity between measured amounts of pluralities of cellular constituents, wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism exhibiting a phenotype, and wherein said landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or of said organism having a genetic perturbation to a known gene, wherein said degree of similarity between said first profile and said landmark profile indicates the degree of similarity between the genotype resulting in the phenotype of said first cell or organism and the known mutant genotype of said second cell or organism, thereby relating the phenotype of said first cell or organism to the genotype of said second cell or organism.
- 5. The method of claim 1 or 2, wherein the database comprises landmark profiles for perturbations to at least 100 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 6. The method of claim 5, wherein the database comprises landmark profiles for perturbations to at least 250 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 7. The method of claim 6, wherein the database comprises landmark profiles for perturbations to at least 500 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 8. The method of claim 7, wherein the database comprises landmark profiles for perturbations to at least 5,000 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 9. The method of claim 8, wherein the database comprises landmark profiles for perturbations to at least 50,000 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 10. The method of claim 9, wherein the database comprises landmark profiles for perturbations to at least 100,000 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 11. The method of claim 1 or 2, wherein the database comprises landmark profiles for perturbations to at least ¼ of the genes, or their encoded RNAs or proteins, in the genome of a human, a livestock animal or a plant.
- 12. The method of claim 11, wherein the database comprises landmark profiles for perturbations to at least ½ of the genes, or their encoded RNAs or proteins, in the genome of a human, a livestock animal or a plant.
- 13. The method of claim 12, wherein the database comprises landmark profiles for perturbations to at least ¾ of the genes, or their encoded RNAs or proteins, in the genome of a human, a livestock animal or a plant.
- 14. The method of claim 1 or 2, wherein the database comprises landmark profiles for perturbations to at least 2% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 15. The method of claim 14, wherein the database comprises landmark profiles for perturbations to at least 5% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 16. The method of claim 15, wherein the database comprises landmark profiles for perturbations to at least 15% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 17. The method of claim 16, wherein the database comprises landmark profiles for perturbations to at least 40% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 18. The method of claim 17, wherein the database comprises landmark profiles for perturbations to at least 75% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 19. The method of claim 1 or 2, wherein said predicted profile is compared to said database, and said first profile is at a first developmental stage or first condition and said predicted profile is at a second, different developmental stage or condition more similar to the developmental stage or condition of said second cell than said first cell.
- 20. The method of claim 1 or 2 wherein said first profile comprises measured amounts of at least 1,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 1,000 cellular constituents.
- 21. The method of claim 20, wherein said first profile comprises measured amounts of at least 10,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 10,000 cellular constituents.
- 22. The method of claim 21, wherein said first profile comprises measured amounts of at least 100,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 100,000 cellular constituents.
- 23. The method of claim 22, wherein said first profile comprises measured amounts of at least 500,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 500,000 cellular constituents.
- 24. The method of claim 1, 2, 3 or 4 wherein the measured amounts of the pluralities of cellular constituents are determined by a method comprising converting expression data into expression values of a plurality of sets of co-varying genes.
- 25. The method of claim 1, 2, 3 or 4, wherein said measured amounts of the plurality of cellular constituents comprise abundances of a plurality of RNA species present in said first cell type or organism.
- 26. The method of claim 25, wherein the abundances of said plurality of RNA species are measured by a method comprising contacting a gene transcript array with RNA from said first cell of said cell type or organism, or with cDNA derived therefrom, wherein a gene transcript array comprises a surface with attached nucleic acids or nucleic acid mimics, said nucleic acids or nucleic acid mimics capable of hybridizing with said plurality of RNA species, or with cDNA derived therefrom.
- 27. The method of claim 1, 2, 3 or 4, wherein the measured amounts of the plurality of cellular constituents comprise abundances of a plurality of protein species present in said first cell type or organism.
- 28. The method of claim 27, wherein the abundances of said plurality of protein species are measured by a method comprising contacting an antibody array with proteins from said first cell of said cell-type or organism, wherein said antibody array comprises a surface with attached antibodies, said antibodies capable of binding with said plurality of protein species.
- 29. The method of claim 1, 2, 3 or 4, wherein the measured amounts of the plurality of cellular constituents comprise activities of a plurality of protein species present in said cell-type.
- 30. The method of claim 1, 2, 3 or 4, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism, and wherein the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism.
- 31. The method of claim 1, 2, 3 or 4, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism and measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are absolute amounts of the pluralities of cellular constituents.
- 32. A method of determining if a genotype associated with a phenotype of interest is present in a cell type or organism, comprising:
(a) determining measured amounts of a plurality of cellular constituents in a first cell of said cell type or organism to create a first profile; and (b) comparing said first profile to a database comprising a plurality of landmark profiles to determine whether one or more landmark profiles known to be indicative of the presence or absence of a genotype associated with the phenotype of interest is similar to said first profile, each landmark profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein, wherein determining that the landmark profiles known to be indicative of the presence of said genotype are similar to said first profile, is indicative of the presence of said genotype associated with the phenotype of interest in the cell type or organism; and wherein determining that the landmark profiles known to be indicative of the absence of said genotype are similar to said first profile, is indicative of the absence of said genotype associated with the phenotype of interest in the cell type or organism.
- 33. A method of determining if a genotype associated with a phenotype of interest is present in a cell type or organism, comprising:
comparing a first profile or a predicted profile derived therefrom to a database comprising a plurality of landmark profiles to determine whether one or more landmark profiles known to be indicative of the presence or absence of a genotype associated with the phenotype of interest is similar to said first or predicted profile; wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism; wherein each landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein; and wherein determining that the landmark profiles known to be indicative of the presence of said genotype are similar to said first or predicted profile, is indicative of the presence of said genotype associated with the phenotype of interest in the cell type or organism; and wherein determining that the landmark profiles known to be indicative of the absence of said genotype are similar to said first or predicted profile, is indicative of the absence of said genotype associated with the phenotype of interest in the cell type or organism.
- 34. The method of claim 32 or 33 wherein the phenotype is desirable.
- 35. The method of claim 32 or 33, wherein the phenotype is undesirable.
- 36. The method of claim 32 or 33, wherein the database comprises landmark profiles for perturbations to at least 100 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 37. The method of claim 36, wherein the database comprises landmark profiles for perturbations to at least 250 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 38. The method of claim 37, wherein the database comprises landmark profiles for perturbations to at least 5,000 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 39. The method of claim 38, wherein the database comprises landmark profiles for perturbations to at least 100,000 genes, or their encoded RNAs or proteins, in the genome of said cell type or organism.
- 40. The method of claim 32 or 33, wherein the database comprises landmark profiles for perturbations to at least ½ of the genes, or their encoded RNAs or proteins, in the genome of a human, a livestock animal or a plant.
- 41. The method of claim 40, wherein the database comprises landmark profiles for perturbations to at least ¾ of the genes, or their encoded RNAs or proteins, in the genome of a human, a livestock animal or a plant.
- 42. The method of claim 32 or 33, wherein the database comprises landmark profiles for perturbations to at least 2% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 43. The method of claim 42, wherein the database comprises landmark profiles for perturbations to at least 5% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 44. The method of claim 43, wherein the database comprises landmark profiles for perturbations to at least 15% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 45. The method of claim 44, wherein the database comprises landmark profiles for perturbations to at least 75% of the genes, or their encoded RNAs or proteins, in a genome of said cell type or organism.
- 46. The method of claim 33, wherein said predicted profile is compared to said database, and said first profile is at a first developmental stage or first condition and said predicted profile is at a second, different developmental stage or condition more similar to the developmental stage or condition of said second cell than said first cell.
- 47. The method of claim 32 or 33 wherein said first profile comprises measured amounts of at least 1,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 1,000 cellular constituents.
- 48. The method of claim 47, wherein said first profile comprises measured amounts of at least 100,000 cellular constituents, and said landmark profiles each comprise measured amounts of at least 100,000 cellular constituents.
- 49. The method of claim 32 or 33, wherein the measured amounts of the pluralities of cellular constituents are determined by a method comprising converting expression data into expression values of a plurality of sets of co-varying genes.
- 50. The method of claim 32 or 33, wherein the measured amounts of the plurality of cellular constituents comprise abundances of a plurality of RNA species present in said first cell type or organism.
- 51. The method of claim 50, wherein the abundances of said plurality of RNA species are measured by a method comprising contacting a gene transcript array with RNA from said first cell of said cell type or organism, or with cDNA derived therefrom, wherein a gene transcript array comprises a surface with attached nucleic acids or nucleic acid mimics, said nucleic acids or nucleic acid mimics capable of hybridizing with said plurality of RNA species, or with cDNA derived therefrom.
- 52. The method of claim 32 or 33, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism, and wherein the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism.
- 53. The method of claim 32 or 33, wherein measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism and the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are absolute amounts of the pluralities of cellular constituents.
- 54. A system for determining one or more candidate genes, or their encoded RNAs or proteins, responsible for a phenotype of interest displayed by a cell or organism, said system comprising:
(a) one or more memory units; and (b) one or more processor units interconnected with the one or more memory units, wherein the one or more memory units encodes one or more programs causing the one or more processor units to perform a method comprising comparing a first profile or a predicted profile derived therefrom to a database comprising a plurality of landmark profiles to determine the one or more landmark profiles most similar to said first or predicted profile; wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism; wherein each landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein; and wherein the genes perturbed in the one or more landmark profiles determined to be most similar are those candidate genes responsible for the phenotype of interest.
- 55. The system of claim 54, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism, and wherein the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism.
- 56. The system of claim 54, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism and the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are absolute amounts of the pluralities of cellular constituents.
- 57. A system for relating the phenotype of a cell type or organism to a genotype, said system comprising:
(a) one or more memory units; and (b) one or more processor units interconnected with the one or more memory units, wherein the one or more memory units encodes one or more programs causing the one or more processor units to perform a method comprising determining the degree of similarity between a first profile of measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism exhibiting a phenotype and a landmark profile of measured amounts of a plurality of cellular constituents in a second cell of said cell type or of said organism having a genetic perturbation to a known gene by comparing said degree of similarity between the measured amounts of said pluralities of cellular constituents, wherein said degree of similarity between said first profile and said landmark profile indicates the degree of similarity between the genotype resulting in the phenotype of said first cell or organism and the known mutant genotype of said second cell or organism, thereby relating the phenotype of said first cell or organism to the genotype of said second cell or organism.
- 58. The system of claim 57, wherein the memory encodes one or more programs causing the one or more processor units to further perform the steps of
inputting measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism exhibiting a phenotype that is a first profile; and inputting measured amounts of a plurality of cellular constituents in a second cell of said cell type or of said organism having a genetic perturbation that is a landmark profile before the step of determining the degree of similarity between said first profile and said landmark profile.
- 59. A system for determining if a genotype associated with a phenotype of interest is present in a cell type or organism, said system comprising:
(a) one or more memory units; and (b) one or more processor units interconnected with the one or more memory units, wherein the one or more memory units encodes one or more programs causing the one or more processor units to perform a method comprising comparing a first profile or a predicted profile derived therefrom to a database comprising a plurality of landmark profiles to determine whether one or more landmark profiles known to be indicative of the presence or absence of a genotype associated with the phenotype of interest is similar to said first or predicted profile; wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism; wherein each landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein; and wherein determining that the landmark profiles known to be indicative of the presence of said genotype are similar to said first or predicted profile, is indicative of the presence of said genotype associated with the phenotype of interest in the cell type or organism; and wherein determining that the landmark profiles known to be indicative of the absence of said genotype are similar to said first or predicted profile, is indicative of the absence of said genotype associated with the phenotype of interest in the cell type or organism.
- 60. The system of claim 59, wherein the phenotype is desirable.
- 61. The system of claim 59, wherein the phenotype is undesirable.
- 62. The system of claim 54, 57, or. 59, wherein said programs further cause the one or more processor units to perform a step of converting expression data into expression values of a plurality of sets of co-varying genes.
- 63. The system of claim 54, 57, or 59, wherein said programs further cause the one or more processor units to perform a step of predicting said predicted profile, and wherein said first profile is at a first developmental stage or first condition and said predicted profile is at a second, different developmental stage or condition more similar to the developmental stage or condition of said second cell than said first cell.
- 64. The system of claim 59, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism, and wherein the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism.
- 65. The method of claim 59, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism and the measured amounts the plurality of cellular constituents in said second cell of said cell type or of said organism are absolute amounts of the pluralities of the cellular constituents.
- 66. A computer program product for use in conjunction with a computer having one or more memory units and one or more processor units, the computer program product comprising a computer readable storage medium having a computer program mechanism encoded thereon, wherein said computer program mechanism may be loaded into the one or more memory units of a computer and cause the one or more processor units of the computer to execute the step of:
comparing a first profile or a predicted profile derived therefrom to a database comprising a plurality of landmark profiles to determine whether one or more landmark profiles known to be indicative of the presence or absence of a genotype associated with the phenotype of interest is similar to said first or predicted profile; wherein said first profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism; wherein each landmark profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a known gene or its encoded RNA or protein; and wherein determining that the landmark profiles known to be indicative of the presence of said genotype are similar to said first or predicted profile, is indicative of the presence of said genotype associated with the phenotype of interest in the cell type or organism; and wherein determining that the landmark profiles known to be indicative of the absence of said genotype are similar to said first or predicted profile, is indicative of the absence of said genotype associated with the phenotype of interest in the cell type or organism.
- 67. The computer program product of claim 66, further comprising the step of converting expression data into expression values of a plurality of sets of co-varying genes.
- 68. The computer program product of claim 66, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism, and wherein the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are determined in comparison to a wild-type cell of said cell type or of said organism.
- 69. The computer program product of claim 66, wherein the measured amounts of the plurality of cellular constituents in said first cell of said cell type or of said organism and the measured amounts of the plurality of cellular constituents in said second cell of said cell type or of said organism are absolute amounts of the pluralities of cellular constituents.
- 70. The method of claim 1, 2, 3 or 4, wherein said one or more landmark profiles determined to be most similar to said first or predicted profile is a consensus profile associated with a perturbation to said known gene.
- 71. The method of claim 31 or 32, wherein said one or more landmark profiles known to be indicative of the presence or absence of a genotype associated with a phenotype of interest is a consensus profile associated with the presence or absence of said genotype associated with said phenotype of interest.
Parent Case Info
[0001] This application claims benefit of provisional U.S. Patent Application Serial No. 60/215,935 filed Jul. 5, 2000, which is incorporated by reference herein in its entirety.
PCT Information
Filing Document |
Filing Date |
Country |
Kind |
PCT/US01/20931 |
7/2/2001 |
WO |
|