Claims
- 1. A method for producing a collection of responder cells, comprising:
a) obtaining an expression profile of a genome or a transcriptome exposed to a perturbation; b) identifying genes that are differentially expressed under the perturbation compared to the absence of the perturbation; c) identifying and isolating regulatory regions from one or more of the genes that are differentially expressed; d) operatively linking each regulatory region to nucleic acid encoding a reporter to produce a reporter construct; and e) introducing each reporter construct into an addressable collection to cells to produce an addressable collection of responder cells.
- 2. The method of claim 1, wherein a plurality of regulatory regions that respond to a perturbation are identified.
- 3. The method of claim 1, wherein the regulatory region comprises a promoter.
- 4. The method of claim 1, wherein the regulatory regions comprise robust responders.
- 5. The method of claim 1, wherein the perturbation comprises exposure to a test compound or plurality thereof.
- 6. The method of claim 5, wherein the test compound is a biopolypmer, a small organic molecule or a natural product.
- 7. The method of claim 6, wherein the test compound is a nucleic acid molecule or a polypeptide.
- 8. The method of claim 6, wherein the test compound is an antibody, a member of a combinatorial library, an antibody or binding fragment thereof, or antisense molecule.
- 9. The method of claim 1, wherein the genome is eukaryotic genome.
- 10. The method of claim 1, wherein the genome is an an animal insect, plant or yeast genome.
- 11. The method of claim 1, wherein the genome is a mammalian genome.
- 12. The method of claim 10, wherein the animal is a human.
- 13. The method of claim 1, wherein the transcriptome is from a tissue or organ.
- 14. The method of claim 1, wherein the perturbation is a disease state in the organism and expression is compared to its absence.
- 15. The method of claim 1, wherein the transcriptome is from a cancerous tissue or organ.
- 16. The method of claim 1, wherein expression of genes operatively linked to the regulatory regions is repressed and/or increased under the perturbation.
- 17. An addressable collection of responder cells produced by the method of claim 1, wherein the collection contains a plurality of sets of cells; and each set contains a different reporter construct.
- 18. The collection of claim 17, wherein each set is in a well in a high density microtiter plate.
- 19. The collection of claim 18, wherein the microtiter plate contains at least 384 wells.
- 20. A method for identifying a regulatory region of a robust responder gene among a plurality of genes comprising:
a) exposing the cell to a test perturbation; b) determining expression of a plurality of genes in the cell in the presence of the perturbation compared to the absence thereof; c) identifying at least one gene whose expression is increased or decreased at least 3-fold in the presence of perturbation compared to the absence thereof; and d) identifying a regulatory region of a gene that confers increased or decreased expression in response the perturbation.
- 21. The method of claim 20, wherein the perturbation is a substance or change in intra-cellular or extra-cellular condition.
- 22. The method of claim 20, wherein at least one gene whose expression is decreased at least 6-fold in the presence of the perturbation is identified.
- 23. The method of claim 20, wherein the regulatory region comprises a promoter or an enhancer.
- 24. The method of claim 20, wherein the cell comprises a tissue or organ or a sample thereof.
- 25. The method of claim 20, wherein the cell is eukaryotic or prokaryotic.
- 26. The method of claim 20, wherein the eukaryotic cell is mammalian, insect, plant or yeast.
- 27. The method of claim 26, wherein the mammalian cell is human.
- 28. The method of claim 20, wherein the perturbation comprises exposure to a drug, a hormone, an extract, a protein, a nucleic acid, a lipid, a carbohydrate or a fat.
- 29. The method of claim 1, wherein the perturbation comprises exposure to a drug, a hormone, an extract, a protein, a nucleic acid, a lipid, a carbohydrate or a fat.
- 30. The method of claim 1, wherein the perturbation comprises increased or decreased temperature, exposure to ultraviolet light, a change in pH, a change in a salt or ion concentration, exposure to or a decrease in oxygen.
- 31. The method of claim 20, wherein the perturbation comprises increased or decreased temperature, exposure to ultraviolet light, a change in pH, a change in a salt or ion concentration, exposure to or a decrease in oxygen.
- 32. The method of claim 20, further comprising:
e) operatively linking a sequence comprising a 5′ untranslated region extending upstream of the translation initiation site of the selected gene to a reporter gene to a produce a reporter gene construct.
- 33. The method of claim 32, further comprising:
f) determining reporter expression in the presence of the perturbation.
- 34. The method of claim 32, wherein the 5′ untranslated region extends 25, 50, 75, 100, 250, 500, 1000, 2500, 5000, 7500, or 10,000 or more nucleotides upstream of the translation initiation site of the selected gene.
- 35. The method of claim 32, wherein the reporter gene construct comprises an expression vector.
- 36. The method of claim 35, wherein the expression vector comprises a viral vector.
- 37. The method of claim 35, wherein the viral vector is a retroviral vector.
- 38. The method of claim 35, wherein the viral vector contains a unidirectional transcriptional blocker.
- 39. The method of claim 35, wherein the viral vector contains a scaffold attachment region.
- 40. The method of claim 35, wherein the viral vector contains a selectable or detectable marker.
- 41. The method of claim 1, wherein step d) is performed by comparison of the selected gene to a sequence database containing at least one genomic sequence.
- 42. The method of claim 41, wherein the comparison identifies a 5′ untranslated region extending upstream of the translation initiation site of the selected gene.
- 43. The method of claim 42, wherein the 5′ untranslated region extends 25, 50, 75, 100, 250, 500, 1000, 2500, 5000, 7500, or 10,000 or more nucleotides upstream from the translation initiation site of the selected gene.
- 44. The method of claim 41, wherein the comparison is performed by a computer system or program, wherein the system or program includes computer readable instructions directing a processor to compare one or more gene sequences to a sequence database.
- 45. The method of claim 41, wherein the sequence database comprises a mammalian, human, yeast, drosophila, C. elegans or plant database.
- 46. The method of claim 41, wherein the sequence database comprises a genomic sequence database.
- 47. The method of claim 44, wherein the computer system or program further comprises computer readable instructions that direct a processor to select a primer set appropriate for amplification of the regulatory region.
- 48. The method of claim 1, further comprising ranking the genes identified in step c) according to their relative increase or decrease in expression.
- 49. The method of claim 48, wherein the ranking is carried out by a computer system or program comprising computer readable instructions directing a processor to rank gene expression according to increase or decrease in response to the perturbation.
- 50. The method of claim 1, wherein expression of a differentially expressed gene is increased to a greater extent than increased expression of one or more other genes among the plurality of genes.
- 51. The method of claim 1, wherein expression genes that are differentially expressed are among the top 20, 10, 5 or 2 genes whose expression is altered among a plurality of genes.
- 52. The method of claim 1, wherein expression of a gene that is differentially expressed is increased to a greater extent than increased expression of any other gene among a plurality of genes whose expression is increased.
- 53. The method of claim 1, wherein expression of a gene that is differentially expressed is decreased to a greater extent than increased expression of any other gene among a plurality of genes whose expression is decreased.
- 54. The method of claim 20, wherein in step c) genes whose expression is increased or decreased are among the top 20, 10, 5 or 2 genes whose expression is altered among a plurality of genes.
- 55. The method of claim 20, wherein in step c) a gene whose expression is increased is increased to a greater extent than increased expression of any other gene among a plurality of genes whose expression is increased.
- 56. The method of claim 20, wherein in step c) a gene whose expression is decreased is decreased to a greater extent than decreased expression of any other gene among a plurality of genes whose expression is decreased.
- 57. The method of claim 20, wherein step b) is performed by hybridization of transcripts of the genes to an array comprising a plurality of oligonucleotides at addressable loci on a substrate.
- 58. The method of claim 57, wherein the transcripts or nucleic acid molecules derived from the transcripts are detectably labeled.
- 59. The method of claim 58, wherein the label comprises a fluorophore, a radioisotope or a chemiluminescent moiety.
- 60. The method of claim 57, wherein one or more of the oligonucleotides represents a known gene, mutant or truncated form of a gene.
- 61. The method of claim 20, wherein step b) is performed by subtractive hybridization, differential display or representational difference analysis.
- 62. The method of claim 20, wherein the plurality of genes comprises all of a genome or a transcriptome.
- 63. The method of claim 20, wherein any of steps a) to e) are controlled by a program comprising computer readable instructions for directing a processor to carry out any of steps a) to d).
- 64. The method of claim 20, wherein any of steps a) to d) are performed by a system comprising:
a processor element; and a computer program comprising computer readable instructions that direct the processor to perform any of steps a) to d).
- 65. The method of claim 32, further comprising introducing the each expression construct into a cell to produce a collection of cells, wherein each cell is a responder cell that comprises the expression construct.
- 66. A collection of cells produced by the method of claim 65.
- 67. A collection of cells, wherein each cell comprises a nucleic acid encoding a robust responder regulatory region operatively linked to a nucleic acid encoding a reporter gene.
- 68. The collection of claim 71, wherein robust responder regulatory regions are obtained from genes whose expression is increased or decreased at least 3-fold in the presence of perturbation compared to the absence of the perturbation.
- 69. The collection of claim 72, wherein genes whose expression is decreased the decrease in expression is at least 6-fold.
- 70. The collection of claim 71, wherein the regulatory region comprises a promoter, a silencer or an enhancer.
- 71. The collection of responder cells of claim 71 that comprises an addressable array.
- 72. A collection of responder cells, comprising a plurality of sets of cells, wherein each set is in an addressable location and the cells of each set comprise a different promoter operably linked to a reporter nucleic acid.
- 73. The collection of claim 72, wherein the collection comprises at least 300 sets of cells.
- 74. The collection of claim 72, wherein the collection comprises at least 1000 sets of cells.
- 75. The collection of claim 72, wherein the collection comprises at least 10,000 sets of cells.
- 76. The collection of claim 72, wherein the different promoters are each robust responders to a particular peturbation of interest.
- 77. The collection of claim 5, wherein the peturbation is exposure to a substance or a change in extracellular or intracellular condition.
- 78. The collection of claim 72, wherein the perturbation comprises exposure to a drug, a hormone, an extract, a protein, a nucleic acid, a lipid, a carbohydrate or a fat.
- 79. The collection of claim 72, wherein the perturbation increased or decreased temperature, exposure to ultraviolet light, a change in pH, a change in a salt or ion concentration, exposure to or a decrease in oxygen.
- 80. A method of characterizing a perturbation, the method comprising:
exposition a collection of responder cells of claim 72 with the substance to obtain a response profile for the substance; and comparing the response profile for the substance with a response profile obtained by contacting the collection of responder cells with a characterized substance to thereby characterize the perturbation.
- 81. The method of claim 80, wherein the response profile for the perturbation is stored in a database.
- 82. The method of claim 80, wherein the perturbation comprises exposure to a drug, a hormone, an extract, a protein, a nucleic acid, a lipid, a carbohydrate or a fat.
- 83. The method of claim 80, wherein the perturbation increased or decreased temperature, exposure to ultraviolet light, a change in pH, a change in a salt or ion concentration, exposure to or a decrease in oxygen.
- 84. A database that comprises response profiles for a plurality of peturbations, wherein the response profiles are obtained by subjecting a collection of responder cells to each peturbation to obtain a response profile for the peturbations.
- 85. The database of claim 84, wherein the peturbations are exposure to a substance.
- 86. A system for identifying a regulatory region of a robust responder gene among a plurality of genes comprising:
a processor element; and a computer program comprising computer readable instructions that direct the processor to:
determine expression of a plurality of genes in a cell in the presence of a perturbation compared to in the absence of the perturbation; identify at least one gene whose expression is increased or decreased at least 3-fold or at least 6-fold; and select the regulatory region of the gene that confers increased or decreased expression in response to the perturbation.
- 87. The system of claim 79, wherein the decrease in expression is at least 6-fold.
- 88. A method, comprising:
exposing each member of an addressable collection of responder cells to a known perturbation; and determining the profile of changes in cellular reporter activity affected by perturbations.
- 89. The method of claim 88, further comprising:
storing the patterns in a computer readable medium to create a database, wherein each profile is identified by the perturbation giving rise to the profile.
- 90. The method of claim 88, further comprising:
treating the addressable collection with a test perturbation; comparing the resulting profile to the known profiles; and identifying profiles that are similar or that match to thereby determine targets of the test perturbation or the activity of the test perturbation.
- 91. A database produced by the method of claim 89.
- 92. The database of claim 91 that is a relational database.
- 93. A method for producing a collection of reporter cells comprising:
(a) identifying a plurality of protein coding sequences from a database of DNA sequences of an organism; (b) designing primers for amplifying untranslated sequences upstream of the protein coding sequences from genomic DNA of the organism, wherein the untranslated sequences each comprise a promoter; (c) amplifying the untranslated sequences using the primers, thereby obtaining a plurality of promoters; (d) producing a plurality of reporter constructs, each of the reporter constructs comprising a promoter operably linked to a DNA sequence encoding a detectable marker; (e) introducing the plurality of reporter constructs into cells to produce a plurality of reporter cells, each reporter cell comprising one of the reporter constructs to thereby produce a collection of cells.
- 94. The method of claim 93, wherein the collection is addressable.
- 95. The method of claim 94, wherein the addressable collection comprises an array.
- 96. The method of claim 88, wherein the array contains at least 300 reporter cells, each reporter cell comprising a different promoter.
- 97. An addressable array produced by the method of claim 88.
- 98. A method of determining the effect of a molecule on a cell comprising:
(a) providing a plurality of reporter cells, each reporter cell comprising a reporter construct that comprises a promoter that is expressible in the reporter cell; (b) contacting the plurality of reporter cells with the molecule; and (c) determining levels of promoter activity in each of the plurality of reporter cells.
- 99. The method of claim 98, wherein the reporter construct comprises a promoter operably linked to a gene encoding a marker, the method comprising determining levels of promoter activity in each of the plurality of reporter cells by determining levels of the marker in of the plurality of reporter cells.
- 100. The method of claim 98, wherein the plurality of reporter cells is a two dimensional array comprising at least 96 reporter cells, each of the reporter cells comprising a different promoter.
- 101. An isolated nucleic acid molecule, comprising a sequence of nucleotides set forth-in any of SEQ ID Nos. 1-12.
- 102. A collection of nucleic acid molecules, comprising the nucleic acid molecules of claim 101.
- 103. An isolated nucleic acid molecule of claim 101, further comprising a nucleic acid molecule encoding a reporter molecule.
- 104. A collection of nucleic acid molecules, comprising nucleic acid molecules of claim 103.
- 105. A vector, comprising a nucleic acid molecule of claim 10
- 106. A vector, comprising a nucleic acid molecule of claim 103.
- 107. A collection of vectors, comprising nucleic acid molecules of claim 104.
- 108. A cell, comprising a nucleic acid molecule of claim 101.
- 109. A collection of cells, each cell comprising a nucleic acid molecule of claim 101.
- 110. A collection of cells, each cell comprising a vector of claim 105.
- 111. The collection of cells of claim 110 that comprises an addressable array.
- 112. A collection of cells comprising regulatory regions from genes involved in osteogenic/osteoporotic regulation.
- 113. A method for generating a signature for a compound, comprising:
a) providing an addressable collection of responder cells; b) exposing the cells to a characterized perturbation; c) identifying cells in the collection that exhibit an altered phenotype responsive to the exposing; d) recording the identity of the identified cells.
- 114. The method of claim 113, wherein the perturbation is a known modulator of a cellular activity.
- 115. The method of claim 113, wherein the perturbation is a compound.
- 116. The method of claim 113, wherein:
the altered phenotype is exhibited as the generation of electromagnetic radiation by the cell; the identities of the identified cells are recorded as an image obtained by scanning the collection after step b), wherein the image represent a signature for the compound.
- 117. The method of claim 113, wherein:
the identities of the identified cells are recorded in a database.
- 118. A database produced by the method of claim 117.
- 119. The method of claim 116, further comprising storing the recorded images in a database.
- 120. A database produced by the method of claim 119.
- 121. A method, comprising:
selecting the cells in claim 113 that exhibit the altered phenotype and preparing a sub-collection.
- 122. The method of claim 118, further comprising treating the sub-collection with test perturbations to identify perturbations that alter the phenotype of one or more of the cells in the sub-collection.
- 123. The method of claim 119, wherein the perturbation is a compound.
- 124. A method for identifying the targets of a test perturbation, comprising:
exposing an addressable collection of responder cells to the perturbation; identifying the cells that exhibit an altered phenotype responsive to the the exposing; and comparing the response to a database of claim 118.
- 125. A method for identifying the targets of a test perturbation, comprising:
exposing an addressable collection of responder cells to the perturbation, wherein the responder cells that exhibit a response emit electromagnetic radiation; imaging the collection; and comparing the response to a database of claim 120.
RELATED APPLICATIONS
[0001] Benefit of priority under 35 U.S.C. §119(e) is claimed to the following applications: U.S. provisional application Ser. No. 60/275,148, filed Mar. 12, 2001, by Jeremy S. Caldwell, entitled, “Chemical and Combinatorial Biology Strategies for High-Throughput Gene Functionalization;” U.S. provisional application Ser. No. 60/274,979, filed Mar. 12, 2001, by Jeremy S. Caldwell, entitled, “Cellular Reporter Arrays;” and U.S. provisional application Ser. No. 60/275,070, filed Mar. 12, 2001, by Andrew Su, John B. Hogenesch, Sumit Chanda and Jeremy S. Caldwell, entitled, “Genomics-driven high speed cellular assay development.” This application is related to U.S. provisional application Ser. No. 60/275,266, filed Mar. 12, 2001, by Jeremy S. Caldwell, entitled, “Identification of cellular targets for biologically active molecules”. The subject matter of each application is herein incorporated by reference in its entirety.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60275148 |
Mar 2001 |
US |
|
60274979 |
Mar 2001 |
US |
|
60275070 |
Mar 2001 |
US |