The present invention concerns gene expression, and is related to the detection of differential gene expression following exposure of cells to ionizing radiation.
Ionizing radiation has many medical, industrial and military uses. Although ionizing radiation can be used in the therapy of diseases such as cancer, exposure to biologically significant levels of such radiation can also cause genotoxic stress. Similarly, many industrial processes (such as the production of nuclear power) and military uses (such as nuclear weapons) can expose individuals to hazardous levels of ionizing radiation. Such radiation can elicit a variety of cellular responses, ranging from cell-cycle arrest to mutation, malignant transformation, or cell death. Many of the responses (such as genotoxicity) are often subtle, and exposed persons may be unaware or unsure if they have been exposed. Moreover, it may require years to evince an untoward effect (such as the development of a malignancy) caused by the exposure.
Many of the assumptions about low dose effects have been based on extrapolations from effects measured at high doses. Transcriptional responses to doses of ionizing radiation with relatively little effect on cell survival have not been as well investigated, although small variations in expression levels of several isolated genes have been detected. A dose of 50 cGy reportedly reduced expression of β- and γ-actin (Woloschakand Chang-Liu, Int. J. Radial. Biol. 59:1173–83, 1991) and induced RB-1 and H4 histone (Woloschak and Chang-Liu, Cancer Lett. 97:169–75, 1995) in Syrian Hamster Embryo cells, while a decrease in c-myc and increase in c-jun was detected in these cells following a dose as low as 6 cGy (Woloschak and Chang-Liu, Cancer Lett. 97:169–75, 1995). In a transformed human lymphoblast cell line, activation of NF-κB has been reported with as little as 10 cGy of radiation (Prasad et al., Radial. Res. 138:367–72, 1994), along with induction of c-FOS, c-JUN, c-MYC and c-Ha-RAS in the 25–200 cGy range (Prasad et al., Radial. Res. 143:263–72, 1995). The induction by 25 cGy of PBP74, a member of the heat shock 70 gene family, has also been reported in two human cancer cell lines (Sadekova et al., Int. J. Radial. Biol. 72:653–60, 1997).
It would be advantageous to have a method for detecting exposure of organisms to biologically significant or hazardous amounts of ionizing radiation. Although small variations in expression levels of several isolated genes in cell lines have been detected at lower doses, none of these studies have demonstrated a dose-response relationship for gene induction at low radiation doses, and overall qualitative and/or quantitative patterns of differential expression have not been investigated. The present invention uses nucleic acid microarray hybridization to evaluate biological effects, such as patterns of expression of genes after radiation exposure. Using these methods, numerous genes have been found which are responsive to radiation exposure in a variety of cell lines, and microarrays have been constructed which are capable of detecting biological responses (such as patterns of expression) to radiation exposure with great sensitivity and specificity.
The present invention includes a method of identifying cells that have been exposed to radiation induced biological stress. The method further includes providing a probe set that includes nucleic acid molecules representing genes that are differentially expressed in cells that have been exposed to a biologically significant amount of ionizing radiation. The probe set is exposed to a labeled nucleic acid composition from a test cell which specifically hybridizes to members of the probe set when the cell has been exposed to a biologically significant amount of ionizing radiation. Whether the nucleic acid composition hybridizes to the nucleic acid molecules representing genes that are differentially expressed is determined.
The probe set may be nucleic acid molecules (such as cDNAs or oligonucleotides) bound in an array to a surface, wherein the nucleotides specifically hybridize to sequences in the nucleic acid composition from the test cell. In one example, the nucleic acid composition includes cDNA reverse transcribed from mRNA in the test cell, and labeled with a fluorophore that detects hybridization of the cDNA to the probe set. In another example, the method also includes exposing the probe set to a labeled nucleic acid composition from a control cell which has not been exposed to a biologically significant amount of ionizing radiation. Genes which are expressed in the absence of radiation exposure will therefore produce mRNA from which labeled cDNA is made that specifically hybridizes to some members of the probe set. The nucleic acids from the test cell and control cell can be labeled with different signals (such as red and green colors) to indicate differential (either increased or decreased) expression of genes in the test cell as compared to the control cell.
The probe set may include probes that specifically hybridize to the labeled nucleic acid composition from specimens obtained more than four hours after exposure to the biologically significant amount of ionizing radiation, and/or less than 24 hours after exposure. In another embodiment of the invention, the probe set includes probes that specifically hybridize to the labeled nucleic acid composition from specimens obtained more than 24 or 48 hours after exposure to the biologically significant amount of ionizing radiation. Probes which detect such late effect exposures may be used to screen for radiation exposure when such screening is not done until one or two days following potential radiation exposure, when a subject is examined in a medical or laboratory facility.
In yet other embodiments of the invention, the probe set includes probes that specifically hybridize to the labeled nucleic acid composition from specimens which have been exposed to less than about 25 cGy of ionizing radiation. The probe set may also include genes that are differentially expressed by at least 1.5-fold or 2-fold following exposure to a biologically significant amount of ionizing radiation. The probe set may include at least 10%, 30% 40%, 50%, 75%, 80%, 90%, 95%, or 99% of the probes identified in Tables 9, 10, 11, 12, 13 or 14 or the entire probe set shown in any of those Tables. The probe set may also include at least 10 or 20 of the probes identified in Tables 9, 10, 11, 12, 13 or 14. Examples of probes that represent such late effects include those listed in Tables 9–12. In another embodiment, the probe set includes nucleic acid sequences that are selected for having differential expression following exposure to a biologically significant amount of ionizing radiation. The probe set may be at least 50%, 75%, 80%, 90%, 95%, 99%, or consist essentially of nucleic acid sequences that are differentially expressed following exposure to a biologically significant amount of ionizing radiation. In yet another embodiment, the probe set includes nucleic acid sequences that are selected for having a differential expression of at least 1.5- or 2-fold following exposure to a biologically significant amount of ionizing radiation.
In yet other embodiments of the method, a plurality of nucleic acid probe elements are bound to a surface, for example in an array, wherein the nucleic acid represents a gene product (including a protein or a nucleic acid such as RNA) that is differentially expressed by a cell following radiation induced biological stress. The plurality of probe elements are contacted with a plurality of gene products from a test cell, under conditions that allow the gene products (such as the nucleic acid sequences) to specifically hybridize to one of more of the probe elements, and provide a signal which indicates differential expression of one or more genes in a test cell has been exposed to biologically significant levels of ionizing radiation, and detecting the presence or absence of the signal. The probe elements may be selected from a set of nucleic acids that specifically hybridize to nucleic acids obtained from cells exposed to ionizing radiation. For example, the target elements are nucleic acid sequences that are differentially expressed by a cell more than 4, 24 or even 48 hours after exposure to the ionizing radiation. The probe elements may also include, or be limited to, nucleic acid sequences that are differentially expressed by at least 1.5-fold or 2-fold following exposure to a biologically significant amount of ionizing radiation.
The target elements may be one or more of the clones listed in Tables 9, 10, 11, 12, 13, or 14, for example Image ID clones 39993, 47475, 109123, 120362, 136114, 195365, 202549, 209340, 221846, 232837, 241412, 244227, 251516, 260619, 280386, 297442, 308588, 549146, 753418, 841278, 51699, 417226 and 28116. The probe nucleic acids may be DNA, such as cDNA, and cDNA obtained from mRNA expressed by the test cell. When the probe is reverse transcribed from cellular RNA, it may average about 1000–2000 nucleotides in length, but may in some instances be as long as 10,000 nucleotides. The probe nucleic acids of the probe set may be about as short as 8 or 10 nucleotides in length, but may also be as long as about 1000 to 1,000,000 nucleotides in length.
The method can also include contacting the probe elements with a plurality of control nucleic acids obtained from mRNA (for example by reverse transcription) of a control cell that has not been exposed to biologically significant levels of ionizing radiation and determining whether the nucleic acids from mRNA of the control cells hybridize differentially to the probe elements than the nucleic acid composition from the test cell. The test nucleic acid sequences are labeled with a first label that detects hybridization of the test nucleic acid sequences to the probe sequences, and the control nucleic acid sequences are labeled with a second label that detects hybridization of the control nucleic acid sequences to the probe sequences. The first and second labels interact to indicate whether expression of a nucleic acid sequence in the test cell has increased or decreased, relative to a baseline level. The first and second labels may be fluorophores of different colors. The nucleic acids from the control cells may, for example, be labeled with a green fluorophore, and the nucleic acids from the test cells may be labeled with a red fluorophore. Hence target elements for which differential gene expression does not occur will appear yellow, while underexpressed (decreased) gene expression will be indicated by green and overexpression (increased expression) by red.
The test cells may be animal cells, such as human cells, for example human peripheral blood cells, for example peripheral blood mononuclear cells, such as lymphocytes. In addition, the cells may be microbial or plant cells, such as microbes or cells from plants in the vicinity of a suspected environmental exposure to ionizing radiation.
In view of the set of stress response genes which have been identified, and may be identified using the present methods, the invention also includes methods of making microarrays for identifying cells that have actually or potentially been exposed to a biologically significant amount of ionizing radiation, by identifying genes that are differentially expressed by a cell following exposure to biologically significant amounts of ionizing radiation. A probe set is then provided, each element of the set including a nucleic acid sequence from a gene that is identified as differentially expressed by a cell following radiation induced biological stress. The target nucleic acid sequence is capable of hybridizing to a nucleic acid sequence which is differentially expressed by the cell following exposure to the biologically significant amount of ionizing radiation. In other embodiments, the genes that are differentially expressed by a cell are identified by exposing the cell to a biologically significant amount of ionizing radiation, obtaining mRNA expressed by the cell, reverse transcribing the mRNA into cDNA, labeling the cDNA, and hybridizing the labeled cDNA to a probe set that represents potential genes that may be differentially expressed and identifying members of the probe set that hybridize with the labeled cDNA. Any high throughput genomic analysis may be used to analyze the differential expression of the stress response genes, as may more standard molecular biology techniques such as dot-blot hybridization. The genes may include p53 regulated genes.
In another embodiment, the method further includes determining a dose response relationship between radiation exposure and differential expression of one or more genes, for example to determine a probable radiation dose in cells that have actually or potentially been exposed to the ionizing radiation. In yet another embodiment, identifying genes that are differentially expressed, for making a microarray, includes identifying genes that are differentially expressed in a cell type that is to be obtained from a subject for testing. The microarray may be used to measure a biological response to potential radiation exposure in the subject, for example in a cell type. The invention also includes microarrays which are made by this method.
The cell type may be peripheral blood cells, for example peripheral blood mononuclear cells, such as lymphocytes. In addition, the cell type may be any microbial or plant cell.
The invention also includes a method of diagnosing biologically significant radiation exposure in a subject, by obtaining a biological specimen from the subject, synthesizing cDNA from mRNA expressed in one or more cells of the biological specimen, and labeling the mRNA with a detectable label. The labeled mRNA is exposed to a probe set which represents genes that are differentially expressed in the biological specimen following exposure to the radiation. A determination is then made whether the labeled mRNA selectively hybridizes to one or more probes of the probe set that are associated with the radiation exposure, or hybridizes in a pattern that is associated with radiation exposure. Particular patterns of hybridization can also be associated with specified exposure doses, or time periods following exposure. The probe set may be one or more of the probes listed in any of Tables 9, 10, 11, 12, 13 or 14, or a probe set that includes at least 10%, 30% 40%, 50%, 75%, 80%, 90%, 95%, or 99% of the probes listed in any of Tables 9, 10, 11, 12, 13 or 14. In another embodiment, the method detects patterns of differential expression associated with biologically significant radiation exposure.
The invention also includes use of the microarrays of the present invention for measuring a biological response to in a subject, by obtaining a biological sample from the subject, synthesizing cDNA from mRNA expressed in one or more cells of the biological sample, labeling the mRNA with a detectable label, and exposing the labeled mRNA to a probe set which represents genes that are differentially expressed in the biological sample following radiation exposure, and determining if the labeled mRNA selectively hybridizes to one or more probes of the probe set that are associated with radiation exposure. The subject may be undergoing radiotherapy (or a candidate for radiotherapy) for the treatment of cancer, and the microarray can used to monitor or predict the subject's biological response to the radiotherapy.
Also included in the invention are the probe sets that provide information about exposure to biologically significant doses of ionizing radiation, for example probe sets including the DNA probe sets shown in any of Tables 9, 10, 11, 12, 13 or 14, or subsets of the probe sets of Tables 13 or Table 14. Such subsets may include sets having at least 10%, 30% 40%, 50%, 75%, 80%, 90%, 95%, or 99% of the probes sets shown in any of Tables 9, 10, 11, 12, 13 or 14.
SEQ ID NO: 1 is the nucleic acid sequence of Image ID Number 39993.
SEQ ID NO: 2 is the nucleic acid sequence of Image ID Number 47475.
SEQ ID NO: 3 is the nucleic acid sequence of Image ID Number 260619.
SEQ ID NO: 4 is the nucleic acid sequence of Image ID Number 753418.
SEQ ID NO: 5 is the nucleic acid sequence of Image ID Number 51699.
SEQ ID NO: 6 is the nucleic acid sequence of Image ID Number 701112.
SEQ ID NO: 7 is the nucleic acid sequence of Image ID Number 753447.
SEQ ID NO: 8 is the nucleic acid sequence of Image ID Number 547058.
SEQ ID NO: 9 is the nucleic acid sequence of Image ID Number 203132.
SEQ ID NO: 10 is the nucleic acid sequence of Image ID Number 1493160.
SEQ ID NO: 11 is the nucleic acid sequence of Image ID Number 109123.
SEQ ID NO: 12 is the nucleic acid sequence of Image ID Number 120362.
SEQ ID NO: 13 is the nucleic acid sequence of Image ID Number 136114.
SEQ ID NO: 14 is the nucleic acid sequence of Image ID Number 195365.
SEQ ID NO: 15 is the nucleic acid sequence of Image ID Number 202549.
SEQ ID NO: 16 is the nucleic acid sequence of Image ID Number 209340.
SEQ ID NO: 17 is the nucleic acid sequence of Image ID Number 221846.
SEQ ID NO: 18 is the nucleic acid sequence of Image ID Number 232837.
SEQ ID NO: 19 is the nucleic acid sequence of Image ID Number 241412.
SEQ ID NO: 20 is the nucleic acid sequence of Image ID Number 244227.
SEQ ID NO: 21 is the nucleic acid sequence of Image ID Number 251516.
SEQ ID NO: 22 is the nucleic acid sequence of Image ID Number 280386.
SEQ ID NO: 23 is the nucleic acid sequence of Image ID Number 297442.
SEQ ID NO: 24 is the nucleic acid sequence of Image ID Number 308588.
SEQ ID NO: 25 is the nucleic acid sequence of Image ID Number 549146.
SEQ ID NO: 26 is the nucleic acid sequence of Image ID Number 841278.
SEQ ID NO: 27 is the nucleic acid sequence of Image ID Number 28116.
SEQ ID NO: 28 is the nucleic acid sequence of Image ID Number 50615.
The following definitions and methods are provided to better define the present invention and to guide those of ordinary skill in the art in the practice of the present invention. It must be noted that as used herein and in the appended claims, the singular forms “a” or “an” or “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a protein” includes a plurality of such proteins and reference to “the array” includes reference to one or more arrays and equivalents thereof known to those skilled in the art, and so forth.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. The materials, methods, and examples are illustrative only and are not intended to be limiting.
Animal Cells: Cells obtained from multicellular vertebrate organisms, a category which includes, for example: mammals, primates, rodents, veterinary subjects, and birds. The cells can be obtained from any source, for example peripheral blood, urine, saliva, tissue biopsy, surgical specimen, amniocentesis samples and autopsy material. From these cells, genomic DNA, cDNA, mRNA, RNA, or protein can be isolated.
Biologically significant radiation exposure: An amount of radiation exposure sufficient to cause differential expression of stress genes in cells after they are exposed to the radiation.
Biological Specimen/Sample: One or more cells obtained from an animal or plant.
Cancer: malignant neoplasm that has undergone characteristic anaplasia with loss of differentiation, increased rate of growth, invasion of surrounding tissue, and is capable of metastasis.
cDNA (complementary DNA): A piece of DNA lacking internal, non-coding segments (introns) and regulatory sequences which determine transcription. cDNA may be synthesized in the laboratory by reverse transcription from messenger RNA extracted from cells.
Control Cells: Cells which have not been exposed to biologically significant levels of ionizing radiation.
DNA: Deoxyribonucleic acid. DNA is a long chain polymer which comprises the genetic material of most living organisms (some viruses have genes comprising ribonucleic acid, RNA). The repeating units in DNA polymers are four different nucleotides, each of which comprises one of the four bases, adenine, guanine, cytosine and thymine bound to a deoxyribose sugar to which a phosphate group is attached. Triplets of nucleotides, referred to as codons, in DNA molecules code for amino acid in a polypeptide. The term codon is also used for the corresponding (and complementary) sequences of three nucleotides in the mRNA into which the DNA sequence is transcribed.
DNA chip: A DNA array in which multiple DNA molecules (such as cDNAs) of known DNA sequences are arrayed on a substrate, usually in a microarray, so that the DNA molecules can hybridize with nucleic acids (such as cDNA or RNA) from a specimen of interest. DNA chips are further described in Ramsay (Nature Biotech. 16:40–44, 1998).
Differential expression of a gene: This refers to either an increased or decreased expression of a gene, or any other change from the normal expression of a gene.
EST (Expressed Sequence Tag): This refers to a partial DNA or cDNA sequence, typically of between 1000 and 2000 sequential nucleotides, obtained from a genomic or cDNA library, prepared from a selected cell, cell type, tissue or tissue type, organ or organism, which corresponds to an mRNA of a gene found in that library. An EST is generally a DNA molecule sequenced from and shorter than the cDNA from which it is obtained.
Fluorophore: A chemical compound, which when excited by exposure to a particular wavelength of light, emits light (i.e., fluoresces), for example at a different wavelength. Fluorophores can be described in terms of their emission profile, or “color.” Green fluorophores, for example Cy-3, FITC, and Oregon Green, are characterized by their emission at wavelengths generally in the range of 515–540 λ. Red fluorophores, for example Texas Red, Cy-5 and tetramethylrhodamine, are characterized by their emission at wavelengths generally in the range of 590–690 λ.
Examples of fluorophores that may be used in the present invention are provided in U.S. Pat. No. 5,866,366 to Nazarenko et al.: 4-acetamido-4′-isothiocyanatostilbene-2,2′disulfonic acid, acridine and derivatives such as acridine and acridine isothiocyanate, 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS), 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate (Lucifer Yellow VS), N-(4-anilino-1-naphthyl)maleimide, anthranilamide, Brilliant Yellow, coumarin and derivatives such as coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151); cyanosine; 4′,6-diaminidino-2-phenylindole (DAPI); 5′,5″-dibromopyrogallol-sulfonephthalein (Bromopyrogallol Red); 7-diethylamino-3-(4′-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4′-diisothiocyanatodihydro-stilbene-2,2′-disulfonic acid; 4,4′-diisothiocyanatostilbene-2,2′-disulfonic acid; 5-[dimethylamino]naphthalene-1-sulfonyl chloride (DNS, dansyl chloride); 4-(4′-dimethylaminophenylazo)benzoic acid (DABCYL); 4-dimethylaminophenylazophenyl-4′-isothiocyanate (DABITC); eosin and derivatives such as eosin and eosin isothiocyanate; erythrosin and derivatives such as erythrosin B and erythrosin isothiocyanate; ethidium; fluorescein and derivatives such as 5-carboxyfluorescein (FAM), 5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF), 2′7′-dimethoxy-4′5′-dichloro-6-carboxyfluorescein (JOE), fluorescein, fluorescein isothiocyanate (FITC), and QFITC (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferone; ortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives such as pyrene, pyrene butyrate and succinimidyl 1-pyrene butyrate; Reactive Red 4 (Cibacron .RTM. Brilliant Red 3B-A); rhodamine and derivatives such as 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride, rhodamine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101 and sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid and terbium chelate derivatives.
Other suitable fluorophores include GFP, Lissamine™, diethylaminocoumarin, fluorescein chlorotriazinyl, naphthofluorescein, 4,7-dichlororhodamine and xanthene and derivatives thereof. Other fluorophores known to those skilled in the art may also be used.
Gene expression microarrays: Microscopic arrays of cDNAs printed on a substrate, which serve as a high density hybridization target for mRNA probes, for example as described in Schena (BioEssays 18:427–431, 1996).
Genomic target sequence: A sequence of nucleotides located in a particular region in the human genome that corresponds to one or more specific loci.
High throughput genomics: This refers to application of genomic or genetic data or analysis techniques that use microarrays or other genomic technologies to rapidly identify large numbers of genes or proteins, or distinguish their structure, expression or function from normal or abnormal cells or tissues.
Human Cells: Cells obtained from Homo sapiens. The cells can be obtained from any source, for example peripheral blood, urine, saliva, tissue biopsy, surgical specimen, amniocentesis samples and autopsy material. From these cells, genomic DNA, cDNA, mRNA, RNA, or protein can be isolated.
Ionizing radiation (IR): An amount of radiation sufficient to separate orbiting electrons from an atomic nucleus. Ionizing radiation includes photons and accelerated particles. Photons (also called gamma rays) are given off in many types of nuclear decay. Ionizing rays (x-rays) occur when an electron is stopped in a dense material. Accelerated particles include protons from solar radiation, heavy nuclei in cosmic rays, and beta and alpha particles given up in nuclear decay.
Isolated: An “isolated” biological component (such as a nucleic acid, peptide, protein, or organelle) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA, organelles, and proteins. Nucleic acids, peptides and proteins which have been “isolated” thus include nucleic acids and proteins purified by standard purification methods. The term also embraces nucleic acids, peptides and proteins prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids.
Label: Detectable marker or reporter molecules, which can be attached to nucleic acids, for example probes. Typical labels include fluorophores, radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are discussed, e.g., in Sambrook et al., in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press (1989) and Ausubel et al., in Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley-Intersciences (1987).
Malignant: cells which have the properties of anaplasia, invasion and metastasis.
Messenger RNA (mRNA): RNA that is translated into protein. cDNA can be reverse transcribed from mRNA using standard molecular biology methods.
Microarray: An array that is miniaturized so as to require microscopic examination for visual evaluation.
Microbes: Any microorganism. Includes for example, viruses and members of the Monera, Protista or Fungi Kingdoms, for example bacteria and parasites.
Neoplasm: abnormal growth of cells
Normal cells: Non-tumor, non-malignant cells.
Nucleic acid array: An arrangement of nucleic acids (such as DNA or RNA) in assigned locations on a matrix, such as that found in cDNA or CGH arrays.
Nucleic acid: A deoxyribonucleotide or ribonucleotide polymer in either single or double stranded form, including known analogs of natural nucleotides.
Nucleic acid molecules representing genes: Any nucleic acid, for example DNA, cDNA or RNA, of any length suitable for use as a probe that is informative about the genes.
Oligonucleotide: A linear single-stranded polynucleotide sequence ranging in length from 2 to about 1,000,000 bases, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for example at least 15, 50, 100, 200, 1,000, 10,000 or even 1,000,000 nucleotides long. Oligonucleotides are often synthetic but can also be produced from naturally occurring polynucleotides.
Plant Cells: Cells obtained from any member of the Plantae Kingdom, a category which includes, for example, trees, flowering and non flowering plants, grasses, and Arabidopsis. The cells can be obtained from any part of the plant, for example roots, leaves, stems, or any flower part. From these cells, nucleic acid or protein can be isolated.
Probes and primers: A probe is an oligonucleotide or isolated nucleic acid capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing with hydrogen bond formation. Oligonucleotide probes are often 10–50 or 15–30 bases long, and can be as long as about 1,000,000 bases. An oligonucleotide probe may include natural (A, T, C, G) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in an oligonucleotide probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, oligonucleotide probes may be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages.
Detectable label or reporter molecules can be attached to probes. Typical labels include fluorophores, radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are discussed, e.g., in Sambrook et al., in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press (1989) and Ausubel et al., in Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley-Intersciences (1987).
Primers are short nucleic acids, such as DNA oligonucleotides 15 nucleotides or more in length. Primers may be annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, and then extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art.
Methods for preparing and using probes and primers are described, for example, in Sambrook et al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989), Ausubel et al., in Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley-Intersciences (1987), and Innis et al., PCR Protocols, A Guide to Methods and Applications, 1990, Innis et al. (eds.), 21–27, Academic Press, Inc.; San Diego, Calif. PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5,© 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.). One of skill in the art will appreciate that the specificity of a particular probe or primer increases with its length. Thus, for example, a primer comprising 20 consecutive nucleotides of a cDNA or gene will anneal to a target sequence contained within a cDNA or genomic DNA library with a higher specificity than a corresponding primer of only 15 nucleotides. Thus, in order to obtain greater specificity, probes and primers may be selected that comprise 20, 25, 30, 35, 40, 50 or more consecutive nucleotides of a cDNA or gene sequence.
Thus isolated nucleic acid molecules that comprise specified lengths of a nucleic acid sequence can be used in the present invention. Such molecules may comprise at least 20, 40, 50, 100, 1000, 10,000, or even 1,000,000 or more consecutive nucleotides of a nucleic acid sequence and may be obtained from any region of a nucleic acid sequence.
Probe Element: A nucleic acid sequence from a gene which is represented in a probe set. For example, the gene may be differentially expressed by a cell following radiation induced biological stress. In addition, the gene may be unaffected in its expression following radiation induced biological stress, for example control genes.
Probe set: A population of two or more probes which represent genes that are differentially expressed in cells that have been exposed to a biologically significant amount of ionizing radiation. The probe set may be nucleic acids, for example RNA, cDNAs, and/or oligonucleotides that specifically hybridize to complementary sequences in a nucleic acid composition, for example from test cells which may have been exposed to ionizing radiation. For example, the probe set may contain any of the probes shown in Tables 4 or 5. In addition, the probe set can be bound in an array to a surface.
Purified: the term purified does not require absolute purity; rather, it is intended as a relative term. Thus, for example, a purified protein preparation is one in which the protein is more pure than the protein in its natural environment within a cell. In one embodiment, a preparation of a protein is purified such that the protein represents at least 50% of the total protein content of the preparation.
Radiation dose: Defined in terms of energy deposition. The basic unit is the gray (Gy), equal to 1 joule per kilogram.
Radiation induced biological stress: The induction of differential expression of stress genes in cells after they are exposed to radiation. Examples include, but are not limited to: CIP1/WAF1, GADD45, MDM2, BCL2, FOS, JUN, REL-B, ATF3, BAX.
Sequence identity: The similarity between two nucleic acid sequences, or two amino acid sequences, is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are. Homologs or orthologs of nucleic acid or amino acid sequences will possess a relatively high degree of sequence identity when aligned using standard methods. This homology will be more significant when the orthologous proteins or nucleic acids are derived from species which are more closely related (e.g., human and chimpanzee sequences), compared to species more distantly related (e.g., human and C. elegans sequences). Typically, orthologs are at least 50% identical at the nucleotide level and at least 50% identical at the amino acid level when comparing human orthologous sequences.
Methods of alignment of sequences for comparison are well known in the art. Various programs and alignment algorithms are described in: Smith & Waterman, Adv. Appl. Math. 2:482, 1981; Needleman & Wunsch, J. Mol. Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237–44, 1988; Higgins & Sharp, CABIOS 5:151–3, 1989; Corpet et al., Nuc. Acids Res. 16:10881–90, 1988; Huang et al. Computer Appls. in the Biosciences 8, 155–65, 1992; and Pearson et al., Meth. Mol. Bio. 24:307–31, 1994. Altschul et al, J. Mol. Biol. 215:403–10, 1990, presents a detailed consideration of sequence alignment methods and homology calculations.
The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al. J. Mol. Biol. 215:403–10, 1990) is available from several sources, including the National Center for Biotechnology Information (NCBI, Bethesda, Md.) and on the Internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. A description of how to determine sequence identity using this program is available at the NCBI website.
Homologs are typically characterized by possession of at least 60%, 70%, 75%, 80%, 90%, 95% or at least 98% sequence identity counted over the full length alignment with a sequence using the NCBI Blast 2.0, gapped blastp set to default parameters. Queries searched with the blastn program are filtered with DUST (Hancock, and Armstrong, 1994, Comput. Appl. Biosci. 10:67–70). Other programs use SEG.
One of ordinary skill in the art will appreciate that these sequence identity ranges are provided for guidance only; it is entirely possible that strongly significant homologs could be obtained that fall outside of the ranges provided. The present invention provides not only the peptide homologs described above, but also nucleic acid molecules that encode such homologs.
One indication that two nucleic acid sequences are substantially identical is that the polypeptide which the first nucleic acid encodes is immunologically cross reactive with the polypeptide encoded by the second nucleic acid.
Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that all encode substantially the same protein.
An alternative indication that two nucleic acid molecules are closely related is that the two molecules hybridize to each other under stringent conditions, as described under “specific hybridization.”
Specific hybridization: Specific hybridization refers to the binding, duplexing, or hybridizing of a molecule only or substantially only to a particular nucleotide sequence when that sequence is present in a complex mixture (e.g. total cellular DNA or RNA). Specific hybridization may also occur under conditions of varying stringency.
Hybridization conditions resulting in particular degrees of stringency will vary depending upon the nature of the hybridization method of choice and the composition and length of the hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength (especially the Na+ concentration) of the hybridization buffer will determine the stringency of hybridization. Calculations regarding hybridization conditions required for attaining particular degrees of stringency are discussed by Sambrook et al. (In: Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., 1989 ch. 9 and 11). By way of illustration only, a hybridization experiment may be performed by hybridization of a DNA molecule to a target DNA molecule which has been electrophoresed in an agarose gel and transferred to a nitrocellulose membrane by Southern blotting (Southern, J. Mol. Biol. 98:503, 1975), a technique well known in the art and described in Sambrook et al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., 1989).
Hybridization with a target probe labeled with [32P]-dCTP is generally carried out in a solution of high ionic strength such as 6×SSC at a temperature that is 20–25° C. below the melting temperature, Tm, described below. For such Southern hybridization experiments where the target DNA molecule on the Southern blot contains 10 ng of DNA or more, hybridization is typically carried out for 6–8 hours using 1–2 ng/ml radiolabeled probe (of specific activity equal to 109 CPM/μg or greater). Following hybridization, the nitrocellulose filter is washed to remove background hybridization. The washing conditions should be as stringent as possible to remove background hybridization but to retain a specific hybridization signal.
The term Tm represents the temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equilibrium. Because the target sequences are generally present in excess, at Tm 50% of the probes are occupied at equilibrium. The Tm of such a hybrid molecule may be estimated from the following equation (Bolton and McCarthy, Proc. Natl. Acad. Sci. USA 48:1390, 1962): Tm=81.5° C.−16.6(log10[Na+])+0.41(% G+C)−0.63(% formamide)−(600/l); where l=the length of the hybrid in base pairs.
This equation is valid for concentrations of Na+ in the range of 0.01 M to 0.4 M, and it is less accurate for calculations of Tm in solutions of higher [Na+]. The equation is also primarily valid for DNAs whose G+C content is in the range of 30% to 75%, and it applies to hybrids greater than 100 nucleotides in length (the behavior of oligonucleotide probes is described in detail in Ch. 11 of Sambrook et al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., 1989).
Thus, by way of example, for a 150 base pair DNA probe derived from a cDNA (with a hypothetical % GC=45%), a calculation of hybridization conditions required to give particular stringencies may be made as follows: For this example, it is assumed that the filter will be washed in 0.3×SSC solution following hybridization, thereby: [Na+]=0.045 M; % GC=45%; Formamide concentration=0; 1=150 base pairs; Tm=81.5−16.6(log10[Na+])+(0.41×45)−(600/150); and so Tm=74.4° C.
The Tm of double-stranded DNA decreases by 1–1.5° C. with every 1% decrease in homology (Bonner et al., J. Mol. Biol. 81:123, 1973). Therefore, for this given example, washing the filter in 0.3×SSC at 59.4–64.4° C. will produce a stringency of hybridization equivalent to 90%; that is, DNA molecules with more than 10% sequence variation relative to the target cDNA will not hybridize. Alternatively, washing the hybridized filter in 0.3×SSC at a temperature of 65.4–68.4° C. will yield a hybridization stringency of 94%; that is, DNA molecules with more than 6% sequence variation relative to the target cDNA molecule will not hybridize. The above example is given entirely by way of theoretical illustration. One skilled in the art will appreciate that other hybridization techniques may be utilized and that variations in experimental conditions will necessitate alternative calculations for stringency.
In the present invention, stringent conditions may be defined as those under which DNA molecules with more than 25%, 15%, 10%, 6% or 2% sequence variation (also termed “mismatch”) will not hybridize. Stringent conditions are sequence dependent and are different in different circumstances. Longer sequences hybridize specifically at higher temperatures. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point Tm for the specific sequence at a defined ionic strength and pH. An example of stringent conditions is a salt concentration of at least about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and a temperature of at least about 30° C. for short probes (e.g. 10 to 50 nucleotides). Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. For example, conditions of 5×SSPE (750 mM NaCl, 50 mM Na Phosphate, 5 mM EDTA, pH 7.4) and a temperature of 25–30° C. are suitable for allele-specific probe hybridizations.
A perfectly matched probe has a sequence perfectly complementary to a particular target sequence. The test probe is typically perfectly complementary to a portion (subsequence) of the target sequence. The term “mismatch probe” refers to probes whose sequence is deliberately selected not to be perfectly complementary to a particular target sequence.
Transcription levels can be quantitated absolutely or relatively. Absolute quantitation can be accomplished by inclusion of known concentrations of one or more target nucleic acids (for example control nucleic acids such as Bio B or with a known amount the target nucleic acids themselves) and referencing the hybridization intensity of unknowns with the known target nucleic acids (for example by generation of a standard curve).
Subject: Living multicellular vertebrate organisms, a category which includes, both human and veterinary subjects for example, mammals, birds and primates.
Test Cell: A cell which has been or may have been exposed to biologically significant levels of ionizing radiation. Using the method of the present invention, whether a test cell has been exposed to biologically significant levels of ionizing radiation can be determined. Test cells can be from any origin, including for example plant and animal cells. In particular examples, the test cells are peripheral blood cells.
Tumor: a neoplasm
Additional definitions of terms commonly used in molecular genetics can be found in Benjamin Lewin, Genes V published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8).
This example describes experiments in which the percent of cells undergoing apoptosis following different levels of irradiation was calculated.
ML-1 cells, a human myeloid leukemia cell line, were grown in RPMI 1640 medium supplemented with 10% heat-inactivated (56° C. for 45 minutes) fetal calf serum and 100 U/ml penicillin and 100 μg/ml streptomycin in a humidified, 5% CO2 atmosphere in a 37° incubator. Cells were irradiated at approximately 5.1 cGy/minute to total doses between 2 and 2000 cGy using a Mark 1-68 137Cs source (J.L. Shepherd and Associates, Inc., San Fernando, Calif.) with lead attenuators in place. The dosimetry of the source was confirmed by exposing TLD monitors (Landauer Inc., Glenwood, Ill.) in the same configuration used for cellular irradiations to the range of doses used. Even at the lowest doses, the calculated absorbed dose (Landauer special dosimetry services) varied by less than 3% from the dose expected. Due to the nature of sparsely ionizing radiation such as γ-rays, it is highly unlikely that cells in the irradiated population will remain unexposed at even the lowest doses used.
To determine if cells underwent apoptosis following irradiation, the cells were irradiated and then incubated for 1, 2, or 3 days. The cells were subsequently fixed in methanol, and stained with DAPI solution (50 ng/ml final concentration). An Olympus fluorescent microscope was used to score nuclei exhibiting characteristic morphological features of apoptosis, and results were expressed as the number of apoptotic nuclei over the total number of nuclei counted. Flow cytometry was used to measure the effects of irradiation on the cell cycle. In this method, cells were fixed in 70% ethanol 0, 8, 10, 12 and 24 hours after irradiation, treated with RNase (100 ug/ml) at 37° C., then stained with propidium iodide. Samples were analyzed using a Becton-Dickenson FacScan and cell cycle distributions were fitted using the Cell Quest data analysis program.
ML-1 cells irradiated with 137Cs γ-rays showed a survival response similar to that of other human cell lines of myeloid or lymphoid lineage (
Although previous studies demonstrated that high doses of ionizing radiation efficiently induce rapid apoptosis in ML-1 cells (Zhan et al. Oncogene 9:3743–51, 1994), at lower, relatively non-toxic doses (cells irradiated with 25 cGy or less), very little apoptosis was measurable. A more sensitive method for detecting apoptotic cells was achieved by scoring morphology of DAPI stained cells. As shown in
In contrast to their effect on apoptosis, doses below 25 cGy significantly and reproducibly perturbed cell cycle progression in a dose-dependent manner (
These results demonstrate that low-levels of non-toxic irradiation can be effectively administered, and that even at these low doses, progression of the cell cycle is perturbed.
This example describes experiments conducted to identify genes which are differentially expressed in cells exposed to radiation, such as low level radiation, which were used to construct cDNA microarrays. Before a probe set can be constructed which identifies differential expression of genes in radiation exposed cells, probes are selected for the set. This selection can be achieved by using cDNA arrays having a general sampling of ESTs, for example from human, mouse, or plant genes. The array of ESTs can be exposed to nucleic acid compositions from test cells that have been exposed to a dose of ionizing radiation sufficient to induce differential expression of stress genes in the test cells. The nucleic acids can then be labeled (for example by reverse transcription from cellular mRNA to labeled cDNA), and exposed to the array. Specific hybridization to an EST in the array, or differential hybridization by a cDNA from a cell that has been irradiated, identifies a potential probe for the probe set. Subsequent confirmation of differential expression can be achieved by northen blot analysis, or other techniques.
The microarray used in this example included a general sampling of human genes (622 ESTs) plus another set of genes (616 ESTs) which were chosen on the basis of their roles in cancer or lymphoid biology. The genes represented in the 1.2K array are shown in Table 1. The sequences are identified in Table 1 by Image Consortium Clone number (hereinafter “Image No.”). This number can be searched on the ATCC Image Consortium Clones database at the ATCC (Manassas, Va.) website, with links to the ATCC accession number of clones from which the sequences can be obtained, and links to Genbank sequence listings which correspond to the Image Number. The array was constructed with ESTs informative for the exposure of interest, namely radiation exposure. A panel of housekeeping genes and other internal controls was also included on the array. The selection of this panel has been described (DeRisi et al., Nat. Genet. 14:457–460, 1996).
The human myeloid leukemia cell line ML-1 is available from the National Cancer Institute's Anti-Neoplastic Drug Screen Panel (NCI-ADS). The members of this panel are listed in Table 2, and are described in many publications, including Monks et al. (J. Nat. Cancer Institut. 83:757–66, 1991), and on the NCI Web page. ML-1 was selected as the test cell in which radiation induced differential expression of genes would be measured. A characteristic of this cell that made it particularly useful for this purpose was that it contains endogenous wild-type p53, which is a cellular stress response mediator following exposure to ionizing radiation. ML-1 contains physiologic levels of p53, rather than the unnatural and often highly overexpressed levels often seen in artificially engineered systems. Moreover, ML-1 is a myeloid cell line, which is prone to undergo rapid apoptosis following genotoxic stress. This characteristic results in the induction of genes specifically associated with rapid apoptosis, such as BAX, MCLI, GADD34 and BCL-X (Zhan et al., Oncogene 9:3743–51, 1994 and Oncogene 14:1031–9, 1997). In addition to ML-1 cells, other cells that can be used to assess the induction of differential expression of genes in radiation exposed cells include other lymphoid cell lines, such as those with wild-type levels of p53, such as Molt4, WMN, TK6 or SR, which may induce genes not found to be regulated in ML-1. Cell lines derived from blood cells (such as these lines) are selected to provide responses more characteristic of peripheral blood responses than would cell lines from other sites, such as colon or breast cell lines (for example RKO or MCF7, respectively). Gene induction can also be measured directly in ex vivo irradiated human peripheral blood cells, as a particularly relevant model system, for methods in which peripheral blood specimens are analyzed to clinically screen for biologically significant radiation exposure (see EXAMPLE 9).
ML-1 cells were grown and irradiated at approximately 3.1 Gy/minute to total doses of 0.25, 2 or 20 Gy using a Mark 1-68 137Cs source, as described in EXAMPLE 1. EST targets identified in Table 1 were prepared by PCR amplification and arrayed on poly-L-lysine coated glass slides by high speed robotic printing as previously described (DeRisi et al., Nat. Genet. 14:457–460, 1996).
A complex cDNA probe was prepared from whole-cell RNA by a single round of reverse transcription using SuperscriptII (Gibco-BRL, Grand Island, N.Y.) for two hours at 42° C. according to the manufacturer's instructions in the presence of fluorescent dNTP (Cy3 dUTP or Cy5 dUTP, Amersham Pharmacia Biotech, Piscataway, N.J.). Probes were hybridized to the slides for 16 hours in 3×SSC (8.77 g/L NaCl, 4.4 g sodium citrate at pH 7.4) at 65° C. in the presence of blockers (8 μg Poly d A; 4 μg yeast tRNA; and 10 μg human Cot 1 per 10 μl hybridization). Hybridized slides were washed at room temperature (RT) in 0.5×SSC, 0.01% SDS, then in 0.06×SSC. The two fluorescent intensities (Cy3 and Cy5) were scanned separately using a laser confocal microscope, and the DeArray program was then used to identify target sites by image segmentation, calibrate relative ratios, and to develop confidence intervals for testing the significance of the ratios obtained (Chen et al., J. Biomed. Optics 2:364–374, 1997).
Local background was calculated for each target location. A normalization factor was estimated from a set of 88 internal control targets (De Risi et al., Nat. Genet, 14:457–60, 1996) with a theoretical ratio of 1.0, and the confidence interval for the array was estimated from the variance of these 88 control ratios from the expected value of 1.0. The ratios for all the targets on the array were then calibrated using the normalization factor, and ratios outside the 99% confidence interval (less than 0.54 or greater than 2.37) were determined to be significantly changed by the radiation treatment.
In the hybridized microarray, induced transcripts hybridized with more of the probe from the IR-treated sample (labeled with a red fluorophore), resulting in red spots, such as that observed at the CIP1/WAF1 target. A transcript down-regulated by IR, such as c-MYC, instead produced a green spot. Intermediate induction ratios result in a gradation of color, such as MBP-1. A schematic drawing showing the hybridized microarray is shown in
Transcripts significantly changed by radiation treatment are shown in Table 3 which also gives the mean intensities of hybridization to the unirradiated control on the microarray. This measure correlates with transcript abundance (Schena et al., Science 270:467–70, 1995; Schena et al., Proc. Natl. Acad. Sci. USA 93:10614–9, 1996), and demonstrates identification of IR modulated genes over three orders of magnitude of basal expression. Many of the stress regulated transcripts identified in Table 3 are known to be expressed at very low levels in ML-1 cells, consistent with their relative hybridization intensities on the array. For example, GADD45 and CIP1/WAF1 represent approximately 1/105 transcripts in unirradiated cells.
e
e
aRatios of relative induction by γ-rays compared to basal levels in ML-1 cells. See also http://rex.nci.nih.gov/RESEARCH/basic/lbc/fornace.htm.
bFluorescence intensity of untreated control on the microarray.
cMicroarray measurement confirmed by quantitative dot blot hybridization where expression varied by less than 2-fold.
dQuantitation varied by more than 2-fold.
eInserts for BAK (Chittenden et al., Nature 374:733–6, 1995) and BCL-XL (Boise et al., Cell 74:597–608, 1993) from clones other than Image Consortium ESTs.
A subset of the IR-responsive genes indicated by the microarray with a range of relative ratios were chosen for further study. This subset of probes included the following: Image ID clones 268652, 428248, 129632, 52681, 52530, 310141, 110503, 236422, 153355, 52753, 51363, 147075, 291503, 41452, 243143, 233194, 248032, 28116, 51699. Probes were obtained from the same plasmids used as targets on the array, and γ-ray induction of these genes was confirmed in independent experiments by northern blot hybridization. Estimates of induction or repression as measured by the microarray were compared to quantitative hybridization with single labeled probes. As indicated in Table 3, estimated expression varied by less than 2-fold for many transcripts. However, all tested sequences that were identified on the microarray as induced showed an appreciable (>2-fold) induction by the quantitative hybridization approach, with the exception of MRC-OX, which showed only 1.5-fold induction. In the case of genes showing less than 2.4-fold induction by the microarray, useful data may still be obtainable. For example, MCL-1 showed 1.9-fold induction by the microarray and 2.5-fold induction by quantitative hybridization.
The time course of induction for nine of these genes was examined in ML-1 cells. The response over time of Rag cohort 1 (RCH1) (Cuomo et al., Proc. Natl. Acad. Sci. USA 91:6156–60, 1994), a newly recognized IR-down-regulated gene, was very similar to the response of TOPOII. Both repressed genes showed a similar rapid decrease of mRNA levels following irradiation, and remain maximally repressed 24 hours after treatment. The levels of most of the newly-identified IR-induced genes rose rapidly following treatment, peaked by four hours, and declined again to near the original levels by 24 hours after treatment, following the pattern of rapid response typical of many stress-induced immediate-early genes (Fornace, Ann. Rev. Genet. 26:507–26, 1992; Smith and Fornace, Mutation Res. 340:109–24, 1996). By analogy, the genes described here appear to have roles in acute cellular responses to damage and may share some regulatory mechanisms with previously characterized IR-response genes.
Induction of nine of the newly-identified stress-response genes was next measured in a panel of human cancer cell lines to determine the scope of their IR-response, and to monitor for induction by exposure to two DNA-base-damaging agents, the alkylating agent methyl methanesulfonate (MMS) and ultraviolet (UV) radiation. The full list of cell lines from this panel is shown in Table 2.
The cell lines used in this comparison included six cell lines of myeloid-lymphoid lineage [ML-1 (myeloid), Molt4 (lymphoid), SR (lymphoid), CCRF-CEM (lymphoid), HL60 (myeloid) and K562 (myeloid)], two lung cancer lines (A549 and H1299), two breast carcinoma lines (MCF7 and T47D), and the colon cancer line RKO, along with its derivative transfected with E6 (RKO/E6) (Zhan et al., Mol. Cell. Biol. 13:4242–50, 1993). Relative induction of particular genes was measured four hours after treatment with a 20 Gy dose of ionizing radiation (
A summary of the results of these induction experiments is shown in
While all of the newly-identified genes respond to IR in at least one cell line in addition to ML-1, two of the cell lines, K562 and A549, did not show γ-ray regulation of any of the newly-defined IR-responsive genes, and only ATF3 was induced by any stress agent tested in these two lines (
This example illustrates that a single cell type may provide a less optimal model for cellular response to genotoxic stress, such as IR. However, using the techniques in this example, genes can be chosen which are often or usually induced in response to a genotoxic stress, such as radiation exposure. Such identified genes can be incorporated into a probe set of a microarray.
In addition, particular cells to be sampled (such as peripheral blood lymphocytes, see EXAMPLE 9) can be studied to determine the genes that are differentially expressed in those cells in response to irradiation. Those genes identified as being differentially expressed in response to irradiation can be incorporated into a probe set of a microarray, which would be specific for the particular cell type to be tested. For example, a microarray can be developed which contains probes which are differentially expressed in a specific type of cell (for example peripheral blood mononuclear cells such as lymphocytes) in response to irradiation. Such a microarray can then be used to determine/monitor radiation exposure in that cell type.
The large number of probes in the probe set also overcomes the problem of using induction in a single cell line as evidence of radiation exposure induction. Hence a review of the results shown in
This example describes methods used to generate a dose response curve which can be used to determine the time period during which a gene is differentially expressed following exposure to ionizing radiation. Such information is useful in determining which probes to incorporate into the probe set, particularly if the probe set is prepared for analysis of potential biological damage at a given time period after a known or suspected exposure. The genes studied in this example, CIP1/WAF1, GADD45, MDM2, ATF3 and BAX, were chosen since they were shown in the previous example to be differentially regulated in response to irradiation.
ML-1 cells were grown and exposed to various doses of radiation at approximately 5.1 cGy/min, to total doses of 2–50 cGy using the Mark 1-68 137Cs source as described in EXAMPLE 1. Gene induction was measured by incubating the irradiated cells at 37° C. for a predetermined number of hours (such as 1–4, or 24–48 hours following irradiation) followed by RNA extraction using a modified guanidine thiocyanate method (as detailed in Chomczynski and Sacchi, Anal. Biochem. 162:156–9, 1987). Gene expression was measured by quantitative dot-blot hybridization. Serial dilutions of RNA were immobilized on nylon membranes, hybridized with cDNA probes at 55° C. in a buffer containing 50% formamide (Hybrisol I, Oncor, Intergen, Purchase, N.Y.), and washed under standard conditions. Hybridization is quantitated on a phosphorimager (Molecular Dynamics, Piscataway, N.J.), and relative signal levels, normalized to the polyA content of each sample, determined using the RNA-Think program. The values for relative RNA levels are directly proportional to RNA abundance, and differences of 1.5-fold or more can reliably be measured.
Following treatment with 20 Gy γ-rays, the CIP1/WAF1, GADD45, MDM2, ATF3 and BAX genes reached maximal induction four hours after irradiation, then declined rapidly until they reached basal levels by 24 hours. The induction of CIP1/WAF1 (
Relative induction of expression can be correlated to dosage, as shown in
A broader dose range was next tested for the induction of CIP1/WAF1 to determine the point at which the induction response begins to saturate. Induction continued to increase in a dose-dependent manner up to approximately 250 cGy (
The extrapolation of data gathered at high doses to predict effects at low doses can present difficulties. In particular, it cannot necessarily be assumed that the dose-response relationship observed at high doses applies to the entire dose spectrum. For example, very low doses of ionizing radiation may be more toxic per cGy than higher doses. Therefore, extrapolation from survival at high doses may not predict the low-dose hypersensitivity revealed by the more accurate methods. This response may reflect an induction of radio-resistance, perhaps through inducible DNA repair, which requires a certain threshold dose to be triggered in some cell lines. Understanding the mechanistic basis for such induced resistance could have broad implications in areas from risk assessment to cancer treatment.
Low-dose hypersensitivity and induced radio-resistance may be related to the adaptive response to ionizing radiation, another potentially important physiological effect of low dose exposures. Exposure to a “priming” or “adapting” dose, usually in the range of 1–25 cGy, reduces the effects of a subsequent higher “challenge” dose (Stecca and Gerber, Biochem. Pharmacol. 55:941–51, 1998).
The phenomena of low-dose hypersensitivity and radio-adaptation raise the question of the requirement for a minimum threshold dose to induce a transcriptional response. The threshold effect is a factor to be considered for modeling low dose effects from results gathered at high doses, as the existence of a threshold implies that a dose can be identified below which exposure carries no risk of response (at least for the endpoint under consideration). The experiments showing low dose hypersensitivity have been interpreted to indicate a threshold, usually in the neighborhood of 25–30 cGy, which is required to activate inducible repair. Doses below this threshold are proportionately more toxic than doses which trigger the putative repair system. Radio-adaptive protection against cytogenetic aberrations or cell killing, however, has been shown to occur following doses as low as 1–2 cGy, indicating a much lower or even absent threshold for induction of this effect, and perhaps a distinct mechanism of action for the two phenomena.
While the genes studied in this example have not been implicated in induced repair, changes in mRNA levels in response to radiation in the dose range relevant to these phenomena have been observed. The data reveal no indication of a threshold for the induction of the genes studied. CIP1/WAF1 and GADD45 were significantly induced by as little as 2 cGy with a linear dose-response through 50 cGy (
Although no evidence of a minimum threshold for induction of CIP1/WAF1 was observed, there was an upper limit to the linear increase in induction, as the induction by 50 cGy was about half that previously observed by 20 Gy (
The contribution of cells which survive the treatment to the gene inductions measured at low doses would also be consistent with the disturbances in cell cycle progression observed at all doses tested. Even with 2 cGy, there was a transient decrease of S-phase cells in the population. This decrease corresponded to a larger proportion of the population than the predicted non-surviving fraction. A transient G1 delay following a consistent trend in both dose-response and temporal kinetics was observed (
Measurable differential gene expression effects were observed at relatively low, biologically relevant doses, such as 2 cGy. In radiotherapy the standard daily dose is about 2 Gy. Therefore, the results of these experiments indicate that the expression of many genes is altered at this dose. The probe sets of the present invention is capable of measuring exposure to relatively low doses of ionizing radiation (for example less than 2 Gy or 1 Gy) that may be considered non-toxic because of the absence of acute effects, but which may have short term effects that induce differential gene expression, and may have long term effects (such as carcinogenesis).
This example illustrates that probe sets that measure not only the fact of exposure to radiation, but also a probable dose of exposure to ionizing radiation, can be created by identifying genes that are differentially expressed at certain radiation doses. For example, probe sets can be designed which contain genes which are only differentially expressed at higher doses of irradiation (for example 20 Gy), or genes which are differentially expressed at lower doses of irradiation (for example 1–50 cGy). Furthermore, the methods for generating dose response curves described in this example can be used to select probes for the probe set, based on the probable time after radiation exposure occurred. Curves that show sustained or maximal expression 12, 24 or 48 hours after irradiation would identify probes that are useful for evaluation of genotoxic stress from exposure to ionizing radiation at times when clinical evaluations are likely to occur. In addition, patterns of probe set hybridization can also be associated with specified times following probable irradiation. After identifying the genes of interest, they can be incorporated into a probe set of a microarray.
This example demonstrates how to determine if the IR-induction of FRA-1 and ATF3 involve a p53 regulatory component. FRA-1 was induced by IR in some of the wt p53 lines, but was not induced in any of the p53 mutant lines studied (
To further examine the extent of dependence of ATF3 IR-induction on p53 status, in vivo induction was examined in wild-type and p53−/− (knockout) mice (Donehower et al., Nature 356:215–21, 1992) using 5 Gy whole-body γ-irradiation. While ATF3 was well induced by two hours after irradiation in the thymus of wild-type mice, there was no significant induction in the p53−/− mouse (
ATF3 is a member of the activating transcription factor/cAMP response element binding protein (ATF/CREB) family which homodimerizes to repress transcription from promoters with ATF sites. An alternatively spliced form of the ATF3 transcript, which lacks DNA binding activity, is also expressed in cells, but this form promotes transcription (Chen et al., J. Biol. Chem. 269:15819–26, 1994). The sizes of ATF3 transcripts hybridizing on the northern blot demonstrated that the smaller alternatively spliced form was the major transcript expressed in untreated ML-1 cells, whereas the IR-induced transcript was predominantly of the full length form.
The induction of ATF3 by the DNA-damaging agents MMS and UV radiation in all 12 cell lines examined (
Although FRA-1 was not IR-inducible in RKO cells, a similar comparison was possible using the MCF7 and MCF7/E6 cell lines. The human breast carcinoma cell line MCF7 has wild-type p53, while MCF7/E6 has lacked appreciable wild-type p53 function. The reduced IR-induction of FRA-1 in MCF7/E6 compared to MCF7 supports a role for p53 in the IR-induction of this gene (
MBP-1 represented another potentially p53 regulated gene, showing a pattern similar to that seen for BAX or BCL-X, in that it was induced only in p53-wild-type cell lines of lymphoid or myeloid lineage. The induction of the murine homolog of MBP-1 in the tissues of wild-type and p53−/− mice was observed to have marginal to absent expression in liver and thymus, but strong expression in spleen. Treatment with ionizing radiation resulted in a 2-fold induction of this gene in the spleens of both p53 wild-type and p53−/− mice, suggesting that this gene does not require p53 function for its induction, but that its expression and induction are both limited to a subset of cell types. This would be consistent with a role for MBP-1 in tissue specific p53-independent stress responses.
The finding that only two of nine of these genes examined in this cell line panel showed a recognizable p53 component to their regulation belies the recent focus of stress-gene studies primarily on p53-regulated genes. In light of the loss of functional p53 in the majority of tumors, the non-p53-dependent stress response genes appears to also be an important consideration in cancer treatment.
An advantage o the present invention is its ability to examine both p53-regulated and non-p53 regulated genes, or other combinations of gene types (for example genes which are recognized oncogenes or genes which are not recognized oncogenes). Quantitative functional genomics approaches, such as cDNA microarray hybridization, can also be used in combination with the radiation sensor array to unravel the inter-relationships of the molecular response pathways involved. Although radioactive-probe hybridization to nylon filter arrays provides a useful method to screen for potential genes of interest which differ in expression levels between two samples, differential screening has its own limitations (Fargnoli et al., Anal Biochem 187:364–73, 1990) some of which are avoided by the use of probes labeled with different fluorochromes co-hybridized to the same microarray. Other methods for identification of differentially expressed mRNAs, such as differential display, subtractive library hybridization and serial analysis of gene expression (SAGE) (Velculescu et al., Science 270:484–7, 1995), can be biased toward detection of highly-expressed and/or strongly-induced transcripts.
With the microarray approach of the present invention, quantitative results over a wide dynamic range were obtained for many genes. The application and further refinement of quantitative fluorescent cDNA microarray hybridization have the potential to advance our understanding of the fields of stress gene response and radiation biology, and to extend this technology beyond simple pair-wise comparisons to applications such as tumor typing, pharmacological screening, biomonitoring, and rapid carcinogen screening.
This example combined with the results from EXAMPLE 2 illustrate that probe sets that measure differential expression in response to a biologically significant amount of radiation for specific tumors can be generated. Such probe sets can be used to monitor a subject who is undergoing radiotherapy for the treatment of a tumor. For example, probe sets can be designed which contain genes from a specific type of tumor, which are differentially expressed in response to irradiation of the tumor cells. This probe set would then be used to monitor a subject's response to the radiotherapy. Further details are provided in EXAMPLE 10. As an extension of this probe set, the present method can be used in “tumor profiling,” wherein gene expression profiles are used to predict the most effective treatment for each individual subject. In this method, cells would be isolated from the tumor in a subject, and the differential expression of the genes in that tumor in response to irradiation measured as described in the examples herein. A pattern of differential expression found to be associated with a particular therapeutic response to radiation therapy can then be used as a factor when considering whether to include radiation therapy in the treatment of the tumor. For example, a pattern of differential expression associated with a good response to radiation therapy could be used to indicate that radiation therapy should be instituted.
Using the ML-1 cancer cell line and the cDNA microarray hybridization technology described in this specification, 30 sequences from ML-1 were identified that were not previously known to be radiation-responsive. These sequences are Image ID Nos. 428248, 129632, 52681, 52530, 110503, 236422, 153355, 52753, 51363, 161023, 35326, 487130, 291503, 41452, 23464, 243143, 205633, 21420, 48677, 43060, 233194, 295093, 48085, 26541, 28089, 359119, 28116, 80549, 485963, 24927. Arrays can be made that incorporate these sequences, or probe fragments thereof, along with controls and/or other sequences that are already known to be informative about radiation exposure, and/or dosage of radiation exposure.
This example describes detection of differential gene expression in irradiated ML-1 cells using different microarrays. Although the specific example discloses expression in ML-1 cells, this method can be used to detect differential gene expression in any cell type.
ML-1 cells were grown and irradiated as described in EXAMPLE 1. Labeled cDNA subsequently prepared from the irradiated cells was exposed to various arrays as described in EXAMPLE 2. cDNA microarray hybridization studies were performed with the 1.2K chip described in EXAMPLE 2 which contained 1,200 probes in a probe set (Table 1), a 5K chip which contained 5,000 probes in a probe set, and a 7K chip which contained 7,000 probes in a probe set. The 5K and 7K chips contained representative verified expressed ESTs from the human genome, and were designed to permit as much as possible of the human genome to be screened using irradiated ML-1 cells. The arrays used represent the largest sequence verified clone sets available to print at the time of the experiments. As more clones become available, they are added to new prints to be screened. The 1.2K array was a smaller set of ESTs designed for the same purpose (EXAMPLE 2).
Table 4 shows the clones used to make the 1.2K array along with data showing the relative expression detected at each position of the array. The clones used to make the 5K array are identified in Table 5 and for the 7K array in Table 6. Although these particular arrays were used in these experiments, other arrays may be used. For example, as additional arrays become available that include even more ESTs, additional biomarkers of radiation exposure can be found and added to the probe sets disclosed in Tables 9–13. Moreover, the 1.2K, 5K, and 7K arrays can also be used with different cells (such as human peripheral lymphocytes) to screen for other markers that would be even more informative when testing those cells.
The disclosed microarrays can be constructed using the Image ID number provided in these tables, and other available information. In particular, the Image ID number can be searched on the ATCC website, with hypertext links available to ATCC deposit numbers for deposited clones corresponding to the Image ID numbers. Hypertext links are also available to Genbank entries, which disclose sequence information about the nucleic acid at each array site corresponding to Image ID numbers.
Tables 7 and 8 summarize the results for exposure of cells to ionizing radiation using the 1.2K, 5K and 7K microarrays. Table 7 shows the results of different radiation doses (2 and 20 Gy), and hypoxia (which triggers p53 induction) using the 1.2K and 5K microarrays. As shown in Table 7, a very large fraction of the genes tested showed altered expression. As shown in the last two rows of Table 7, a substantial number of genes responded both acutely (3 hours) and at a later time (24 hours) after irradiation. The genes which showed expression at 24 hours were considered particularly suitable for a clinical biodetector that would detect the effects of radiation exposure during a time period when laboratory investigations of potential radiation exposure are likely to occur. However the genes which respond acutely (at three hours) could also be placed in an array to measure even more immediate exposures. Other arrays that include both acute (e.g. three hours) and later (e.g. 24 hours) response can be included in a single array. Differences between response of these two subsets can be used to help determine a probable time of radiation exposure. For example, if the three hour responders are positive but the 24 hours responders are not, then the time of exposure would have been at least three but less than 24 hours before the test. The results shown in Table 8 focus on lower doses and longer timepoints than the studies shown in Table 7.
1Results are summarized for microarray hybridizations conducted in ML-1 cells. Many of the results for row 1 and 3 have been verified by quantitative single-probe hybridization.
2Two different microarrays (chips) used contained only limited overlap in the genes represented.
3The number induced refers to cDNA clones showing significant induction (≧99% confidence).
4For cDNA clones showing a significant reduction, values are shown for those having at least a 2-fold reduction in expression compared to untreated cells; all these values exceeded 99% confidence.
5Values represent range of relative expression of irradiated sample compared to untreated control; e.g. only targets showing an increase in the relative mRNA level of 2.37-fold or more were scored as induced in the first row.
aNumber of Image ID clones that were upregulated in response the irradition.
bNumber of Image ID clones that were downregulated in response the irradition.
Tables 9–12 lists the clones (by Image ID number) for which there was 99.9% confidence that the genes were differentially expressed in ML-1 cells with the 7K array at: 24 hours after 200 cGy of irradiation (Table 9); 24 hours after 20 cGy of irradiation (Table 10); 24 hours after 2 cGy of irradiation (Table 11) and; 48 hours after 200 cGy of irradiation (Table 12). Late responding genes were found by using RNA harvested at the timepoint of interest, for example 24 or 48 hours after exposure to ionizing radiation. A more detailed timecourse of expression (such as that disclosed in EXAMPLE 3) can be used to determine if expression peaks at the time of identification (24 or 48 hours), or if there is a sustained elevation of expression. To identify genes for other timepoints post-exposure, or for different exposures, the cells would be treated with the agent/dose of interest, and the RNA would be harvested at the appropriate time.
Genes with sustained elevated expression (for example 24–48 hours following the exposure) may be suitable for inclusion in probe sets for clinical tests which would not be performed for several days after potential exposure to ionizing radiation. Using dose response information, different probe sets can be designed for different clinical situations, tailored to detection of exposure a certain number of hours following potential exposure.
Probe sets can also be designed that detect certain subsets of genes that are differentially expressed at a particular post-exposure time and level of expression. The levels of differential expression can be correlated to the dose of ionizing radiation to which the subject was exposed, such that the probe set can also be informative about the dose of exposure. This information can provide helpful prognositc information, such as the likelihood of carcinogenesis brought about by the exposure. Alternatively, levels of differential expression can be used to determine a subjects' response to radiation therapy for a tumor, as described in EXAMPLE 10.
Nucleic acid hybridization technologies may be used to survey gene expression patterns in organisms or cells that have been exposed to ionizing radiation. Such technologies are not necessarily limited to nucleic acid arrays. By way of example, northern blot and/or dot blot techniques (see EXAMPLE 3) may also be used to determine the quantitative and qualitative expression patterns of some or all of the disclosed radiation responsive sequences.
While more conventional nucleic acid hybridization techniques (such as northern and dot blots) have been used for many years, nucleic acid array technology is now widely used for monitoring and analyzing gene expression patterns. This array technology may be used in a number of forms, including microarrays. Microarrays typically comprise a large number of nucleic acid probes spotted at high density onto a surface. Descriptions of nucleic acid array and microarray technology may be found in the scientific literature, including, for example, in Chee et al., Science 274:610–4 (1996); Lockhart et al., Nature Biotechnol. 14:1675–80 (1996); Lipshutz et al. Biotechniques 19:4427 (1995); Southern et al., Trends Genetics 12:110–5 (1996); Soares et al., Curr. Op. in Biotech. 8:542–6 (1997); Ramsay et al., Nature Biotech. 16:40-4 (1997); Schena et al., Science 270:467–70 (1995); Schena et al., BioEssays 18:427–31 (1996); DeRisi et al., Science 278:680 (1998), and Iyer et al., Science 283:83–7 (1999). Detailed technical descriptions of various forms of this technology can also be found in the patent literature, including in the following patent documents:
cDNA arrays can be formed on non-porous surfaces (such as glass) by in situ synthesis of oligodeoxynucleotides on a chemically sensitized glass surface (as in WO 92/10588 and WO 95/11995), or by robotic micropipetting of nanoliter quantities of DNA to predetermined positions on a non-porous glass surface (as in Schema et al., Science 270:467–470, 1995, and WO 95/35505). DNA may be coupled to the solid support by electrostatic interactions with a coating film of a polycationic polymer such as poly-L-lysine (WO 95/35505), or covalently bound to the solid support.
Nucleic acid arrays employ conventional nucleic acid hybridization methods that have been used for decades to identify and quantify nucleic acids in biological samples (such conventional methods include southern and northern blots, colony hybridizations and dot blots). However, whereas such conventional techniques typically employ one or two hybridization probes to obtain information on the expression patterns of one or a few genes, array techniques typically employ very large probe sets (for example at least 100, 1000 or even 5000 or 10,000 probes) to obtain data on the expression of a vast number of genes simultaneously.
The basic principle underlying the array technology is the hybridization of a sample nucleic acid composition with a defined set of nucleic acid probes, followed by detection of specific hybridization of the sample to one of more of these probes. The hybridization pattern obtained is then analyzed and compared to hybridization patterns obtained with control nucleic acid samples.
Most arrays comprise a defined set of nucleic acid probes immobilized on a fixed surface in an ordered and known sequence, forming an array of discrete spots of nucleic acid material. A number of substrates may be used to form the fixed surface, including silica-based chips, nylon membranes, microtiter plates and glass slides. Each probe within the set is typically produced by polymerase chain reaction amplification (alternative methods include purification from cloning vectors and, for oligonucleotide probes, chemical synthesis). Each amplification product (probe) is then typically spotted onto the fixed surface using a mechanized means (such as a robotic arm) to form the array. The sample nucleic acid composition (for example, labeled cDNA produced by RT-PCR from mRNA extracted from a tissue sample) is labeled with a detectable marker (e.g., a fluorescent label) prior to hybridization to the array to permit detection of specific hybridization of the sample to a particular probe. Following hybridization, the hybridization pattern is detected and an image of the array is produced for analysis of gene expression. The use of multiple different fluorescent labels to label the samples permits multiple different samples to be hybridized to a single array. Typically however, no more than two samples are hybridized to a single array.
Alternatively, the arrangement may be reversed. The fixed array may include a number of nucleic acid test samples and the probes may be hybridized to a number of duplicates of the array. In this situation, a large number of test sample nucleic acid compositions are attached to the surface to form the array. Subsets of the nucleic acid probe set are subsequently hybridized to duplicates of the array, and hybridization of the probes to individual spots of the array is detected. The use of multiple different fluorophores to label the probes permits multiple different probes from the probe set to be hybridized to a single array.
By way of illustration, a microarray may be produced and employed using the following techniques:
Each probe in the microarray probe set is produced by PCR amplification. This may be achieved by using primers that are specific for the sequence of the differentially expressed gene (as disclosed herein) in conjunction with genomic DNA or cDNA as a template. The amplified PCR products are purified to remove excess primer and template, for example by column chromatography, or by simple sodium acetate or ammonium acetate precipitation, followed by either isopropanol or ethanol washing, and drying. In certain instances, the probe may be used without purification. Following purification, the PCR-amplified probes are resuspended in 10 μl of a salt solution (3×SSC) and transferred into 96-well or 384-well microtiter plates (if they were not already in a microtiter plate from previous steps). The samples are mechanically spotted onto coated glass slides, using a robotic arm containing pins that transfer an amount of each probe from the microtiter plate onto the coated glass slide. Several hundred or several thousand probe spots are spotted onto the array such that each probe is represented by a discrete spot with a center to center distance between spots of about 250 μm to about 500 μm. The glass slides are then processed to cross-link the probe set onto the glass surface.
The nucleic acid sample to be hybridized to the array is typically cDNA obtained by RT-PCR of mRNA extracted from a biological sample (e.g., plant or animal cells) (see EXAMPLE 2). The detectable label (e.g., the fluorophore) may be incorporated during the RT-PCR amplification step. Typical experiments involve either single-color fluorescence hybridization to measure the absolute levels of gene expression in a single sample, or two-color fluorescence hybridization to examine the relative expression of genes in two different samples.
For single-color fluorescence hybridization experiments, mRNA is isolated from a sample of interest and used as a template to produce cDNA. The cDNA is labeled, for example using a fluorescent dye such as Cy-3 or Cy-5 (Amersham Pharmacia Biotech, Piscataway, N.J.), or any other fluorophore or label. The label can be incorporated directly during the reverse transcription step. The sample is then hybridized to the array. Following washing to remove non-specifically bound sample, the array is scanned for fluorescent emission following laser excitation, and the intensity of each fluorescing spot is measured. The intensity of each spot is approximately proportional to the expression of the gene (corresponding to the probe) in the sample examined. This data provides an indication of the expression of a particular gene in the tissues from which the sample nucleic acid was prepared.
For two-color fluorescence hybridization experiments, RNA is isolated from two samples of interest and labeled as described above, except that each sample is labeled with a different fluorescent label, each of which fluoresces at a different wavelength (for example, one sample may be labeled with Cy-3 and the other with Cy-5). After the two sample preparations are labeled, they are mixed together and hybridized to a single microarray. After washing, the microarray slide is scanned in two fluorescence channels. Because the two fluorescent labels are selected such that their emission spectra do not overlap, the signal of each of the two fluors can measured for each of the probes of the array. The absolute levels of intensity for each probe in an array is approximately proportional to the expression of the gene in the sample examined, and the ratio of the two fluor intensities indicates the relative expression of a gene in the two different samples.
Each probe includes at least one copy (and more typically many copies) of an isolated nucleic acid molecule. Typically, the nucleic acid molecule is in substantially pure form, i.e., while there may be small amounts of other nucleic acid molecules in the probe preparation (such as degradation products), the selected nucleic acid molecule will be the predominant nucleic acid molecule present. In addition, a probe is generally of known sequence. A “probe set” comprises a collection of two or more such probes (the individual probes within the probe set not being co-mingled). A probe set will include probes selected for a particular purpose (such as detection of acute or semi-acute exposure to ionizing radiation), but may also include controls or other probes for different purposes. In some examples, at least 10% of the probes in the probe set are selected for the particular purpose of determining an actual or potential biological response to an actual or potential radiation exposure. In other examples, at least 5, 10 25, or 50 of the probes of an array are selected for this purpose.
A hybridization probe for use in an array produced according to the present invention may be referred to as a sequence “representing” a particular gene product. A sequence “representing” a particular gene product is one that will specifically hybridize to a nucleic acid molecule encoding that gene product, thereby permitting identification of that gene product. A sequence representing a particular gene product may comprise an entire cDNA sequence (or the corresponding genomic gene sequence) or less than an entire cDNA sequence. For example, the probe may comprise an oligonucleotide comprising a minimum specified number of consecutive bases of a selected gene that is differentially expressed following irradiation. Oligonucleotides as short as 8–10 consecutive bases of a cDNA will be effective to produce meaningful gene expression data using microarray technology. For example, a nine base oligonucleotide can distinguish 262,144 transcripts (49). However, for enhanced specificity of hybridization, longer oligonucleotides may be employed, such as at least 10, 15, 20, 25, 30, 50, 50 or more consecutive bases of a cDNA. Furthermore, a probe “representing” a particular gene product need not be a complete match. While probes that share 100% sequence identity over their entire length to the corresponding cDNA sequence will typically provide enhanced specificity of hybridization, probes that share less than 100% sequence identity may also be useful in such microarray applications. Typically, such probes will share at least 70% sequence identity with the corresponding cDNA, but probes sharing at least 75%, 80%, 85%, 90%, 95%, 97%, 98%, and 99% sequence identity may be utilized to achieve enhanced specificity.
An examples of a probe sets that has been used to make a cDNA array that detects differential expression of genes following irradiation is shown in Tables 13. The clones on this chips include all the positive hits from studies with larger arrays (5K and 7K), in addition to probes for other genes that have been reported in the literature as being stress-induced.
A microarray was assembled which contained each clone in Table 13 printed only once on the array. However the arrays can routinely include duplicates of each position as the stress-array, to provide an additional degree of certainty for positive “hits.” In addition, internal controls of the type known in the art can be used on each array.
The probe set for a cDNA array can be made using a probe set comprising nucleic acid molecules representing a portion of the cDNAs disclosed in Tables 9–13. Alternatively, such probe sets may be used to analyze gene expression patterns using other hybridization techniques, such as northern and dot blots (see EXAMPLE 3). However, useful information may also be obtained using arrays or other techniques that employ probe sets representing less than all of the disclosed cDNAs in the array of Tables 9–13. For example, arrays that employ probe sets representing between 10% and 99% of the disclosed cDNA sequences may be employed. Thus, arrays employing probe sets representing at least 25%, 50%, 80% or 90% of the disclosed cDNAs depicted in any of Tables 9–13 may be employed. Alternatively the probe set may represent at least 5, 10, 25, or 50 of the disclosed cDNAs depicted in any of Tables 9–13. Each probe can “represent” the cDNA by having sufficient contiguous nucleotide (or variant thereof) that hybridizes to a target nucleic acid of interest. Alternatively, an array can be made using probes that detect differential expression of genes within a certain time period. An example would be an array consisting of the probes of any of Tables 9–12 or combinations thereof.
The sets of disclosed probes may also be used in nucleic acid hybridization techniques. Such probe sets are particularly useful for assessing differential gene expression in cells that have been exposed to radiation. The probe sets comprise nucleic acid probes representing specified percentages of the differentially expressed cDNAs disclosed herein. These probe sets may be used with any nucleic acid hybridization technique that can be used to analyze patterns of gene expression, including but not limited to: northern blotting, dot blotting, arrays and microarrays.
In addition, the invention also provides methods for analyzing expression of multiple genes in a biological sample. A biological sample is provided which includes nucleic acids, and the biological sample is hybridized to a nucleic acid probe set arrayed on a surface. Typically, the nucleic acids of the biological sample are labeled with a detectable label such as a fluorescent label, to permit easy detection of hybridization between the sample and a probe. The nucleic acid probe set arrayed on the surface includes nucleic acid probes representing, for example at least 2%, 5%, 10%, 25%, 50% or more of the radiation induced cDNA sequences disclosed in any of Tables 9–13, such as subsets of probes for genes that have sustained expression at 24 or 48 hours, or expression at a level that corresponds to clinical exposure to a certain dose of ionizing radiation.
The present invention also includes providing a plurality of duplicate arrays of test sample nucleic acid compositions (such as samples taken from one or more subjects) immobilized on a fixed surface. These arrays are then hybridized with a nucleic acid probe set (typically labeled with a detectable label). Hybridization using the complete set of probes is typically achieved by hybridizing a small subset of the probes (typically 2–5 probes) to one of the array duplicates, and then repeating the procedure by hybridizing additional subsets of the probe set to additional duplicates of the array. To facilitate detection of hybridization, each of the probes within a particular subset is typically conjugated to a different detectable label. The probe set used in this method comprises nucleic acid probes representing at least 2%, 5%, 10%, 25%, 50% or more of the cDNA sequences disclosed in any of Tables 9–13, or the complete probe sets disclosed in those Tables, other subsets thereof (such as at least 5, 10, 25, or 50 of the disclosed sequences), or probe sets that are larger than any of these sets and include additional informative markers that have been or are discovered.
Arrays of polynucleotide DNA probes immobilized on solid supports have been used to study the composition of complex mixtures of DNA by hybridization techniques. For example, a complex mixture of labeled cDNA is hybridized to the DNA array under conditions of appropriate stringency, and unbound material is washed away. The array is then scanned using a detector, such as a scanning fluorescent microscope, capable of sensing the remaining bound labeled cDNA. The intensity of the detected signal at any given element is a measure of the concentration of the corresponding cDNA in the original complex mixture. See Schema et al., Science 270:467–470, 1995 and WO 96/17958.
This example demonstrates that cDNA microarrays can be used to assess the effects on gene expression of irradiating isolated human peripheral blood lymphocytes or alternatively can detect altered gene expression to assess potential radiation exposure. Peripheral blood lymphocytes provide an easily accessible source of radiation sensitive tissue from a subject who has potentially been exposed to ionizing radiation. To determine whether the subject has experienced a biologically significant radiation exposure, approximately 30 ml may be obtained by conventional phlebotomy. This amount of blood provides sufficient mRNA for hybridization to a large array (such as a 5K array or 7K array), or several smaller arrays (such as 1.2K arrays), during efforts to determine which genes are differentially expressed. However, much smaller probe sets can be used diagnostically, with corresponding smaller amounts of blood required.
Although this example describes methods which were used to analyze peripheral blood lymphocytes, one skilled in the art will recognize that similar methods can be used to analyze any biological specimen. Biological specimens can be obtained from subjects who have potentially been, or are known to have been, exposed to ionizing radiation. A variety of biological specimens can be used in the present invention. Examples include, but are not limited to: peripheral blood, urine, saliva, tissue biopsy, surgical specimen, amniocentesis samples and autopsy material.
Blood may be drawn from potentially exposed individuals, RNA extracted immediately, and tested against an established baseline to determine relative levels of a number of genes, then compared to profiles for various qualities or quantities of radiation, or other environmental agents, to determine likelihood of exposure. Approximate dose and time since exposure can also be determined.
Human blood from normal healthy donors were obtained from the NIH blood bank (Department of Transfusion Medicine) and within 30–60 minutes of drawing, the components were separated by centrifugation on a Lymphoprep (Nycomed, Oslo, Norway) density gradient according to the manufacturer's instructions. The buffy coat layers were recovered, washed in phosphate buffered saline and resuspended at a density of 0.5–1×106 cells per ml in RPMI 1640 medium supplemented with 10% heat-inactivated (56° C. for 45 minutes) fetal calf serum and 100 U/ml penicillin and 100 μg/ml streptomycin at 37° C. in a humidified, 5% CO2 atmosphere. Peripheral blood lymphocytes (PBLs) were allowed to equilibrate to culture conditions for 45–60 minutes, then irradiated at approximately 60 cGy/min. to total doses of 20–200 cGy using a Mark 1-68 137Cs source (J.L. Shepherd and Associates, Inc., San Fernando, Calif.) with lead attenuators in place.
RNA was harvested at 0, (unirradiated) 4, 24, 48 or 72 hours after irradiation using an acid guanidinium thiocyanate-phenol-chloroform mixture, as described by Chomczynski et al. (Anal. Biochem. 162:156–9, 1987). RNA harvested 24 hours after irradiation was hybridized to the 7K microarray (Table 6) and to the array shown in Table 13 and analyzed as described in EXAMPLE 2.
The results shown in Table 14 show the Image ID of PBL genes which were differentially expressed 24 hours following 2 Gy of irradiation. These results indicate that cDNA microarrays, such as the 7K microarray shown in Table 6, can be used to identify genes which are differentially expressed in response to irradiation. Such information can be used to identify genes to include in a probe set (such as those shown in Table 14) which will be useful for assessing potential radiation exposure in a cell from an animal or plant subject.
The alteration in gene expression following γ-irradiation was examined in PBLs for five genes identified as being differentially expressed in response to irradiation (Table 14). As shown in
A timecourse of induction following 2 Gy γ-rays delivered ex vivo to PBLs was also examined as described in EXAMPLE 3. RNA was harvested from lymphocytes following one, two, or three days of incubation. Following irradiation, the DDB2, CIP1/WAF1, and XP-C genes reached maximal induction 24 hours after irradiation, then declined (
The relative induction of expression was correlated to dosage, as shown in
In conclusion, this example demonstrates that biological specimens can be used to determine if a cell has been exposed to a biologically significant amount of ionizing radiation, and approximately how much time has elapsed since the cell was exposed. In addition, the relative induction of radiosensitive genes can be correlated to the dosage of irradiation received by the cell.
This example describes methods which can be used to analyze organisms which have been, or may have been, exposed to ionizing radiation.
Analysis of Irradiated Animals
The analysis of animals which have potentially been exposed to, or are known to have been exposed to, ionizing irradiation can be performed. As test subjects, rodents such as mice or rats are whole body irradiated, for example using a Mark 1-68 137Cs source (JL Shepherd and Associates, San Fernando, Calif.) to deliver total doses between 2 and 500 cGy as described in the above examples. Following irradiation, biological samples are harvested from the animals. Examples of biological samples include, but are not limited to, those listed above in EXAMPLE 9. Tissues are flash frozen in GTC (guanidine thiocyanate solution, used above), and the RNA is isolated and reverse transcribed incorporating a label, as described above. The labeled nucleic acids are then exposed to an array, for example using mouse equivalents of the genes shown in any of Tables 9–13. A test array can be generated, by selecting for probe sets as described herein, for example probe sets designed to detect radiation exposure in a specimen that is to be tested, such as peripheral blood cells for example leukocytes. Patterns or hybridization to the array can then be observed to determine the likelihood of exposure, and/or a dose of exposure in subsequent test subjects.
Analysis of Subjects Undergoing Radiation Therapy
The disclosed arrays can also be used to evaluate and/or select patients who are undergoing (or are candidates for) radiation therapy, for the treatment of cancer or a tumor. For example, subjects can be monitored during radiotherapy to detect gene expression induced by irradiation. Differential gene expression provides diagnostic or prognostic information, for example, it may indicate whether sufficient doses of radiation are being administered to cause regression of the cancer. An indication that a sufficient dose of radiation has been used may be a pattern of differential gene expression associated with a successful outcome of treatment of tumors of a particular type. In addition, subjects can be monitored to determine if the dose of radiation is causing harmful effects, for example iatrogenic carcinogenesis or suppression of the immune system by determining whether a pattern of differential expression has occurred that is associated with either of these outcomes.
The RNA from subjects is isolated, reverse-transcribed, and labeled as described above in EXAMPLE 2. The labeled nucleic acids are then exposed to an array, for example an array which contains a probe set which has been selected for genes which are differentially expressed at the radiation dose which the subject is being treated with. For example, the array may include genes which have been shown using the methods described in the above examples to be differentially expressed at 2 Gy, a typical daily dose which a subject might receive in radiotherapy. In addition, an array can be selected based on the tumor which is being treated. For example, the array can include or consist of genes which have been shown (using the methods described in the above examples) to be differentially expressed at the dose which the subject is receiving, in the type of tumor the subject suffers from. Such specific probe sets provide useful diagnostic information regarding the subject's progress in the radiotherapy.
Analysis of Irradiated Plants or Microbes
In addition to humans and other animals, plants or microbes which have been, or may have been, exposed to radiation can be analyzed using the present invention. In one example, following a nuclear accident, if there are no animals present to examine to determine the amount of exposure, the plant or microbial flora present in the area can be analyzed. Nucleic acids, for example RNA is isolated using standard methods. Nucleic acids are isolated from any part of the plant, for example roots, stems, leaves or flowers. The RNA is isolated, reverse-transcribed, and labeled as described above. The labeled plant nucleic acids can then be exposed to an array, for example using plant equivalents of the genes shown in Tables 9–13, or other plant genes that are found to be differentially expressed following radiation exposure.
Having illustrated and described the principles of assessing whether an organism has been exposed to biologically significant levels of ionizing radiation by detecting differential cellular expression of genes that are differentially expressed following such exposure, it should be apparent to one skilled in the art that the invention can be modified in arrangement and detail without departing from such principles. In view of the many possible embodiments to which the principles of our invention may be applied, it should be recognized that the illustrated embodiments are only examples of the invention and should not be taken as a limitation on the scope of the invention. Rather, the scope of the invention is in accord with the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.
This application is a U.S. national stage § 371 application of PCT/US00/04897 filed Feb. 25, 2000, which was published in English under PCT Article 21(2), which claims the benefit of U.S. provisional application No. 60/121,756 filed Feb. 26, 1999.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US00/04897 | 2/25/2000 | WO | 00 | 8/8/2001 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO00/50643 | 8/31/2000 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5563060 | Hozier | Oct 1996 | A |
5691157 | Gong et al. | Nov 1997 | A |
5707807 | Kato | Jan 1998 | A |
5744305 | Fodor et al. | Apr 1998 | A |
5770722 | Lockhart et al. | Jun 1998 | A |
5830645 | Pinkel et al. | Nov 1998 | A |
5843655 | McGall | Dec 1998 | A |
5866330 | Kinzler et al. | Feb 1999 | A |
5877215 | McClay et al. | Mar 1999 | A |
5910309 | Ullrich | Jun 1999 | A |
5965352 | Stoughton et al. | Oct 1999 | A |
5998136 | Kamb | Dec 1999 | A |
Number | Date | Country |
---|---|---|
WO 9730179 | Aug 1997 | WO |
WO 9818961 | May 1998 | WO |
WO 9856954 | Dec 1998 | WO |
WO 9955913 | Nov 1999 | WO |
Number | Date | Country | |
---|---|---|---|
60121756 | Feb 1999 | US |