Nucleic acid detection assay control genes

Information

  • Patent Application
  • 20040048297
  • Publication Number
    20040048297
  • Date Filed
    July 30, 2003
    21 years ago
  • Date Published
    March 11, 2004
    21 years ago
Abstract
The present invention includes methods of normalizing quantitative and non-quantitative nucleic acid detection assays by identifying genes whose expression level is invariant among cell or tissue types. The methods of the invention can be used in the diagnosis of disease, in quality control in evaluating external data or databases, and in normalization of external data for comparative purposes. The genes of the invention can be used to produce microarrays that generate data with improved reliability.
Description


FIELD OF THE INVENTION

[0002] The invention relates generally to control genes that maybe utilized for normalizing hybridization and/or amplification reactions, as well as methods of identifying these genes that may be used in toxicology studies and in analyzing gene expression data sets for quality and compatibility with other data sets.



BACKGROUND OF THE INVENTION

[0003] Nucleic acid hybridization and other quantitative nucleic acid detection assays are routinely used in medical and biotechnological research and development, diagnostic testing, drug development and forensics. Such technologies have been used to identify genes which are up- or down-regulated in various disease or physiological states, to analyze the roles of the members of cellular signaling cascades and to identify drugable targets for various disease and pathology states.


[0004] Examples of technologies commonly used for the detection and/or quantification of nucleic acids include Northern blotting (Krumlauf (1994), Mol Biotechnol 2: 227-242), in situ hybridization (Parker & Barnes (1999), Methods Mol Biol 106: 247-283), RNAse protection assays (Hod (1992), Biotechniques 13: 852-854; Saccomanno et al. (1992), Biotechniques 13: 846-850), microarrays, and reverse transcription polymerase chain reaction (RT-PCR) (see Bustin (2000), J Mol Endocrin 25: 169-193).


[0005] The reliability of these nucleic acid detection methods depend on the availability of accurate means for accounting for variations between analyses. For example, variations in hybridization conditions, label intensity, reading and detector efficiency, sample concentration and quality, background effects, and image processing effects each contribute to signal heterogeneity (Hegde et al. (2000), Biotechniques 29: 548-562; Berger et al. (2000), WO 00/04188). Normalization procedures used to overcome these variations often rely on control hybridizations to housekeeping genes such as P-actin, glyceraldehyde-3-phosphate dehydrogenase (GADPH), and the transferrin receptor gene (Eickhoff et al. (1999), Nucl Acids Res 27:e33; Spiess et al. (1999), Biotechniques 26: 46-50. These methods, however, generally do not provide the signal linearity sufficient to detect small but significant changes in transcription or gene expression (Spiess et al.(1999), Biotechniques 26:46-50). In addition, the steady state levels of many housekeeping genes are susceptible to alterations in expression levels that are dependent on cell differentiation, nutritional state, specific experimental and stimulation protocols (Eickhoff et al. (1999), Nucl Acids Res 27:e33; Spiess et al. (1999), Biotechniques 26:46-50; Hegde et al. (2000), Biotechniques 29:548-562; and Berger et al. (2000), WO 00/04188). Consequently, there exists a need for the identification and use of additional genes that may serve as effective controls in nucleic acid detection assays.



SUMMARY OF THE INVENTION

[0006] The present invention includes methods of identifying at least one gene that is consistently expressed across different cell or tissue types in an organism, comprising: preparing gene expression profiles for different cell or tissue types from the organism; calculating a coefficient of variation for at least one gene in each of the profiles across the different cell or tissue types; and selecting any gene whose coefficient of variation indicates that the gene is consistently expressed across the different cell or tissue types. The coefficient of variation may be less than about 40% and the methods may comprise creating gene expression profiles for about 10, 25, 50, 100 or more different cell or tissue types. The gene expression profiles may be prepared be querying a gene expression database.


[0007] The invention also includes a set of probes comprising at least two probes that specifically hybridize to a control gene identified by the methods of the invention. Such sets of probes may comprise probes that specifically hybridize to at least about 10, 25, 50 or 100 control genes. In some formats, the sets of probes are attached to a solid substrate such as a microarray or chip.


[0008] The invention also includes methods of normalizing the data from a nucleic acid detection assay comprising: detecting the expression level for at least one gene in a nucleic acid sample; and normalizing the expression of said at least one gene with the detected expression of at least one control gene identified by the method of the invention. The number of control genes used to normalize gene expression data may comprise about 10, 25, 50, 100 or more of the control genes herein identified.


[0009] In another embodiment, the invention includes a set of probes comprising at least two probes that specifically hybridize to a gene of Table 1. The set may comprise at least about 10, 25, 50, 100 or more the control genes of Table 1. The sets of probes may or may not be attached to a solid substrate such as a chip.


[0010] The invention, in another embodiment, includes methods of normalizing the data from a nucleic acid detection assay comprising: detecting the expression level for at least one gene in a nucleic acid sample; and normalizing the expression of said at least one gene with the detected expression of at least one control gene of Table 1. The number of control genes used to normalize gene expression data may comprise about 10, 25, 50, 100, 500 or more of the control genes herein identified.



DETAILED DESCRIPTION

[0011] The present Inventors have identified rat control genes that may be monitored in nucleic acid detection assays and whose expression levels may be used to normalize gene expression data or evaluate the suitability of test data to compare to or to include in a database of like data. Normalization of gene expression data from a cell or tissue sample with the expression level(s) of the identified control genes allows the accurate assessment of the expression level(s) for genes that are differentially regulated between samples, tissues, treatment conditions, et. These control genes may be used across a broad spectrum of assay formats, but are particularly useful in microarray or hybridization based assay formats.


[0012] A. Nucleic Acid Detection Assay Controls


[0013] 1. Selection of Control Genes


[0014] As used herein, the genes selected by the disclosed methods as well as the rat genes and nucleic acids of Table 1 are referred to as “invariant” or “control genes.” Control genes of the invention may be produced by a method comprising preparing gene expression profiles (a representation of the expression level for at least one gene, preferably 10, 25, 50, 100, 500 or more, or, most preferably, nearly all or all expressed genes in a sample) from at least two (or a variety) of cell or tissue types, or from a set of samples of at least one cell or tissue type in which the set contains normal samples (from healthy animals), disease state samples, toxin-exposed samples, etc., measuring the level of expression for at least one gene in each of the gene expression profiles to produce gene expression data, calculating a coefficient of variation in the expression level from the gene expression data for each gene (% CV) and selecting genes whose coefficient of variation indicates that the gene is consistently expressed at about the same level in the different cell or tissue types. In one embodiment, such genes that are expressed at about the same level, or are invariantly expressed, are those genes that have a coefficient of variation (expressed as a percentage) of less than or equal to about 40%.


[0015] In the methods of the invention, gene expression profiles maybe produced by any means of quantifying gene expression for at least one gene in the tissue or cell sample. In preferred methods, gene expression is quantified by a method selected from the group consisting of a hybridization assay or an amplification assay. Hybridization assays may be based on any assay format that relies on the hybridization of a probe or primer to a nucleic acid molecule in the sample. Such formats include, but are not limited to, differential display formats and microarray hybridization, including microarrays produced in chip format. Amplification assays include, but are not limited to, quantitative PCR, semiquantitative PCR and assays that rely on amplification of nucleic acids subsequent to the hybridization of the nucleic acid to a probe or primer. Such assays include the amplification of nucleic acid molecules from a sample that are bound to a microarray or chip.


[0016] In other circumstances, gene expression profiles may be produced by querying a gene expression database comprising expression results for genes from various cell or tissue samples. The gene expression results in the database may be produced by any available method, such as differential display methods and micro array-based hybridization methods. The gene expression profile is typically produced by the step of querying the database with the identity of a specific cell or tissue type for the genes that are expressed in the cell or tissue type and/or the genes that are differentially regulated compared to a control cell or tissue sample. Available databases include, but are not limited to, the Gene Logic Gene Express™ database, the Gene Expression Omnibus gene expression and hybridization array repository available through NCBI (www.ncbi.nlm.nih.gov/entrez) and the SAGE™ gene expression database.


[0017] In preferred embodiments, the statistical measure referred to herein as the coefficient of variation (% CV) is calculated on a gene by gene basis across a number of samples or across a reference database to find the least variant genes with respect to a number of cell or tissue types or sample treatments.


[0018] Further, the statistical methods of the invention are particularly useful for determining the compatibility of a test sample to an entire set of samples, or an existing database derived from those samples. For instance, a % CV value for genes that have been shown to be the most resistant to variability is calculated for all samples within a test group or test database. These % CV values are then compared to those from a standard reference database. Accordingly, a closeness distribution of all individual samples in the test database to the reference database as a whole can be generated to evaluate the compatibility of new samples. The genes identified in Table I show invariant patterns of expression and can be used to assess compatibility and reliability of gene expression experiments and predictive modeling experiments. These genes show low variability both in control groups from many different experiments and in studies of disruptions of gene expression, such as those occurring in disease states. As a result, these genes can be used as an internal standard for comparing gene expression data. Measurements of expression levels of these genes are used to determine the extent of compatibility of data from different sources and the need, or lack thereof, for normalization or further quality control and adjustments. These measurements also provide an internal standard that supplies a reference point for highly disrupted patterns of gene expression. These genes are also of critical importance for determining relative expression if small numbers of markers are used in custom microarrays.


[0019] The cell or tissue sample that reduced to prepare gene expression profiles may include any cell or tissue sample available. Such samples include, but are not limited to, tissues removed as surgical samples, diseased or normal tissues, in vitro or in vivo grown cells, and cell cultures and cells or tissues from animals exposed to an agent such as a toxin. The number of samples that may be used to calculate a coefficient of variation is variable, but may include about 3, 10, 25, 50, 100, 200, 500 or more cell or tissue samples. The cell or tissue samples may be derived from an animal or plant, preferably a mammal, most preferably a rat. In some instances, the cell or tissue samples may be human, canine (dog), mouse or rat in origin.


[0020] In some embodiments of the invention, the coefficient of variation maybe calculated from raw expression data or from data that has been normalized to control for the mechanics of hybridization, such as data normalized or controlled for background noise due to non-specific hybridization. Such data typically includes, but is not limited to, fluorescence readings from microarray based hybridizations, densitometry readings produced from assays that rely on radiological labels to detect and quantify gene expression and data produced from quantitative or semi-quantitative amplification assays.


[0021] The coefficient of variation (CV) is typically calculated by calculating a mean value for the expression level of a given gene across a number of samples and calculating the standard deviation (SD) from that mean. The CV may be calculated by the following equation: CV=SD/Mean and may or may not be presented as a percentile value. Genes with a CV of less than about 40% may be selected as control genes or are considered as genes that are consistently expressed across the different cell or tissue types tested.


[0022] As used herein, “background” refers to signals associated with non-specific binding (cross-hybridization). In addition to cross-hybridization, background may also be produced by intrinsic fluorescence of the hybridization format components themselves.


[0023] “Bind(s) substantially” refers to complementary hybridization between an oligonucleotide probe and a nucleic acid sample and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the nucleic acid sample.


[0024] The phrase “hybridizing specifically to” refers to the binding, duplexing or hybridizing of a molecule substantially to or only to a particular nucleotide sequence or sequences under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA.


[0025] 2. Preparation of Controls Genes, Probes and Primers


[0026] The control genes listed in Table I may be obtained from a variety of natural sources such as organisms, organs, tissues and cells. The sequences of known genes are in the public databases. The GenBank Accession Number corresponding to the Normalization Control Genes can be found in Table 1. The sequences of the genes in GenBank (http://www.ncbi.nlm.nih.gov/) are herein incorporated by reference in their entirety as of the priority date of this application.


[0027] Probes or primers for the nucleic acid detection assays described herein that specifically hybridize to a control gene may be produced by any available means. For instance, probe sequences may be prepared by cleaving DNA molecules produced by standard procedures with commercially available restriction endonucleases or other cleaving agents. Following isolation and purification, these resultant normalization control gene fragments can be used directly, amplified by PCR methods or amplified by replication or expression from a vector.


[0028] Control genes and control gene probes or primers (i.e., synthetic oligonucleotides and polynucleotides) are most easily synthesized by chemical techniques, for example, the phosphoramidite method of Matteucci et al. ((1981) J Am Chem Soc 103:3185-3191) or using automated synthesis methods using the GenBank sequences disclosed in Table 1. Probes for attachment to microarrays or for use as primers in amplification assays may be produced from the sequences of the genes identified herein using any available software, including, for instance, software available from Molecular Biology Insights, Olympus Optical Co. and Premier Biosoft International.


[0029] In addition, larger nucleic acids can readily be prepared by well known methods, such as synthesis of a group of oligonucleotides that define various modular segments of the normalization control genes and normalization control gene segments, followed by ligation of oligonucleotides to build the complete nucleic acid molecule.


[0030] B. Normalization Methods


[0031] Gene expression data produced from the control genes in a given sample or samples may be used to normalize the gene expression data from other genes using any available arithmatic or calculative means. In particular, gene expression data from the control genes in Table 1 are useful to normalize gene expression data for toxicology testing or modeling in an animal model, preferably in a rat. Such methods include, but are not limited, methods of data analysis described by Hegde et al. (2000), Biotechniques 29:548-562; Winzeller et al. (1999), Meth Enzymol 306:3-18; Tkatchenko et al. (2000), Biochimica et Biophysica Acta 1500:17-30; Berger et al. (2000), WO 00/04188; Schuchhardt et al. (2000), Nucl Acids Res 28:e47; Eickhoffet al. (1999), Nucl Acids Res 27:e33. Micro-array data analysis and image processing software packages and protocols, including normalization methods, are also available from BioDiscovery (http://www.biodiscovery.com), Silicon Graphics (http://www.sigenetics.com), Spotfire (http://www.spotfire.com), Stanford University (http://rana.Stanford.EDU/software), National Human Genome Research Institute (http://www.nhgri.nih.gov/DIR/LCG/15K/HTML/img_analysis.html), TIGR (http://www.tigr.org/softlab), and Affymetrix (affy and maffy packages), among others.


[0032] C. Assay or Hybridization Formats


[0033] The control genes of the present invention may be used in any nucleic acid detection assay format, including solution-based and solid support-based assay formats. As used herein, “hybridization assay format(s)” refer to the organization of the oligonucleotide probes relative to the nucleic acid sample. The hybridization assay formats that may be used with the control genes and methods of the present invention include assays where the nucleic acid sample is labeled with one or more detectable labels, assays where the probes are labeled with one or more detectable labels, and assays where the sample or the probes are immobilized. Hybridization assay formats include but are not limited to: Northern blots, Southern blots, dot blots, solution-based assays, branched DNA assays, PCR, RT-PCR, quantitative or semi-quantitative RT-PCR, microarrays and biochips.


[0034] As used herein, “nucleic acid hybridization” simply involves contacting a probe and nucleic acid sample under conditions where the probe and its complementary target can form stable hybrid duplexes through complementary base pairing (see Lockhart et al., (1999) WO 99/32660). The nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be detected, typically through detection of an attached detectable label.


[0035] It is generally recognized that nucleic acids are denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids. Under low stringency conditions (e.g., low temperature and/or high salt) hybrid duplexes (e.g., DNA-DNA, RNA-RNA or RNA-DNA) will form even where the annealed sequences are not perfectly complementary. Thus, specificity of hybridization is reduced at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower salt) successful hybridization requires fewer mismatches. One of skill in the art will appreciate that hybridization conditions may be selected to provide any degree of stringency. In a preferred embodiment, hybridization is performed at low stringency, in this case in 6×SSPE-T at 37° C. (0.005% Triton X-100) to ensure hybridization, and then subsequent washes are performed at higher stringency (e.g., 1×SSPE-T at 37° C.) to eliminate mismatched hybrid duplexes. Successive washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25×SSPE-T at 37° C. to 50° C. until a desired level of hybridization specificity is obtained. Stringency can also be increased by addition of agents such as formamide. Hybridization specificity may be evaluated by comparison of hybridization to the test probes with hybridization to the various controls that can be present (e.g., expression level control, normalization control, mismatch controls, etc.).


[0036] As used herein, the term “stringent conditions” refers to conditions under which a probe will hybridize to a complementary control nucleic acid, but with only insubstantial hybridization to other sequences. Stringent conditions are sequence-dependent and will be different under different circumstances. Longer sequences hybridize specifically at higher temperatures. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.


[0037] Typically, stringent conditions will be those in which the salt concentration is at least about 0.01 to 1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.


[0038] In general, there is a tradeoff between hybridization specificity (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed at the highest stringency that produces consistent results and that provides a signal intensity greater than approximately 10% of the background intensity. Thus, in a preferred embodiment, the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above that the hybridization pattern is not appreciably altered and which provides adequate signal for the particular oligonucleotide probes of interest.


[0039] The “percentage of sequence identity” or “sequence identity” is determined by comparing two optimally aligned sequences or subsequences over a comparison window or span, wherein the portion of the polynucleotide sequence in the comparison window may optionally comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical residue (e.g., nucleic acid base or amino acid residue) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Percentage sequence identity when calculated using the programs GAP or BESTFIT (see below) is calculated using default gap weights. Sequences corresponding to the control genes of the invention may comprise at least about 70% sequence identity to those sequences identified by GenBank Accession Nos. in Table 1, preferably about 75%, 80% or 85% sequence identity, or more preferably, about 90%, 95% or more sequence identity.


[0040] Homology or identity is determined by BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed by the programs blastp, blastn, blastx, tblastn and tblastx (Karlin et al. (1990), Proc Natl Acad Sci USA 87:2264-2268 and Altschul (1993), J Mol Evol 36:290-300, fully incorporated by reference) which are tailored for sequence similarity searching. The approach used by the BLAST program is first to consider similar segments between a query sequence and a database sequence, then to evaluate the statistical significance of all matches that are identified and finally to summarize only those matches which satisfy a preselected threshold of significance. For a discussion of basic issues in similarity searching of sequence databases, see Altschul et al. (1994), Nat Genet 6: 119-129, which is fully incorporated by reference. The search parameters for histogram, descriptions, alignments, expect (i.e., the statistical significance threshold for reporting matches against database sequences), cutoff, matrix and filter are at the default settings. The default scoring matrix used by blastp, blastx, tblastn, and tblastx is the BLOSUM62 matrix (Henikoff et al. (1992), Proc Natl Acad Sci USA 89:10915-10919, fully incorporated by reference). Four blastn parameters were adjusted as follows Q=10 (gap creation penalty) R=10 (gap extension penalty); wink=1 (generates word hits at every winkth position along the query); and gapw=16 (sets the window width within which gapped alignments are generated). The equivalent Blastp parameter settings were Q=9; R=2; wink=1; and gapw=32. A Bestfit comparison between sequences, available in the GCG package version 10.0, uses DNA parameters GAP=50 (gap creation penalty) and LEN=3 (gap extension penalty) and the equivalent settings in protein comparisons are GAP=8 and LEN=2.


[0041] As used herein, a “probe” or “oligonucleotide probe” is defined as a nucleic acid, capable of binding to a nucleic acid sample or complementary control gene nucleic acid through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, U, C or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in probes may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, probes may be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages.


[0042] Probe arrays may contain at least two or more oligonucleotides that are complementary to or hybridize to one or more of the control genes described herein. Such arrays may also contain oligonucleotides that are complementary or hybridize to at least about 2, 3, 5, 7, 10, 50, 100 or more the genes described herein. Any solid surface to which oligonucleotides or nucleic acid sample can be bound, either directly or indirectly, either covalently or non-covalently, can be used. For example, solid supports for various hybridization assay formats can be filters, polyvinyl chloride dishes, silicon or glass based chips, etc. Glass-based solid supports, for example, are widely available, as well as associated hybridization protocols (see, e.g., Beattie, WO 95/11755).


[0043] A preferred solid support is a high density array or DNA chip. This contains an oligonucleotide probe of a particular nucleotide sequence at a particular location on the array. Each particular location may contain more than one molecule of the probe, but each molecule within the particular location has an identical sequence. Such particular locations are termed features. There may be, for example, 2, 10, 100, 1000, 10,000, 100,000, 400,000, 1,000,000 or more such features on a single solid support. The solid support, or more specifically, the area wherein the probes are attached, may be on the order of a square centimeter.


[0044] 1. Dot Blots


[0045] The control genes listed in Table I and methods of the present invention may be utilized in numerous hybridization formats such as dot blots, dipstick, branched DNA sandwich and ELISA assays. Dot blot hybridization assays provide a convenient and efficient method of rapidly analyzing nucleic acid samples in a sensitive manner. Dot blots are generally as sensitive as enzyme-linked immunoassays. Dot blot hybridization analyses are well known in the art and detailed methods of conducting and optimizing these assays are detailed in U.S. Pat. Nos. 6,130,042 and 6,129,828, and Tkatchenko et al. (2000), Biochimica et Biophysica Acta 1500:17-30. Specifically, a labeled or unlabeled nucleic acid sample is denatured, bound to a membrane (i.e., nitrocellulose) and then contacted with unlabeled or labeled oligonucleotide probes. Buffer and temperature conditions can be adjusted to vary the degree of identity between the oligonucleotide probes and nucleic acid sample necessary for hybridization.


[0046] Several modifications of the basic dot blot hybridization format have been devised. For example, reverse dot blot analyses employ the same strategy as the dot blot method, except that the oligonucleotide probes are bound to the membrane and the nucleic acid sample is applied and hybridized to the bound probes. Similarly, the dot blot hybridization format can be modified to include formats where either the nucleic acid sample or the oligonucleotide probe is applied to microtiter plates, microbeads or other solid substrates.


[0047] 2. Membrane-Based Formats


[0048] Although each membrane-based format is essentially a variation of the dot blot hybridization format, several types of these formats are preferred. Specifically, the methods of the present invention may be used in Northern and Southern blot hybridization assays. Although the methods of the present invention are generally used in quantitative nucleic acid hybridization assays, these methods may be used in qualitative or semi-quantitative assays such as Southern blots, in order to facilitate comparison of blots. Southern blot hybridization, for example, involves cleavage of either genomic or cDNA with restriction endonucleases followed by separation of the resultant fragments on a polyacrylamide or agarose gel and transfer of the nucleic acid fragments to a membrane filter. Labeled oligonucleotide probes are then hybridized to the membrane-bound nucleic acid fragments. In addition, intact cDNA molecules may also be used, separated by electrophoresis, transferred to a membrane and analyzed by hybridization to labeled probes. Northern analyses, similarly, are conducted on nucleic acids, either intact or fragmented, that are bound to a membrane. The nucleic acids in Northern analyses, however, are generally RNA.


[0049] 3. Arrays


[0050] Any microarray platform or technology maybe used to produce gene expression data that may be normalized with the control genes and methods of the invention. Oligonucleotide probe arrays can be made and used according to any techniques known in the art (see for example, Lockhart et al., (1996), Nat Biotechnol 14: 1675-1680; McGall et al. (1996), Proc Natl Acad Sci USA 93:13555-13460). Such probe arrays may contain at least one or more oligonucleotides that are complementary to or hybridize to one or more of the nucleic acids of the nucleic acid sample and/or the control genes of Table 1. Such arrays may also contain oligonucleotides that are complementary or hybridize to at least about 2, 3, 5, 7, 10, 25, 50, 100, 500 or more of the control genes listed in Table 1.


[0051] Control oligonucleotide probes of the invention are preferably of sufficient length to specifically hybridize only to appropriate, complementary genes or transcripts. Typically the oligonucleotide probes will be at least about 10, 12, 14, 16, 18, 20 or 25 nucleotides in length. In some cases longer probes of at least 30, 40, or 50 nucleotides will be desirable. The oligonucleotide probes of high density array chips include oligonucleotides that range from about 5 to about 45, or 5 to about 500 nucleotides, more preferably from about 10 to about 40 nucleotides, and most preferably from about 15 to about 40 nucleotides in length. In other particularly preferred embodiments, the probes are 20 or 25 nucleotides in length. In another preferred embodiment, probes are double- or single-stranded DNA sequences. The oligonucleotide probes are capable of specifically hybridizing to the control gene nucleic acids in a sample.


[0052] One of skill in the art will appreciate that an enormous number of array designs comprising control probes of the invention are suitable for the practice of this invention. The high density array will typically include a number of probes that specifically hybridize to each control gene nucleic acid, e.g. mRNA or cRNA (see WO 99/32660 for methods of producing probes for a given gene or genes). Assays and methods comprising control probes of the invention may utilize available formats to simultaneously screen at least about 100, preferably about 1000, more preferably about 10,000 and most preferably about 500,000 or 1,000,000 different nucleic acid hybridizations.


[0053] The methods and control genes of this invention may also be used to normalize gene expression data produced using commercially available oligonucleotide arrays that contain or are modified to contain control gene probes of the invention. A preferred oligonucleotide array may be selected from the Affymetrix, Inc. GeneChipg series of arrays which include the Human Genome Focus Array, Human Genome U133 Set, Human Genome U95 Set, HuGeneFL Array, Human Cancer Array, HuSNP Mapping Array, GenFlex Tag Array, p53 Assay Array, CYP450 Assay Array, Rat Genome U34 Set, Rat Neurobiology U34 Array, Rat Toxicology U34 Array, Murine Genome U74v2, Murine 11K Set, Yeast Genome S98 Array, E. coli Antisense Genome Array, E. coli Genome Array (Sense), Arabidopsis ATH1 Genome Array, Arabidopsis Genome Array, P. aeruginosa Genome Array and B. subtilis Genome Array. In another embodiment, an oligonucleotide array may be selected from the Motorola Life Sciences and Amersham Pharmaceuticals CodeLink Bioarray System microarrays, including the UniSet Human 20K I, Uniset Human I, ADME-Rat, UniSet Rat I and UniSet Mouse I, or from the Motorola Life Sciences eSensor™ series of microarrays.


[0054] 4. RT-PCR


[0055] The control genes and methods of the invention may be used in any type of polymerase chain reaction. A preferred PCR format is reverse transciptase polymerase chain reaction (RT-PCR), an in vitro method for enzymatically amplifying defined sequences of RNA (Rappolee et al. (1988), Science 241: 708-712) permitting the analysis of different samples from as little as one cell in the same experiment (see “RT-PCR: The Basics,” Ambion, www.ambion.com/techlib/basics/rtpcr/index.html; PCR, M. J. McPherson and S. G. Moller, BIOS Scientific Publishers, Oxfordshire, England, 2000; and PCR Primer: A Laboratory Manual, Dieffenbach et al., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1995, for review). One of ordinary skill in the art may appreciate the enormous number of variations in RT-PCR platforms that are suitable for the practice of the invention, including complex variations aimed at increasing sensitivity such as semi-nested (Wasserman et al. (1999), Mol Diag 4:21-28), nested (Israeli et al. (1994), Cancer Res 54:6303-6310; Soeth et al. (1996), Int J Cancer 69:278-282), and even three-step nested (Funaki et al. (1997), Life Sci 60:643-652; Funaki et al. (1998), Brit J Cancer 77:1327-1332).


[0056] In one embodiment of the invention, separate enzymes are used for reverse transcription and PCR amplification Two commonly used reverse transcriptases, for example, are avian myeloblastosis virus and Moloney murine leukaemia virus. For amplification, a number of thermostable DNA-dependent DNA polymerases are currently available, although they differ in processivity, fidelity, thermal stability and ability to read modified triphosphates such as deoxyuridine and deoxyinosine in the template strand (Adams et al. (1994), Bioorg Med Chem 2:659-667; Perler et al. (1996), Adv Prot Chem 48:377-435). The most commonly used enzyme, Taq DNA polymerase, has a 5′-3′ nuclease activity but lacks a 3′-5′ proofreading exonuclease activity. When fidelity is required, proofreading exonucleases such as Vent and Deep Vent (New England Biolabs) or Pfu (Stratagene) may be used (Cline et al. (1996), Nucl Acids Res 24:3456-3551). In another embodiment of the invention, a single enzyme approach maybe used involving a DNA polymerase with intrinsic reverse transcriptase activity, such as Thermus thermophilus (Tth) polymerase (Bustin (2000), J Mol Endo 25:169-193). A skilled artisan may appreciate the variety of enzymes available for use in the present invention.


[0057] The methodologies and control gene primers of the present invention may be used, for example, in any kinetic RT-PCR methodology, including those that combine fluorescence techniques with instrumentation capable of combining amplification, detection and quantification (Orlando et al. (1998), Clin Chem Lab Med 36:255-269). The choice of instrumentation is particularly important in multiplex RT-PCR, wherein multiple primer sets are used to amplify multiple specific targets simultaneously. This requires simultaneous detection of multiple fluorescent dyes. Accurate quantitation while maintaining a broad dynamic range of sensitivity across mRNA levels is the focus of upcoming technologies, any of which are applicable for use in the present invention. Preferred instrumentation may be selected from the ABI Prism 7700 (Perkin-Elmer Applied Biosystems), the Lightcycler (Roche Molecular Biochemicals) and icycler Thermal Cycler. Featured aspects of these products include high-throughput capacities or unique photodetection devices.


[0058] Without further description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, practice the methods and use the control genes of the present invention. The following examples therefore, specifically point out the preferred embodiments of the present invention, and are not to be construed as limiting in any way the remainder of the disclosure.







EXAMPLES


Example 1


Selection of Control Genes

[0059] The control genes were selected by querying a Gene Logic rat tissue database to create expression profiles from a variety of rat cell and tissue samples.


[0060] This database was produced from data derived from screening various cell or tissue samples using an Affymetrix rat GeneChip® set. The rat cell and tissue samples that were analyzed include those that were not treated at all and that can be referred to as “normal,” as they represent the laboratory rat population that has not been manipulated outside of normal daily activity within that setting. In general, tissue and cell samples were processed following the Affymetrix GeneChip® Expression Analysis Manual. Frozen tissue or cells were ground to a powder using a Spex Certiprep 6800 Freezer Mill. Total RNA was extracted with Trizol (GibcoBRL), according to the manufacturer's protocol. The total RNA yield for each sample was 200-500 μg per 300 mg cells. mRNA was isolated using the Oligotex mRNA Midi kit (Qiagen) followed by ethanol precipitation. Double stranded cDNA was generated from mRNA using the SuperScript Choice system (GibcoBRL). First strand cDNA synthesis was primed with a T7-(dT24) oligonucleotide. The cDNA was phenol-chloroform extracted and ethanol precipitated to a final concentration of 1 μg/ml. From 2 μg of cDNA, cRNA was synthesized using Ambion's T7 MegaScript in vitro Transcription Kit.


[0061] To biotin label the cRNA, nucleotides Bio-11-CTP and Bio-16-UTP (Enzo Diagnostics) were added to the reaction. Following a 37° C. incubation for six hours, impurities were removed from the labeled cRNA following the RNeasy Mini kit protocol (Qiagen). cRNA was fragmented (fragmentation buffer consisting of 200 mM Tris-acetate, pH 8.1, 500 mM KOAc, 150 mM MgOAc) for thirty-five minutes at 94° C. Following the Affymetrix protocol, 55 μg of fragmented cRNA was hybridized on an Affymetrix Rat Genome U34 array set for twenty-four hours at 60 rpm in a 45° C. hybridization oven. The chips were washed and stained with Streptavidin Phycoerythrin (SAPE) (Molecular Probes) in Affymetrix fluidics stations. To amplify staining, SAPE solution was added twice with an anti-streptavidin biotinylated antibody (Vector Laboratories) staining step in between. Hybridization to the probe arrays was detected by fluorometric scanning (Hewlett Packard Gene Array Scanner). Following hybridization and scanning, the chips were analyzed for quality control, looking for major chip defects or abnormalities in hybridization signal. After the chips passed quality control, data were analyzed using Affymetrix GeneChip® version 3.0 and Expression Data Mining Tool (EDMT) software (version 1.0), S-Plus, and the GeneExpresss software system. Microarrays were scanned on a high photomultiplier tube (PMT) settings.


[0062] To prepare tissue samples from animals, e.g., rats, sterile instruments were used to sacrifice the animals, and fresh and sterile disposable instruments were used to collect tissues. Gloves were worn at all times when handling tissues or vials. All tissues were collected and frozen within approximately 5 minutes of the animal's death. The liver sections and kidneys were frozen within approximately 3-5 minutes of the animal's death. The time of euthanasia, an interim time point at freezing of liver sections and kidneys, and time at completion of necropsy were recorded. Tissues were stored at approximately −80° C. or perserved in 10% neutral buffered formalin. Tissues were collected and processed as follows.


[0063] Liver


[0064] 1. Right medial lobe—snap frozen in liquid nitrogen and stored at −80° C.


[0065] 2. Left medial lobe—Preserved in 10% neutral-buffered formalin (NBF) and evaluated for gross and microscopic pathology.


[0066] 3. Left lateral lobe—snap frozen in liquid nitrogen and stored at −80° C.


[0067] Heart—A sagittal cross-section containing portions of the two atria and of the two ventricles was preserved in 10% NBF. The remaining heart was frozen in liquid nitrogen and stored at −80° C.


[0068] Kidneys (both)


[0069] 1. Left—Hemi-dissected; half was preserved in 10% NBF and the remaining half was frozen liquid nitrogen and stored at −80° C.


[0070] 2. Right—Hemi-dissected; half was preserved in 10% NBF and the remaining half frozen in liquid nitrogen and stored at −80° C.


[0071] Testes (both)—A sagittal cross-section of each testis was preserved in 10% NBF. The remaining testes were frozen together in liquid nitrogen and stored at −80° C.


[0072] Brain (whole)—A cross-section of the cerebral hemispheres and of the diencephalon was preserved in 10% NBF, and the rest of the brain was frozen in liquid nitrogen and stored at −80° C.


[0073] Gene expression data were then analyzed to identify those genes that were consistently expressed across a set of about 5,000 different tissue samples, e.g., being called Present more than 95% of the time. For each of these samples, the mean average difference, standard deviation and CV were determined for each Affymetrix fragment on the rat U34 GeneChip®. The data were sorted by CV, and those gene fragments with values less than 40% were chosen for further analysis. Table 1 provides a list of approximately 858 genes with a coefficient of variation less than 0.44 and whose expression is considered not to vary across the normal and treated samples studied. For each gene listed, Table 1 also provides a GenBank Accession No., a Present frequency value, a mean expression level value and a coefficient of variation, expressed as CV. The GenBank Accession Nos. can be used to locate the publicly available sequences, each of which is herein incorporated by reference in its entirety as of the priority date of this application (Jul. 30, 2002).



Example 2


Quantitative PCR Analysis of Expression Levels Using the Control Genes

[0074] The expression levels of one or more genes listed in Table 1 may be used to normalize gene expression data produced using quantitative PCR analysis. For example, the sequences may be used as Taqman® probes, along with the forward and reverse primers for a gene in Table 1. Real time PCR detection may be accomplished by the use of the ABI PRISM 7700 Sequence Detection System. The 7700 measures the fluorescence intensity of the sample each cycle and is able to detect the presence of specific amplicons within the PCR reaction. The TaqMan® assay provided by Perkin Elmer may be used to assay quantities of RNA. The primers may be designed from each of the genes identified in Table 1 using Primer Express, a program developed by PE to efficiently find primers and probes for specific sequences. These primers may be used in conjunction with SYBR green (Molecular Probes), a nonspecific double-stranded DNA dye, to measure the expression level of mRNA corresponding to the expression levels of each gene. This gene expression data may then be used to normalize gene expression data of other test genes.


[0075] Although the present invention has been described in detail with reference to examples above, it is understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the following claims. All cited patents and publications referred to in this application are herein incorporated by reference in their entirety.
1TABLE 1PresentGenBank No.FrequencyAdjusted MeanCVNM_0571410.9621353.73029490.394573166AA8003640.9921538.74772020.35144586 AA8005010.9874200.4644310.271392863AA8010510.9946594.99342880.327732429AA8014420.9937472.45989820.314834757AA8482380.9516341.6609870.288832086AA8492620.9846210.5203060.37721446 AA9441270.9522178.1057280.384011401NM_0319810.994 328.16891550.389666675NM_0319810.998 384.21809160.31917883 NM_0193520.9964577.42455020.233685612AB0088070.9906375.35831610.389113636NM_0192130.9959167.71055570.33117565 NM_0313310.9928299.56686870.388714827NM_0191910.972567.901779150.330844052NM_0535270.9608151.05020460.316326745AF0039260.9947331.1514050.243708921NM_0316560.962470.88581160.349709856NM_0535530.9656136.67856340.375165396NM_0535560.9816193.24948120.365428046NM_0192010.9993800.76651290.383779573NM_0225360.992 637.69989970.349019258NM_0317490.9833260.9692880.379520142AF0931390.9955164.58112470.291070695NM_0534670.9967645.0327580.312688813NM_0192220.9959241.17154550.306923163NM_0537070.9954278.29688870.363812568NM_0571430.9969491.57011440.377464693NM_0571410.99 334.70991860.372595428NM_0172840.9991469.01154610.37489808 NM_0537430.9986599.92537540.318539549NM_0316030.9961531.17138730.3948861310379340.9794212.29952690.267710366NM_0225980.9898150.31468040.392695104NM_0225980.9978410.43866980.355948912NM_0130760.0736249.87574240.364628324NM_0193170.973483.519737770.373249306NM_0172360.9867671.58919640.360897244H329780.997 435.23988280.300833768NM_0310900.961970.315285750.399983405NM_0572090.9864257.50447480.332226154NM_0125000.9895150.65228090.302162035K028160.9981382.64213880.334260892NM_0225180.99 575.22874930.267122194NM_0311290.9986672.6868730.268407165NM_0126390.9592157.94594250.307400434NM_0319740.9978616.42787390.361748118NM_0131770.9988787.16411470.381937146NM_0171010.99751067.8965410.347227639M577280.9729108.39733580.368600327AA6846410.9692132.91067690.312063584AA7992790.9991839.30571420.325266583AA7992790.9947568.15834620.347844769AA7995420.9924273.58361870.370208871AA7995500.9973470.93840470.370333288AA7996090.9912134.12953180.334268614AA7996410.9966276.41441250.307718893AA7996540.9981296.19417250.351166278AA7996670.9908248.92779670.291789627AA7997210.9629114.48385340.373755794AA7997350.9644126.84777160.292430382AA7997350.9813137.50324870.318140687AA7998220.9906162.75686310.360262563NM_0330960.9941225.05464610.319505767AA8000150.9972384.35361350.289608893AA8000390.9906354.19010130.287620068AA8000530.9898129.52136750.37530915 AA8001700.967575.96290530.355273922AA8001980.9639159.21055780.295644976AA8002100.9821105.03303790.370992795NM_0130060.9898237.30416360.391423327AA8002680.976 166.47686230.340868372AA8006510.9912400.53747770.330167434AA8006690.9949426.01645270.393889597AA8007870.9874149.01040150.379600998AA8008140.9525109.0585370.389571008AA8011300.9957263.92455320.38007823 AA8011760.989 325.55645120.295890207AA8012300.9972567.30711480.389999402NM_0320570.9955179.39529180.296133732AA8177690.9941185.05900310.323946319AA8178450.9951380.10190290.242937824NM_0536820.9941267.50287470.372604104AA8178920.9828305.83213610.370745167AA8179070.9916285.46641540.323851183AA8179450.99791077.8473090.363411979AA8179670.9943296.898260.323986488AA8181180.9951324.90411450.349051053PA8181290.99 187.6227710.301614267NM_1304050.9909169.66950170.325635921AA8182030.9931173.38890320.372430825AA8182460.9878423.07002330.395146083AA8185340.9927259.40590440.283803337AA8185680.977279.083537030.299639563AA8186690.9928330.93127210.317467573AA8186970.9979645.18753120.274054292AA8187780.9964324.61273550.343055107AA8187880.988 128.40936710.374028119NM_0199070.993 178.41233380.361138088AA8190570.9974559.60635820.246931544AA8191190.962191.251277070.380140187AA8192240.9812135.23269290.383447682NM_0317450.9853165.14177530.394406392AA8193180.9527212.48317190.361885928AA8193620.9862154.09220990.361381365AA8193640.9933282.81860950.260938603AA8193670.991 135.74153990.34405043 AA8194000.9886135.31392070.345055159AA8194680.9986320.30624920.334844865AA8194710.9678102.55599070.342984629AA8194870.9736138.43499050.345244474AA8196910.9941431.89446480.382341107AA8196940.9714103.40995250.372870205AA8197290.9931289.96160280.32509288 AA8197610.987 240.3288630.353298909AA8197980.9961412.85911640.38513809 AA8197980.9977543.38937270.333884343AA8484040.9812470.9737660.353247324NM_1333200.9901374.01955780.370573918AA8486740.9709198.57099180.268430905AA8486960.9584113.96254720.393992922AA8489670.9551371.10029020.378302744AA8490920.9941265.20553440.361330207AA8493120.9896261.65607290.302612314AA8495310.9977452.39762720.258409286AA8497150.9955427.40129340.324344933AA8497210.9759429.62554160.3397144 AA8497570.991 728.75501030.385766928AA8497660.9905338.28372760.386484508AA8497670.9892686.4737220.370408924AA8497740.9543199.01783660.290368511AA8497880.9899277.09791590.302937581AA8498090.9847213.06999480.395797677AA8499520.9836249.68298940.317677787AA8499540.952386.970702670.337457286AA8499650.9908209.86707760.346431239AA8501170.9611228.53061010.38524339 AA8504510.9922427.55170040.379543087AA8504800.9932408.67636310.386935606AA8505250.9702335.29640340.260647679AA8505290.9771383.94247290.372557101AA8505350.994 484.84632650.306355053AA8505500.9955538.42476950.220155894AA8505690.9969539.46611920.369801777AA8506240.966 114.07408150.388810835AA8506660.9933372.25468010.294655078AA8507540.9731100.3076080.376838798AA8508940.9917442.46653580.273072964AA8509070.9935365.95637430.326461416AA8511610.9968448.01185510.319626206AA8512020.9808200.02562890.355469087AA8512140.9914523.35494560.295434486AA8512510.996 296.89623870.303746  AA8513470.9982438.40968150.299495219AA8513760.9982499.5526330.311653073AA8513970.9651325.9605270.365291903AA8514050.9773114.68403330.327907652AA8514390.962 229.27051150.326726191AA8514640.9918267.89016250.284156685AA8516410.979 179.84678660.376196046NM_1333240.9766178.86895840.271247953AA8516860.9954300.54529930.237099425AA8517010.9737134.33445690.344149947AA8517280.9785252.46679120.323521957AA8517390.9553206.43289050.300390557AA8517650.97 828.68590180.305562042AA8518730.9721329.86495680.359666828AA8518830.977 185.26280580.329071104AA8519090.9906206.17244230.31357168 AA8519200.9779208.76727390.374968241AA8519380.9938392.31785410.326446676AA8584570.9843168.4267780.309690174NM_0311530.9815289.80762990.3760569 AA8585510.9893210.23600920.362121119AA8586600.9962252.29576350.34707555 AA8587180.99771028.9843490.399132237AA8588330.9793159.88259660.393184202AA8588670.9648136.17265280.331092316AA8589900.9965870.36279490.365647982AA8591000.9772136.27732690.356567976AA8592010.9978275.6831280.306043339AA8597960.953471.011431760.389965008AA8599190.9881142.76745150.333076793AA8599190.9988667.15373310.292027993AA8663640.9931138.12451740.38210287 AA8663710.9633142.69122040.320447204NM_0243940.9617151.40199160.387696691AA8754310.9929254.1616460.360714718AA8754700.9605291.35697930.273123402AA8754700.9683127.73325420.328086924AA8755520.9933196.75450270.275253333AA8756610.985374.477966320.335756096NM_0537390.9971223.26432350.343236121AA8915460.972 65.305020260.399473194AA8917170.9819135.79703980.291717192AA8917420.9844120.16134090.361978153AA8917460.9946310.88279480.393391849AA8918100.9808328.07278330.30475391 AA8918100.9858152.51907590.339434397AA8919020.970251.591422230.349786011AA8919350.988 233.00247190.270449187AA8921200.979160.978007310.373214916AA8923130.9913107.21596270.389027092AA8923940.9971221.33452670.386151078AA8923940.9952129.19946870.397846098AA8924220.975 185.01943160.252043297AA8925050.9941256.16747940.272080589AA8925500.9702118.63619730.377480727AA8927890.986 236.30267270.297799259AA8927910.9855175.63112430.325854158AA8927960.9973621.51109270.313521756AA8928140.9959421.71652880.32340475 AA8932240.9918129.29661350.319716751AA8933530.9939289.29382760.343012426AA8935150.9965268.16931340.293905313AA8936410.9655124.6315130.343387848AA8936830.983 87.448821960.347534606AA8937410.9921192.25337240.269508199AA8938110.973684.603012160.398222929AA8940990.9813285.1426340.328324892AA8941010.9559102.44714780.332265391AA8941010.9824111.45609650.350773318AA8941310.9766106.3876690.383457001AA8942340.9841236.12649940.284552291AA8942590.9966449.04547630.312420033AA8995460.982 220.59715060.309976207AA8996720.99791656.0758950.319960146AA8996910.9924195.40727330.34078252 AA8997430.99741071.9737860.330354696AA8999110.9976522.94541350.252669377AA8999590.9952492.04461420.395266737AA9000780.9565188.06063340.387115384AA9001560.9857761.52827340.223294586AA9001870.998 471.53019480.257859064AA9003430.9773209.16046360.322304252AA9003480.9502212.75030910.340891055AA9003640.9652162.57034420.284958346AA9004220.9604404.27149950.356451649AA9008600.9869167.39215330.383802956AA9008910.9524149.09806690.315160786AA9009750.9978868.18861080.37200699 AA9012220.9956480.16161030.309579403AA9013650.9995890.26130040.32691327 AA9239920.9576114.25716220.396496378AA9239980.9659225.95032350.280597467AA9240300.989 162.91277270.392729941AA9240790.9821205.18772840.379629317AA9240920.9888318.99073960.228510957AA9241690.9927288.07505620.272550639AA9243170.9867190.50129750.224003653AA9243390.999 1652.6700330.282834255AA9243690.9936365.65616140.261149514AA9245320.9632344.4931830.317913577NM_0310200.960462.910209690.397017329AA9246040.9938318.26244760.354029504AA9246090.9984461.80110670.378571404AA9246540.9809211.28290820.332834334NM_0535550.9507311.27216590.318866213AA9247650.9972324.89285590.224049302AA9247680.9896599.75436730.309926225AA9247870.9896481.05251570.333960186AA9248710.9647148.89776460.393740956AA9251230.9984850.76644420.264758138AA9251520.9946699.03552910.239167455AA9251600.9823190.19166310.373116063AA9252120.9973611.80572860.294395182AA9253040.9902302.28402470.272196777AA9253050.9837411.70955140.285568444AA9253380.9803298.87744640.290626881AA9253400.9991521.84900520.283743847AA9253410.9669241.64363920.360237623AA9254320.9735225.79881510.350901777AA9254730.9959622.43780440.399479916AA9254780.9819341.74127590.372375386AA9256770.9608201.00995720.335202855AA9258540.9842201.38934230.31172844 AA9259790.9878458.1652570.346902976AA9259830.999 599.53841680.391862852AA9260130.981 193.65580060.228102661AA9260980.9965755.1400210.247890637AA9262790.9703258.18515090.330058893AA9263310.9658142.92197080.389925982AA9331580.9771163.61473590.267516634AA9429470.9693175.69909630.375870721AA9430150.9726170.03380350.360085756AA9430150.9858388.24292040.341776907AA9431220.9762419.82067020.320372184AA9432400.9636203.39591790.34869085 AA9432810.9691305.02889640.378822839NM_0129130.9813258.92362460.388849176AA9434210.9632209.31950090.370589173AA9435000.9708209.9193680.281165988AA9435530.9872519.97514250.382535341AA9435530.9966665.56112150.379984839AA9436450.9933581.04113380.381146596AA9437380.9859137.09176460.271120535AA9437660.9957400.04609910.370001453NM_0809090.9928872.53155770.389923883AA9442030.9914400.62445670.350853364AA9443350.9976866.02892080.267386275AA9443470.9626161.30677470.318180188AA9444450.9781222.59781060.373397302AA9444510.9971599.8703250.323030108AA9445280.9954425.86514940.22460926 AA9446350.9801322.84496190.312266733AA9448420.9861299.65227490.237903089AA9450890.9941914.59957690.375028263NM_1332970.994 683.54830470.367321409AA9457400.9612124.64957110.329660581AA9457460.9956408.44845760.33612417 AA9457460.9839255.55244340.348157071NM_1319070.9962370.08782640.339124904AA9458690.9792179.28684310.327315212AA9460040.983 142.70659780.295987651AA9460180.9978786.55923170.233292925AA9460380.9888281.40389760.278216507AA9462050.9976828.25923250.35788087 AA9464320.9924545.6602410.259340094AA9464400.9982570.56885940.245347075AA9551120.9962291.17391640.212582952AA9552400.9973647.71297130.274411246NM_0173590.996 231.46996750.329048264AA9553960.9879243.10468850.255401546AA9555060.9877242.18713790.383670644AA9555360.9823187.91859850.269361535AA9561140.9955143.38421050.398204182AA9561400.9938776.63221840.395628638AA9561850.9823250.42147370.341205084AA9564600.9955399.36038110.336899504AA9569830.9853295.15968610.34098573 AA9569920.9928417.40062810.269624667AA9569920.9976491.74709750.243689773AA9570630.9941391.77478520.319424296AA9574910.988 180.25769660.351832394AA9576490.9924423.46767890.338824147AA9576760.9592429.43188470.396293319AA9577770.9866112.74349870.363054879AA9630720.9709134.52709450.333884657AA9630940.9977606.83338540.328724125AA9631700.987 118.57221270.275144443AA9633670.9977817.72377170.369667036AA9638080.998 669.60772620.382254088AA9640540.9949405.65774010.347892099AA9640640.9888375.26864330.37526759 AA9640820.9956680.46976450.272604335AA9641140.9831753.73154940.29001828 AA9643620.9869145.35353340.33101453 AA9643660.9774316.68699350.283982561AA9646070.9923499.92874890.320740471AA9646240.9538125.80569870.280619534AA9646300.993 389.01629410.283601586AA9646420.9755372.57171090.358580387AA9650730.9802626.02646830.341947106AA9963980.9501142.98545770.356277705AA9965760.9856411.23722920.313642878AA9967970.9889366.8729340.310438476AA9969390.994 369.23566920.339669675AA9969740.9859172.26695720.330914903AA9970520.9625159.38976590.386386551AA9971840.9801277.35374970.296759505NM_0534940.982 300.82105030.361139323AA9979290.9888166.26362070.279152327AA9981580.9521297.13765120.280727643AA9984350.9989878.29226890.231761767AA9985230.9678182.55206590.383305234AA9985560.9802157.50361880.323268276AA9988430.9713200.14155240.351313361AA9988930.9938218.64517180.356607867AI0077430.9885280.31878980.397692815AI0077500.9766224.58122170.29763571 AI0079200.9603128.78925640.362125165AI0079870.9945391.05970950.286018393AI0083720.996 586.57424990.29223874 AI0084230.9933197.72693510.251485242AI0086830.9969590.72945250.250069145AI0087400.9606182.27444270.352124819AI0087740.995 226.91169640.278990202AI0087840.99681245.1909370.28202077 AI0089310.9798214.34023790.369067812AI0089580.99561352.4860470.283940464AI0090790.9974826.13520410.383871133AI0091570.9919213.28222680.364389086AI0092000.9967440.71375780.313553376AI0093500.9998711.41699290.316543066NM_0534160.99882751.8428390.354727984AI0095910.9601115.06637010.297851506AI0096500.9861290.35706160.313940175AI0096550.9884408.82570280.386816933AI0096930.9945704.52081260.38652766 AI0097410.9983502.04419420.347210566AI0097720.9982661.49971570.357510104AI0098190.9833385.9745120.30650183 AI0099360.9982482.98521020.382801193AI0100340.9869175.13258830.357119903AI0103420.9757160.97450040.341899927AI0103620.9879325.14169670.367946604AI0104220.9766133.90967680.348370485AI0104520.99951364.3991470.323158828AI0105180.9897523.38352780.251025801AI0107580.9791159.57854870.342965544AI0109440.9655198.9318890.348891534AI0111480.9888509.75778220.297487627AI0111900.9848244.25959090.263968956AI0113060.9735186.93378550.299628693AI0113390.9878339.33328710.295719531AI0113440.9985808.97081860.242918026AI0115560.9933307.16236570.361656015AI0115710.9913231.10398660.349951837AI0117540.9601357.23055610.259066046AI0117560.9845505.68623630.335393314AI0120270.9905498.94159090.330533351AI0120770.9685183.53046070.28972612 AI0122580.9627232.93191680.397616077AI0122770.9815224.01569560.315069042AI0122850.9969455.21129470.251368311AI0124670.9755247.99795520.393536674AI0125620.9819247.00556480.399176309AI0125670.994 614.01830820.300419775AI0126360.9774244.79548810.343753177AI0126410.99541107.580010.232410874AI0129370.9873231.10425780.299231915AI0129470.9972750.6343750.34167501 AI0129510.9969589.21399830.348304745AI0130240.9941972.69819930.377775478AI0130900.9848294.81006320.307498244AI0130970.9949470.594740.279198054AI0132040.9984974.70281810.387451663AI0133500.9844259.91489550.232030679AI0133630.9974517.43919680.277623342AI0135550.9335.56521870.285524016AI0135640.9686181.89700450.347588318AI0136970.9987911.75142720.330567711NM_0317210.9825322.66696710.384108704AI0138160.9654635.76367970.31640618 AI0138700.9903267.85690070.324199211AI0139460.9985532.85618330.216157143AI0140590.9806391.82866440.306162721AI0289970.9851246.39106010.293805901AI0291100.9941292.51321620.268267155AI0294210.9787293.34962990.204319564AI0294840.9977367.16806930.227864083AI0297330.99871950.5413120.296475897AI0297370.9916520.4388010.25582686 AI0301470.9894307.98555380.350690338AI0301920.9753234.13263850.387161574AI0302480.9969701.84595090.250558161AI0304300.9955511.35441520.329914711AI0307510.9565233.32355680.269114645AI0307990.9962490.6489010.386351868AI0309070.9939322.8011280.367771191AI0310350.9978352.46111730.295253508AI0441120.9956355.45961710.308803476NM_0538640.9816329.95274690.325582005AI0447270.9941399.15415490.24747723 NM_0225950.9749558.64766660.376778078AI0448630.9792231.34004970.328631874AI0448720.9738126.35529630.319073417AI0450030.9781302.16758060.367003326NM_0533470.9792425.67071060.262323344AI0454580.9987834.24788720.281073528AI0455970.9795236.78723390.362316181AI0457740.9984597.52324020.361381584Al0458100.9758119.84427760.37446961 AI0583730.9714166.53056510.310968224AI0589630.9795226.90158040.318390827AI0589720.9639163.47534350.314357814AI0594280.9631337.53264090.343346037AI0597620.9628101.85332710.350501189AI0601320.9923869.12052180.296469343AI0601960.9987652.58302420.247507825AI0602220.9954274.20122570.299764113AI0700700.9762237.88184110.32422811 AI0701530.9524241.70173980.274003857AI0701760.9983508.3105410.347627491AI0703990.9921240.27387810.320172095AI0712430.9775164.56411090.375047226AI0717730.9948176.26875150.362438965AI0719460.9964224.31850870.299033252AI0720810.9983581.69990550.319003258AI0721210.9872250.89231040.328781873AI0725550.9637103.77730540.269538712AI0726660.9947470.24364460.322438521AI0726750.9957324.39572170.378467368AI0728850.9812128.31807670.328993951AI0730300.997 529.55993980.226307567AI0731180.9816132.81957890.330656304AI0731930.9983387.05324340.314540172AI0732150.9879422.90323970.327263785AI0732600.999 990.06588940.383685551NM_0535690.9896203.35125670.226791767AI1011810.974197.179894130.28235925 AI1012220.993 312.1496850.31926142 AI1013750.9933415.47878770.391617213AI1013950.9725160.73637460.388039479AI1014380.970964.452396290.38983282 AI1014600.9974399.51437450.355650986AI1016590.9988627.05230450.329943  AI1018640.99831112.0104610.276584797AI1019340.973 160.60407660.331698393AI1020460.9785187.08086490.28794593 AI1020800.9882291.2394550.268077727AI1021910.9768160.22398910.276862462AI1022520.9936186.99710170.289805898NM_0534360.973 227.22971470.385866717AI1024380.9956321.38972370.394869448AI1026120.9975571.38001410.289213412AI1027340.9938530.66402010.385875553AI1029350.9884324.81648740.3850101 AI1029780.9903147.71012050.381554081AI1029910.998 389.64949340.211087424AI1030940.9979873.94862490.282128391AI1031290.9774740.6123510.322658411NM_0311460.9852438.01552620.279684593AI1033770.9844226.96374360.305656574AI1033790.9981528.3628490.191923868AI1034280.9832141.45272730.375933874AI1035210.9907384.70842690.310270528AI1037170.95 277.6749250.301083424AI1037180.998 991.30564250.220129724AI1038480.9836146.29105910.304856826AI1039500.968485.78136180.38997044 AI1039540.9965345.23011780.370824066AI1042310.9752132.44946520.325845881AI1042340.9786374.74823620.356449801AI1042390.979 125.32047020.332461387AI1042470.9691212.50647120.340422775AI1042500.9641219.03547550.325898084AI1042830.9922289.71607160.266096751AI1043200.9906303.23739490.248088041AI1043880.9505156.66735080.329438846AI1044880.9672117.80884650.229489312AI1045360.99861025.5362720.284465579NM_0225180.9936711.89998910.304763664AI1046000.9521122.28634590.349242431AI1047530.9524326.70323460.399036819AI1048640.9868376.82587890.356488841AI1048780.9972438.5059440.358693351AI1049140.9956199.50442510.268591505NM_0807810.9663107.75132810.335662126AI1050720.9972395.97830730.387243663NM_0572050.9978283.31319560.293441255AI1050870.9933515.11040670.352504039AI1051490.9983911.53926650.263156658AI1052650.9538217.28377410.341799777AI1053450.9861155.84607450.364213518AI1053520.9938141.90751670.361489718AI1054310.998 356.68413750.378020827AI1116830.9915184.20536280.241117848AI1119750.999 192.35601480.3351647 AI1120920.9954269.30739340.349380313AI1122500.9968653.29353030.378593708AI1125120.959875.402702660.386823108AI1130200.9844232.08388050.311528728AI1362310.9537132.46051030.376730451AI1365640.9761278.63183780.35005281 AI1366690.9958518.51940870.351974463AI1372320.9988469.61584460.366380187AI1372980.9923230.65094960.270580577AI1375820.9799135.34982940.397497765AI1380020.9942219.68291040.24859447 AI1446570.9923129.44799510.312677319AI1446680.9909297.12736830.314893774AI1449560.9904207.75463160.264521312AI1453320.9538129.02425130.37788731 AI1453620.9719120.38021370.339227369AI1453680.9917254.0511090.364636436AIl456140.9969441.54372870.356577697AI1456270.9917327.56087570.302478299AI1458530.9823133.55295820.319436187AI1460340.9925154.62752290.324965874AI1460370.9941168.1346470.275451237AI1460900.9967305.29872010.284411247AI1461700.9944210.50299250.228534495AI1689330.9927219.30205580.278034578AI1689500.99 318.0105110.362293659AI1689740.9754214.40039910.377276222AI1689790.9801212.27298090.331980427AI1689860.9746156.79797160.360198438AI1690630.9964290.67802520.385981295AI1691540.9968336.52422840.307023873AI1691700.9979769.78785410.398738752AI1692690.977 137.3571140.35503002 AI1692720.969680.831402520.339825829AI1693430.9727166.9897640.268576719AI1693770.9889180.02980190.390844085AI1694610.9973866.20810390.336846289AI1696110.9986503.46381090.36365031 AI1696150.9985663.36042150.326765274AI1696410.9962363.8283760.277445228AI1696420.9856143.42586460.275093398AI1702470.9568102.16255180.31850578 AI1702650.9961361.18794510.39819102 AI1703570.9719133.70806410.281598384AI1703880.9935162.56940810.354744306AI1704000.950875.045340080.377930524AI1704140.9824281.12404320.292176861AI1705320.9979325.76233780.256357803AI1706630.9919340.06257680.39841173 NM_0320790.9912212.79653040.383477936AI1707800.9978403.73548890.31491612 AI1707970.9898362.01049560.367765173AI1708070.9943244.36975280.250841845AI1708210.9835115.65151350.35887866 AI1712120.9978775.90226830.275974842AI1712300.971969.496217620.348752498AI1712320.9996746.09040060.390223181AI1712720.9961584.79448740.273294529AI1712730.9838409.65692960.341742937AI1713140.9894690.41767350.399646269AI1713450.9857121.90088990.300431813NM_0308360.9942222.05176030.339780512AI1715610.9974913.08788630.23190465 NM_0192080.9812125.35004820.338940828AI1716610.9675108.26016240.280616401AI1717370.9904251.80428640.37508985 AI1717640.9973487.41624730.277969318AI1717680.9941333.69682050.399552284AI1717810.9916195.23292850.325496907AI1717830.9935277.37220530.390225646AI1717980.951196.822129970.357869848AI1718090.9786121.09323390.375796802AI1718700.9849205.06614440.333133061AI1718820.9965253.51763120.301825594AI1719510.991 200.01564820.247113526AI1719520.9979575.45561910.295443738AI1719530.9927553.61069970.357140612AI1720010.982 118.91826180.358781789AI1720690.957955.271895980.301195  AI1720740.9837135.63361790.35943329 AI1720920.9622108.33226890.317645185AI1721050.9964431.88046550.3638466 AI1721060.955984.18573010.340208075AI1721960.9848219.35750940.331715935AI1722140.9946416.0722140.309679658AI1722180.9678136.64342570.298583323AI1723010.9895280.46774980.327001975AI1723580.9609229.837190.287010264AI1724720.9882178.88986370.356223766AI1725370.9762126.37434110.365833038AI1750010.965961.521598270.398114591AI1750080.9927259.70408260.362558835AI1750440.9575219.38012030.389920735AI1752660.9973335.30953110.26393186 AI1753660.9878219.40677530.316098431AI1754670.99741050.9531110.364080843AI1754770.9975658.99957810.339519262AI1755120.999 1013.0506730.248961012AI1755470.959986.619516320.316920617NM_0539690.9975342.2075060.23220273 AI1759910.973593.669911740.324717316AI1760160.9896118.34078240.35524637 AI1761210.99841070.601590.328798698AI1761400.99851167.5680180.301694674AI1763040.9927123.81672390.335661085AI1763080.9965366.19480250.338165832AI1763090.954286.007379840.344481111AI1763560.9946109.38216590.389383607AI1764010.9844124.77465690.350696016AI1764200.9925201.3971610.350564698AI1764910.9919403.62173640.372574341AI1765110.9689113.83076920.39779294 AI1765810.9974319.33646590.297959615NM_0316030.9949216.35616190.362669512AI1766800.9875447.79280970.319707122AI1767000.9947219.78530670.396439967AI1767240.9903209.97254550.302455154AI1770250.998 610.22107840.281843657AI1771040.9826112.37180130.36436637 NM_1308230.98671029.213640.382011417AI1772750.9552151.49796720.387740506AI1772850.992 464.79127680.382919686NM_0533230.99881040.3201820.36489234 AI1774910.9963259.44597050.30966825 AI1775130.9925309.62268660.340314119AI1775900.9662136.19938350.305817502AI1775930.9972754.54785230.336073841NM_0537980.9932228.77765680.379213177NM_0225930.9964231.74725960.352931313AI1777650.9947242.14441790.376897727AI1778660.9944235.30416120.359760014AI1778710.9978493.02047810.36528958 AI1778730.9732144.57823930.323229199AI1778750.9749169.14419770.327244948AI1778940.9978381.74931320.277235768AI1779020.978 266.84743970.364559195AI1779190.9746156.46552330.307400879AI1779210.9989357.89007520.231216519AI1780520.9942210.99198050.3269212 AI1782390.9946593.09480350.316485994AI1783780.974 113.35974990.35910838 AI1784410.9693123.9779610.3713985 AI1785030.9805161.85753860.330391311AI1785260.9886237.41700530.351527825AI1786440.9698137.67299670.319376532AI1787630.9953470.49687980.293156364AI1788300.9803224.3832540.374635776AI1789550.9978647.28121590.338554719AI1792390.992 158.96631520.393554903AI1792430.958488.447746930.35176644 AI1793270.9979769.05048480.342140031AI1793290.9616154.420710.265469568AI1793350.999 516.30692020.397506405AI1793550.9974440.01640120.302809917AI1793560.999 561.17869910.297285533AI1793800.9927471.03444430.399527454AI1794780.9899388.02927760.311100554AI1795870.9609181.11078770.27873237 AI1796200.961 115.47299150.386630951AI1796360.9952340.98614320.253360334AI1796400.9733101.34701660.317864614AI1797110.9917161.21687470.308572322AI1798330.9978601.02367640.205199054AI1798400.972 274.16030070.324552252AI1798650.9841437.93567530.284891811AI1799010.9957309.4291260.297663083AI1800150.9994614.85816580.333863576AI1800810.9738389.83847120.311918656AI1801080.9864284.63409160.330955649AI1802240.9959277.06155620.26693795 AI1802590.9973740.0073840.269317518AI1802830.9766336.66240440.383017026AI1804000.9968483.03356270.38594082 NM_1333240.964108.3656820.321158744AI1804260.9917193.42110320.399608831AI1804410.9793177.71277880.221639307AI2276120.9872131.4115930.379405748AI2277050.9973373.90455360.308933462AI2277430.99 197.53167190.39445111 AI2278840.99811267.1802430.223517344AI2278870.9987690.31968350.344995772AI2278940.9914150.31450560.266321451AI2279620.9693138.72349680.332020291AI2281120.9991577.4938510.329771829AI2281650.981 245.90519050.288970498AI2282490.9917429.4995320.295305029AI2283830.9684118.39062520.308243873AI2284550.9592252.54913090.259258531AI2285820.9931244.07812780.299501991AI2291040.9973418.45194950.234220274AI2292510.99811138.3374590.262304468AI2294410.9967720.18474760.286485755AI2294870.9972326.65849510.278494869AI2295950.9949334.93990220.360754811AI2297020.9886220.65319920.314985308NM_0313420.9864412.80778370.286440425AI2300690.9884252.02369870.315987791AI2300730.9973396.20826140.258575264AI2301920.9968592.22031670.261516543AI2302480.9949420.02257970.337867921AI2302780.9967324.01603670.337131197AI2303080.9803180.54014760.350570361AI2305030.9844135.09258650.332488962AI2306350.9949280.96658140.247155413AI2307780.9804107.59290710.343369569AI2309120.9954200.58725430.353036114AI2310170.9914198.3007420.381906854AI2310380.9956250.06825230.277155064AI2310500.9943410.80505460.253822599AI2310710.997 393.03359390.198604907AI2312010.9983408.01264230.261904403AI2314710.9956346.90781640.371698402AI2314910.9912191.23106610.37822703 AI2317730.9964604.78768540.27563454 AI2317850.9978823.17250470.304029365AI2318120.982 210.75453640.282818931AI2318860.9978443.32050680.368330282AI2320300.9805402.22722120.353433208AI2320330.9926258.57492250.304872148AI2320600.9826129.65649710.320998742AI2321010.9941610.11229630.275151496AI2321120.973 208.46957250.342289609AI2321290.9874161.86771310.257738135AI2321590.9844248.3317370.349563487AI2321630.9791499.47242540.367022262NM_0305860.9845162.70745770.354117606AI2322740.9944229.19659470.309473807AI2322960.9575375.51567370.332664579AI2323210.976595.628597610.357723537AI2323540.9963322.44665060.305484765AI2325100.9636204.82720790.384522622AI2326390.953 114.85332350.370466228AI2327310.9661207.3390480.371840978AI2327340.9981379.62752840.307581158AI2328000.9504193.94822790.347453565AI2328070.983 197.11203360.309983248AI2328410.9903307.25661210.312565038AI2328870.9942204.75729490.363005514AI2329740.9922259.91913330.365849096AI2329790.9901232.3041060.320933431AI2330960.9573188.6697310.368765567AI2332040.99561010.0907550.339942787AI2332220.9993768.20226980.307770797AI2332670.9768115.11845040.370933612AI2333080.9935151.08165920.349681314AI2333160.9941301.93568290.36245711 AI2333500.9919228.1842420.363111901AI2333700.9859189.33101940.376119729AI2336980.9969198.63854870.322074038AI2337280.9612143.85048270.392284441AI2339150.9968440.82593170.348972124AI2340080.9763144.39220010.323216362AI2340400.9959214.8890630.282330926AI2341490.9894147.29863780.346303317AI2342230.9943155.58557920.295689739AI2342370.9898128.06041130.365310901AI2342920.9666132.03033640.371450337AI2343360.9606108.08726250.348978236AI2348720.9933342.83429840.348857143AI2349330.9735437.55976370.362073729AI2350540.9805158.12141420.341417567AI2352190.9903372.89950330.398319082AI2352380.9975828.10633820.269155653AI2352710.9859210.16747840.269465284AI2353970.9937263.1355930.33163326 AI2354030.9927295.66608060.249674383AI2355020.9674227.33193450.366379508AI2355080.9741166.2839590.274104484AI2355100.99811041.8710280.289468985NM_0225180.9893485.72827130.364274933AI2358850.9861143.73815020.330077747AI2359010.9788116.03773020.33953459 AI2359620.9923170.2564460.228940909AI2360030.9911116.78520260.368833607AI2363070.9931647.07450830.3535376 AI2363180.9905145.02242180.368534279AI2365200.9859230.03841210.310186585AI2365230.969379.937627670.334615459AI2365290.9893267.96886130.215091202AI2365700.99721503.5929590.295288772AI2366810.9979434.04897090.388139121AI2366910.9938329.13110410.374653742AI2367040.984787.825027540.361266423AI2367450.9936232.08043620.235151157AI2367630.9736114.09718410.323485716AI2367830.9988405.58827130.270457401AI2368000.9588130.72852040.364611624AI2369280.9889249.38959550.311710968AI2371990.950596.753301120.385809363AI2373110.9975994.90340910.307945865NM_0539890.9855152.33147110.348698125AI2377000.9899259.41714990.318929992NM_0313260.994 181.35145180.358704788AI2378610.9915252.74346160.257047104AI2378720.9856177.63812870.287293468AI6394250.983469.090787650.309131529NM_0570970.9897196.92134070.392392697S708030.9906176.5645580.296587753NM_0225880.958673.3087820.367974984NM_0132210.9839101.66910630.364721399NM_0225950.9955303.9157920.34309984 NM_0537990.9948362.9742160.304148568U538590.9911598.59763370.357330309NM_0130500.9878220.09144370.370223521NM_0533310.9996556.65651580.26197082 U753920.9967514.17697390.263076873NM_0217650.975 119.88552620.279960896NM_0172760.9834371.49168060.349680389Y133360.9959552.66616810.270873633


Claims
  • 1. A method of identifying at least one gene that is consistently expressed across different cell or tissue types in an organism, comprising: (a) preparing gene expression profiles for different cell or tissue types from the organism; (b) calculating a coefficient of variation for at least one gene in each of the profiles across the different cell or tissue types; and (c) selecting any gene whose coefficient of variation indicates that the gene is consistently expressed across the different cell or tissue types.
  • 2. A method of claim 1, wherein step (c) comprises identifying at least one gene with a coefficient of variation of less than about 40%.
  • 3. A method of claim 1, wherein the different cell or tissue types comprise greater than about 10 different cell or tissue types.
  • 4. A method of claim 1, wherein the different cell or tissue types comprise greater than about 25 different cell or tissue types.
  • 5. A method of claim 1, wherein the different cell or tissue types comprise greater than about 50 different cell or tissue types.
  • 6. A method of claim 3, wherein the cell or tissue types comprise normal and diseased cell or tissue types.
  • 7. A method of claim 1, wherein the cell or tissues have been exposed to a test agent.
  • 8. A method of claim 7, wherein the agent is a toxin.
  • 9. A method of claim 8, wherein the expression profiles are generated by querying a gene expression database for the expression level of at least one gene in different cell or tissue types from the organism or from a cell line.
  • 10. A set of probes comprising at least two probes that specifically hybridize to a gene identified by the method of claim 1.
  • 11. A set of probes according to claim 10, wherein the set comprises probes that specifically hybridize to at least about 10 genes.
  • 12. A set of probes according to claim 10, wherein the set comprises probes that specifically hybridize to at least about 25 genes.
  • 13. A set of probes according to claim 10, wherein the set comprises probes that specifically hybridize to at least about 50 genes.
  • 14. A set of probes according to claim 10, wherein the set comprises probes that specifically hybridize to at least about 100 genes.
  • 15. A set of probes according to claim 10, wherein the probes are attached to a single solid substrate.
  • 16. A set of probes of claim 15, wherein the solid substrate is a chip.
  • 17. A method of normalizing the data from a nucleic acid detection assay comprising: (a) detecting the expression level for at least one gene in a nucleic acid sample; and (b) normalizing the expression of said at least one gene with the detected expression level of a control gene identified by the method of claim 1.
  • 18. A method of claim 17, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 10 control genes.
  • 19. A method of claim 17, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 25 control genes.
  • 20. A method of claim 17, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 50 control genes.
  • 21. A method of claim 17, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 100 control genes.
  • 22. A method of claim 17, wherein the assay is quantitative.
  • 23. A method of claim 17, wherein the assay is a hybridization reaction conducted on a solid substrate.
  • 24. A method of claim 23, wherein the solid substrate is an oligonucleotide array.
  • 25. A method of claim 24, wherein the array comprises oligonucleotide probes that are complementary to the control genes.
  • 26. A method of claim 17, wherein the assay is a polymerase chain reaction.
  • 27. A set of probes comprising at least two probes that specifically hybridize to a gene of Table 1.
  • 28. A set of probes of claim 27, comprising probes that specifically hybridize to at least about 10 genes of Table 1.
  • 29. A set of probes of claim 27, comprising probes that specifically hybridize to at least about 25 genes of Table 1.
  • 30. A set of probes of claim 27, comprising probes that specifically hybridize to at least about 50 genes of Table 1.
  • 31. A set of probes of claim 27, comprising probes that specifically hybridize to at least about 100 genes of Table 1.
  • 32. A set of probes of claim 27, wherein the probes are attached to a single solid substrate.
  • 33. A set of probes of claim 32, wherein the solid substrate is a chip.
  • 34. A method of normalizing the data from a nucleic acid detection assay comprising: (a) detecting the expression level for at least one gene in a nucleic acid sample; and (b) normalizing the expression of said at least one gene with the detected expression of a control gene of Table 1.
  • 35. A method of claim 34, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 10 control genes of Table 1.
  • 36. A method of claim 34, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 25 control genes of Table 1.
  • 37. A method of claim 34, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 50 control genes of Table 1.
  • 38. A method of claim 34, wherein step (b) comprises normalizing the expression level of said at least one gene with the expression levels of at least about 100 control genes of Table 1.
  • 39. A method of claim 34, wherein the assay is quantitative.
  • 40. A method of claim 34, wherein the assay is a hybridization reaction conducted on a solid substrate.
  • 41. A method of claim 40, wherein the solid substrate is an oligonucleotide array.
  • 42. A method of claim 41, wherein the array comprises oligonucleotide probes that are complementary to the control genes.
  • 43. A method of claim 34, wherein the nucleic acid sample is from a rat cell or tissue sample that has been exposed to a test agent.
  • 44. A method of claim 43, wherein the test agent is a potential toxin.
  • 45. A method of claim 17, wherein the normalizing of step (b) comprises dividing the expression level for said at least one gene by the detected expression level of said control gene.
  • 46. A method of identifying at least one gene that is consistently expressed across different rat cell or tissue types, comprising: (a) querying a gene expression database for the expression level of at least one gene in different cell or tissue types from a rat population or cell line; (b) calculating a coefficient of variation for said at least one gene across the different cell or tissue types or cell lines; and (c) identifying at least one gene whose coefficient of variation indicates that the gene is consistently expressed across the different cell or tissue types or cell lines.
  • 47. A method of claim 46, wherein step (c) comprises identifying at least one gene with a coefficient of variation of less than about 40%.
  • 48. A method of claim 47, wherein the different cell or tissue types comprise greater than about 10 different cell or tissue types.
  • 49. A method of claim 47, wherein the different cell or tissue types comprise greater than about 25 different cell or tissue types.
  • 50. A method of claim 47, wherein the different cell or tissue types comprise greater than about 50 different cell or tissue types.
  • 51. A method of claim 46, wherein the cell or tissue types comprise normal and diseased cell or tissue types.
  • 52. A method of claim 51, wherein the cell or tissue types are exposed to a test agent.
  • 53. A method of claim 52, wherein the agent is a toxin.
  • 54. A method of identifying a nucleic acid molecule whose level of expression is invariant across two or more cell or tissue samples, comprising: (a) determining the variation in the expression level of the nucleic acid molecule as a coefficient of variation (% CV) from two or more cell or tissue samples; (b) comparing the coefficient of variation for the nucleic acid molecule to a threshold value, wherein the expression level of the nucleic acid molecule is considered to be invariant if the coefficient of variation is less than the threshold value; and (c) identifying a nucleic acid molecule whose level of expression is invariant across two or more cell or tissue samples.
  • 55. A method of normalizing data from a nucleic acid detection assay comprising: (a) detecting the expression level for at least one gene in a nucleic acid sample; and (b) normalizing the expression level of said at least one gene with the detected expression level of an invariant gene identified by the method of claim 54.
RELATED APPLICATIONS

[0001] This application claims priority under 35 U.S.C. §119(e) to U.S. Provisional Application No. 60/399,158, filed Jul. 30, 2002, which is herein incorporated by reference in its entirety.

Provisional Applications (1)
Number Date Country
60399158 Jul 2002 US