The present disclosure relates to a plurality of genes modulated by estrogen or other agents, such as hormones or combinations of hormones, in various types of tissue. In particular, one embodiment of the disclosure relates to a plurality of genes which demonstrates certain patterns of expression differing qualitatively or, quantitatively, with and without exposure to estrogen and/or other hormone compositions. The disclosure further relates to the methods of using these genes in identifying agents that exert at least some of the biological effects of estrogen and/or other agents, and to pharmaceuticals and related therapies. The disclosure further relates to the use of the plurality of genes in methods of monitoring, in gene chips and in kits.
Estrogens exert biological effects in numerous organs throughout the body. The role of estrogens in reproductive biology, the prevention of postmenopausal hot flashes, and the prevention of postmenopausal osteoporosis are well established. Many observational studies have suggested estrogens also reduce the risk of development of cardiovascular disease(1), at least in part by estrogens reducing LDL cholesterol levels and elevating HDL cholesterol levels(2,3). More recently, estrogens have been suggested to inhibit the development of colon cancer(4), inhibit the development of Alzheimer's disease(5), and inhibit development of cataracts (6). The multitude of estrogen responses matches the widespread distribution of estrogen receptors (ER) throughout numerous organs, with ERα expression highest in uterus, pituitary, kidney and adrenal gland and ERβ expression highest in ovary, uterus, bladder and lung(7). While various estrogens have been profiled for biological activity, little is known regarding the patterns of gene expression which are responsible for these diverse activities.
Thus, a need exists for the systemic analysis of the regulation by estrogen and/or other hormonal compositions of gene expression in various tissues and the identification of the plurality of differentially expressed genes. The identification of candidate agents that at least partially exert the same differential expression and development of pharmaceuticals and new treatment methods based on such agents is highly desirable. There also exists a need for methods of monitoring conditions and for diagnostic products, including gene chips and kits, which may be used in the above-described analyses.
The embodiments provided herein relate generally to a plurality of genes, particularly a plurality of genes that are modulated by estrogen and/or other hormonal compositions in various organs, such as the uterus, kidney and pituitary gland. Such differentially expressed genes are useful in screening assays to examine the effects of a candidate agent on the expression of genes that are responsive to estrogen. A candidate agent that induces, in a given tissue, a gene expression profile that exhibits one or more similarities to the gene expression profile of estrogen and/or other hormonal compositions, can be identified for possible use in pharmaceuticals. The invention also relates to the identification of estrogen responsive genes that are known to be associated with the inhibition of certain conditions, such as shock, post-menopausal calcium deficiencies, cardiovascular diseases, and conditions where there is decreased renal blood flow, such as those caused by diuretics or congestive heart failure.
FIGS. 8A-BC show a series of relative expression levels for various genes in different types of mice. This figure also presents a model for AF1 or AF2 activation for each gene. Ovariectomized WT mice, ERαERβKO mice (expressing only ΔAF1-ERα) or ERαKO mice (expressing ΔAF1-ERα along with ERβ) were treated for 6 weeks with vehicle, 10 μg/kg/day E2, 10 μg/kg/day E2+5 mg/kg/day ICI182780, or 5 mg/kg/day tamoxifen. Kidney gene expression values were determined by real-time PCR for each individual animal and normalized for GAPDH expression. The mean expression level in vehicle-treated WT mice was defined as 1 for each gene. *p<0.01 for comparison to vehicle treated animals. A model for the requirement of AF1 or AF2 for activation of each gene is shown below each graph. The change in ER shape with tamoxifen Cr) bound denotes the alternate helix 12 conformation induced by tamoxifen compared to E2. CA denotes coactivators.
One embodiment of the disclosure relates to a plurality of genes, each of whom is differentially expressed in tissue cells exposed to estrogen and/or other hormones or combination of hormones and tissue cells without said exposure, which plurality comprises a first group and a second group, wherein each gene in said first group is differentially expressed at a higher level in said tissue cells exposed to estrogen and/or a hormone or combinations of hormones than in said tissue cells without said exposure, wherein each gene in said second group is differentially expressed at a lower level in said tissue cells exposed to estrogen and/or a hormone or combinations of hormones than in said tissue cells without said exposure. Confirmation of such expression is confirmed by real-time PCR. Such cells preferably are from the kidney, pituitary or uterus. Exposure to estrogen and/or the other hormones is in vivo or in vitro. The higher level and lower levels are assessed using a predetermined statistical significance standard based on measurements of expression levels. The measurements can obtained using nucleotide arrays or nucleotide filters.
Another embodiment relates to a method for identifying an agent having the biological effect of estrogen and/or other hormones or combination of hormones, on gene expression in a given tissue, wherein said desired effect represents a first plurality of genes differentially expressed at various levels, which method comprises:
Another embodiment is directed to an agent identified by the above method.
Another embodiment is a pharmaceutical composition comprising this agent and a pharmaceutically acceptable excipient.
Another embodiment relates to a method for identifying an agent capable of maintaining vascular volume in septic shock, which method comprises:
Another embodiment relates to a method of identifying an agent capable of enhancing calcium uptake in post-menopausal women, which method comprises:
Another embodiment relates to a method for identifying an agent for treating cardiovascular disorders, which method comprises:
Another embodiment relates to agents identified by any of the above methods and pharmaceutical agents comprising such agents and a pharmaceutically acceptable excipient.
Another embodiment relates to a solid substrate comprising any of the above described plurality of genes.
Another embodiment relates to a kit comprising any of the above plurality of genes.
The embodiments herein provide a plurality of genes modulated by estrogen and/or other hormonal compositions of interest in various types of tissue and the use of such a plurality of differentially expressed genes in screening for agents that exert at least some of the biological effects of estrogen and other hormonal compositions of interest. Such identified agents can be used in pharmaceuticals and in related new therapeutic methods. The plurality of genes can be used in methods of monitoring.
Definitions:
In general, “a gene” is a region on the genome that is capable of being transcribed to an RNA that either has a regulatory function, a catalytic function and/or encodes a protein. A gene typically has introns and exons, which may organize to produce different RNA splice variants that encode alternative versions of a mature protein. “Gene” contemplates fragments of genes that may or may not represent a functional domain.
A “plurality of genes” as used herein refers to a group of identified or isolated genes whose levels of expression vary in different tissues, cells or under different conditions or biological states. The different conditions may be caused by exposure to certain agent(s)—whether exogenous or endogenous—which include hormones, receptor ligands, chemical compounds, etc. The expression of a plurality of genes demonstrates certain patterns. That is, each gene in the plurality is expressed differently in different conditions or with or without exposure to a certain endogenous or exogenous agents. The extent or level of differential expression of each gene may vary in the plurality and may be determined qualitatively and/or quantitatively according to this invention. A gene expression profile, as used herein, refers to a plurality of genes that are differentially expressed at different levels, which constitutes a “pattern” or a “profile.” As used herein, the term “expression profile,” “profile,” “expression pattern,” “pattern,” “gene expression profile,” and “gene expression pattern” are used interchangeably.
An “agent that exerts at least some of the biological effects of estrogen,” as used herein refers to any factor, agent, compound whether endogenous or exogenous in origin, which is capable of binding and interacting with estrogen receptors and thereby eliciting certain biological effects of extrogen. The skilled artisan would know that, for instance, one of the biological effects of estrogen is to promote the development of the female reproductive system. Other biological effects of estrogen are well documented and discussed, infra.
Gene expression profiles may be measured, according to this invention, by using nucleotide or microarrays. These arrays allow tens of thousands of genes to be surveyed at the same time.
“Hormones or combinations of hormones” include for instance, combinations of estrogens or other hormones that are known to exert biological effects of estrogen.
As used herein, the term “microarray” refers to nucleotide arrays that can be used to detect biomolecules, for instance to measure gene expression. “Array,” “slide,” and “chip” are used interchangeably in this disclosure. Various kinds of arrays are made in research and manufacturing facilities worldwide, some of which are available commercially. There are, for example, two main kinds of nucleotide arrays that differ in the manner in which the nucleic acid materials are placed onto the array substrate: spotted arrays and in situ synthesized arrays. One of the most widely used oligonucleotide arrays is GeneChip™ made by Affymetrix, Inc. The oligonucleotide probes that are 20- or 25-base long are synthesized in silico on the array substrate. These arrays tend to achieve high densities (e.g., more than 40,000 genes per cm2). The spotted arrays, on the other hand, tend to have lower densities, but the probes, typically partial cDNA molecules, usually are much longer than 20- or 25-mers. A representative type of spotted cDNA array is LifeArray made by Incyte Genomics. Pre-synthesized and amplified cDNA sequences are attached to the substrate of these kinds of arrays.
In one embodiment, the nucleotide is an array (i.e., a matrix) in which each position represents a discrete binding site for a product encoded by a gene (e.g., a protein or RNA), and in which binding sites are present for products of most or almost all of the genes in the organism's genome. In one embodiment, the “binding site” (hereinafter, “site”) is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA can specifically hybridize. The nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length cDNA, a less-than full length cDNA, or a gene fragment.
Although the microarray may contain binding sites for products of all or almost all genes in the target organism's genome, such comprehensiveness is not necessarily required. Usually the microarray will have binding sites corresponding to at least about 50% of the genes in the genome, often at least about 75%, more often at least about 85%, even more often more than about 90%, and most often at least about 99%. Preferably, the microarray has binding sites for genes relevant to the action of the gene expression modulating agent of interest or in a biological pathway of interest.
The nucleic acid or analogue are attached to a “solid support,” which may be made from glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, or other materials. A preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is described generally by Schena et al., 1995, Quantitative monitoring of gene expression patterns with a complementary DNA microarray, Science 270:467-470. This method is especially useful for preparing microarrays of cDNA. See also DeRisi et al., 1996, Use of a cDNA microarray to analyze gene expression patterns in human cancer, Nature Genetics 14:457-460; Shalon et al., 1996, A DNA microarray system for analyzing complex DNA samples using two-color fluorescent probe hybridization, Genome Res. 6:639-645; and Schena et al., 1995, Parallel human genome analysis; microarray-based expression of 1000 genes, Proc. Natl. Acad. Sci. USA 93:10539-11286.
In a preferred embodiment, the microarray is a high-density oligonucleotide array, as described above. In a particularly preferred embodiment, the nucleotide arrays are the MG_U74 and MG_U74v2 arrays from Affymetrix.
“Polymerase Chain Reaction” or “PCR” is an amplification-based assay used to measure the copy number of the gene. In such assays, the corresponding nucleic acid sequences act as a template in an amplification reaction. In a quantitative amplification, the amount of amplification product will be proportional to the amount of template in the original sample. Comparison to appropriate controls provides a measure of the copy number of the gene, corresponding to the specific probe used, according to the principle discussed above. Methods of “real-time quantitative PCR” using Taqman probes are well known in the art. Detailed protocols for real-time quantitative PCR are provided, for example, for RNA in: Gibson et al., 1996, A novel method for real time quantitative RT-PCR. Genome Res. 10:995-1001; and for DNA in: Heid et al., 1996, Real time quantitative PCR. Genome Res. 10:986-994.
A TaqMan-based assay can also be used to quantify polynucleotides. TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5′ fluorescent dye and a 3′ quenching agent. The probe hybridizes to a PCR product, but cannot itself be extended due to a blocking agent at the 3′ end. When the PCR product is amplified in subsequent cycles, the 5′ nuclease activity of the polymerase, for example, AmpliTaq, results in the cleavage of the TaqMan probe. This cleavage separates the 5′ fluorescent dye and the 3′ quenching agent, thereby resulting in an increase in fluorescence as a function of amplification (see, for example, http://www2.perkin-elmer.com).
Other suitable amplification methods include, but are not limited to, ligase chain reaction (LCR) (see, Wu and Wallace, 1989, Genomics 4: 560; Landegren et al., 1988 Science 241: 1077; and Barringer et al., 1990, Gene 89: 117), transcription amplification (Kwoh et al., 1989, Proc. Natl. Acad. Sci. USA 86: 1173), self-sustained sequence replication (Guatelli et al., 1990, Proc. Nat. Acad. Sci. USA 87: 1874), dot PCR, and linker adapter PCR, etc.
The “level of mRNA” in a biological sample refers to the amount of mRNA transcribed from a given gene that is present in a cell or a biological sample. One aspect of the biological state of a biological sample (e.g. a cell or cell culture) usefully measured in the present invention is its transcriptional state. The transcriptional state of a biological sample includes the identities and abundances of the constituent RNA species, especially mRNAs, in the cell under a given set of conditions. Preferably, a substantial fraction of all constituent RNA species in the biological sample are measured, but at least a sufficient fraction is measured to characterize the action of an agent or gene modulator of interest. The level of mRNA may be quantified by methods described herein or may be simply detected, by visual detection by a human, with or without comparison to a level from a control sample or a level expected of a control sample.
A “biological sample,” as used herein refers to any sample taken from a biological subject, in vivo or in situ. A biological sample may be a sample of biological tissue, or cells or a biological fluid. Biological samples may be taken, according to this invention, from any kind of biological species, any types of tissues, and any types of cells, among other things. Cell samples may be isolated cells, primary cell cultures, or cultured cell lines according to this invention. Biological samples may be combined or pooled as needed in various embodimets. Preferred tissues are from the uterus, kidney, pituitary glands, breast, brain and adipose tissue.
“Modulation of gene expression,” as this term is used herein, refers to the induction or inhibition of expression of a gene. Such modulation may be assessed or measured by assays. Typically, modulation of gene expression may be caused by endogenous or exogenous factors or agents. The effect of a given compound can be measured by any means known to those skilled in the art. For example, expression levels may be measured by PCR, Northern blotting, Primer Extension, Differential Display techniques, etc.
“Induction of expression” as used herein refers to any observable or measurable increase in the levels of expression of a particular gene, either qualitatively or quantitatively. The measurement of levels of expression may be carried out according to this invention using any techniques that are capable of measuring RNA transcripts in a biological sample. Examples of these techniques include, as discussed above, PCR, TaqMan, Primer Extension, Differential display and nucleotide arrays, among other things.
“Repression of expression.” “Repression” or “inhibition” of expression, are used interchangeably according to this disclosure. It refers to any observable or measurable decrease in the levels of expression of a particular gene, either qualitatively or quantitatively. The measurement of levels of expression may be carried out using any techniques that are capable of measuring RNA transcripts in a biological sample. The examples of these techniques include, as discussed above, PCR, TaqMan, Primer Extension, Differential Display, and nucleotide arrays, among other things.”
A “gene chip” or “DNA chip” is described, for instance, in U.S. Pat. Nos. 5,412,087, 5,445,934 and 5,744,305 and is useful for screening gene expression at the mRNA level. Gene chips are commercially available.
A “kit” is one or more of containers or packages, containing at least one “plurality of genes,” as described above, on a solid support. Such kit also may contain various reagents or solutions, as well as instructions for use and labels.
A “detectable label” or a “detectable moiety” is a composition that when linked with a nucleic acid or a protein molecule of interest renders the latter detectable, via spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include radioactive isotopes, magnetic beads, metallic beads, colloidal particles, fluorescent dyes, electron-dense reagents, enzymes, biotin, digoxigenenin or haptens. A “labeled nucleic acid or oligonucleotide probe” is one that is bound, either covalently, through a linker or a chemical bond, or noncovalently through ionic, vander Waals, electrostatic, hydrophobic interactions, or hydrogen bonds, to a label such that the presence of the nucleic acid or probe may be detected by detecting the presence of the label bound to the nucleic acid or probe.
A “nucleic acid probe” is a nucleic acid capable of binding to a target nucleic acid or complementary sequence through one or more types of chemical bond, usually through complementary base pairing usually through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, or T or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. It will be understood by one of skill in the art that probes may bind target sequences that lack complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. The probes are preferably directly labeled with isotopes, for example, chromophores, luminphores, chromogens, or indirectly labeled with biotin to which a strepavidin complex may later bind. By assaying the presence or absence of the probe, one can detect the presence or absence of a target gene of interest.
“In situ hybridization” is a methodology for determining the presence of or the copy number of a gene in a sample, for example, fluorescence in situ hybridization (FISH) (see Angerer, 1987 Meth. Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: (1) fixation of tissue or biological structure to be analyzed; (2) prehybridization treatment of the biological structure to increase accessibility of target nucleic acid, and to reduce nonspecific binding; (3) hybridization of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not bound in the hybridization, and (5) detection of the hybridized nucleic acid fragments. The probes used in such applications are typically labeled, for example, with radioisotopes or fluorescent reporters. Preferred probes are sufficiently long, for example, from about 50, 100, or 200 nucleotides to about 1000 or more nucleotides, to enable specific hybridization with the target nucleic acid(s) under stringent conditions.
Hybridization protocols suitable for use with the methods of the invention are described, for example, in Albertson (1984) EMBO J. 3:1227-1234; Pinkel (1988) Proc. Natl. Acad. Sci. USA 85:9138-9142; EPO Pub. No. 430:402; Methods in Molecular Biology, Vol. 33: In Situ Hybridization Protocols, Choo, ed., Humana Press, Totowa, N.J. (1994); etc.
“A predetermined statistical significance standard based on measurements of expression levels” is a confidence score based upon the assessment of four factors. Specifically, a score is assigned to each gene that reflects the confidence of the change. The score is based on four criteria: Fold Change, p-value (T-test), Present Calls, Frequency Value. For example, see the Table below:
“Data” refers to information obtained that relates to the expression of genes in response to exposure to estrogen or an agent of unknown biological effect. The plurality of genes identified by the disclosed methods are examples of such information. The information is stored in electronic or paper formats. Electronic format can be selected from the group consisting of electronic mail, disk, compact disk (CD) digital versatile disk (DVD), memory card, memory chip, ROM or RAM, magnetic optical disk, tape, video, video clip, microfilm, internet, shared network, shared server and the like; wherein data is displayed, transmitted or analyzed via electronic transmission, video display, telecommunication, or by using any of the above stored formats; wherein data is compared and compiled at the site of sampling specimens or at a location where the data is transported following a process as described above.
“Genes modulated by estrogen.” Genes regulated by estrogen and/or hormonal compositions and identified according to the disclosed methods, are listed in Tables II, III and IV. Relevant Unigene codes or Genbank accession numbers are provided.
Identification of Genes Modulated by Estrogen
A. Biological Sample and Assay
One embodiment disclosed herein relates to a plurality of genes, each of which is differentially expressed in kidney cells exposed to estrogen or a candidate agent and kidney cells without exposure to estrogen or a candidate agent, which plurality comprises a first group and a second group, wherein each gene in said first group is differentially expressed at a higher level in said kidney cells exposed to estrogen or a candidate agent than in said kidney cells without said exposure, wherein each gene in the second group is differentially expressed at a lower level in said kidney cells exposed to estrogen or candidate agent than in said kidney cells without said exposure.
A biological sample of kidney cells are obtained according to methods well known to the skilled artisan. One group of kidney cells are exposed to estrogen. Such estrogen may be 17β estradiol. The kidney cells may be from one or more animals of the same species or from a culture of kidney cells or kidney tissue. Preferably, such cells are from a mammal, most preferably a mouse, rat or human. Such animal must produce little or no estrogen. For instance, an aromatase knockout animal cannot produce estrogen. Because the major source of circulating estrogen is the ovary, ovariectomy dramatically decreases circulating estrogen levels. Thus, in one embodiment, ovariectomized animals are used. By “exposure” is meant a type and quantity of either in vivo or in vitro administration that is applicable to the source of the kidney cells and known and acceptable to those of skill of the art. The total RNA from such cells is prepared by methods known to the skilled artisan, e.g., by Trizol (Invitrogen) followed by subsequent repurification, e.g. via Rneasy columns (Qiagen). The total RNA is used to generate a labeled target according to methods and using detectable labels well know in the art, as described above in detail. For instance, the RNA may be labeled with biotin to form a cRNA target for use in an assay. See a complete description of preferred methods in the Affymetrix GeneChip® technical manual (Pages 700217 through 700223), which is herein incorporated by reference.
The assay, according to the invention, may be any assay suitable to detect gene expression. For instance, mRNA, cDNA or protein expression may be detected. Many different types of assays are known, examples of which are set forth above, including analyses by nucleotide arrays and nucleotide filters. The hybridization conditions (temperature, time, and concentrations) are adjusted according to procedures also well known in the art, as described above. In a preferred embodiment, the assay of the invention involves the use of a high density oligonucleotide array. For instance, in a preferred embodiment, cRNA labeled with biotin is hybridized to a murine MG_U74Av2 probe array (Affymetrix, Santa Clara, Calif.) for 16 hours at 45 degrees. Eleven biotin-labeled cRNAs at defined concentration are spiked into each hybridization and used to convert average difference values to frequencies expressed as parts per million.
Other solid supports and microarrays are known and commercially available to the skilled artisan, as described above.
B. Measurements and Statistical Analysis
The assay of the invention is used to identify genes modulated by estrogen. Such modulation may be induction of expression (a plurality of genes belonging to a “first group”) or repression of such expression (a plurality of genes belonging to a “second group”). Gene expression induction is indicated by a higher level of expression, whereas repression is indicated by a lower level of expression, as assessed using a predetermined statistical significance standard based on measurements of expression levels.
Thus, the genes expressed or repressed in kidney cells with estrogen exposure are compared to the genes expressed or repressed in kidney cells that were not exposed to estrogen. Pairwise comparisons are made between each of the treatments. A pairwise comparison is the expression data for a given gene under a given treatment condition compared to the expression data for this gene under a second treatment condition. The fold change ratio is then calculated, the p-value based on Student's t-test, the number of present calls, and the expression level for each comparison. A confidence score “CS” is defined as CS(x)=FC(x)+PV(x)+PC(x)+EL(x) where FC, PV, PC and EL are scores assigned to the fold change, p-value, number of present calls, and the expression level, respectively. FC(x) is assigned 5 if the fold change ratio was greater then 1.95 and is assigned 0 if the ratio is between 1.95 and 1.5. PV(x) is assigned 3 if the p-value is less then 0.05 and is assigned 2 if the p-value was between 0.05 and 0.1. PC(x) is assigned 3 if at least 50% of the samples are called P by the Affymetrix algorithm and assigned 1 if only 25% of the samples are called P. EL(x) is assigned 3 if at least two samples have frequency value of 20 or greater and assigned 1 if two samples only have a frequency greater then 15. Penalty points are assigned if the fold change is less then 1.5, the p-value is greater then 0.2 or the frequency values were below 15 ppm. CS(x) ranged for −14 to 14 with qualifiers having a score of 14 considered the most significant changes. Genes with 11 or more points in any one pairwise comparison is considered to be significant. Real-time PCR and histology analyses are then performed to confirm the identity of the genes, essentially as described previously (9,10), which are herein incorporated by reference. The above described analysis can be used to identify candidate agents that are “estrogen-like” in that they have a differential expression profile which is in the most preferred embodiment substantially the same as estrogen's. For instance, in one embodiment, the expression levels for the genes upon exposure to the respective compounds is at least within 50% of each other.
C. Biological Samples from Other Organs
The above described methods, assay and analysis can be applied to biological samples from any tissue, including the uterus, pituitary gland, liver, brain, colon, breast, adipose tissue, etc. In preferred embodiments, the biological samples are from the kidney, pituitary gland and the uterus.
D. The Plurality of Genes
Pursuant to the above described methods, the genes listed in Table II were identified as being differentially expressed upon exposure to estrogen. Genes in which expression is induced by estrogen are considered to be genes of the “first group,” whereas genes that are repressed by estrogen are considered to be in the “second group”.
Specifically, the estrogen modulated genes in the kidney are Tissue Factor, CYP7B1, BCAT1, STAT5A, GADD45G, BHMT, SAHH, NTT73, ABCC3. Of these genes, estrogen induced expression in all but BHMT and SAHH, where it repressed expression.
Thus, one disclosed embodiment is a plurality of genes, wherein in the first group, where gene expression in kidney cells is induced by estrogen exposure, the plurality of genes comprise NTT73 and ABCC3. Another disclosed embodiment is a plurality of genes wherein the “first group” comprises CYP7B1 in kidney cells. In another embodiment, the plurality of genes of the “second group,” where gene expression in kidney cells is repressed by estrogen exposure, comprises at least BHMT and SAHH.
Another disclosed embodiment is directed to a plurality of genes in kidney cells, wherein the first group comprises Tissue Factor, CYP7B1, BCAT1, STAT5A, and GADD45G, and wherein said second group comprises BHMT.
Another disclosed embodiment is directed to a plurality of the genes wherein the first group comprises CYP7B1, TF, SCYA28, Iga, Vk28, PHD 2, ELF 3, TIM1, STAT5A, COR1, BCAT1, ABCC3, TIM2, NAT6, RGS3, GNBP3, BCL7A, 17βDHH, FYVE ZFP, NTT73, AGPS, TRIM2, HBACH, CIS2, CYP27B1, and STAT5B, wherein said second group comprises SAHH, ADH1A7, RARRES2, and BHMT. Another disclosed embodiment relates to a plurality of genes, wherein the first group comprises COR1 and GNBP3.
The estrogen modulated genes in the pituitary gland are STAT5B, GADD45G, Kallikrein-9, and FSHb, the expression of which is induced by estrogen for all but FSHb, which is repressed.
Thus one embodiment relates to a plurality of genes in the pituitary gland, wherein the first group comprises STAT5B and GADD45G.
Another embodiment relates to a plurality of genes, wherein the first group comprises STAT5B, GADD45G1 and Kallikreins genes in the pituitary.
Yet another embodiment relates to a plurality of genes, wherein the second group of genes in the pituitary gland comprise FSHb.
Pursuant to the above methods, the inventors discovered that the estrogen modulated genes in the uterus comprise SFRP4, Deiodinase (type II), Procollagen (type I, Alpha I), vimentin and IGFBP4, Scavenger receptor, AI121305, ALOX15, BCAT1, SiAMOX, C3, FOS, MAP2k1, CEBPb, EGR1 and CYP1A1. All of these genes are induced by estrogen in the uterus except for Scavenger receptor and CYP1A1, which are repressed.
Thus, one embodiment is directed to a plurality of genes in the uterus, wherein the first group comprises SFRP4, Deiodinase (type II), Procollagen (type I, Alpha I), vimentin and IGFBP4.
Another embodiment of the invention is directed to the plurality of genes, wherein the first group in the uterus comprises AI121305, ALOX15, BCAT1, SiAMOX, C3, FOX, MAP2k1, CEBPb and EGR1.
Another embodiment is directed to a plurality of genes wherein the first group in the uterus comprises SFRP4, Deiodinase (type II), Procollagen (type I, Alpha I), vimentin and IGFBP4, Scavenger receptor, AI121305, ALOX15, BCAT1, SiAMOX, C3, FOS, MAP2k1, CEBPb and EGR1.
In another embodiment, the plurality of genes in the second group in the uterus comprises CYP1A1.
In yet another embodiment, the plurality of genes in the second group in the uterus comprises Scavenger receptor.
Methods of Identifying Agents
Based upon the above described methods for determining differential expression of genes in various organs, another aspect of the invention relates to the identification of candidate agents that have the same or substantially the same biological effect of a known agent, such as estrogen or another hormonal combination of known biological effect. An “agent” could be any compound of unknown biological effect on genes in a given body site. Specifically, the invention relates to a method for identifying an agent having a desired effect on gene expression in an organ, wherein said desired effect represents a first plurality of genes differentially expressed at various levels, which method comprises:
Another embodiment relates to an agent identified by the above method.
Yet another embodiment relates to a gene chip comprising any one or more of the above plurality of genes.
Pharmaceuticals and Methods of Treating
The identification of agents that induce or repress the expression of a gene associated with a given disorder or condition can lead to the development of pharmaceuticals that can be administered to a patient at therapeutically effective doses to prevent, treat, or control such disorder or condition.
Some conditions associated with estrogen regulation of gene expression in the kidney are known. For instance, in women, high estrogen levels preceding ovulation, during pregnancy, and resulting from estrogen administration commonly results in body water retention (23,24). Increased renal sodium reabsorption is a major mechanistic component for the elevated fluid retention (25). In rats, estrogen has been shown to increase thiazide-sensitive NaCl cotransporter expression levels(26), providing one possible molecular basis for estrogen effects on sodium retention.
Pursuant to the methods of the invention as disclosed above and as exemplified in greater detail in the Examples below, two additional estrogen regulated genes that influence sodium retention were identified. First, estrogen (E2) treatment increased mRNA levels for NTT73 (27), which is a sodium and chloride dependent transporter, known to regulate sodium retention. Second, E2 treatment also induced mRNA levels for ABCC3, a member of a family of genes which are known to modulate epithelial sodium channel activity (28). The physiological role of E2 regulation of these genes may lie in the large volume expansion required during pregnancy. The ED50 value for E2 activation of gene expression in the kidney was about 10-fold higher than that required for uterine weight increases (
Premenopausal women survive septic shock better than comparably aged males while postmenopausal women have a diminished survival advantage. Since volume loss is a major cause of morbidity in shock, it is expected that enhanced sodium and water retention due to elevated expression of NTT73(27) by E2 plays a role in this protective process. Thus, one disclosed embodiment relates to a method for identifying an agent capable of maintaining vascular volume in septic shock comprising exposing, in vivo or in vitro, kidney cells to an agent, measuring expression levels of NTT73 and ABCC3 in kidney cells exposed the agent and in kidney cells not exposed to the agent; comparing the expression levels of the NTT73 and ABCC3 with the expression levels of the genes kidney cells exposed to estrogen. The candidate agent identified by this process can be used in pharmaceuticals for purposes of maintaining vascular volume in the treatment of septic shock.
Another estrogen modulated gene in the kidney with biological significance is CYP27B1, the enzyme responsible for the rate limiting conversion of inactive 25-hydroxy vitamin D3 into active 1,25-dihidroxy vitamin D (29). This process is known to occur in the proximal tubules of the kidney and has been shown to be stimulated by estrogen treatment of birds(30). Urinary calcium excretion is increased in postmenopausal women, while estrogen treatment reduces urine calcium levels (31, 32). The presence of vitamin D receptors within the proximal convoluted tubule and collecting duct tubules of the kidney suggests that E2 induction of CYP27B1 is the basis of this beneficial effect.
Thus, the invention relates to a method of identifying agents that are capable of enhancing calcium uptake in postmenopausal women comprising exposing, in vivo or in vitro, kidney cells to an agent, measuring expression levels of CYP7B1 in kidney cells exposed the agent and in kidney cells not exposed to the agent; comparing the expression levels of the CYP7B1 with the expression levels of the genes in kidney cells exposed to estrogen. The agent identified by this process can be used in pharmaceuticals for purposes of enhancing calcium uptake in postmenopausal women.
It is known that estrogen treatment reduces expression of betaine:homocysteine methyltransferase (BHMT) and S-adenosylhomocysteine hydrolase (SAHH), two enzymes involved in the methionine/homocysteine cycle (34). Elevated plasma homocysteine levels are now recognized as an important risk factor for the development of cardiovascular disease (35), and estrogen treatments reduced plasma homocysteine levels in postmenopausal women. Thus, the regulation of BHMT and SAHH provides a mechanistic link for this effect.
Thus, one embodiment disclosed herein relates to a method of identifying candidate agents for treating cardiovascular disorders comprising measuring expression of BHMT and SAHH in kidney cells exposed to an agent and in kidney cells with such exposure, comparing the expression levels of BHMT and SAHH with the expression levels of the genes in kidney cells exposed to estrogen. The agent identified by this process can be used in pharmaceuticals for purposes of treating cardiovascular disorders.
Finally, E2 treatment induced expression of COR1 (chemokine orphan receptor 1, RDC1) an orphan G-protein coupled receptor (37), along with the guanylate nucleotide binding protein 3 (GNBP3) and the regulator of G-protein signaling 3 (RGS3), suggesting these proteins may form a functional unit. RDC1 is a receptor for the potent vasodilatory peptide adrenomedullin and calcitonin gene-related peptide, CGRP (38). Administration of CGRP to ovariectomized rats does not produce a decrease in kidney vascular resistance; however, in ovariectomized rats treated with E2 or in pregnant rats, injection of CGRP significantly decreases kidney vascular resistance (39). The observed increased expression of RDC1 in kidney provides a mechanism for the E2 induction of sensitivity to CGRP in the kidney, resulting in the large increase in renal flow seen during pregnancy (40).
Thus, one embodiment disclosed herein relates to a method of identifying candidate agents for treating conditions associated with reduced renal flow, such as caused by diuretics or congestive heart failure. Toxicity and therapeutic efficacy of such agents identified by the above methods can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, for example, for determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and can be expressed by the ratio, LD50/ED50. compounds that exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side effects can be used, care should be taken to design a delivery system that targets such compounds to the site of affected tissue to minimize potential damage to normal cells and thereby reduce side effects.
The data obtained from the cell culture assays and animal studies can be used to formulate a dosage range for use in humans. The dosage of such compounds likes preferably within a range of circulating concentrations that include ED50 with little or no toxicity. The dosage can vary within this range depending upon the dosage form employed and the route of administration.
The pharmaceuticals of the present invention can be formulated by standard techniques using one or more physiologically acceptable carriers or excipients and the biologically active agent. The agents and its physiologically acceptable salts and solvates can be formulated and administered orally, introraly, rectally, parenteraly, epicutaneously, topically, transdermally, subcutaneously, intramuscularly, intranasally, sublingually, intradurally, introcularly, intravenously, intraperioneally, or by inhalation.
With regard to oral administration, the pharmaceutical compositions can take the form of tablets or capsules prepared by conventional means with pharmaceutically acceptable excipient, such as binding agents etc. Tablets may be coated according to methods well known in the art. Liquid preparations can be in the form of solutions, suspensions and syrups or can be initially in dry form for constitution with water or other suitable vehicle. Other additives may include suspending agents, such as sorbitol syrup, cellulose derivatives or hydrogenated edible fats, emulsifying agents or non-aqueous solutions. Preparations for oral administration may also be formulated for a time or controlled release of the active ingredient using techniques well know in the art of the invention.
Other formulations of the pharmaceuticals of the invention may be depot preparations for administration via implantation.
The pharmaceutical compositions of the present invention may be presented in a pack or dispenser device that contains one or more unit dosage forms containing the active ingredient. The pack can for example comprise metal or plastic foil, for example a blister pack. The pack or dispenser would contain instructions for administration.
Methods of Monitoring
The identification of the plurality of genes described above provides a powerful tool for assessing the progression of a state, condition or treatment. Specifically, a plurality of genes can be identified in a patient prior to an event, such as menopause, surgery, the onset of a therapeutic regime, or the completion of a therapeutic regime, to provide a base line result. This base-line can then be compared with the result obtained using identical methods either during or after such event. This information can be used for both diagnostic and prognostic purposes.
Kits
Another embodiment is directed to a kit containing a plurality of genes, preferably on a substrate. The kit also may comprise one or more containers or packages, along with reagents, solutions and possibly instructions for use.
All of the cited references are herein incorporated by reference. The invention is further described by the following Examples, which do not limit the invention in any manner.
Introduction
Estrogen receptors are expressed in numerous organs, although only a few organs are considered classical targets for estrogens. A systematic survey of estrogen regulation of approximately 10,000 genes in 13 tissues from wild type and ERβKO mice treated subcutaneously with vehicle or 17β-estradiol (E2) for six weeks was conducted. As expected, the uterus and pituitary had the greatest number of genes regulated by E2, while, surprisingly, the kidney had the third largest number of regulated genes. Some of these kidney regulations may provide mechanisms for known physiological effects of estrogens. For example, E2 induction of CYP27B1, the rate limiting enzyme in the synthesis of 1,25-dihydroxy vitamin D, may explain the ability of estrogens to decrease urinary calcium excretion in women. In situ hybridizations localized E2 regulation in the kidney to the juxtamedullary proximal and distal collecting tubule epithelial cells in both the mouse and rat. E2 regulations in the kidney were intact in the ERβKO mice, and the ERα selective agonist propyl pyrazole triol acted similarly as E2, together suggesting an ERα mediated mechanism. Finally, the combination of the AF1-selective agonist tamoxifen plus mice expressing an AF1-deleted version of ERα (previously designated as ERα knockouts) allowed clear identification of genes dependent upon ERα AF1 activity and genes dependent upon ERα AF2 activity. Both AF1 and AF2 dependent genes were stimulated by E2 with the same ED50, indicating that sensitivity of gene regulation in the kidney depends upon ER ligand binding and not on the subsequent ER activation mechanisms.
Animal-Related Procedure
Animals—Wldtype 129 strain female mice or Sprague-Dawley rats (bred at Wyeth or obtained from Taconic Farms) were placed on a casein-based diet at approximately 6 weeks of age. One week later, the animals were ovariectomized. Commencing the day after ovariectomy, each animal received a daily subcutaneous treatment with vehicle (50% DMSO, 50% phosphate buffered saline) or vehicle containing treatments for six weeks. Each group consisted of six or seven animals. Approximately 2 hours following the final treatment, the animals were euthanized with selected tissues frozen in liquid nitrogen for RNA analysis or on dry ice for histology.
GeneChip—Total RNA was prepared separately from each individual organ by using Trizol (Invitrogen) followed by subsequent repurification on Rneasy columns (Qiagen). In general, two pools of RNA were created using equal amounts of RNA from three mice. For small organs such as pituitary, an equal amount of RNA from six animals was combined.
Target Preparation and Array Hybridization—Total RNA was used to generate biotin labeled cRNA target as described (8) which was hybridized to the murine MG_U74Av2 probe arrays (Affymetrix, Santa Clara, Calif.) for 16 h at 45° C. Eleven biotin-labeled cRNAs at defined concentration were spiked into each hybridization and used to convert average difference values to frequencies expressed as parts per million.
Pairwise comparisons were made between each of the treatments. We calculated the fold change ratio, the p-value based on Student's t-test, the number of present calls, and the expression level for each comparison. A confidence score (CS) was defined as CS(x)=FC(x)+PV(x)+PC(x)+EL(x) where FC, PV, PC and EL are scores assigned to the fold change, p-value, number of present calls, and the expression level, respectively. FC(x) was assigned 5 if the fold change ratio was greater then 1.95 and was assigned 0 if the ratio was between 1.95 and 1.5. PV(x) was assigned 3 if the p-value was less then 0.05 and was assigned 2 if the p-value was between 0.05 and 0.1. PC(x) was assigned 3 if at least 50% of the samples are called P by the Affymetrix algorithm and assigned 1 if only 25% of the samples are called P. EL(x) was assigned 3 if at least two samples had a frequency value of 20 or greater and assigned 1 if two samples only had a frequency greater then 15. Penalty points were assigned if the fold change was less then 1.5, the p-value was greater then 0.2 or the frequency values were below 15 ppm. CS(x) ranged for −14 to 14 with qualifiers having a score of 14 considered the most significant changes. Genes with 11 or more points in any one pairwise comparison were considered to be significant and were included for further analysis. Real-time PCR on individual RNA samples and histology analyses were performed essentially as described previously (9, 10).
To begin a systematic survey of estrogen receptor regulation of gene expression in the mouse, ovariectomized wild-type (WT) and ERβKO mice were treated by daily subcutaneous administration of either vehicle or 20 μg/kg/day 17β-estradiol (E2) for six weeks. RNA prepared from 13 organs was analyzed by microarray for estrogen regulation of gene expression. The resulting data set was queried for genes whose regulation was dependent on ERα or ERβ. For ERα regulation, the basal expression level was predicted to be the same in WT and ERβKO mice, with E2 induction or suppression occurring in both WT and ERβKO mice. (
Surprisingly, the kidney had a very large number of genes regulated at least 2-fold by E2 (26 inductions, 4 repressions;
The ED50s for E2 stimulation of CYP7B1, TF, STAT5A, and BCAT1 in the kidney were all very similar at about 3 μg/kg/day (
Confirmation of the role of ERα in the induction of kidney gene expression was obtained with 4-propyl-1,3,5-Tris(4-hydroxy-phenyl)pyrazole (PPT) a compound which exclusively activates ERα but not ERβ (11). Treatment with PPT induced expression of CYP7B1, TF, STAT5A and BCAT1 to a similar extent as did treatment with E2 (
It has been recognized that a commonly utilized strain of ERαKO mice (12) in fact expresses an ERα protein lacking only AF1, due to alternative splicings of the exon containing the targeted knockout mutation(13, 14). The resulting truncated ERα proteins, referred to here as ΔAF1-ERα, have the ability to stimulate expression of a synthetic estrogen response element driven promoter (14). As found previously for ERαKO mice, the level of this misspliced transcript in the uterus of ERαERβKO mice was lower than the level of full length message in WT mice (
The presence of ΔAF1-ERα at significant levels in the kidney allows determination of the relative contribution of AF1 and AF2 to E2 regulation of individual genes. To determine whether AF1 or AF2 regions of ERα were required for induction of CYP7B1, TF or BCAT1 in the kidney, WT, ERαERβKO or ERαKO mice were treated with E2 or the AF1 selective agonist tamoxifen (15). Expression of CYP7B1 was induced by E2 but not by tamoxifen in WT mice (
Estrogen receptors α or β are found in almost all organs of the body, yet relatively few tissues are considered targets for estrogen action. To begin to develop a more complete understanding of estrogen biology, estrogen responsive genes in 13 tissues from WT and ERβKO mice were characterized. In general, many tissues showed patterns of E2 regulation consistent with an ERα mechanism, including such known target organs as uterus, pituitary, and bone. Surprisingly, no E2 regulations were found that fit the expected pattern for ERβ regulations. This was true even in organs expressing moderately high levels of ERβ such as the bladder and lung (7). At least three mechanisms could explain the lack of detection of expected ERβ responses. First, it has been proposed that a major function of ERβ is to modulate the activity of ERα(16). For example, expression of the Ki-67 protein was constitutively elevated in uterus of ERβKO mice, i.e. in the ERβKO mice its expression was always equivalent to the E2 stimulated levels in WT animals (17). The survey criteria used here would not detect this pattern. Further analysis of these data has revealed many genes in multiple tissues which also have this “nonclassical” pattern of regulation whereby expression is constitutively elevated in both vehicle and E2 treated ERβKO mice (data not shown). Second, analysis of whole organs may easily miss regulations occurring in only selected cell subtypes within an organ. For example, initial analysis of kidney did not identify GADD45G as being regulated by E2, because GADD45G expression is regulated only in tubule epithelial cells. The unregulated expression of GADD45G throughout most of the kidney sufficiently diminished the fold induction so as to be less than 2-fold in whole organ samples. The combination of laser capture microdissection with microarray technology (18) should allow detection of ERβ regulated genes with a classical pattern of regulation.
This global survey demonstrates that, unexpectedly, the kidney had a very large number of regulated genes. Both genetic approaches (
Although the observed regulations in the kidney were mediated by ERα, the mechanism of activation of gene expression by ERα was gene specific. Thus studies using tamoxifen, which activates ERα through AF1, along with studies using ΔAF1-ERαKO mice (previously designated as ERKO mice) together indicate that E2 induction of CYP7B1 expression occurred predominantly through an AF1-dependent mechanism, E2 induction of TF expression occurred predominantly through an AF2-dependent mechanism, and E2 induction of BCAT1 expression occurred through both AF1 and AF2 mechanisms (
Analysis of 10 kb of upstream putative promoter sequences of E2 induced genes identified good matches to the consensus estrogen response element (ERE) in only a few genes, although ERE half-sites could be identified in most promoters. Many of these genes may be activated through nonclassical ERα mechanisms such as the combination of an ERE half-site with Sp1 binding sites (41). It is unlikely that a nonclassical ERα/AP1 stimulatory mechanism is responsible for these regulations, since ICI182780 functions as a partial agonist in this mechanism (42) while ICI182780 was a complete antagonist for E2 regulation of gene expression in the kidney (
musculus cDNA, 3′ end/clone = UI-M-AN1-afi-h-09-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH2.2-aow-f-03-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH1-alj-d-10-0-UI/
musculus cDNA, 3′ end/clone = UI-M-AQ0-aag-a-02-0-UI/
musculus cDNA, 3′ end/clone = UI-M-AO0-acc-d-08-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH2.1-apm-e-08-0-UI/
thaliana]
musculus cDNA, 3′ end/clone = UI-M-AK0-adc-e-09-0-UI/
musculus cDNA, 3′ end/clone = UI-M-AO1-ael-f-02-0-UI/
Mus musculus epithelial protein lost in neoplasm-a (Eplin)
musculus cDNA, 3′ end/clone = UI-M-AK1-aex-d-02-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH1-ann-d-07-0-UI/
Mus musculus calmodulin III (Calm3) mRNA, 3′ untranslated
musculus cDNA, 3′ end/clone = UI-M-BH2.1-apd-g-08-0-UI/
pombe]
musculus cDNA, 3′ end/clone = UI-M-BH2.3-aqh-h-05-0-UI/
Mus musculus (strain C57BI/6) mRNA sequence
norvegicus]
M. musculus TOP gene for topoisomerase I, exons 19-21
Mus musculus HSCO mRNA, complete cds
musculus cDNA, 3′ end/clone = UI-M-AO0-abz-c-02-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH0-aji-a-10-0-UI/
MELANOGASTER]
musculus cDNA, 3′ end/clone = UI-M-AO0-acd-e-10-0-UI/
cuniculus]
musculus cDNA, 3′ end/clone = UI-M-AK1-aeu-g-04-0-UI/
cerevisiae MAF1 protein [C. elegans]
musculus cDNA, 3′ end/clone = UI-M-BH2.1-apy-h-03-0-UI/
musculus cDNA, 3′ end/clone = UI-M-BH1-akt-a-10-0-UI/
Number | Date | Country | Kind |
---|---|---|---|
60371682 | Apr 2002 | US | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US03/11240 | 4/10/2003 | WO | 4/20/2005 |