The invention relates to the identification of expression profiles and the nucleic acids indicative of gastrointestinal-related disorders, such as ulcerative colitis, and to the use of such expression profiles and nucleic acids in diagnosis of ulcerative colitis and related diseases. The invention further relates to methods for identifying, using, and testing candidate agents and/or targets which modulate ulcerative colitis.
Ulcerative colitis (UC) is a multifactorial autoimmune disease with a complex pathogenesis involving unidentified genetic, microbial, and environmental factors. Recent studies using microarray analysis of inflamed colonoscopic tissue biopsy vs. non-inflamed biopsy samples from UC patients revealed dysregulation of a few inflammatory cytokines (1), however, the etiology, pathogenesis, and role of tumor necrosis factor-alpha (TNFα) in UC is still poorly understood. TNFα is a critical proinflammatory cytokine in Crohn's disease as demonstrated by the therapeutic effect of infliximab on the induction and maintenance of clinical remission, closure of enterocutaneous, perianal, and rectovaginal fistulas, maintenance of fistula closure, and steroid tapering in Crohn's disease patients (2-5). However, the evidence to support a role of TNFα in the pathogenesis of UC has been controversial (6-10) despite the fact that it is also found at increased levels in the blood, colonic tissue, and stools of UC patients (11-13). A recent clinical study (ACT-1) by Rutgeerts et al. showed that infliximab is effective when administered at weeks 0, 2, 6 and every 8 weeks thereafter in achieving clinical response and remission in patients with moderate-to-severe active UC despite the use of conventional therapy supporting a critical pathogenic role of TNFα in UC.
Microarray technology is a powerful tool since it enables analysis of the expression of thousands of genes simultaneously and can also be automated allowing for a high-throughput format. In diseases associated with complex host functions, such as those known as immune mediated inflammatory diseases, such as UC, microarray results can provide a gene expression profile that can be of utility in designing new approaches to disease diagnosis and management. These approaches also serve to identify novel genes and annotating genes of unknown function heretofore unassociated with the disease or condition. Accordingly, there is a need to identify and characterize new gene markers useful in developing methods for diagnosing and treating autoimmune disorders, such as UC and Crohn's disease, as well as other diseases and conditions and how a patient would respond to a therapeutic intervention.
Gene expression can be modulated in several different ways, including by the use of siRNAs, shRNAs, antisense molecules and DNAzymes. SiRNAs and shRNAs both work via the RNAi pathway and have been successfully used to suppress the expression of genes. RNAi was first discovered in worms and the phenomenon of gene silencing related to dsRNA was first reported in plants by Fire and Mello and is thought to be a way for plant cells to combat infection with RNA viruses. In this pathway, the long dsRNA viral product is processed into smaller fragments of 21-25 bp in length by a DICER-like enzyme and then the double-stranded molecule is unwound and loaded into the RNA induced silencing complex (RISC). A similar pathway has been identified in mammalian cells with the notable difference that the dsRNA molecules must be smaller than 30 bp in length in order to avoid the induction of the so-called interferon response, which is not gene specific and leads to the global shut down of protein synthesis in the cell.
Synthetic siRNAs have been successfully designed to selectively target a single gene and can be delivered to cells in vitro or in vivo. ShRNAs are the DNA equivalents of siRNA molecules and have the advantage of being incorporated into a cells' genome where they are replicated during every mitotic cycle.
DNAzymes have also been used to modulate gene expression. DNAzymes are catalytic DNA molecules that cleave single-stranded RNA. They are highly selective for the target RNA sequence and as such can be used to down-regulate specific genes through targeting of the messenger RNA.
Accordingly, there is a need to identify and characterize new gene markers useful in developing methods for diagnosing and treating autoimmune disorders, such as UC and Crohn's disease, as well as other diseases and conditions.
The present invention relates to a method of diagnosing and/or treating UC and/or related diseases or disorders and predicting the suitability of candidate agents for treatment. The present invention includes the discovery of panels of genes, one of 19 genes and one of five genes, that have modified expression levels in patients responsive to treatment for UC (effective in reducing the symptoms of UC) versus patients nonresponsive to treatment. The modified expression levels constitute a profile that can serve as a biomarker profile predictive of a patient's responsiveness to treatment.
In a particular embodiment, the present invention comprises a method of predicting the suitability of a treatment for UC based on the pattern of gene expression of one or more of the 19 genes which constitute the profile prior to treatment. One or more of these genes may be from a category of genes, such as those involved in nucleotide acid binding, ATP binding, transferase activity, proteolysis, oxidoreductase activity, ubiquitin thiolesterase activity, replication of proteins, signal transduction, and regulation of transcription, and the like. In a typical embodiment, the cell specimen expresses at least two expression profile genes. The profile genes may show an increase or decrease.
In addition, the present invention comprises a method of identifying subjects with UC and/or related diseases or disorders that are candidates for treatment with a particular therapeutic agent by evaluating their expression profile of one or more genes of the 19- or 5-gene panel.
In one embodiment, the UC-related gene profile is used to create an array-based method for prognostic or diagnostic purposes, the method comprising:
Optionally, statistical analysis is performed on the changes in levels of members of the gene panel to evaluate the significance of these changes and to identify which members are meaningful members of the panel.
In an alternative embodiment, the present invention comprises a kit for predicting the suitability of candidate agents for treating UC and/or related diseases or disorders based on the pattern of gene expression.
The present invention further provides any invention described herein.
The following definitions are set forth to illustrate and define the meaning and scope of various terms used to describe the invention herein.
An “activity,” a biological activity, and a functional activity of a polypeptide refers to an activity exerted by a gene of the UC-related gene panel in response to its specific interaction with another protein or molecule as determined in vivo, in situ, or in vitro, according to standard techniques. Such activities can be a direct activity, such as an association with or an enzymatic activity on a second protein, or an indirect activity, such as a cellular process mediated by interaction of the protein with a second protein or a series of interactions as in intracellular signaling or the coagulation cascade.
An “antibody” includes any polypeptide or peptide containing molecule that comprises at least a portion of an immunoglobulin molecule, such as but not limited to, at least one complementarity determining region (CDR) of a heavy or light chain or a ligand binding portion thereof, a heavy chain or light chain variable region, a heavy chain or light chain constant region, a framework region, or any portion, fragment or variant thereof. The term “antibody” is further intended to encompass antibodies, digestion fragments, specified portions and variants thereof, including antibody mimetics or comprising portions of antibodies that mimic the structure and/or function of an antibody or specified fragment or portion thereof, including single chain antibodies and fragments thereof. For example, antibody fragments include, but are not limited to, Fab (e.g., by papain digestion), Fab′ (e.g., by pepsin digestion and partial reduction) and F(ab′)2 (e.g., by pepsin digestion), facb (e.g., by plasmin digestion), pFc′ (e.g., by pepsin or plasmin digestion), Fd (e.g., by pepsin digestion, partial reduction and reaggregation), Fv or scFv (e.g., by molecular biology techniques) fragments, and single domain antibodies (e.g., VH or VL), are encompassed by the invention (see, e.g., Colligan, et al., eds., Current Protocols in Immunology, John Wiley & Sons, Inc., NY (1994-2001); Colligan et al., Current Protocols in Polypeptide Science, John Wiley & Sons, NY (1997-2001)).
The terms “array” or “microarray” or “biochip” or “chip” as used herein refer to articles of manufacture or devices comprising a plurality of immobilized target elements, each target element comprising a “clone,” “feature,” “spot” or defined area comprising a particular composition, such as a biological molecule, e.g., a nucleic acid molecule or polypeptide, immobilized to a solid surface, as discussed in further detail, below.
“Complement of” or “complementary to” a nucleic acid sequence of the invention refers to a polynucleotide molecule having a complementary base sequence and reverse orientation as compared to a first polynucleotide.
“Identity,” as known in the art, is a relationship between two or more polypeptide sequences or two or more polynucleotide sequences, as determined by comparing the sequences. In the art, “identity” also means the degree of sequence relatedness between polypeptide or polynucleotide sequences, as determined by the match between strings of such sequences. “Identity” and “similarity” can be readily calculated by known methods, including, but not limited to, those described in Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., Siam J. Applied Math., 48:1073 (1988). In addition, values for percentage identity can be obtained from amino acid and nucleotide sequence alignments generated using the default settings for the AlignX component of Vector NTI Suite 8.0 (Informax, Frederick, Md.).
The terms “specifically hybridize to,” “hybridizing specifically to,” “specific hybridization” and “selectively hybridize to,” as used herein refer to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions. The term “stringent conditions” refers to conditions under which a probe will hybridize preferentially to its target subsequence; and to a lesser extent to, or not at all to, other sequences. A “stringent hybridization” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization (e.g., as in array, Southern or Northern hybridizations) are sequence dependent, and are different under different environmental parameters. Alternative hybridization conditions that can be used to practice the invention are described in detail, below. In alternative aspects, the hybridization and/or wash conditions are carried out under moderate conditions, stringent conditions and very stringent conditions, as described in further detail, below. Alternative wash conditions are also used in different aspects, as described in further detail, herein.
The phrases “labeled biological molecule” or “labeled with a detectable composition” or “labeled with a detectable moiety” as used herein refer to a biological molecule, e.g., a nucleic acid, comprising a detectable composition, i.e., a label, as described in detail, below. The label can also be another biological molecule, as a nucleic acid, e.g., a nucleic acid in the form of a stem-loop structure as a “molecular beacon,” as described below. This includes incorporation of labeled bases (or, bases which can bind to a detectable label) into the nucleic acid by, e.g., nick translation, random primer extension, amplification with degenerate primers, and the like. Any label can be used, e.g., chemiluminescent labels, radiolabels, enzymatic labels and the like. The label can be detectable by any means, e.g., visual, spectroscopic, photochemical, biochemical, immunochemical, physical, chemical and/or chemiluminescent detection. The invention can use arrays comprising immobilized nucleic acids comprising detectable labels.
The term “nucleic acid” as used herein refers to a deoxyribonucleotide (DNA) or ribonucleotide (RNA) in either single- or double-stranded form. The term encompasses nucleic acids containing known analogues of natural nucleotides. The term nucleic acid is used interchangeably with gene, DNA, RNA, cDNA, mRNA, oligonucleotide primer, probe and amplification product. The term also encompasses DNA backbone analogues, such as phosphodiester, phosphorothioate, phosphorod ithioate, methyl phosphonate, phosphoramidate, alkyl phosphotriester, sulfamate, 3′-thioacetal, methylene (methylimino), 3′-N-carbamate, morpholino carbamate, and peptide nucleic acids (PNAs).
The terms “sample” or “sample of nucleic acids” as used herein refer to a sample comprising a DNA or RNA, or nucleic acid representative of DNA or RNA isolated from a natural source. A “sample of nucleic acids” is in a form suitable for hybridization (e.g., as a soluble aqueous solution) to another nucleic acid (e.g., immobilized probes). The sample nucleic acid may be isolated, cloned, or extracted from particular cells or tissues. The cell or tissue sample from which the nucleic acid sample is prepared is typically taken from a patient having or suspected of having UC or a related disease or condition. Methods of isolating cell and tissue samples are well known to those of skill in the art and include, but are not limited to, aspirations, tissue sections, needle biopsies, and the like. Frequently the sample will be a “clinical sample” which is a sample derived from a patient, including sections of tissues such as frozen sections or paraffin sections taken for histological purposes. The sample can also be derived from supernatants (of cells) or the cells themselves taken from patients or from cell cultures, cells from tissue culture and other media in which it may be desirable to detect the response to drug candidates. In some cases, the nucleic acids may be amplified using standard techniques such as PCR, prior to the hybridization. The probe an be produced from and collectively can be representative of a source of nucleic acids from one or more particular (pre-selected) portions of, e.g., a collection of polymerase chain reaction (PCR) amplification products, substantially an entire chromosome or a chromosome fragment, or substantially an entire genome, e.g., as a collection of clones, e.g., BACs, PACs, YACs, and the like (see below).
“Nucleic acids” are polymers of nucleotides, wherein a nucleotide comprises a base linked to a sugar which sugars are in turn linked one to another by an interceding at least bivalent molecule, such as phosphoric acid. In naturally occurring nucleic acids, the sugar is either 2′-deoxyribose (DNA) or ribose (RNA). Unnatural poly- or oliogonucleotides contain modified bases, sugars, or linking molecules, but are generally understood to mimic the complementary nature of the naturally occurring nucleic acids after which they are designed. An example of an unnatural oligonucleotide is an antisense molecule composition that has a phosphorothiorate backbone. An “oligonucleotide” generally refers to a nucleic acid molecule having less than 30 nucleotides.
The term “profile” means a pattern and relates to the magnitude and direction of change of a number of features. The profile may be interpreted stringently, i.e., where the variation in the magnitude and/or number of features within the profile displaying the characteristic is substantially similar to a reference profile or it may be interpreted less stringently, for example, by requiring a trend rather than an absolute match of all or a subset of feature characteristics.
The terms “protein,” “polypeptide,” and “peptide” include “analogs,” or “conservative variants” and “mimetics” or “peptidomimetics” with structures and activity that substantially correspond to the polypeptide from which the variant was derived, as discussed in detail above.
A “polypeptide” is a polymer of amino acid residues joined by peptide bonds, and a peptide generally refers to amino acid polymers of 12 or less residues. Peptide bonds can be produced naturally as directed by the nucleic acid template or synthetically by methods well known in the art.
A “protein” is a macromolecule comprising one or more polypeptide chains. A protein may further comprise substituent groups attached to the side groups of the amino acids not involved in formation of the peptide bonds. Typically, proteins formed by eukaryotic cell expression also contain carbohydrates. Proteins are defined herein in terms of their amino acid sequence or backbone and substituents are not specified, whether known or not.
The term “receptor” denotes a molecule having the ability to affect biological activity, in e.g., a cell, as a result of interaction with a specific ligand or binding partner. Cell membrane bound receptors are characterized by an extracellular ligand-binding domain, one or more membrane spanning or transmembrane domains, and an intracellular effector domain that is typically involved in signal transduction. Ligand binding to cell membrane receptors causes changes in the extracellular domain that are communicated across the cell membrane, direct or indirect interaction with one or more intracellular proteins, and alters cellular properties, such as enzyme activity, cell shape, or gene expression profile. Receptors may also be untethered to the cell surface and may be cytosolic, nuclear, or released from the cell altogether. Non-cell associated receptors are termed soluble receptors or ligands.
All publications or patents cited herein are entirely incorporated herein by reference, whether or not specifically designated accordingly, as they show the state of the art at the time of the present invention and/or provide description and enablement of the present invention. Publications refer to any scientific or patent publications, or any other information available in any media format, including all recorded, electronic or printed formats. The following references are entirely incorporated herein by reference: Ausubel, et al., ed., Current Protocols in Molecular Biology, John Wiley & Sons, Inc., NY (1987-2001); Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor, NY (1989); Harlow and Lane, antibodies, a Laboratory Manual, Cold Spring Harbor, N.Y. (1989); Colligan, et al., eds., Current Protocols in Immunology, John Wiley & Sons, Inc., NY (1994-2001); Colligan et al., Current Protocols in Protein Science, John Wiley & Sons, NY (1997-2001).
The present invention provides novel methods for screening for compositions which modulate the symptoms of UC, particularly the mucosal layer of the rectum and all or part of the colon. By “UC” or grammatical equivalents as used herein, is meant a disease state or condition which is marked by diarrhea, rectal bleeding, tenesmus, passage of mucus, and crampy abdominal pain.
In one aspect, the expression levels of genes are determined in different patient samples for which diagnosis information is desired, to provide expression profiles. An expression profile of a particular sample is essentially a “fingerprint” of the state of the sample; while two states may have any particular gene similarly expressed, the evaluation of a number of genes simultaneously allows the generation of a gene expression profile that is unique to the state of the patient sample. That is, normal tissue may be distinguished from lesion tissue and tissue from a treated patient may be distinguished from an untreated patient. By comparing expression profiles of tissue in different disease states that are known, information regarding which genes are important (including both up- and down-regulation of genes) in each of these states is obtained.
The identification of sequences (genes) that are differentially expressed in disease tissue allows the use of this information in a number of ways. For example, the evaluation of a particular treatment regime may be evaluated.
This may be done by making biochips comprising sets of the important disease genes, which can then be used in these screens. These methods can also be performed on the protein basis; that is, protein expression levels of the UC-related gene product proteins can be evaluated for diagnostic purposes or to screen candidate agents. In addition, the nucleic acid sequences comprising the UC-related gene profile can be used to measure whether a patient is likely to respond to a therapeutic prior to treatment.
UC-related gene sequences can include both nucleic acid and amino acid sequences. In a preferred embodiment, the UC-related gene sequences are recombinant nucleic acids. By the term “recombinant nucleic acid” herein is meant nucleic acid, originally formed in vitro, in general, by the manipulation of nucleic acid by polymerases and endonucleases, in a form not normally found in nature. Thus, an isolated nucleic acid, in a linear form, or an expression vector formed in vitro by ligating DNA molecules that are not normally joined, are both considered recombinant for the purposes of this invention. It is understood that once a recombinant nucleic acid is made and reintroduced into a host cell or organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the host cell rather than in vitro manipulations; however, such nucleic acids, once produced recombinantly, although subsequently replicated non-recombinantly, are still considered recombinant for the purposes of the invention.
The invention provides in silico, array-based methods relying on the relative amount of a binding molecule (e.g., nucleic acid sequence) in two or more samples. Also provided are computer- implemented methods for determining the relative amount of a binding molecule (e.g., nucleic acid sequence) in two or more samples and using the determined relative binding amount to predict responsiveness to a particular therapy, and monitor and enhance therapeutic treatment.
In practicing the methods of the invention, two or more samples of labeled biological molecules (e.g., nucleic acid) are applied to two or more arrays, where the arrays have substantially the same complement of immobilized binding molecule (e.g., immobilized nucleic acid capable of hybridizing to labeled sample nucleic acid). The two or more arrays are typically multiple copies of the same array. However, because each “spot,” “clone” or “feature” on the array has similar biological molecules (e.g., nucleic acids of the same sequence) and the biological molecules (e.g., nucleic acid) in each spot is known, as is typical of nucleic acid and other arrays, it is not necessary that the multiple arrays used in the invention be identical in configuration it is only necessary that the position of each feature on the substrate be known, that is, have an address. Thus, in one aspect, multiple biological molecules (e.g., nucleic acid) in samples are comparatively bound to the array (e.g., hybridized simultaneously) and the information gathered is coded so that the results are based on the inherent properties of the feature (e.g., the nucleic acid sequence) and not it's position on the substrate.
Amplification of Nucleic Acids
Amplification using oligonucleotide primers can be used to generate nucleic acids used in the compositions and methods of the invention, to detect or measure levels of test or control samples hybridized to an array, and the like. The skilled artisan can select and design suitable oligonucleotide amplification primers. Amplification methods are also well known in the art, and include, e.g., polymerase chain reaction, PCR (PCR PROTOCOLS, A GUIDE TO METHODS AND APPLICATIONS, ed. Innis, Academic Press, N.Y. (1990) and PCR STRATEGIES (1995), ed. Innis, Academic Press, Inc., N.Y., ligase chain reaction (LCR) (see, e.g., Wu (1989) Genomics 4:560; Landegren (1988) Science 241:1077; Barringer (1990) Gene 89:117); transcription amplification (see, e.g., Kwoh (1989) Proc. Natl. Acad. Sci. USA 86:1173); and, self-sustained sequence replication (see, e.g., Guatelli (1990) Proc. Natl. Acad. Sci. USA 87:1874); Q Beta replicase amplification (see, e.g., Smith (1997) J. Clin. Microbiol. 35:1477-1491), automated Q-beta replicase amplification assay (see, e.g., Burg (1996) Mol. Cell. Probes 10:257-271) and other RNA polymerase mediated techniques (e.g., NASBA, Cangene, Mississauga, Ontario); see also Berger (1987) Methods Enzymol. 152:307-316; Sambrook; Ausubel; U.S. Pat. Nos. 4,683,195 and 4,683,202; Sooknanan (1995) Biotechnology 13:563-564.
Hybridizing Nucleic Acids
In practicing the methods of the invention, test and control samples of nucleic acid are hybridized to immobilized probe nucleic acid, e.g., on arrays. In alternative aspects, the hybridization and/or wash conditions are carried out under moderate conditions, stringent conditions and very stringent conditions. An extensive guide to the hybridization of nucleic acids is found in, e.g., Sambrook Ausubel, Tijssen. Generally, highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on an array or a filter in a Southern or northern blot is 42° C. using standard hybridization solutions (see, e.g., Sambrook), with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.15 M NaCl at 72° C. for about 15 minutes. An example of stringent wash conditions is a 0.2×SSC wash at 65° C. for 15 minutes (see, e.g., Sambrook). Often, a high stringency wash is preceded by a medium or low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is 1×SSC at 45° C. for 15 minutes. An example of a low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4× to 6×SSC at 40° C. for 15 minutes.
In alternative aspects of the compositions and methods of the invention, e.g., in practicing comparative nucleic acid hybridization, such as comparative genomic hybridization (CGH) with arrays, the fluorescent dyes Cy3® and Cy5® are used to differentially label nucleic acid fragments from two samples, e.g., the array-immobilized nucleic acid versus the sample nucleic acid, or, nucleic acid generated from a control versus a test cell or tissue. Many commercial instruments are designed to accommodate the detection of these two dyes. To increase the stability of Cy5®, or fluors or other oxidation-sensitive compounds, antioxidants and free radical scavengers can be used in hybridization mixes, the hybridization and/or the wash solutions. Thus, Cy5® signals are dramatically increased and longer hybridization times are possible. See WO 0194630 A2 and U.S. Patent Application No. 20020006622.
To further increase the hybridization sensitivity, hybridization can be carried out in a controlled, unsaturated humidity environment; thus, hybridization efficiency is significantly improved if the humidity is not saturated. See WO 0194630 A2 and U.S. Patent Application No. 20020006622. The hybridization efficiency can be improved if the humidity is dynamically controlled, i.e., if the humidity changes during hybridization. Mass transfer will be facilitated in a dynamically balanced humidity environment. The humidity in the hybridization environment can be adjusted stepwise or continuously. Array devices comprising housings and controls that allow the operator to control the humidity during pre-hybridization, hybridization, wash and/or detection stages can be used. The device can have detection, control and memory components to allow pre-programming of the humidity and temperature controls (which are constant and precise or which flucturate), and other parameters during the entire procedural cycle, including pre-hybridization, hybridization, wash and detection steps. See WO 0194630 A2 and U.S. Patent Application No. 20020006622.
The methods of the invention can comprise hybridization conditions comprising osmotic fluctuation. Hybridization efficiency (i.e., time to equilibrium) can also be enhanced by a hybridization environment that comprises changing hyper-/hypo-tonicity, e.g., a solute gradient. A solute gradient is created in the device. For example, a low salt hybridization solution is placed on one side of the array hybridization chamber and a higher salt buffer is placed on the other side to generate a solute gradient in the chamber. See WO 0194630 A2 and U.S. Patent Application No. 20020006622.
Blocking the Ability of Repetitive Nucleic Acid Sequences to Hybridize
The methods of the invention can comprise a step of blocking the ability of repetitive nucleic acid sequences to hybridize (i.e., blocking “hybridization capacity”) in the immobilized nucleic acid segments. The hybridization capacity of repetitive nucleic acid sequences in the sample nucleic acid sequences can be blocked by mixing sample nucleic acid sequences with unlabeled or alternatively labeled repetitive nucleic acid sequences. Sample nucleic acid sequences can be mixed with repetitive nucleic acid sequences before the step of contacting with the array-immobilized nucleic acid segments. Blocking sequences are for example, Cot-1 DNA, salmon sperm DNA, or specifc repetitive genomic sequences. The repetitive nucleic acid sequences can be unlabeled. A number of methods for removing and/or disabling the hybridization capacity of repetitive sequences using, e.g., Cot-1 are known; see, e.g., Craig (1997) Hum. Genet. 100:472-476; WO 93/18186. Repetitive DNA sequences can be removed from library probes by means of magnetic purification and affinity PCR, see, e.g., Rauch (2000) J. Biochem. Biophys. Methods 44:59-72.
Arrays are generically a plurality of target elements immobilized onto the surface of the plate as defined “spots” or “clusters,” or “features,” with each target element comprising one or more biological molecules (e.g., nucleic acids or polypeptides) immobilized to a solid surface for specific binding (e.g., hybridization) to a molecule in a sample. The immobilized nucleic acids can contain sequences from specific messages (e.g., as cDNA libraries) or genes (e.g., genomic libraries), including a human genome. Other target elements can contain reference sequences and the like. The biological molecules of the arrays may be arranged on the solid surface at different sizes and different densities. The densities of the biological molecules in a cluster and the number of clusters on the array will depend upon a number of factors, such as the nature of the label, the solid support, the degree of hydrophobicity of the substrate surface, and the like. Each feature may comprise substantially the same biological molecule (e.g., nucleic acid), or, a mixture of biological molecules (e.g., nucleic acids of different lengths and/or sequences). Thus, for example, a feature may contain more than one copy of a cloned piece of DNA, and each copy may be broken into fragments of different lengths.
Array substrate surfaces onto which biological molecules (e.g., nucleic acids) are immobilized can include nitrocellulose, glass, quartz, fused silica, plastics and the like, as discussed further, below. The compositions and methods of the invention can incorporate in whole or in part designs of arrays, and associated components and methods, as described, e.g., in U.S. Pat. Nos. 6,344,316; 6,197,503; 6,174,684; 6,159,685; 6,156,501; 6,093,370; 6,087,112; 6,087,103; 6,087,102; 6,083,697; 6,080,585; 6,054,270; 6,048,695; 6,045,996; 6,022,963; 6,013,440; 5,959,098; 5,856,174; 5,843,655; 5,837,832; 5,770,456; 5,723,320; 5,700,637; 5,695, 940; 5,556,752; 5,143,854; see also, e.g., WO 99/51773; WO 99/09217; WO 97/46313; WO 96/17958; WO 89/10977; see also, e.g., Johnston (1998) Curr. Biol. 8:R171-174; Schummer (1997) Biotechniques 23:1087-1092; Kern (1997) Biotechniques 23:120-124; Solinas- Toldo (1997) Genes, Chromosomes & Cancer 20:399-407; Bowtell (1999) Nature Genetics Supp. 21:25-32; Epstein (2000) Current Opinion in Biotech. 11:36-41; Mendoza (1999 Biotechniques 27: 778-788; Lueking (1999) Anal. Biochem. 270:103-111; Davies (1999) Biotechniques 27:1258-1261.
Substrate Surfaces
Substrate surfaces that can be used in the compositions and methods of the invention include, for example, glass (see, e.g., U.S. Pat. No. 5,843,767), ceramics, and quartz. The arrays can have substrate surfaces of a rigid, semi-rigid or flexible material. The substrate surface can be flat or planar, be shaped as wells, raised regions, etched trenches, pores, beads, filaments, or the like. Substrate surfaces can also comprise various materials such as nitrocellulose, paper, crystalline substrates (e.g., gallium arsenide), metals, metalloids, polacryloylmorpholide, various plastics and plastic copolymers, Nylon®, Teflon®, polyethylene, polypropylene, latex, polymethacrylate, poly (ethylene terephthalate), rayon, nylon, poly(vinyl butyrate), and cellulose acetate. The substrates may be coated and the substate and the coating may be functionalized to, e.g., enable conjugation to an amine.
Arrays Comprising Calibration Sequences
The invention comtemplates the use of arrays comprising immobilized calibration sequences for normalizing the results of array-based hybridization reactions, and methods for using these calibration sequences, e.g., to determine the copy number of a calibration sequence to “normalize” or “calibrate” ratio profiles. The calibration sequences can be substantially the same as a unique sequence in an immobilized nucleic acid sequence on an array. For example, a “marker” sequence from each “spot” or “biosite” on an array (which is present only on that spot, making it a “marker” for that spot) is represented by a corresponding sequence on one or more “control” or “calibration” spot(s).
The “control spots” or “calibration spots” are used for “normalization” to provide information that is reliable and repeatable. Control spots can provide a consistent result independent of the labeled sample hybridized to the array (or a labeled binding molecule from a sample). The control spots can be used to generate a “normalization” or “calibration” curve to offset possible intensity errors between the two arrays (or more) used in the in silico, array-based methods of the invention.
One method of generating a control on the array would be to use an equimolar mixture of all the biological molecules (e.g., nucleic acid sequences) spotted on the array and generating a single spot. This single spot would have equal amounts of the biological molecules (e.g., nucleic acid sequences) from all the other spots on the array. Multiple control spots can be generated by varying the concentration of the equimolar mixture.
Samples and Specimens
The sample nucleic acid may be isolated, cloned, or extracted from particular cells, tissues, or other specimens. The cell or tissue sample from which the nucleic acid sample is prepared is typically taken from a patient having or suspected of having UC or a related condition. Methods of isolating cell and tissue samples are well known to those of skill in the art and include, but are not limited to, aspirations, tissue sections, needle biopsies, and the like. Frequently, the sample will be a “clinical sample” which is a sample derived from a patient, including whole blood, or sections of tissues, such as frozen sections or paraffin sections taken for histological purposes. The sample can also be derived from supernatants (of cells) or the cells themselves taken from patients or from cell cultures, cells from tissue culture and other media in which it may be desirable to detect the response to drug candidates. In some cases, the nucleic acids may be amplified using standard techniques such as PCR, prior to the hybridization.
In one embodiment, the present invention is a pre-treatment method of predicting disease regression or resolution. The method includes (1) taking a colon biopsy or other specimen from an individual diagnosed with UC or a related disease or disorder, (2) measuring the expression levels of the profile genes of the panel, (3) comparing the pre-treatment expression level of the genes with a pre-treatment reference profile from treatment responders, and (4) predicting treatment response by monitoring the expression levels of the gene panel.
Methods of Assessing Biomarker Utililty
The prognostic utility of the present biomarker gene panel for assessing a patient's response to treatment or prognosis of disease can be validated by using other means for assessing a patient's state of disease. For example, gross measurement of disease may be assessed and recorded by certain imaging methods, such as but not limited to: imaging by photographic, radiometric, or magnetic resonance technology. General indices of health or disease futher include serum or blood composition (protein, liver enzymes, pH, electrolytes, red cell volume, hematocrit, hemoglobin, or specific protein). However, in some diseases, the etiology is still poorly understood. UC is an example of one such disease.
Some of the genes in the panel belong to classes of genes that have been reported to be aberrantly expressed in UC patients previously, such as transcription factors, replication proteins, and oxidases, the expression patterns of the genes over the course of treatment have not been studied in the treatment of UC, and none has been identified as having predictive value. The panel of gene expression biomarkers disclosed herein permits the generation of methods for rapid and reliable prediction, diagnostic tools that predict the clinical outcome of a UC trial, or prognostic tools for tracking the efficacy of UC therapy. Prognostic methods based on detecting these genes in a sample are provided. These compositions may be used, for example, in connection with the diagnosis, prevention and treatment of a range of immune-mediated inflammatory diseases.
As used herein, the term “antagonists” refer to substances which inhibit or neutralize the biologic activity of the gene product of the UC-related gene panel of the invention. Such antagonists accomplish this effect in a variety of ways. One class of antagonists will bind to the gene product protein with sufficient affinity and specificity to neutralize the biologic effects of the protein. Included in this class of molecules are antibodies and antibody fragments (such as, for example, F(ab) or F(ab′)2 molecules). Another class of antagonists comprises fragments of the gene product protein, muteins or small organic molecules, i.e., peptidomimetics, that will bind to the cognate binding partners or ligands of the gene product, thereby inhibiting the biologic activity of the specific interaction of the gene product with its cognate ligand or receptor. The UC-related gene antagonist may be of any of these classes as long as it is a substance that inhibits at least one biological activity of the gene product.
Antagonists include antibodies directed to one or more regions of the gene product protein or fragments thereof, antibodies directed to the cognate ligand or receptor, and partial peptides of the gene product or its cognate ligand which inhibit at least one biological activity of the gene product. Another class of antagonists includes siRNAs, shRNAs, antisense molecules and DNAzymes targeting the gene sequence as known in the art are disclosed herein.
Suitable antibodies include those that compete for binding to UC-related gene products with monoclonal antibodies that block UC-related gene product activation or prevent UC-related gene product binding to its cognate ligand, or prevent UC-related gene product signalling.
A therapeutic targeting the inducer of the UC-related gene product may provide better chances of success. Gene expression can be modulated in several different ways including by the use of siRNAs, shRNAs, antisense molecules and DNAzymes. Synthetic siRNAs, shRNAs, and DNAzymes can be designed to specifically target one or more genes and they can easily be delivered to cells in vitro or in vivo.
The present invention encompasses antisense nucleic acid molecules, i.e., molecules that are complementary to a sense nucleic acid encoding a UC-related gene product polypeptide, e.g., complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary to an entire coding strand, or to only a portion thereof, e.g., all or part of the protein coding region (or open reading frame). An antisense nucleic acid molecule can be antisense to all or part of a non-coding region of the coding strand of a nucleotide sequence encoding a UC-related gene product polypeptide. The non-coding regions (“5′ and 3′ untranslated regions”) are the 5′ and 3′ sequences that flank the coding region and are not translated into amino acids.
The invention also provides chimeric or fusion proteins. As used herein, a “chimeric protein” or “fusion protein” comprises all or part (preferably biologically active) of a UC-related gene product polypeptide operably linked to a heterologous polypeptide (i.e., a polypeptide other than the same UC-related gene product polypeptide). Within the fusion protein, the term “operably linked” is intended to indicate that the UC-related gene product polypeptide and the heterologous polypeptide are fused in-frame to each other. The heterologous polypeptide can be fused to the amino-terminus or the carboxyl-terminus of the UC-related gene product polypeptide. In another embodiment, a UC-related gene product polypeptide or a domain or active fragment thereof can be fused with a heterologous protein sequence or fragment thereof to form a chimeric protein, where the polypeptides, domains or fragments are not fused end to end but are interposed within the heterologous protein framework.
In yet another embodiment, the fusion protein is an immunoglobulin fusion protein in which all or part of a UC-related gene product polypeptide is fused to sequences derived from a member of the immunoglobulin protein family. The immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an interaction between a ligand (soluble or membrane-bound) and a protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate ligand of a UC-related gene product polypeptide. Inhibition of ligand/receptor interaction can be useful therapeutically, both for treating proliferative and differentiative disorders and for modulating (e.g., promoting or inhibiting) cell survival. A preferred embodiment of an immunoglobulin chimeric protein is a CH1 domain-deleted immunoglobulin or MIMETIBODY™ construct having an active polypeptide fragment interposed within a modified framework region as taught in co-pending application PCT WO/04002417. Moreover, the immunoglobulin fusion proteins of the invention can be used as immunogens to produce antibodies directed against a UC-related gene product polypeptide in a subject, to purify ligands and in screening assays to identify molecules that inhibit the interaction of receptors with ligands.
Compositions and Their Uses
In accordance with the invention, the neutralizing anti-UC-related gene product antagonists, such as monoclonal antibodies, described herein can be used to inhibit UC-related gene product activity. Additionally, such antagonists can be used to inhibit the pathogenesis of UC and -related inflammatory diseases amenable to such treatment, which may include, but are not limited to, rheumatic diseases. The individual to be treated may be any mammal and is preferably a primate, a companion animal which is a mammal and most preferably a human patient. The amount of antagonist administered will vary according to the purpose it is being used for and the method of administration.
The UC-related gene antagonists may be administered by any number of methods that result in an effect in tissue in which pathological activity is desired to be prevented or halted. Further, the anti-UC-related gene product antagonists need not be present locally to impart an effect on the UC-related gene product activity, therefore, they may be administered wherever access to body compartments or fluids containing UC-related gene product is achieved. In the case of inflamed, malignant, or otherwise compromised tissues, these methods may include direct application of a formulation containing the antagonists. Such methods include intravenous administration of a liquid composition, transdermal administration of a liquid or solid formulation, oral, topical administration, or interstitial or inter-operative administration. Adminstration may be affected by the implantation of a device whose primary function may not be as a drug delivery vehicle.
For antibodies, the preferred dosage is about 0.1 mg/kg to 100 mg/kg of body weight (generally about 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of about 50 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and fully human antibodies have a longer half-life within the human body than other antibodies. Accordingly, the use of lower dosages and less frequent administration is often possible. Modifications, such as lipidation, can be used to stabilize antibodies and to enhance uptake and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is described by Cruikshank et al. ((1997) J. Acquired Immune Deficiency Syndromes and Human Retrovirology 14:193).
The UC-related gene product antagonist nucleic acid molecules can be inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous injection, local administration (U.S. Pat. No. 5,328,470), or by stereotactic injection (see, e.g., Chen et al. (1994) Proc. Natl. Acad. Sci. USA 91:3054-3057). The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells which produce the gene delivery system.
The pharmaceutical compositions can be included in a container, pack, or dispenser together with instructions for administration.
Agents, or modulators that have a stimulatory or inhibitory effect on activity or expression of a UC-related gene product polypeptide as identified by a screening assay described herein, can be administered to individuals to treat (prophylactically or therapeutically) disorders associated with aberrant activity of the polypeptide. In conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's genotype and that individual's response to a foreign compound or drug) of the individual may be considered. Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation between dose and blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's genotype. Such pharmacogenomics can further be used to determine appropriate dosages and therapeutic regimens. Accordingly, the activity of a UC-related gene product polypeptide, expression of a UC-related gene product nucleic acid, or mutation content of a UC-related gene product gene in an individual can be determined to thereby select an appropriate agent(s) for therapeutic or prophylactic treatment of the individual.
Pharmacogenomics deals with clinically significant hereditary variations in the response to drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., Linder (1997) Clin. Chem. 43(2): 254-266. In general, two types of pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on the body are referred to as “altered drug action.” Genetic conditions transmitted as single factors altering the way the body acts on drugs are referred to as “altered drug metabolism.” These pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans.
As an illustrative embodiment, the activity of drug metabolizing enzymes is a major determinant of both the intensity and duration of drug action. The discovery of genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why some patients do not obtain the expected drug effects or show exaggerated drug response and serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among different populations. For example, the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side effects when they receive standard doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification.
Thus, the activity of a UC-related gene product polypeptide, expression of a nucleic acid encoding the polypeptide, or mutation content of a gene encoding the polypeptide in an individual can be determined to thereby select appropriate agent(s) for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the identification of an individual's drug responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when treating a subject with a modulator of activity or expression of the polypeptide, such as a modulator identified by one of the exemplary screening assays described herein.
The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant expression or activity of a UC-related gene product polypeptide and/or in which the UC-related gene product polypeptide is involved.
The present invention provides a method for modulating or treating at least one UC-related gene product related disease or condition, in a cell, tissue, organ, animal, or patient, as known in the art or as described herein, using at least one UC-related gene product antagonist.
Compositions of UC-related gene product antagonist may find therapeutic use in the treatment of UC or related conditions, such as Crohn's disease or other gastrointestinal disorders.
The present invention also provides a method for modulating or treating at least one gastrointestinal, immune related disease, in a cell, tissue, organ, animal, or patient including, but not limited to, at least one of gastric ulcer, inflammatory bowel disease, ulcerative colitis, Crohn's pathology, and the like. See, e.g., the Merck Manual, 12th-17th Editions, Merck & Company, Rahway, N.J. (1972, 1977, 1982, 1987, 1992, 1999), Pharmacotherapy Handbook, Wells et al., eds., Second Edition, Appleton and Lange, Stamford, Conn. (1998, 2000), each entirely incorporated by reference.
Disorders characterized by aberrant expression or activity of the UC-related gene product polypeptides are further described elsewhere in this disclosure.
In one aspect, the invention provides a method for at least substantially preventing in a subject, a disease or condition associated with an aberrant expression or activity of a UC-related gene product polypeptide, by administering to the subject an agent that modulates expression or at least one activity of the polypeptide. Subjects at risk for a disease that is caused or contributed to by aberrant expression or activity of a UC-related gene product can be identified by, for example, any or a combination of diagnostic or prognostic assays as described herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. The appropriate agent can be determined based on screening assays described herein.
Another aspect of the invention pertains to methods of modulating expression or activity of UC-related gene or gene product for therapeutic purposes. The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of the polypeptide. An agent that modulates activity can be an agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of the polypeptide, a peptide, a peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or more of the biological activities of the polypeptide. In another embodiment, the agent inhibits one or more of the biological activities of the UC-related gene or gene product polypeptide. Examples of such inhibitory agents include antisense nucleic acid molecules and antibodies and other methods described herein. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of a UC-related gene product polypeptide. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or combination of agents that modulate (e.g., up-regulates or down-regulates) expression or activity. Inhibition of activity is desirable in situations in which activity or expression is abnormally high or up-regulated and/or in which decreased activity is likely to have a beneficial effect.
While having described the invention in general terms, the embodiments of the invention will be further disclosed in the following examples which should not be construed as limiting the scope of the claims.
Sample Collection and RNA Isolation
Patients with moderate to severe active UC were randomly assigned 1:1:1 to intravenous placebo or infliximab (anti-TNF antibody) at a dose of 5 or 10 mg/kg at 0, 2, 6 and every 8 weeks thereafter. Colonoscopic punch biopsies were obtained from disease tissues at weeks 0 (prior to therapy), 8, and 30 and kept frozen until RNA preparation. RNA isolated from the biopsy samples was subsequently used for Affymetrix (oligonucleotide) microarray analysis. One hundred and twenty-three colon biopsy samples were collected from 49 subjects in this study. Gene expression profiles from 17 samples collected at week 0 (prior to treatment) from 16 subjects (there were 2 samples collected from one of the subjects—as a result of duplicative biopsy procedures) who responded to infliximab treatment in both 5 and 10 mg/kg dose groups at both weeks 8 and 30 were compared to that of 6 samples from week 0 of non-responders across both dose groups at both time points as described herein. Treatment responders showed a marked clinical improvement following therapy defined by a decrease from baseline Mayo score by at least 3 points and at least 30% with an accompanying decrease in rectal bleeding sub-score of at least 1 point or an absolute rectal bleeding sub-score of 0 or 1. Clinical response was assessed at weeks 8 and 30 post-treatment.
Total RNA was isolated with an RNeasy mini kit according to the manufacturer's instructions (Qiagen Inc., Valencia, Calif.). The colon biopsy samples were lysed and homogenized in the presence of 600 μL of GITC (guanidine isothiocyanate)-containing buffer, which immediately inactivates RNase to ensure isolation of intact RNA. 600 μL of 70% ethanol was added to provide appropriate binding conditions and the sample was then applied to an RNeasy mini spin column where the total RNA binds to the membrane and contaminants were efficiently washed away. High-quality RNA was then eluted in 30 μL of water. RNA quality and quantity was analyzed with 2100 Bioanalyzer (Agilent Technologies Inc., Palo Alto, Calif.).
Microarray Data Analysis
Microarray analysis was performed on GeneChip Human Genome U133 Plus 2.0 arrays that allow the analysis of the expression level of more than 47,000 transcripts and variants, including 38,500 well-characterized human genes. RNA amplification, target synthesis and labeling, chip hybridization, washing and staining were performed in accordance with the manufacturer's protocol (Affymetrix, Santa Clara, Calif.). The GeneChips were scanned using the GeneChip Scanner 3000. The data were analyzed with GCOS 1.4 (GeneChip Operating System) using Affymetrix default analysis settings and global scaling as normalization method. The trimmed mean target intensity of each array was arbitrarily set to 500.
Data quality was assessed by hybridization intensity distribution and Pearson's correlation in Partek Pro software version 6.1 (Partek Inc., St. Charles, Mo.), and was deemed good except 2 samples, E36507_P43—5 mg/kg_W30 & E36498_P39_placebo_W8. These samples were regarded as outliers and removed from data analysis.
Using GeneSpring™ software version 7.2 (Agilent Technologies, Palo Alto, Calif.), the intensity for probe set was normalized across all samples. Each measurement was divided by the median of all measurements in that sample. The intensity of a probe set was then normalized to the median intensity of that probe set in the control group. The control groups in this study were all 45 week 0 samples. Normalized intensity of probe set A in sample X was calculated as following:
Using Partek Pro 6.2, statistical analysis identified significant differences between responders and nonresponders using log-2 transformed normalized intensities. Samples from subjects of the same clinical response in both 5 and 10 mg/kg dose groups at both weeks 8 and 30 were combined to increase the statistical power. ANOVA was conducted between responders and non-responders, using samples collected at week 0 (prior to treatment). Statistically significant differences were determined after applying a false discovery rate (FDR) of 10% for multiple testing correction.
Classification of infliximab responsiveness for each patient sample was generated with the ‘K-Nearest Neighbors’ algorithm, using GeneSpring™ 7.2. A classifier containing transcripts showing significant differential expression between responders and non-responders prior to IFX treatment was evaluated by leave-one-out cross-validation for its efficiency of classification. A p-value was calculated to measure the probability that a test sample was predicted as belonging to one class by chance. And a p-value ratio was defined as the p-value of the first best class relative to that of the next best class.
Microarray Results
Biopsies taken from infliximab treatment responders and non-responders at week 0 allowed an understanding of the potential mechanism underlying treatment response and non-response in UC, and thus predicting clinical response prior to treatment. The baseline responder samples analyzed were taken from patients who showed a marked clinical improvement following infliximab therapy based on assessment done at weeks 8 and 30 post-treatment, as defined by a decrease from baseline in the total Mayo score by at least 3 points and at least 30% with an accompanying decrease in rectal bleeding sub-score of at least 1 point or an absolute rectal bleeding sub-score of 0 or 1. The non-responder samples were taken from patients who didn't achieve the treatment response as defined above.
Gene expression results from responders in both 5- and 10-mg/kg dose groups were compared to that of non-responders at week 0. A set of 19 genes that demonstrated significant changes between an infliximab responder vs. non-responder is listed in Tables 1A and 1B. Each differentially expressed gene is presented by the ratio of normalized hybridization intensity of infliximab treatment responder samples to that of non-responder samples. Each gene is defined by Gene Ontology (GO) terms for its associated biological process, cellular component, and molecular function.
Table 1A and Table 1B. A set of 19 genes differentially expressed between infliximab responders and non-responders at the baseline.
Since these genes passed the ANOVA test with a false discovery rate of 10% or less when the infliximab treatment responder samples were compared to the corresponding non-responder samples, the entire expression profile as detailed in Tables 1A and 1B are therefore defined as the infliximab treatment response signature in UC prior to treatment. In this baseline response gene signature, all the genes were expressed at higher levels in infliximab treatment responder samples as compared with that in the non-responder samples.
These results are novel findings in that clinical response outcome to infliximab treatment in moderate to severe UC can be predicted prior to treatment via assessing gene expression levels of a panel of selective genes. The expression changes in a panel of genes as represented in Tables 1A and 1B can constitute a classifier that serves as a biomarker profile indicative of the response of a subject to treatment at baseline prior to any treatment.
Real Time PCR (TaqMan®) Confirmation:
In order to confirm the microarray finding by an independent means, Real Time PCR (TaqMan®) technology was employed. Two micrograms of total RNA in the volume of 100 μL was converted to cDNA in the presence of MultiScribe Reverse Transcriptase. The reaction was carried out by incubating for 10 minutes at 25° C. followed by 30 minutes at 48° C. Reverse Transcriptase was inactivated at 95° C. for 5 minutes. Twenty-five nanograms of cDNA per reaction were used in real time PCR with ABI 7900 system (Foster City, Calif.). In the presence of AmpliTaq Gold DNA polymerase (ABI biosystem, Foster City, Calif.), the reaction was incubated for 2 minutes at 50° C. followed by 10 minutes at 95° C. Then, the reaction was run for 40 cycles at 15 seconds, at 95° C. and 1 minute, 60° C. per cycle. The housekeeping gene GAPDH (glyceraldehyde-3-phosphate dehydrogenase) was used to normalize gene expression. The TaqMan® results on a selected number of genes are consistent with the observation from the microarray analysis.
The TaqMan® results on a selected number of genes are summarized in Table 2. Overall, it is consistent with the observation from the microarray analysis. The data is reported as the relative fold change in expression (responder relative to non-responder). Microarray data is shown for comparison purposes.
&Mean fold change of five responders vs. four non-responders at week 0 by Taqman, FC = mean fold change, L = lower fold change value, U = upper fold change value
Cross-Validation of the Treatment Response Classifier
Among the 19 genes (Tables 1A and 1B) that differentiate the infliximab responders and non-responders at the baseline, the differential expression of five of them, C20orf23 (SEQ ID NO:5), LOC90693 (SEQ ID NO:8), CPOX (SEQ ID NO:10), USP28 (SEQ ID NO:13), and UBP1 (SEQ ID NO:14) were tested and confirmed by TaqMan (Table 2). The efficiency of the two panels of genes for patient classification was further evaluated by leave-one-out cross-validation. Classification was generated by the ‘K-Nearest Neighbors’ algorithm to cross-validate 22 baseline subjects (16 responders and six non-responders based on the assessment at both weeks 8 and 30 post-treatment) for the infliximab responsiveness. The panel of 19 or 5 genes achieved the same classification efficiency, by correctly classifying 21 patients (16 responders and five non-responders) and misclassifying one non-responder, with 100% of sensitivity and 83.3% specificity for prediction of treatment responsiveness at baseline (Table 3).
Utility of the response signature. The response signature for infliximab treatment in UC described herein can be assessed and used as described below.
Total RNA is to be analyzed on a gene chip array for the expression intensities of the 19-gene panel listed in Tables 1A and 1B or the 5-gene panel listed in Table 2. The following procedures are exemplary of a method of evaluating members of a gene panel of the invention against a reference standard in order to compare values of the 19- or 5-gene panel members:
1. Total RNA is extracted from a biopsy sample from a prospective UC (or related disorder) patient before treatment and the total RNA quantity and quality is assessed as specified above in Example 1.
2. Total RNA is run in duplicate on three separate identical gene chip arrays, e.g., GeneChip Human Genome U133 Plus 2.0 arrays as follows:
a. RNA amplification, target synthesis and labeling, chip hybridization, washing and staining are performed according to the manufacturer's protocol, e.g., Affymetrix, Santa Clara, Calif.
b. The GeneChips are scanned using, e.g., the GeneChip Scanner 3000.
c. The data is analyzed with, e.g., GCOS 1.4 (GeneChip Operating System) using Affymetrix default analysis settings and global scaling as normalization method, with the trimmed mean target intensity of each array arbitrarily set to 500.
d. The data quality is determined by correlating the data of each gene among the duplicates and across the three arrays.
e. An average intensity value is calculated with a standard error representing the variability.
f. The patient should respond to treatment with an anti-TNFα antibody (e.g., infliximab) if:
g. The patient should not respond to anti-TNFα antibody (e.g., infliximab) treatment if:
Although illustrated and described above with reference to certain specific embodiments, the present invention is nevertheless not intended to be limited to the details shown. Rather, the present invention is directed to the UC-related genes and gene products. Polynucleotides, antibodies, apparatus, and kits disclosed herein and uses thereof, and methods for predicting responsiveness to treatment and controlling the levels of the UC-related biomarker genes, and various modifications may be made in the details within the scope and range of equivalents of the claims and without departing from the spirit of the invention.
This application claims the benefit of U.S. Provisional Application No. 60/914,908, filed 30 Apr. 2007, the entire contents of which are incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
60914908 | Apr 2007 | US |