The present invention relates to methods and biomarkers for detection and characterization of lymphoma (e.g., splenic marginal zone lymphoma) in biological samples (e.g., tissue samples, blood samples, plasma samples, cell samples, serum samples).
Splenic marginal zone lymphoma (SMZL) is an indolent malignancy of splenic B lymphocytes characterized by splenomegaly, peripheral leukocytosis and cytopenias with a median age of onset of greater than 50 years. SMZL is the most common primary malignancy of the spleen and represents approximately 10% of all lymphomas that involve the spleen (Franco et al., 2003 Blood 101:2464-2472).
Although the disease course is usually indolent, with many patients surviving beyond 10 years, some patients present with more aggressive disease and survival between 1 and 2 years (Chacon et al., 2002 Blood 100:1648-1654). A “watch and wait” approach to instituting therapy may be considered for patients with favorable clinical prognostic factors (Arcaini et al., 2006) however, as it is difficult to predict subsequent risk of disease aggressiveness or refractoriness, a common first-line therapeutic approach is splenectomy and anti-Blymphocyte biological agents such as the anti-CD20 antibody (rituximab). Refractory cases may then be treated with more toxic chemotherapies including alkylating agents or purine analogs. In contrast to many other B-cell malignancies, SMZL is not associated with recurrent balanced translocations or genetic mutations. Moreover, little is known about the genetic events underpinning the development of aggressive or refractory disease or the transformation to higher-grade disease.
Better, more effective non-invasive tests for early detection of lymphomas are needed to lower the morbidity and mortality associated with such cancers.
The present invention relates to methods and biomarkers for detection and characterization of lymphoma (e.g., splenic marginal zone lymphoma) in biological samples (e.g., tissue samples, blood samples, plasma samples, cell samples, serum samples).
For example, in some embodiments, the present invention provides a method for detecting NOTCH2 variants associated with splenic marginal zone lymphoma (SMZL) in a subject, comprising: a) contacting a sample from a subject with a NOTCH2 variant detection assay under conditions that the presence of a NOTCH variant associated with SMZL is determined; and b) diagnosing SMZL in the subject when the NOTCH2 variants are present in the sample. In some embodiments, the NOTCH2 variant encodes a loss of function mutation. In some embodiments, the loss of function mutation is a truncation mutation (e.g., the truncation results in a non-functional PEST domain of the NOTCH2 polypeptide). The present invention is not limited to a particular NOTCH2 mutation. Examples include, but are not limited to, one or more of c.6909dupC (p.I2304fsX9), c.7198C>T (p.R2400X), c.4999G>A (p.V1667I), c.6304A>T (p.K2102X), c.6824C>A (p.A2275D), c.6834delinsGCACG (p.T2280fsX12), c.6853C>T (p.Q2285X), c.6868G>A (p.E2290X), c.6873delG (p.K2292fsX3), c.6909delC (p.I2304fsX2), c.6909delC (p.I2304fsX2) plus c.7072A>G (p.M2358V), c.6909dupC (p.I2304fsX9), c.6910delinsCCC (p.I2304fsX3), c.6973C>T (p.Q2325X), or c.7231G>T (p.E2411X). In some embodiments, variants in additional genes are detected in combination with the described NOTCH2 variants (e.g., those described in Tables 5 and 6). In some embodiments, the detection assay is a variant NOTCH2 nucleic acid or polypeptide detection assay. In some embodiments, detecting variant NOTCH2 nucleic acids comprises one or more nucleic acid detection methods selected from, for example, sequencing, amplification or hybridization. In some embodiments, the biological sample is a tissue sample, a cell sample, or a blood sample. In some embodiments, the determining comprises a computer implemented method (e.g., analyzing NOTCH2 variant information and displaying the information to a user). In some embodiments, the method further comprises the step of treating the subject for SMZL and monitoring the subject for the presence of NOTCH2 variants associated with SMZL. In some embodiments, the method further comprises the step of treating the subject for SMZL under conditions such that one or more symptoms of SMZL are decreased or eliminated. Additional embodiments provide the use of a variant NOTCH2 nucleic acid or polypeptide for detecting SMZL in a subject.
In still further embodiments, the present invention provides a method of determining a decreased time to adverse outcome in a subject diagnosed with SMZL, comprising: a) contacting a sample from a subject with a NOTCH2 variant detection assay under conditions that the presence of a NOTCH2 variant associated with SMZL is determined; and b) diagnosing a decreased time to adverse outcome in the subject when the NOTCH2 variants are present in the sample. In some embodiments, the adverse outcome is relapse of SMZL, metastasis, or death.
Additional embodiments will be apparent to persons skilled in the relevant art based on the teachings contained herein.
SMZL including expansion of pale staining marginal zones surrounding splenic follicles in a biphasic pattern. Panels B and C display reverse complement sequence reads (Read Alignment) mapped to the reference genome (Reference Sequence) from two of three index samples with mutations in NOTCH2 (boxed) with deviations from reference genome highlighted in blue. Bottom panel shows Sanger sequencing electropherograms confirming mutations in the index cases (SMZL) and the absence of the mutations in matched normal constitutional tissue (Germline).
To facilitate an understanding of the present invention, a number of terms and phrases are defined below:
As used herein, the term “sensitivity” is defined as a statistical measure of performance of an assay (e.g., method, test), calculated by dividing the number of true positives by the sum of the true positives and the false negatives.
As used herein, the term “specificity” is defined as a statistical measure of performance of an assay (e.g., method, test), calculated by dividing the number of true negatives by the sum of true negatives and false positives.
As used herein, the term “informative” or “informativeness” refers to a quality of a marker or panel of markers, and specifically to the likelihood of finding a marker (or panel of markers) in a positive sample.
As used herein, the term “metastasis” is meant to refer to the process in which cancer cells originating in one organ or part of the body relocate to another part of the body and continue to replicate. Metastasized cells subsequently form tumors which may further metastasize. Metastasis thus refers to the spread of cancer from the part of the body where it originally occurs to other parts of the body.
The term “neoplasm” as used herein refers to any new and abnormal growth of tissue. Thus, a neoplasm can be a premalignant neoplasm or a malignant neoplasm. The term “neoplasm-specific marker” refers to any biological material that can be used to indicate the presence of a neoplasm. Examples of biological materials include, without limitation, nucleic acids, polypeptides, carbohydrates, fatty acids, cellular components (e.g., cell membranes and mitochondria), and whole cells. The term “SMZL-specific marker” refers to any biological material that can be used to indicate the presence of SMZL. Examples of SMZL specific markers include, but are not limited to, the NOTCH2 variants described herein.
As used herein, the term “adverse outcome” refers to an undesirable outcome in a patient diagnosed with SMZL. In some embodiments, the patient is undergoing or has undergone treatment for SMZL. Examples of adverse outcome include but are not limited to, recurrence of SMZL, metastasis, transformation, or death.
As used herein, the term “amplicon” refers to a nucleic acid generated using primer pairs. The amplicon is typically single-stranded DNA (e.g., the result of asymmetric amplification), however, it may be RNA or dsDNA.
The term “amplifying” or “amplification” in the context of nucleic acids refers to the production of multiple copies of a polynucleotide, or a portion of the polynucleotide, typically starting from a small amount of the polynucleotide (e.g., a single polynucleotide molecule), where the amplification products or amplicons are generally detectable.
As used herein, the term “primer” refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, that is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product that is complementary to a nucleic acid strand is induced (e.g., in the presence of nucleotides and an inducing agent such as a biocatalyst (e.g., a DNA polymerase or the like) and at a suitable temperature and pH). The primer is typically single stranded for maximum efficiency in amplification, but may alternatively be double stranded.
If double stranded, the primer is generally first treated to separate its strands before being used to prepare extension products. In some embodiments, the primer is an oligodeoxyribonucleotide. The primer is sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer and the use of the method. In certain embodiments, the primer is a capture primer.
A “sequence” of a biopolymer refers to the order and identity of monomer units (e.g., nucleotides, etc.) in the biopolymer. The sequence (e.g., base sequence) of a nucleic acid is typically read in the 5′ to 3′ direction.
As used herein, the term “subject” refers to any animal (e.g., a mammal), including, but not limited to, humans, non-human primates, rodents, and the like, which is to be the recipient of a particular treatment. Typically, the terms “subject” and “patient” are used interchangeably herein in reference to a human subject.
As used herein, the term “non-human animals” refers to all non-human animals including, but are not limited to, vertebrates such as rodents, non-human primates, ovines, bovines, ruminants, lagomorphs, porcines, caprines, equines, canines, felines, ayes, etc.
The term “locus” as used herein refers to a nucleic acid sequence on a chromosome or on a linkage map and includes the coding sequence as well as 5′ and 3′ sequences involved in regulation of the gene.
The present invention relates to methods and biomarkers for detection and characterization of lymphoma (e.g., splenic marginal zone lymphoma) in biological samples (e.g., tissue samples, blood samples, plasma samples, cell samples, serum samples).
The NOTCH family of transmembrane receptor proteins is important for mediating cell fate determination and differentiation in a variety of embryonic and adult tissues. During hematopoietic differentiation, NOTCH1 signaling is known to influence cell-fate decisions as lymphocytes differentiate into B- or T-cells (Pui et al., 1999 Immunity 11:299-308; Radtke et al., 1999 Immunity 10:547-558; Robey and Bluestone, 2004 Curr Opin Immunol 16:360-366). Moreover, NOTCH2 is known to control B-lymphocyte specification into cells of marginal zone lineage (Pillai and Cariappa, 2009 Nat Rev Immunol 9:767-777). Whereas defects in NOTCH1 signaling have been implicated in oncogenesis in acute T-lymphoblastic leukemia (Aster et al., 2011 J Pathol 223:262-273; Weng et al., 2004 Science 306:269-271), chronic lymphocytic leukemia/small lymphocytic lymphoma (Del Giudice et al., 2012 Haematologica 97:437-441.; Puente et al., 2011) and mantle cell lymphoma (Kridel et al., 2012 Blood 119:1963-1971), comparatively little is known about the potential role of NOTCH2 signaling defects in the development of malignancies affecting cells of B-lymphocyte lineage (Aster et al., 2011 J Pathol 223:262-273).
Experiments conducted during the course of development of embodiments of the present invention utilized whole genome and targeted Sanger gene sequencing to identify recurrent mutations predominantly clustered in the C-terminal portion of the NOTCH2 gene in SMZL. NOTCH2 mutations were identified in half of these cases. Sanger sequencing of 93 additional SMZLs and 103 other types of B-cell lymphoma or leukemia or reactive lymphoid hyperplasia showed NOTCH2 mutations in 22 additional SMZL patients, yielding an overall frequency of 25.3%. No mutations were identified in other non-MZL B-cell lymphomas and leukemias analyzed. Moreover, in 19 patients with NOTCH2-mutated SMZL constitutional DNA was available for assessment and was confirmed to be wild-type indicating somatic acquisition of NOTCH2 mutation in SMZL.
In total, 26 NOTCH2 mutations were identified in 25 SMZL patients. These mutations represented six unique types of non-sense mutations, five unique types of frameshift mutations and three unique types of missense mutations. Twenty-five of these mutations affected the TAD or PEST domains with 23 predicted to yield protein truncation at or upstream of the PEST domain. The remaining case harbored a somatic p.V1667I mutation in the HD. All of these mutations were identified in the same protein domains as have been reported for NOTCH1 in T-ALL, CLL/SLL and MCL. However, NOTCH1 mutations in T-ALL are more prevalent in the HD than the TAD and PEST domain (
Pathogenic germline mutations in the TAD/PEST domain of NOTCH2 have been reported in Hajdu-Cheney syndrome (HCS), a rare autosomal dominant skeletal disorder characterized by facial anomalies, acro-osteolysis and osteoporosis (Isidor et al., 2011 Nat Genet 43:306-308; Simpson et al., 2011 Nat Genet 43:303-305). The NOTCH2 mutations in HCS include one report of a transmitted p.R2400X mutation (Simpson et al., 2011 supra) (
Examination of NOTCH2 mutational status in non-splenic MZLs revealed mutation in approximately 5% of cases analyzed. The NOTCH2 mutation identified in a single case of extranodal MZL of the breast was a p.R2400X nonsense mutation. This mutation was also identified in nine of 99 (9.1%) SMZL cases. The selectivity of NOTCH2 mutations for malignancies of marginal zone B-cells is in keeping with the known role of NOTCH2 in marginal zone cell fate determination (Saito et al., 2003 Immunity 18:675-685; Witt et al., 2003 J Immunol 171:2783-2788). It is noteworthy that NOTCH1 dictates T-cell fate and supra-physiological NOTCH1 signaling induces T-ALL (Weng et al., 2004 Science 306:269-271). The present invention is not limited to a particular mechanism. Indeed, an understanding of the mechanism is not necessary to practice the present invention. Nonetheless, it is contemplated that since NOTCH2 specifies marginal zone-B cell fate, supra-physiological NOTCH2 signaling plays a role in pathogenesis of MZL. Somatic mutations affecting specific genes that impact SMZL prognosis are largely unknown. While previous studies have implicated a role for mutations targeting genes in the NFKB pathway in a subset of SMZL (Rossi et al., 2011 Blood 118:4930-4934), only TP53 alterations present in a small minority of cases has been demonstrated to impact SMZL prognosis (Rinaldi et al., 2011 Blood 117:1595-1604; Salido et al., 2010 Blood 116:1479-1488). Experiments described herein found that the presence of NOTCH2 mutation in SMZLs at time of diagnosis predicted an adverse disease course characterized either by refractoriness to therapy, histological transformation to higher grade disease, or an otherwise aggressive clinical course. Assessment of NOTCH2 mutation status in cases of SMZL is thus useful to predict risk of aggressive disease and inform clinical decision-making at diagnosis, with the presence of NOTCH2 mutation being an indication for more aggressive therapy.
Embodiments of the present invention provide diagnostic, prognostic, and screening methods. In some embodiments, methods characterize and diagnose lymphoma (e.g., splenic marginal zone lymphoma (SMZL) or non-splenic MZLs). Exemplary, non-limiting methods of identifying NOTCH2 mutations are described below.
Embodiments of the present invention provide compositions and methods for detecting mutations in NOTCH2 (e.g., to identify or diagnose splenic lymphomas). The present invention is not limited to particular NOTCH2 mutations. In some embodiments, mutations are loss of function mutations (e.g., truncation, nonsense, missense, or frameshift mutations).
Exemplary mutations include, but are not limited to, c.6909dupC (p.I2304fsX9), c.7198C>T (p.R2400X), c.4999G>A (p.V1667I), c.6304A>T (p.K2102X), c.6824C>A (p.A2275D), c.6834delinsGCACG (p.T2280fsX12), c.6853C>T (p.Q2285X), c.6868G>A (p.E2290X), c.6873delG (p.K2292fsX3), c.6909delC (p.I2304fsX2), c.6909delC (p.I2304fsX2) plus c.7072A>G (p.M2358V), c.6909dupC (p.I2304fsX9), c.6910delinsCCC (p.I2304fsX3), c.6973C>T (p.Q2325X), or c.7231G>T (p.E2411X).
While the present invention exemplifies several markers specific for detecting splenic lymphoma, any marker that is correlated with the presence or absence or prognosis of splenic lymphomas may be used. A marker, as used herein, includes, for example, nucleic acid(s) whose production or mutation or lack of production is characteristic of a splenic lymphoma and mutations that cause the same effect (e.g., deletions, truncations, etc).
In some embodiments, one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, or more (e.g., all)) of the mutations are identified in order to diagnose or characterize splenic lymphoma. In some embodiments, mutations are identified in combination with one or more additional markers of splenic lymphomas or other cancers (e.g., those described in Tables 5 and 6). In some embodiments, multiple markers are detected in a panel or multiplex format.
Particular combinations of markers may be used that show optimal function with different ethnic groups or sex, different geographic distributions, different stages of disease, different degrees of specificity or different degrees of sensitivity. Particular combinations may also be developed which are particularly sensitive to the effect of therapeutic regimens on disease progression. Subjects may be monitored after a therapy and/or course of action to determine the effectiveness of that specific therapy and/or course of action.
In some embodiments, the present invention provides methods of detecting the presence of wild type or variant (e.g., mutant or polymorphic) NOTCH2 nucleic acids or polypeptides. The detection of mutant NOTCH2 finds use in the diagnosis of disease (e.g., splenic lymphomas), research, and selection of appropriate treatment and/or monitoring regimens.
Accordingly, the present invention provides methods for determining whether a patient has a NOTCH2 mutation profile associated with a splenic lymphoma.
A number of methods are available for analysis of variant (e.g., mutant or polymorphic) nucleic acid sequences. Assays for detecting variants (e.g., polymorphisms or mutations) fall into several categories, including, but not limited to direct sequencing assays, fragment polymorphism assays, hybridization assays, and computer based data analysis. Protocols and commercially available kits or services for performing multiple variations of these assays are available. In some embodiments, assays are performed in combination or in hybrid (e.g., different reagents or technologies from several assays are combined to yield one assay). The following assays are useful in the present invention.
Any patient sample containing NOTCH2 nucleic acids or polypeptides may be tested according to the methods of the present invention. By way of non-limiting examples, the sample may be tissue, blood, urine, semen, or a fraction thereof (e.g., plasma, serum, whole blood, spleen cells, etc.).
The patient sample may undergo preliminary processing designed to isolate or enrich the sample for the NOTCH2 nucleic acids or polypeptides or cells that contain NOTCH2. A variety of techniques known to those of ordinary skill in the art may be used for this purpose, including but not limited: centrifugation; immunocapture; cell lysis; and, nucleic acid target capture (See, e.g., EP Pat. No. 1 409 727, herein incorporated by reference in its entirety).
i. DNA and RNA Detection
The NOTCH2 variants of the present invention may be detected as genomic DNA or mRNA using a variety of nucleic acid techniques known to those of ordinary skill in the art, including but not limited to: nucleic acid sequencing; nucleic acid hybridization; and, nucleic acid amplification.
1. Sequencing
Illustrative non-limiting examples of nucleic acid sequencing techniques include, but are not limited to, chain terminator (Sanger) sequencing and dye terminator sequencing.
Those of ordinary skill in the art will recognize that because RNA is less stable in the cell and more prone to nuclease attack experimentally RNA is usually reverse transcribed to DNA before sequencing.
Chain terminator sequencing uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. Extension is initiated at a specific site on the template DNA by using a short radioactive, fluorescent or other labeled, oligonucleotide primer complementary to the template at that region. The oligonucleotide primer is extended using a DNA polymerase, standard four deoxynucleotide bases, and a low concentration of one chain terminating nucleotide, most commonly a di-deoxynucleotide. This reaction is repeated in four separate tubes with each of the bases taking turns as the di-deoxynucleotide.
Limited incorporation of the chain terminating nucleotide by the DNA polymerase results in a series of related DNA fragments that are terminated only at positions where that particular di-deoxynucleotide is used. For each reaction tube, the fragments are size-separated by electrophoresis in a slab polyacrylamide gel or a capillary tube filled with a viscous polymer. The sequence is determined by reading which lane produces a visualized mark from the labeled primer as you scan from the top of the gel to the bottom.
Dye terminator sequencing alternatively labels the terminators. Complete sequencing can be performed in a single reaction by labeling each of the di-deoxynucleotide chain-terminators with a separate fluorescent dye, which fluoresces at a different wavelength.
Some embodiments of the present invention utilize next generation or high-throughput sequencing. A variety of nucleic acid sequencing methods are contemplated for use in the methods of the present disclosure including, for example, chain terminator (Sanger) sequencing, dye terminator sequencing, and high-throughput sequencing methods. Many of these sequencing methods are well known in the art. See, e.g., Sanger et al., Proc. Natl. Acad. Sci. USA 74:5463-5467 (1997); Maxam et al., Proc. Natl. Acad. Sci. USA 74:560-564 (1977); Drmanac, et al., Nat. Biotechnol. 16:54-58 (1998); Kato, Int. J. Clin. Exp. Med. 2:193-202 (2009); Ronaghi et al., Anal. Biochem. 242:84-89 (1996); Margulies et al., Nature 437:376-380 (2005); Ruparel et al., Proc. Natl. Acad. Sci. USA 102:5932-5937 (2005), and Harris et al., Science 320:106-109 (2008); Levene et al., Science 299:682-686 (2003); Korlach et al., Proc. Natl. Acad. Sci. USA 105:1176-1181 (2008); Branton et al., Nat. Biotechnol. 26(10):1146-53 (2008); Eid et al., Science 323:133-138 (2009); each of which is herein incorporated by reference in its entirety.
In some embodiments, sequencing technology including, but not limited to, pyrosequencing, sequencing-by-ligation, single molecule sequencing, sequence-by-synthesis
(SBS), massive parallel clonal, massive parallel single molecule SBS, massive parallel single molecule real-time, massive parallel single molecule real-time nanopore technology, etc. Morozova and Marra provide a review of some such technologies in Genomics, 92: 255 (2008), herein incorporated by reference in its entirety. Those of ordinary skill in the art will recognize that because RNA is less stable in the cell and more prone to nuclease attack experimentally RNA is usually reverse transcribed to DNA before sequencing.
A number of DNA sequencing techniques are known in the art, including fluorescence-based sequencing methodologies (See, e.g., Birren et al., Genome Analysis: Analyzing DNA, 1, Cold Spring Harbor, N.Y.; herein incorporated by reference in its entirety). In some embodiments, the technology finds use in automated sequencing techniques understood in that art. In some embodiments, the present technology finds use in parallel sequencing of partitioned amplicons (PCT Publication No: WO2006084132 to Kevin McKernan et al., herein incorporated by reference in its entirety). In some embodiments, the technology finds use in DNA sequencing by parallel oligonucleotide extension (See, e.g., U.S. Pat. No. 5,750,341 to Macevicz et al., and U.S. Pat. No. 6,306,597 to Macevicz et al., both of which are herein incorporated by reference in their entireties). Additional examples of sequencing techniques in which the technology finds use include the Church polony technology (Mitra et al., 2003, Analytical Biochemistry 320, 55-65; Shendure et al., 2005 Science 309, 1728-1732; U.S. Pat. No. 6,432,360, U.S. Pat. No. 6,485,944, U.S. Pat. No. 6,511,803; herein incorporated by reference in their entireties), the 454 picotiter pyrosequencing technology (Margulies et al., 2005 Nature 437, 376-380; US 20050130173; herein incorporated by reference in their entireties), the Solexa single base addition technology (Bennett et al., 2005, Pharmacogenomics, 6, 373-382; U.S. Pat. No. 6,787,308; U.S. Pat. No. 6,833,246; herein incorporated by reference in their entireties), the Lynx massively parallel signature sequencing technology (Brenner et al. (2000). Nat. Biotechnol. 18:630-634; U.S. Pat. No. 5,695,934; U.S. Pat. No. 5,714,330; herein incorporated by reference in their entireties), and the Adessi PCR colony technology (Adessi et al. (2000). Nucleic Acid Res. 28, E87; WO 00018957; herein incorporated by reference in its entirety).
Next-generation sequencing (NGS) methods share the common feature of massively parallel, high-throughput strategies, with the goal of lower costs in comparison to older sequencing methods (see, e.g., Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbial., 7: 287-296; each herein incorporated by reference in their entirety). NGS methods can be broadly divided into those that typically use template amplification and those that do not. Amplification-requiring methods include pyrosequencing commercialized by Roche as the 454 technology platforms (e.g., GS 20 and GS FLX), the Solexa platform commercialized by Illumina, and the Supported Oligonucleotide Ligation and Detection (SOLiD) platform commercialized by Applied Biosystems. Non-amplification approaches, also known as single-molecule sequencing, are exemplified by the HeliScope platform commercialized by Helicos BioSciences, and emerging platforms commercialized by VisiGen, Oxford Nanopore Technologies Ltd., Life Technologies/Ion Torrent, and Pacific Biosciences, respectively.
In pyrosequencing (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbial., 7: 287-296; U.S. Pat. No. 6,210,891; U.S. Pat. No. 6,258,568; each herein incorporated by reference in its entirety), template DNA is fragmented, end-repaired, ligated to adaptors, and clonally amplified in-situ by capturing single template molecules with beads bearing oligonucleotides complementary to the adaptors. Each bead bearing a single template type is compartmentalized into a water-in-oil microvesicle, and the template is clonally amplified using a technique referred to as emulsion PCR. The emulsion is disrupted after amplification and beads are deposited into individual wells of a picotitre plate functioning as a flow cell during the sequencing reactions. Ordered, iterative introduction of each of the four dNTP reagents occurs in the flow cell in the presence of sequencing enzymes and luminescent reporter such as luciferase. In the event that an appropriate dNTP is added to the 3′ end of the sequencing primer, the resulting production of ATP causes a burst of luminescence within the well, which is recorded using a CCD camera. It is possible to achieve read lengths greater than or equal to 400 bases, and 106 sequence reads can be achieved, resulting in up to 500 million base pairs (Mb) of sequence.
In the Solexa/Illumina platform (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbial., 7: 287-296; U.S. Pat. No. 6,833,246; U.S. Pat. No. 7,115,400; U.S. Pat. No. 6,969,488; each herein incorporated by reference in its entirety), sequencing data are produced in the form of shorter-length reads. In this method, single-stranded fragmented DNA is end-repaired to generate 5′-phosphorylated blunt ends, followed by Klenow-mediated addition of a single A base to the 3′ end of the fragments. A-addition facilitates addition of T-overhang adaptor oligonucleotides, which are subsequently used to capture the template-adaptor molecules on the surface of a flow cell that is studded with oligonucleotide anchors. The anchor is used as a PCR primer, but because of the length of the template and its proximity to other nearby anchor oligonucleotides, extension by PCR results in the “arching over” of the molecule to hybridize with an adjacent anchor oligonucleotide to form a bridge structure on the surface of the flow cell. These loops of DNA are denatured and cleaved. Forward strands are then sequenced with reversible dye terminators. The sequence of incorporated nucleotides is determined by detection of post-incorporation fluorescence, with each fluor and block removed prior to the next cycle of dNTP addition. Sequence read length ranges from 36 nucleotides to over 50 nucleotides, with overall output exceeding 1 billion nucleotide pairs per analytical run.
Sequencing nucleic acid molecules using SOLiD technology (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. No. 5,912,148; U.S. Pat. No. 6,130,073; each herein incorporated by reference in their entirety) also involves fragmentation of the template, ligation to oligonucleotide adaptors, attachment to beads, and clonal amplification by emulsion PCR. Following this, beads bearing template are immobilized on a derivatized surface of a glass flow-cell, and a primer complementary to the adaptor oligonucleotide is annealed. However, rather than utilizing this primer for 3′ extension, it is instead used to provide a 5′ phosphate group for ligation to interrogation probes containing two probe-specific bases followed by 6 degenerate bases and one of four fluorescent labels. In the SOLiD system, interrogation probes have 16 possible combinations of the two bases at the 3′ end of each probe, and one of four fluors at the 5′ end. Fluor color, and thus identity of each probe, corresponds to specified color-space coding schemes. Multiple rounds (usually 7) of probe annealing, ligation, and fluor detection are followed by denaturation, and then a second round of sequencing using a primer that is offset by one base relative to the initial primer. In this manner, the template sequence can be computationally re-constructed, and template bases are interrogated twice, resulting in increased accuracy. Sequence read length averages 35 nucleotides, and overall output exceeds 4 billion bases per sequencing run.
In certain embodiments, the technology finds use in nanopore sequencing (see, e.g., Astier et al., J. Am. Chem. Soc. 2006 Feb 8; 128(5):1705-10, herein incorporated by reference). The theory behind nanopore sequencing has to do with what occurs when a nanopore is immersed in a conducting fluid and a potential (voltage) is applied across it. Under these conditions a slight electric current due to conduction of ions through the nanopore can be observed, and the amount of current is exceedingly sensitive to the size of the nanopore. As each base of a nucleic acid passes through the nanopore, this causes a change in the magnitude of the current through the nanopore that is distinct for each of the four bases, thereby allowing the sequence of the DNA molecule to be determined
In certain embodiments, the technology finds use in HeliScope by Helicos BioSciences (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. No. 7,169,560; U.S. Pat. No. 7,282,337; U.S. Pat. No. 7,482,120; U.S. Pat. No. 7,501,245; U.S. Pat. No. 6,818,395; U.S. Pat. No. 6,911,345; U.S. Pat. No. 7,501,245; each herein incorporated by reference in their entirety). Template DNA is fragmented and polyadenylated at the 3′ end, with the final adenosine bearing a fluorescent label. Denatured polyadenylated template fragments are ligated to poly(dT) oligonucleotides on the surface of a flow cell. Initial physical locations of captured template molecules are recorded by a CCD camera, and then label is cleaved and washed away. Sequencing is achieved by addition of polymerase and serial addition of fluorescently-labeled dNTP reagents. Incorporation events result in fluor signal corresponding to the dNTP, and signal is captured by a CCD camera before each round of dNTP addition. Sequence read length ranges from 25-50 nucleotides, with overall output exceeding 1 billion nucleotide pairs per analytical run.
The Ion Torrent technology is a method of DNA sequencing based on the detection of hydrogen ions that are released during the polymerization of DNA (see, e.g., Science 327(5970): 1190 (2010); U.S. Pat. Appl. Pub. Nos. 20090026082, 20090127589, 20100301398, 20100197507, 20100188073, and 20100137143, incorporated by reference in their entireties for all purposes). A microwell contains a template DNA strand to be sequenced. Beneath the layer of microwells is a hypersensitive ISFET ion sensor. All layers are contained within a CMOS semiconductor chip, similar to that used in the electronics industry. When a dNTP is incorporated into the growing complementary strand a hydrogen ion is released, which triggers a hypersensitive ion sensor. If homopolymer repeats are present in the template sequence, multiple dNTP molecules will be incorporated in a single cycle. This leads to a corresponding number of released hydrogens and a proportionally higher electronic signal. This technology differs from other sequencing technologies in that no modified nucleotides or optics are used. The per-base accuracy of the Ion Torrent sequencer is ˜99.6% for 50 base reads, with ˜400 Mb generated per run. The read-length is 100 base pairs. The accuracy for homopolymer repeats of 5 repeats in length is ˜98%. The benefits of ion semiconductor sequencing are rapid sequencing speed and low upfront and operating costs.
The technology finds use in another nucleic acid sequencing approach developed by Stratos Genomics, Inc. and involves the use of Xpandomers. This sequencing process typically includes providing a daughter strand produced by a template-directed synthesis. The daughter strand generally includes a plurality of subunits coupled in a sequence corresponding to a contiguous nucleotide sequence of all or a portion of a target nucleic acid in which the individual subunits comprise a tether, at least one probe or nucleobase residue, and at least one selectively cleavable bond. The selectively cleavable bond(s) is/are cleaved to yield an Xpandomer of a length longer than the plurality of the subunits of the daughter strand. The Xpandomer typically includes the tethers and reporter elements for parsing genetic information in a sequence corresponding to the contiguous nucleotide sequence of all or a portion of the target nucleic acid. Reporter elements of the Xpandomer are then detected. Additional details relating to Xpandomer-based approaches are described in, for example, U.S. Pat. Pub No. 20090035777, entitled “High Throughput Nucleic Acid Sequencing by Expansion,” filed Jun. 19, 2008, which is incorporated herein in its entirety.
Other emerging single molecule sequencing methods include real-time sequencing by synthesis using a VisiGen platform (Voelkerding et al., Clinical Chem., 55: 641-58, 2009; U.S. Pat. No. 7,329,492; U.S. Pat. App. Ser. No. 11/671956; U.S. Pat. App. Ser. No. 11/781166; each herein incorporated by reference in their entirety) in which immobilized, primed DNA template is subjected to strand extension using a fluorescently-modified polymerase and florescent acceptor molecules, resulting in detectible fluorescence resonance energy transfer (FRET) upon nucleotide addition.
In some embodiments, capillary electrophoresis (CE) is utilized to analyze amplification fragments. During capillary electrophoresis, nucleic acids (e.g., the products of a PCR reaction) are injected electrokinetically into capillaries filled with polymer. High voltage is applied so that the fluorescent DNA fragments are separated by size and are detected by a laser/camera system. In some embodiments, CE systems from Life Technogies (Grand Island, N.Y.) are utilized for fragment sizing (See e.g., U.S. Pat. No. 6,706,162, U.S. Pat. No. 8,043,493, each of which is herein incorporated by reference in its entirety).
2. Hybridization
Illustrative non-limiting examples of nucleic acid hybridization techniques include, but are not limited to, in situ hybridization (ISH), microarray, and Southern or Northern blot. In situ hybridization (ISH) is a type of hybridization that uses a labeled complementary DNA or RNA strand as a probe to localize a specific DNA or RNA sequence in a portion or section of tissue (in situ), or, if the tissue is small enough, the entire tissue (whole mount ISH). DNA ISH can be used to determine the structure of chromosomes. RNA ISH is used to measure and localize mRNAs and other transcripts within tissue sections or whole mounts. Sample cells and tissues are usually treated to fix the target transcripts in place and to increase access of the probe. The probe hybridizes to the target sequence at elevated temperature, and then the excess probe is washed away. The probe that was labeled with either radio-, fluorescent- or antigen-labeled bases is localized and quantitated in the tissue using either autoradiography, fluorescence microscopy or immunohistochemistry, respectively. ISH can also use two or more probes, labeled with radioactivity or the other non-radioactive labels, to simultaneously detect two or more transcripts.
3. Microarrays
In some embodiments, microarrays are utilized for detection of NOTCH2 nucleic acid sequences. Examples of microarrays include, but not limited to: DNA microarrays (e.g., cDNA microarrays and oligonucleotide microarrays); protein microarrays; tissue microarrays; transfection or cell microarrays; chemical compound microarrays; and, antibody microarrays. A DNA microarray, commonly known as gene chip, DNA chip, or biochip, is a collection of microscopic DNA spots attached to a solid surface (e.g., glass, plastic or silicon chip) forming an array for the purpose of expression profiling or monitoring expression levels for thousands of genes simultaneously. The affixed DNA segments are known as probes, thousands of which can be used in a single DNA microarray. Microarrays can be used to identify disease genes by comparing gene expression in disease and normal cells. Microarrays can be fabricated using a variety of technologies, including but not limiting: printing with fine-pointed pins onto glass slides; photolithography using pre-made masks; photolithography using dynamic micromirror devices; ink-jet printing; or, electrochemistry on microelectrode arrays.
Arrays can also be used to detect copy number variations at al specific locus. These genomic micorarrys detect microscopic deletions or other variants that lead to disease causing alleles.
Southern and Northern blotting is used to detect specific DNA or RNA sequences, respectively. DNA or RNA extracted from a sample is fragmented, electrophoretically separated on a matrix gel, and transferred to a membrane filter. The filter bound DNA or RNA is subject to hybridization with a labeled probe complementary to the sequence of interest. Hybridized probe bound to the filter is detected. A variant of the procedure is the reverse Northern blot, in which the substrate nucleic acid that is affixed to the membrane is a collection of isolated DNA fragments and the probe is RNA extracted from a tissue and labeled.
4. Amplification
NOTCH2 nucleic acid may be amplified prior to or simultaneous with detection. Illustrative non-limiting examples of nucleic acid amplification techniques include, but are not limited to, polymerase chain reaction (PCR), reverse transcription polymerase chain reaction (RT-PCR), transcription-mediated amplification (TMA), ligase chain reaction (LCR), strand displacement amplification (SDA), and nucleic acid sequence based amplification (NASBA). Those of ordinary skill in the art will recognize that certain amplification techniques (e.g., PCR) require that RNA be reversed transcribed to DNA prior to amplification (e.g., RT-PCR), whereas other amplification techniques directly amplify RNA (e.g., TMA and NASBA).
The polymerase chain reaction (U.S. Pat. Nos. 4,683,195, 4,683,202, 4,800,159 and 4,965,188, each of which is herein incorporated by reference in its entirety), commonly referred to as PCR, uses multiple cycles of denaturation, annealing of primer pairs to opposite strands, and primer extension to exponentially increase copy numbers of a target nucleic acid sequence. In a variation called RT-PCR, reverse transcriptase (RT) is used to make a complementary DNA (cDNA) from mRNA, and the cDNA is then amplified by PCR to produce multiple copies of DNA. For other various permutations of PCR see, e.g., U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159; Mullis et al., Meth. Enzymol. 155: 335 (1987); and, Murakawa et al., DNA 7: 287 (1988), each of which is herein incorporated by reference in its entirety.
Transcription mediated amplification (U.S. Pat. Nos. 5,480,784 and 5,399,491, each of which is herein incorporated by reference in its entirety), commonly referred to as TMA, synthesizes multiple copies of a target nucleic acid sequence autocatalytically under conditions of substantially constant temperature, ionic strength, and pH in which multiple RNA copies of the target sequence autocatalytically generate additional copies. See, e.g., U.S. Pat. Nos. 5,399,491 and 5,824,518, each of which is herein incorporated by reference in its entirety. In a variation described in U.S. Publ. No. 20060046265 (herein incorporated by reference in its entirety), TMA optionally incorporates the use of blocking moieties, terminating moieties, and other modifying moieties to improve TMA process sensitivity and accuracy.
The ligase chain reaction (Weiss, R., Science 254: 1292 (1991), herein incorporated by reference in its entirety), commonly referred to as LCR, uses two sets of complementary
DNA oligonucleotides that hybridize to adjacent regions of the target nucleic acid. The DNA oligonucleotides are covalently linked by a DNA ligase in repeated cycles of thermal denaturation, hybridization and ligation to produce a detectable double-stranded ligated oligonucleotide product.
Strand displacement amplification (Walker, G. et al., Proc. Natl. Acad. Sci. USA 89: 392-396 (1992); U.S. Pat. Nos. 5,270,184 and 5,455,166, each of which is herein incorporated by reference in its entirety), commonly referred to as SDA, uses cycles of annealing pairs of primer sequences to opposite strands of a target sequence, primer extension in the presence of a dNTPaS to produce a duplex hemiphosphorothioated primer extension product, endonuclease-mediated nicking of a hemimodified restriction endonuclease recognition site, and polymerase-mediated primer extension from the 3′ end of the nick to displace an existing strand and produce a strand for the next round of primer annealing, nicking and strand displacement, resulting in geometric amplification of product. Thermophilic SDA (tSDA) uses thermophilic endonucleases and polymerases at higher temperatures in essentially the same method (EP Pat. No. 0 684 315).
Other amplification methods include, for example: nucleic acid sequence based amplification (U.S. Pat. No. 5,130,238, herein incorporated by reference in its entirety), commonly referred to as NASBA; one that uses an RNA replicase to amplify the probe molecule itself (Lizardi et al., BioTechnol. 6: 1197 (1988), herein incorporated by reference in its entirety), commonly referred to as Qβ replicase; a transcription based amplification method (Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1173 (1989)); and, self-sustained sequence replication (Guatelli et al., Proc. Natl. Acad. Sci. USA 87: 1874 (1990), each of which is herein incorporated by reference in its entirety). For further discussion of known amplification methods see Persing, David H., “In Vitro Nucleic Acid Amplification Techniques” in Diagnostic Medical Microbiology: Principles and Applications (Persing et al., Eds.), pp. 51-87 (American Society for Microbiology, Washington, DC (1993)).
5. Detection Methods
Non-amplified or amplified NOTCH2 nucleic acids can be detected by any conventional means. For example, nucleic acid can be detected by hybridization with a detectably labeled probe and measurement of the resulting hybrids. Illustrative non-limiting examples of detection methods are described below.
One illustrative detection method, the Hybridization Protection Assay (HPA) involves hybridizing a chemiluminescent oligonucleotide probe (e.g., an acridinium ester-labeled (AE) probe) to the target sequence, selectively hydrolyzing the chemiluminescent label present on unhybridized probe, and measuring the chemiluminescence produced from the remaining probe in a luminometer. See, e.g., U.S. Pat. No. 5,283,174 and Norman C. Nelson et al., Nonisotopic Probing, Blotting, and Sequencing, ch. 17 (Larry J. Kricka ed., 2d ed. 1995, each of which is herein incorporated by reference in its entirety).
Another illustrative detection method provides for quantitative evaluation of the amplification process in real-time. Evaluation of an amplification process in “real-time” involves determining the amount of amplicon in the reaction mixture either continuously or periodically during the amplification reaction, and using the determined values to calculate the amount of target sequence initially present in the sample. A variety of methods for determining the amount of initial target sequence present in a sample based on real-time amplification are well known in the art. These include methods disclosed in U.S. Pat. Nos. 6,303,305 and 6,541,205, each of which is herein incorporated by reference in its entirety. Another method for determining the quantity of target sequence initially present in a sample, but which is not based on a real-time amplification, is disclosed in U.S. Pat. No. 5,710,029, herein incorporated by reference in its entirety.
Amplification products may be detected in real-time through the use of various self-hybridizing probes, most of which have a stem-loop structure. Such self-hybridizing probes are labeled so that they emit differently detectable signals, depending on whether the probes are in a self-hybridized state or an altered state through hybridization to a target sequence.
By way of non-limiting example, “molecular torches” are a type of self-hybridizing probe that includes distinct regions of self-complementarity (referred to as “the target binding domain” and “the target closing domain”) which are connected by a joining region (e.g., non-nucleotide linker) and which hybridize to each other under predetermined hybridization assay conditions. In a preferred embodiment, molecular torches contain single-stranded base regions in the target binding domain that are from 1 to about 20 bases in length and are accessible for hybridization to a target sequence present in an amplification reaction under strand displacement conditions. Under strand displacement conditions, hybridization of the two complementary regions, which may be fully or partially complementary, of the molecular torch is favored, except in the presence of the target sequence, which will bind to the single-stranded region present in the target binding domain and displace all or a portion of the target closing domain. The target binding domain and the target closing domain of a molecular torch include a detectable label or a pair of interacting labels (e.g., luminescent/quencher) positioned so that a different signal is produced when the molecular torch is self-hybridized than when the molecular torch is hybridized to the target sequence, thereby permitting detection of probe:target duplexes in a test sample in the presence of unhybridized molecular torches. Molecular torches and a variety of types of interacting label pairs are disclosed in U.S. Pat. No. 6,534,274, herein incorporated by reference in its entirety.
Another example of a detection probe having self-complementarity is a “molecular beacon.” Molecular beacons include nucleic acid molecules having a target complementary sequence, an affinity pair (or nucleic acid arms) holding the probe in a closed conformation in the absence of a target sequence present in an amplification reaction, and a label pair that interacts when the probe is in a closed conformation. Hybridization of the target sequence and the target complementary sequence separates the members of the affinity pair, thereby shifting the probe to an open conformation. The shift to the open conformation is detectable due to reduced interaction of the label pair, which may be, for example, a fluorophore and a quencher (e.g., DABCYL and EDANS). Molecular beacons are disclosed in U.S. Pat. Nos. 5,925,517 and 6,150,097, herein incorporated by reference in its entirety.
Other self-hybridizing probes are well known to those of ordinary skill in the art. By way of non-limiting example, probe binding pairs having interacting labels, such as those disclosed in U.S. Pat. No. 5,928,862 (herein incorporated by reference in its entirety) might be adapted for use in the present invention. Probe systems used to detect single nucleotide polymorphisms (SNPs) might also be utilized in the present invention. Additional detection systems include “molecular switches,” as disclosed in U.S. Publ. No. 20050042638, herein incorporated by reference in its entirety. Other probes, such as those comprising intercalating dyes and/or fluorochromes, are also useful for detection of amplification products in the present invention. See, e.g., U.S. Pat. No. 5,814,447 (herein incorporated by reference in its entirety).
ii. Detection of Variant NOTCH2 Proteins
In other embodiments, variant NOTCH2 polypeptides are. Any suitable method may be used to detect truncated or mutant NOTCH2 polypeptides including, but not limited to, those described below.
1. Antibody Binding
In some embodiments, antibodies (See below for antibody production) are used to determine if an individual contains an allele encoding a variant NOTCH2 polypeptide. In preferred embodiments, antibodies are utilized that discriminate between variant (i.e., truncated proteins); and wild-type proteins. In some embodiments, the antibodies are directed to the C-terminus of NOTCH2 proteins. Proteins that are recognized by the N-terminal, but not the C-terminal antibody are truncated. In some embodiments, quantitative immunoassays are used to determine the ratios of C-terminal to N-terminal antibody binding. In other embodiments, identification of variants of NOTCH2 is accomplished through the use of antibodies that differentially bind to wild type or variant forms of NOTCH2 proteins.
Antibody binding is detected by techniques known in the art (e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich” immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., using colloidal gold, enzyme or radioisotope labels, for example), Western blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc.
In one embodiment, antibody binding is detected by detecting a label on the primary antibody. In another embodiment, the primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary antibody is labeled. Many methods are known in the art for detecting binding in an immunoassay and are within the scope of the present invention.
In some embodiments, an automated detection assay is utilized. Methods for the automation of immunoassays include those described in U.S. Patents 5,885,530, 4,981,785, 6,159,750, and 5,358,691, each of which is herein incorporated by reference. In some embodiments, the analysis and presentation of results is also automated. For example, in some embodiments, software that generates a prognosis based on the result of the immunoassay is utilized. In other embodiments, the immunoassay described in U.S. Pat. Nos. 5,599,677 and 5,672,480; each of which is herein incorporated by reference.
C. Kits for Detecting NOTCH2 Mutant or Variant Alleles
The present invention also provides kits for determining whether an individual contains a wild-type or variant (e.g., mutant or polymorphic) allele of NOTCH2. In some embodiments, the kits are useful for determining whether the subject has a splenic lymphoma (e.g., SMZL) or to provide a prognosis to an individual diagnosed with a splenic lymphoma (e.g., SMZL). The diagnostic kits are produced in a variety of ways. In some embodiments, the kits contain at least one reagent useful, necessary, or sufficient for specifically detecting a mutant or variant NOTCH2 allele or protein. In some embodiments, the kits contain reagents for detecting a truncation in the NOTCH2 polypeptide. In preferred embodiments, the reagent is a nucleic acid that hybridizes to nucleic acids containing the mutation and that does not bind to nucleic acids that do not contain the mutation. In other embodiments, the reagents are primers for amplifying the region of DNA containing the mutation. In still other embodiments, the reagents are antibodies that preferentially bind either the wild-type or truncated or variant NOTCH2 proteins.
In some embodiments, the kits include ancillary reagents such as buffering agents, nucleic acid stabilizing reagents, protein stabilizing reagents, and signal producing systems (e.g., florescence generating systems as Fret systems), and software (e.g., data analysis software). The test kit may be packages in any suitable manner, typically with the elements in a single container or various containers as necessary along with a sheet of instructions for carrying out the test. In some embodiments, the kits also preferably include a positive control sample.
In some embodiments, markers (e.g., those described herein) are detected alone or in combination with other markers in a panel or multiplex format. For example, in some embodiments, a plurality of markers are simultaneously detected in an array or multiplex format (e.g., using the detection methods described herein).
D. Bioinformatics
For example, in some embodiments, a computer-based analysis program is used to translate the raw data generated by the detection assay (e.g., the presence, absence, or amount of a given NOTCH2 allele or polypeptide) into data of predictive value for a clinician. The clinician can access the predictive data using any suitable means. Thus, in some preferred embodiments, the present invention provides the further benefit that the clinician, who may not be trained in genetics or molecular biology, need not understand the raw data. The data is presented directly to the clinician in its most useful form. The clinician is then able to immediately utilize the information in order to optimize the care of the subject.
The present invention contemplates any method capable of receiving, processing, and transmitting the information to and from laboratories conducting the assays, information providers, medical personal, and subjects. For example, in some embodiments of the present invention, a sample (e.g., a biopsy or a blood or serum sample) is obtained from a subject and submitted to a profiling service (e.g., clinical lab at a medical facility, genomic profiling business, etc.), located in any part of the world (e.g., in a country different than the country where the subject resides or where the information is ultimately used) to generate raw data. Where the sample comprises a tissue or other biological sample, the subject may visit a medical center to have the sample obtained and sent to the profiling center, or subjects may collect the sample themselves (e.g., a urine sample) and directly send it to a profiling center. Where the sample comprises previously determined biological information, the information may be directly sent to the profiling service by the subject (e.g., an information card containing the information may be scanned by a computer and the data transmitted to a computer of the profiling center using an electronic communication systems). Once received by the profiling service, the sample is processed and a profile is produced (i.e., presence of wild type or mutant NOTCH2), specific for the screening, diagnostic or prognostic information desired for the subject.
The profile data is then prepared in a format suitable for interpretation by a treating clinician. For example, rather than providing raw data, the prepared format may represent a diagnosis or risk assessment (e.g., diagnosis or prognosis of SMZL) for the subject, along with recommendations for particular treatment options. The data may be displayed to the clinician by any suitable method. For example, in some embodiments, the profiling service generates a report that can be printed for the clinician (e.g., at the point of care) or displayed to the clinician on a computer monitor.
In some embodiments, the information is first analyzed at the point of care or at a regional facility. The raw data is then sent to a central processing facility for further analysis and/or to convert the raw data to information useful for a clinician or patient. The central processing facility provides the advantage of privacy (all data is stored in a central facility with uniform security protocols), speed, and uniformity of data analysis. The central processing facility can then control the fate of the data following treatment of the subject. For example, using an electronic communication system, the central facility can provide data to the clinician, the subject, or researchers.
In some embodiments, the subject is able to directly access the data using the electronic communication system. The subject may chose further intervention or counseling based on the results. In some embodiments, the data is used for research use. For example, the data may be used to further optimize the inclusion or elimination of markers as useful indicators of a particular condition or stage of disease.
In some embodiments, the methods disclosed herein are useful in monitoring the treatment of lymphoma (e.g., SMZL). For example, in some embodiments, the methods may be performed immediately before, during and/or after a treatment to monitor treatment success. In some embodiments, the methods are performed at intervals on disease free patients to ensure treatment success.
The present invention also provides a variety of computer-related embodiments. Specifically, in some embodiments the invention provides computer programming for analyzing and comparing a pattern of SMZL-specific marker detection results in a sample obtained from a subject to, for example, a library of such marker patterns known to be indicative of the presence or absence of SMZL, or a particular stage or prognosis of SMZL.
In some embodiments, the present invention provides computer programming for analyzing and comparing a first and a second pattern of SMZL-specific marker detection results from a sample taken at least two different time points. In some embodiments, the first pattern may be indicative of a pre-cancerous condition and/or low risk condition for SMZL cancer and/or progression from a pre-cancerous condition to a cancerous condition. In such embodiments, the comparing provides for monitoring of the progression of the condition from the first time point to the second time point. In yet another embodiment, the invention provides computer programming for analyzing and comparing a pattern of SMZL-specific marker detection results from a sample to a library of SMZL-specific marker patterns known to be indicative of the presence or absence of a SMZL, wherein the comparing provides, for example, a differential diagnosis between an aggressively malignant SMZL cancer and a less aggressive SMZL cancer (e.g., the marker pattern provides for staging and/or grading of the cancerous condition).
The methods and systems described herein can be implemented in numerous ways. In one embodiment, the methods involve use of a communications infrastructure, for example the internet. Several embodiments of the invention are discussed below. It is also to be understood that the present invention may be implemented in various forms of hardware, software, firmware, processors, distributed servers (e.g., as used in cloud computing) or a combination thereof. The methods and systems described herein can be implemented as a combination of hardware and software. The software can be implemented as an application program tangibly embodied on a program storage device, or different portions of the software implemented in the user's computing environment (e.g., as an applet) and on the reviewer's computing environment, where the reviewer may be located at a remote site (e.g., at a service provider's facility).
For example, during or after data input by the user, portions of the data processing can be performed in the user-side computing environment. For example, the user-side computing environment can be programmed to provide for defined test codes to denote platform, carrier/diagnostic test, or both; processing of data using defined flags, and/or generation of flag configurations, where the responses are transmitted as processed or partially processed responses to the reviewer's computing environment in the form of test code and flag configurations for subsequent execution of one or more algorithms to provide a results and/or generate a report in the reviewer's computing environment.
The application program for executing the algorithms described herein may be uploaded to, and executed by, a machine comprising any suitable architecture. In general, the machine involves a computer platform having hardware such as one or more central processing units (CPU), a random access memory (RAM), and input/output (I/O) interface(s). The computer platform also includes an operating system and microinstruction code. The various processes and functions described herein may either be part of the microinstruction code or part of the application program (or a combination thereof) which is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
As a computer system, the system generally includes a processor unit. The processor unit operates to receive information, which generally includes test data (e.g., specific gene products assayed), and test result data (e.g., the pattern of gastrointestinal neoplasm-specific marker detection results from a sample). This information received can be stored at least temporarily in a database, and data analyzed in comparison to a library of marker patterns known to be indicative of the presence or absence of a pre-cancerous condition, or known to be indicative of a stage and/or grade of gastrointestinal cancer.
Part or all of the input and output data can also be sent electronically; certain output data (e.g., reports) can be sent electronically or telephonically (e.g., by facsimile, e.g., using devices such as fax back). Exemplary output receiving devices can include a display element, a printer, a facsimile device and the like. Electronic forms of transmission and/or display can include email, interactive television, and the like. In some embodiments, all or a portion of the input data and/or all or a portion of the output data (e.g., usually at least the library of the pattern of gastrointestinal neoplasm-specific marker detection results known to be indicative of the presence or absence of a pre-cancerous condition) are maintained on a server for access, e.g., confidential access. The results may be accessed or sent to professionals as desired.
A system for use in the methods described herein generally includes at least one computer processor (e.g., where the method is carried out in its entirety at a single site) or at least two networked computer processors (e.g., where detected marker data for a sample obtained from a subject is to be input by a user (e.g., a technician or someone performing the assays)) and transmitted to a remote site to a second computer processor for analysis (e.g., where the pattern of SMZL-specific marker) detection results is compared to a library of patterns known to be indicative of the presence or absence of a pre-cancerous condition), where the first and second computer processors are connected by a network, e.g., via an intranet or internet). The system can also include a user component(s) for input; and a reviewer component(s) for review of data, and generation of reports, including detection of a pre-cancerous condition, staging and/or grading of SMZL, or monitoring the progression of a pre-cancerous condition or SMZL. Additional components of the system can include a server component(s); and a database(s) for storing data (e.g., as in a database of report elements, e.g., a library of marker patterns known to be indicative of the presence or absence of a pre-cancerous condition and/or known to be indicative of a grade and/or a stage of SMZL, or a relational database (RDB) which can include data input by the user and data output. The computer processors can be processors that are typically found in personal desktop computers (e.g., IBM, Dell, Macintosh), portable computers, mainframes, minicomputers, tablet computer, smart phone, or other computing devices.
The input components can be complete, stand-alone personal computers offering a full range of power and features to run applications. The user component usually operates under any desired operating system and includes a communication element (e.g., a modem or other hardware for connecting to a network using a cellular phone network, Wi-Fi, Bluetooth,
Ethernet, etc.), one or more input devices (e.g., a keyboard, mouse, keypad, or other device used to transfer information or commands), a storage element (e.g., a hard drive or other computer-readable, computer-writable storage medium), and a display element (e.g., a monitor, television, LCD, LED, or other display device that conveys information to the user). The user enters input commands into the computer processor through an input device. Generally, the user interface is a graphical user interface (GUI) written for web browser applications.
The server component(s) can be a personal computer, a minicomputer, or a mainframe, or distributed across multiple servers (e.g., as in cloud computing applications) and offers data management, information sharing between clients, network administration and security. The application and any databases used can be on the same or different servers. Other computing arrangements for the user and server(s), including processing on a single machine such as a mainframe, a collection of machines, or other suitable configuration are contemplated. In general, the user and server machines work together to accomplish the processing of the present invention.
Where used, the database(s) is usually connected to the database server component and can be any device which will hold data. For example, the database can be any magnetic or optical storing device for a computer (e.g., CDROM, internal hard drive, tape drive). The database can be located remote to the server component (with access via a network, modem, etc.) or locally to the server component.
Where used in the system and methods, the database can be a relational database that is organized and accessed according to relationships between data items. The relational database is generally composed of a plurality of tables (entities). The rows of a table represent records (collections of information about separate items) and the columns represent fields (particular attributes of a record). In its simplest conception, the relational database is a collection of data entries that “relate” to each other through at least one common field.
Additional workstations equipped with computers and printers may be used at point of service to enter data and, in some embodiments, generate appropriate reports, if desired. The computer(s) can have a shortcut (e.g., on the desktop) to launch the application to facilitate initiation of data entry, transmission, analysis, report receipt, etc. as desired.
In certain embodiments, the present invention provides methods for obtaining a subject's risk profile for developing SMZL or having aggressive SMZL. In some embodiments, such methods involve obtaining a blood or blood product sample from a subject (e.g., a human at risk for developing SMZL; a human undergoing a routine physical examination, or a human diagnosed with SMZL), detecting the presence or absence of the NOTCH2 variants described herein associated SMZL in the sample, and generating a risk profile for developing SMZL or progressing to metastatic or aggressive SMZL. For example, in some embodiments, a generated profile will change depending upon specific markers and detected as present or absent or at defined threshold levels. The present invention is not limited to a particular manner of generating the risk profile. In some embodiments, a processor (e.g., computer) is used to generate such a risk profile. In some embodiments, the processor uses an algorithm (e.g., software) specific for interpreting the presence and absence of specific exfoliated epithelial markers as determined with the methods of the present invention. In some embodiments, the presence and absence of specific NOTCH2 variants as determined with the methods of the present invention are imputed into such an algorithm, and the risk profile is reported based upon a comparison of such input with established norms (e.g., established norm for pre-cancerous condition, established norm for various risk levels for developing SMZL, established norm for subjects diagnosed with various stages of SMZL cancer). In some embodiments, the risk profile indicates a subject's risk for developing SMZL or a subject's risk for re-developing SMZL. In some embodiments, the risk profile indicates a subject to be, for example, a very low, a low, a moderate, a high, and a very high chance of developing or re-developing SMZL cancer or having a poor prognosis (e.g., likelihood of long term survival) from SMZL. In some embodiments, a health care provider (e.g., an oncologist) will use such a risk profile in determining a course of treatment or intervention (e.g., biopsy, wait and see, referral to an oncologist, referral to a surgeon, etc.).
The following examples are provided in order to demonstrate and further illustrate certain preferred embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof
Patients and samples. Six SMZL samples from the University of Michigan were selected as index cases for whole genome sequencing. To assess the prevalence of NOTCH2 mutations in SMZL, an additional 93 SMZL cases were obtained from The University of Texas MD Anderson Cancer Center (31 cases), the University of Utah Health Sciences Center (25 cases), the Southern California Permanente Medical Group (20 cases), the
University of Michigan (15 cases), and the University of Wisconsin (2 cases). Approval from the University of Michigan Hospital institutional review board (HUM0002325b) was obtained for these studies. In order to assess the specificity of NOTCH2 mutations in SMZL, genomic DNA was extracted from additional tissues representing non-SMZL diseases including 15 cases of chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL), 15 cases of mantle cell lymphoma (MCL), 44 cases of grade 1-2 follicular lymphoma (FL), 15 cases of hairy cell leukemia (HCL) and 14 cases of reactive lymphoid hyperplasia (RLH). In addition, 19 cases of non-SMZL (e.g., nodal and extranodal/mucosa-associated lymphoid tissue lymphoma) were analyzed.
Pathological review. All specimens were reviewed independently and confirmed by consensus among three hematopathologists (MSL, NGB and KEJ) according to World Health Organization classification criteria without knowledge of NOTCH2 mutational status. Only cases containing adequate neoplastic tissue were included in subsequent analyses.
Whole genome and targeted NOTCH2 DNA sequencing. From each of six index SMZL cases, 10 μg of high-molecular-weight genomic DNA was extracted from fresh frozen tumor tissue using the QIAamp DNA extraction kit (QIAGEN) and subjected to whole genome sequencing by Complete Genomics, Incorporated (CGI; Mountain View, Calif.). CGI performs massively parallel short-read sequencing using a combinatorial probe-anchor ligation (cPAL) chemistry coupled with a patterned nanoarray-based platform of self-assembling DNA nanoballs (Drmanac et al., 2010 Science 327:78-81). Library generation, read-mapping to the NCBI reference genome (Build 37, RefSeq Accession numbers CM000663-CM00686), local de novo assembly and variant-calling protocols were performed as previously described (Drmanac et al., 2010 supra; Roach et al., 2010 Science 328:636-639). Initial read mapping and variant calling were performed using CGAtools v1.3.0. Additional downstream bioinformatic analyses were performed using custom designed PERL processing routines. Targeted sequencing of the NOTCH2 C-terminal coding exons 25 to 34 was performed using Sanger sequencing for the SMZL samples in the validation cohort. For all other samples, sequencing was confined to exons 26, 27 and 34 where all confirmed mutations in SMZL samples occurred. Somatic acquisition of each mutation was also assessed when matched constitutional tissue was available for analysis. Genomic DNA from index cases and genomic DNA corresponding to matched constitutional tissue were subjected to Sanger sequencing of regions of the NOTCH2 where mutations were observed through whole genome sequencing. For targeted sequencing of exons 25-34 in the NOTCH2 C-terminal region in the validation and specificity cohort samples, genomic DNA was extracted using both the QIAGEN BioRobot EZ1 and QIAamp FFPE DNA extraction kits (QIAGEN). For all Sanger sequencing reactions, PCR amplification was performed using Phusion DNA polymerase (New England Biolabs) followed by conventional Sanger sequencing technology using BigDye version 3.1 chemistry run on an Applied Biosystems 3730x1 DNA Sequencer at the University of Michigan DNA sequencing Core. All sequencing reactions were performed using nested sequencing primers. Sequencing trace analysis was performed using Mutation Surveyor software. All mutations were verified in at least two independent PCR amplification and sequencing reactions. cDNA nucleotide numbering of coding sequence is based on Genbank accession NG—008163.1. Protein amino acid numbering is based on Genbank accession NP—077719.2. Detailed primer sequences for targeted exon sequencing can be found in Table 1.
Cloning of NOTCH2 mutants and transactivation analysis. DNA constructs representing NOTCH2 mRNA lacking the EGF-repeat region (residues p.M1 to p.E1412, exons 1-24; ΔEGF) were engineered from full length NOTCH2 gene (OriGene; Rockville MD) to contain nucleotide sequence identical to that of wild-type and selected NOTCH2 mutations identified in index and validation SMZL samples. These constructs were transiently expressed in 293T cells and assessed for their ability to activate a NOTCH sensitive luciferase reporter gene system (SA Biosciences; Valencia Calif.). NOTCH2 mutations p.V1667I, p.Q2285X, p.I2304fsX9, p.R2400X and p.E2411X were introduced into ΔEGF NOTCH2 using QuickChange kit (Stratagene; La Jolla, CA) and appropriate truncation primers. Wild-type and NOTCH2 mutated constructs were cloned between EcoR1 and Xho1 restriction sites in the pCAGGS3.2 FLAG vector, which introduces FLAG tag at the N-terminal part of the protein. The sequence verified constructs were tested for expression of either wild-type or mutant NOTCH2 protein. NOTCH2 expression plasmids were introduced into 293T cells by transient transfection using Polyjet transfection reagent (SignaGen Laboratories; Rockville Md.) and assessed for their ability to activate a NOTCH-sensitive luciferase reporter gene, using Cignal RBP-Jk Reporter kit per protocol (SA Biosciences; Valencia Calif.). Briefly, cells in 24-well dishes were co-transfected in triplicate with 400 ng of various ΔEGF NOTCH2 expression constructs, a NOTCH-sensitive firefly luciferase reporter gene, and an internal control Renilla luciferase plasmid (Promega; Madison Wis.). Firefly luciferase activities were measured in whole-cell extracts prepared 48 h after transfection using the Dual Luciferase kit (Promega) and a specially configured luminometer (Berthold Technologies; Germany). Western blotting was performed using the extracts to ensure equal expression of different constructs. Briefly, 50 μl of total protein extracts from the reporter assay were separated on a high resolution SDS PAGE using SDS PAGE running buffer, followed by Western blotting using FLAG M2 mouse monoclonal antibody (Sigma-Aldrich; St. Louis Mo.).
Statistical analysis of clinical outcomes. Clinical outcomes data (time to transformation, relapse or death) were analyzed using standard survival analysis. Survival plots were generated using Kaplan-Meier method and Log-rank tests were used to compare survival times between patients with NOTCH2 mutations and patients with wild-type NOTCH2. Cox-proportional hazards regression analysis was conducted to compare the two groups of patients after adjusting for age, gender, performance status and stage at diagnosis. Statistical analyses were performed with SAS version 9.3.
Genome Sequencing and NOTCH2 Mutation Confirmation
To gain insight into the pathogenesis of SMZL, WGS was performed on six index cases of SMZL. Whole genome sequencing (WGS) yielded an average of 350±10 million mapped reads per sample with an average of 97.6±0.08% genome coverage and 96.4±0.3% fully-called exome coverage. The median genomic sequencing depth exceeded 80× in all samples normalized across the entire genome. In order to enhance the ability to identify somatic alterations that are important in SMZL pathogenesis, variations that were present in any of the 6 SMZL genomes and not in the Database of SNPs (dbSNP) were investigated.
After normalization to publicly available constitutional normal genome sequencing data (Complete Genomics, Inc.), relative depth of coverage for distinct chromosomal regions were examined for evidence of recurrent chromosomal gains or losses. Corresponding plots of ploidy for each genome are shown in
In total, 2,995 candidate genes were identified with at least one previously undocumented single nucleotide polymorphism (SNP) or small insertion/deletion event (indel) in at least one of the six SMZL genomes (comparison to dbSNP; not shown). Of these, 232 genes showed novel alterations in at least two of the six SMZL index genomes. These included mutations in epigenetic modifiers including MLL2 and MLL3 which have been previously reported to occur in follicular diffuse large B-cell lymphomas but not in marginal zone lymphomas (Morin et al., 2011 Nature. 476:298-303; Pasqualucci et al., 2011 Nat Genet. 43:830-837). In three of six index SMZL cases, variant call analysis identified NOTCH2 mutations predicted to lead to protein truncation in the distal C-terminal region in the transactivation (TAD) and proline/glutamate/serine/threonine-rich (PEST) domains. Two of these cases harbored the same p.R2400X nonsense amino acid substitution mutation and one case harbored a length-affecting mutation leading to a frameshift at residue p.I2304 (
Prevalence of NOTCH2 Mutations in SMZL
In order to establish the prevalence of NOTCH2 mutations among a larger SMZL cohort, targeted Sanger sequencing of exons 25 through 34 (
These mutations were largely length-affecting mutations (either frameshift or non-sense mutations) confined to the distal TAD and PEST domains and are predicted to cause truncation of the NOTCH2 protein, eliminating degradation signals in the PEST domain, thereby increasing the stability of the NICD2. A single missense mutation (p.V1667I) located in the HD is predicted to be equivalent to the p.V1722INOTCH1 mutation in T-ALL associated with ligand-independent NOTCH1 activation (Gordon et al., 2007 Nat Struct Mol Biol 14:295-300; Malecki et al., 2006 Mol Cell Biol 26:4642-4651). Overall, 25 of 99 SMZL cases (25.3%) harbored NOTCH2 mutations. Whereas most of these mutations were single heterozygous mutations, one of 25 SMZL patients had two distinct NOTCH2 mutations including both a length affecting mutation (p.I2304fsX2) and a missense variant (p.M2358V;
although constitutional tissue was not available to assess somatic acquisition). Of the 25 cases with NOTCH2 mutations, 19 patients had corresponding matched normal tissue. None of the constitutional tissues harbored sequence variants indicating somatic acquisition of NOTCH2 mutations detected in tumor tissue.
Having established a high frequency of NOTCH2 mutations in the validation cohort, the initial genomic sequencing screening data was queried for the existence of structural alterations affecting other genes in the NOTCH signaling pathway. This investigation identified predicted protein coding alterations affecting MAML2, a cofactor of the NOTCH2 transcriptional complex, in the three genomes that did not have NOTCH2 mutations. These alterations included previously reported p.Q237R and p.V836I variants as well as a novel p.G25W mutation. Sanger sequencing confirmed the variants in the corresponding tumor samples. However, the previously reported variants were present in corresponding germline tissue and thus were not somatically acquired. The novel p.G25W mutation was confirmed to be somatically acquired by direct Sanger sequencing. The mutation affects an amino acid with the N-terminal region of the MAML2 protein known to mediate protein-protein interactions with NOTCH family members. The prevalence of additional MAML2 mutations in the validation cohort were investigated. This identified a single additional somatic mutations in MAML2 (p.A11S) in a genome without an identified NOTCH2 mutation. Overall, the prevalence of putative impactful somatic mutations in MAML2 was therefore two out of 99 case (2.0%). No mutations were found in Fbw7 or other NOTCH-pathway related genes in the discovery cohort.
Of the mutations identified in SMZL including the most frequently recurrent mutations at p.R2400 and p.I2304, most are predicted to prematurely truncate the protein prior to complete translation of the C-terminal PEST domain. These mutations are therefore predicted to abrogate the negative regulatory function of the PEST domain and lead to increased NICD2 stability with activation of downstream NOTCH2 signaling by gain-of-function. Additionally, the HD is known to protect from ligand-independent activation (Kojika and Griffin, 2001 Exp Hematol 29:1041-1052). The single missense mutation (p.V1667I) identified in the HD region would be predicted to disable this protection and thus trigger NOTCH2 intracellular signaling and promote downstream transcriptional activation.
To test the functional effect of NOTCH2 mutations on NOTCH2 signaling, selected NOTCH2 mutant proteins (p.V1667I, p.Q2285X, p.I2304fsX9, p.R2400X and p.E2411X mutations) were transiently expressed into 293T cell lines and the effects on down-stream NOTCH2 signaling were assessed using a luciferase reporter system containing iterated CSL-binding sites derived from the HES 1 promoter (
Having established the frequency of NOTCH2 mutations in SMZL, the specificity of these mutations for SMZL was assessed. Sanger sequencing was performed on CLL/SLL, FL, HCL, and MCL as well as RLH samples. No evidence of NOTCH2 mutations was identified in any of 103 cases of CLL/SLL, FL, HCL, MCL or RLH (
Having demonstrated the presence of NOTCH2 mutations in a subset of SMZL cases, it was determined whether the presence of these mutations influenced clinical outcomes. Time to adverse outcome, defined from tissue diagnosis to relapse, transformation or death was compared between patients harboring NOTCH2 mutations and those with wild-type
NOTCH2. Survival data was available for 46 patients from this study including 11 patients with NOTCH2 mutations and 35 patients with wild-type NOTCH2 with a median follow-up of 40 months (range: 0.7 to 177 months). Patients with NOTCH2 mutations had significantly shorter time to adverse outcome compared to patients with wild-type NOTCH2 (the median time to adverse outcome was 32.6 months in NOTCH2-mutated patients versus 107.2 months in patients without NOTCH2 mutations (P=0.002;
tion
L
C
X9
L
T
X
L
T
X
L
A
L
T
X
L
A
2275D
L
GCACG
X12
L
T
2285X
L
T
2285X
L
A
L
G
X3
L
C
L
C
X2
L
C
X2
L
C
X9
L
CCC
X3
L
T
L
T
X
L
T
X
L
T
X
L
T
X
L
T
X
L
T
X
L
T
X
L
T
T
X
indicates data missing or illegible when filed
tomy
1
mg/L
indicates data missing or illegible when filed
,887
MYD3
,776
54
A
RTM4
139
P15
PF1
778
4
T
951
79,998
21
,709
RL
5
7
1
1
F2
6
2
,993,7
,75
41
0
TN2
,616,99
,217,728
,531
AM
52
0,395
1
63
5
04
,390
60,973
32
,703,
66
,178
,024,956
,026,278
6
5
8
0
94
,975
LFN11
2
LFN11
,759
47
LFN11
66,242
62
,074,72
A5
X
31,945
,083
2,258
8,
63,473
OPL
,051
,125
42,066
54,133
7,7
3,729
7
4,563,723
2
1
4,566,277
5
0,974
,682,
55
4,7
,691
5
81
46
45
K3C3
16
,018,371
29
2,
0
90
7,5
7
77
6
6
CP2
F
,351,
93
1,
52
2
5,637
7,019,774
,885,142
P
6,3
6
Y3
,220
DK1
63
1
AN12
0,4
6,0
71,978
99
C1D13
7,
2
6,4
,05
,480
,882
,537,6
1
,77
1
3
,174,1
2,273
HG
,595,998
,5
,600
4,3
9
1,092
LFN
89,757
LFN
,690,846
LFN
,364,
,498
15
T
2,
1,753
2,495
4,472
,152
19
26,
00
,539
,667
5,128,324
,985
5,381,398
97
,924,695
H2D1
2
,24
,836
6
1,376,811
1,380,334
96,590
1
2,137
2,986
,720,813
D5
,224
5,
1
01
9,013,485
9,015,126
,624
6
1
,143
,513,997
4,401
1
DK1
,122
55
13A
7,456
9,157
,180
,642,127
ERP2
,400,405
,402,558
2
nterchromosome
MYD3
A
RTM4
3
PF1
3
T
571
5
1
1
99
96
06
4
4
5
AM
62
2
24
LFN11
LFN11
LFN11
X
5
KL1A
47
XW4
2
LC16
79
3C3
KR
7
CP2
F
6
UD31
NCAIP
NCAIP
7
P
Y3
DK1
62,
AN12
C1D13
5
20,74
5,
15
T
2
1
LFN
5
LFN
LFN
G
T
19
A
2
2
H2D1
,204
1
s
7,4
0
13A
45
2
2
indicates data missing or illegible when filed
All publications and patents mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the medical sciences are intended to be within the scope of the following claims.
This application claims priority to U.S. Provisional Patent Application No. 61/666,445, filed Jun. 29, 2012, which is herein incorporated by reference in its entirety.
This invention was made with government support under DE019249, CA136905 and CA140806 awarded by the National Institutes of Health. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
61666445 | Jun 2012 | US |