This invention is related to the area of nucleic acid analysis. In particular, it relates to analysis of methylated nucleic acids.
In humans, DNA methylation is largely restricted to cytosines within 5′-CpG dinucleotides. This covalent modification of DNA functions as an important mediator of gene regulation and, together with covalent modifications of histone proteins, forms the cornerstone for the burgeoning field of epigenetics. In addition to its role in development, imprinting, and cellular differentiation, DNA methylation is altered in cancer cells. Though cancers are globally hypomethylated1, 2, specific regions of genes have been shown to be hypermethylated in associated with transcription silencing3,4. Reversal of the hypermethylation with pharmacological agents such as 5-azacytidine can reactivate such genes4. Though many regions of the cancer genome are abnormally methylated, the methylation and silencing of tumor suppressor genes are particularly important. In such cases, methylation serves as a heritable mechanism to inactivate a tumor suppressor gene, with the end result similar to that resulting from mutational inactivation of the gene.
In addition to its implications for gene regulation, DNA methylation is providing a new generation of cancer biomarkers5, 6. Though mutant sequences provide exquisitely specific biomarkers of this class, their utility is compromised by their heterogeneity: the same gene can be mutationally inactivated through many different mechanisms or mutated at many different positions. In contrast, DNA hypermethylation in cancers often affects identical residues in the regulatory regions of particular genes, providing significant advantages in biomarker test design. Accordingly, many studies have employed DNA methylation of specific genes for diagnostics development7-13. Such diagnostic tests can in principle be used for early detection of cancers, prognosis, assessing the effects of therapy or detecting residual diseases.
Many diagnostic tests based on DNA methylation have employed bisulfite to convert cytosine residues to uracils. This conversion alters the sequence of DNA, providing an opportunity to assess DNA methylation with allele-specific PCR, restriction digestion or specific hybridization probes14-18. Such techniques have been enormously useful for establishing the position of hypermethylated sequences in the cancer genome as well as for documenting the potential utility of such methylation for diagnostic purposes. However, in many important diagnostic scenarios, DNA from the cancer represents only a small fraction of the total DNA in the clinical sample. Such scenarios include the use of DNA from plasma/serum, urine, feces, or sputum for early diagnosis or therapeutic monitoring and the use of DNA from surgical margins or lymph nodes to monitor the extent of disease19. Currently used techniques for assessing methylation generally do not provide direct quantitative information about the fraction of DNA fragments that are methylated or are incapable of detecting small fractions of methylated fragments. The actual fraction of methylated fragments can be determined through cloning and sequencing of PCR products or next-generation sequencing technologies20, 21, but these approaches cannot easily be used in a clinical diagnostics setting.
There is a continuing need in the art for sensitive and accurate analytic methods.
According to one embodiment of the invention a method is provided for determining fraction of molecules comprising a methylated sequence in a sample of analyte DNA molecules comprising the sequence. A sample of analyte DNA molecules is treated with a reagent which selectively modifies methylated cytosine residues or which selectively modifies unmethylated cytosine residues. Microemulsions are formed comprising the treated analyte DNA molecules. A portion of a treated analyte DNA molecule is amplified in the microemulsions in the presence of beads. The portion comprises one or more 5′-CpG methylation sites. The beads are bound to a plurality of molecules of a primer for amplifying the analyte DNA molecules. A plurality of copies of analyte DNA molecule are formed covalently attached to the plurality of molecules of the primer which are bound to beads. Nucleotide sequences at the one or more 5′-CpG methylation sites of analyte DNA molecules which are bound to beads are determined and beads that have modified 5′-CpG methylation sites and beads that have unmodified 5′-CpG methylation sites are quantified.
According to another aspect of the invention a bead is provided which is bound to a plurality of molecules of a primer for amplifying DNA molecules. The primer comprises at least 15 contiguous nucleotides selected from the group consisting of 5′-GTTGTTTAGG TTGTAGGTGN GGG-3 (SEQ ID NO: 2), 5′-CTCNTCCTCC TACCNCAAAA TATTC-3′ (SEQ ID NO: 3), and the complements thereof.
These and other embodiments which will be apparent to those of skill in the art upon reading the specification provide the art with
The inventors have developed a technology for direct quantification of DNA methylation at specific sites that is readily applicable to clinical samples, even when the fraction of methylated fragments in such samples is minute. This approach can be applied to determine the extent of methylation of marker genes in plasma and fecal DNA from cancer patients and healthy controls.
Beads are also known as microspheres or microparticles. Particle sizes can vary between about 0.1 and 10 microns in diameter. Typically beads are made of a polymeric material, such as polystyrene, although nonpolymeric materials such as silica can also be used. Other materials which can be used include styrene copolymers, methyl methacrylate, functionalized polystyrene, glass, silicon, and carboxylate. Optionally the particles are superparamagnetic, which facilitates their purification after being used in reactions.
Beads can be modified by covalent or non-covalent interactions with other materials, either to alter gross surface properties, such as hydrophobicity or hydrophilicity, or to attach molecules that impart binding specificity. Such molecules include without limitation, antibodies, ligands, members of a specific-binding protein pair, receptors, nucleic acids. Specific-binding protein pairs include avidin-biotin, streptavidin-biotin, and Factor VII-Tissue Factor. Primers can be attached via spacer moieties. The spacer moieties may be oligonucleotides.
Beads, after being prepared according to the present invention as product beads, have more than one copy of the same nucleic acid molecule bound to them. Preferably each bead is bound to at least 10, 50, 100, 500, 1000, or 10,000 molecules of the same nucleic acid sequence. In some circumstances some of the product beads are bound to more than one type of nucleic acid molecule. These product beads are generally less useful in the analysis of ratios of methylated sequences in a population of sequences. Such product beads can be readily discriminated and so do not distort the analysis.
A population of product beads will often comprise two or more types of nucleic acids. Such a population is heterogeneous with respect to the nucleic acids. Desirably, a substantial proportion of the product beads comprise only one type of nucleic acid per bead. A substantial proportion can be for example, at least 1%, at least 5%, at least 10%, or at least 50%. A product bead with only one type of nucleic acid per bead is termed homogeneous. A product bead with two types of nucleic acid per bead is termed heterogeneous. Heterogeneous product beads may result from aqueous compartments which have more than two molecules of template of non-identical sequence. This could result, for example from incomplete modification of methylated cytosine residues. A population of product beads can be heterogeneous as a population but contain individual product beads that are homogeneous.
Individual product beads preferably comprise more than one copy of template analyte molecule. Each bead may comprise at least 10, at least 50, at least 100, at least 500, at least 1000, or at least 10,000 copies of template analyte. If the bead is homogeneous, each of those copies will be identical.
Populations of product beads can be maintained in a liquid suspension. Alternatively they can be sedimented and dried or frozen. The latter alternatives may be beneficial for storage stability.
Analysis of populations of product beads can be used for distinguishing between methylated and non-methylated or hypomethylated templates. Polynucleotides can be distinguished which differ by as little as a single methylated residue, although differences in multiple residues may by better for detection sensitivity. Increased sensitivity obtained using multiple residues may decrease specificity, however.
One very convenient way for distinguishing between the products of methylated and non-methylated CpG islands is by differentially labeling probes to the expected products with fluorescent dyes. Such labeling can be accomplished by hybridization of a fluorescently labeled oligonucleotide probe to one species of polynucleotide. Alternatively, a fluorescently labeled antibody can be used to specifically attach to one oligonucleotide probe that hybridizes to a particular product. Such antibody binding can be, for example, mediated by a protein or polypeptide which is attached to an oligonucleotide hybridization probe. Of course, other means of labeling polynucleotides as are known in the art can be used without limitation. Another means of labeling different polynucleotide species is by primer extension. Primers can be extended using labeled deoxyribonucleotides, such as fluorescently labeled deoxyribonucleotides.
Populations of product beads can be used as templates. Template analyte molecules on the product beads can be analyzed to assess DNA sequence variations by hybridization, primer-extension methods, mass spectroscopy, and other methods commonly used in the art. Template analyte molecules on product beads can be employed for solid phase sequencing. In one solid phase sequencing technique, product beads are arrayed by placing them on slides spotted with complementary oligonucleotides. In another solid phase sequencing technique, product beads are placed into individual wells. In still another solid phase sequencing technique product beads are incorporated into acrylamide matrices (with or without subsequent polony formation). Sequencing reactions can be performed with any solid phase sequencing method, such as those using unlabeled nucleotide precursors (e.g., pyrosequencing, as described in Ronaghi et al., Anal. Biochem. 267: 65-71, 1999) or labeled nucleotides (e.g., photocleavable reagents described by Mitra et al., Anal. Biochem. 320:55-65, 2003). Product beads can thus be used for and facilitate massively parallel sequencing. Product beads can also be used in sequencing employing Type IIS restriction endonucleases. Product beads can also be used to provide templates for conventional dideoxynucleotide sequencing. To obtain useful data upon sequence analysis, a homogeneous template population is desirable. To provide a homogenous template population, product beads can be diluted, separated, or otherwise isolated so that each sequencing reaction contains a single product bead. Alternatively, product beads can be sorted to provide populations of beads with a single species of template.
Oligonucleotide primers can be bound to beads by any means known in the art. They can be bound covalently or non-covalently. They can be bound via an intermediary, such as via a protein-protein interaction, such as an antibody-antigen interaction or a biotin-avidin interaction. Other specific binding pairs as are known in the art can be used as well. To achieve optimum amplification, primers bound to the bead may be longer than necessary in a homogeneous, liquid phase reaction. Oligonucleotide primers may be at least 12, at least 15, at least 18, at least 25, at least 35, or at least 45 nucleotides in length. The length of the oligonucleotide primers which are bound to the beads need not be identical to that of the primers that are in the liquid phase. Primers can be used in any type of amplification reaction known in the art, including without limitation, polymerase chain reaction, isothermal amplification, rolling circle amplification, self-sustaining sequence replication (3SR), nucleic acid sequence-based amplification (NASBA), transcription-mediated amplification (TMA), strand-displacement amplification (SDA), and ligase chain reaction (LCR). Primers can be designed to hybridize only to modified, only to non-modified, or to both modified and non-modified sequences. Mixtures of primer sequences can be used to capture variants.
Microemulsions are made by stirring or agitation of oil, aqueous phase, and detergent. The microemulsions form small aqueous compartments (nanocompartments) which have an average diameter of 0.5 to 50 microns. The compartments may be from 1 to 10 microns, inclusive, from 11 to 100 microns, inclusive, or about 5 microns, on average. All such compartments need not comprise a bead. Desirably, at least one in 10,000 of said aqueous compartments comprise a bead. Typically from 1/100 to 1/1 or from 1/50 to 1/1 of said aqueous compartments comprise a bead. In order to maximize the proportion of beads which are homogeneous with respect to oligonucleotide, it is desirable that on average, each aqueous compartment contains less than 1 template molecule. Aqueous compartments will also desirably contain whatever reagents and enzymes are necessary to carry out amplification. For example, for polymerase chain reaction (PCR) the compartments will desirably contain a DNA polymerase and deoxyribonucleotides. For rolling circle amplification a DNA polymerase and a generic DNA circle may be present.
Emulsions can be “broken” or disrupted by any means known in the art. One particularly simple way to break the emulsions is to add more detergent. Detergents which can be used include, but are not limited to Triton X100, Laureth 4, Nonidet.
Sample DNA for amplification and analysis according to the present invention can be genomic DNA. Samples can be derived from a single individual, for example, from a body sample such as urine, blood, serum, plasma, sputum, stool, tissue or saliva. Samples can also be derived from a population of individuals. The individuals can be humans, but can be any organism, plant or animal, eukaryotic or prokaryotic, viral or non-viral.
Any type of probe can be used for specific hybridization to the amplified polynucleotides which are bound to the beads. Fluorescently labeled probes are useful because their analysis can be automated and can achieve high throughput. Fluorescence activated cell sorting (FACS) permits both the analysis and the isolation of different populations of beads. One type of fluorescently labeled probe that can be used is a modified molecular beacon probe. These probes have stem-loop structures and an attached fluorescent moiety on the probe, typically on one end of the probe, sometimes attached through a linker. Unlike standard molecular beacon probes, modified molecular beacon probes do not have a quenching moiety. The modified molecular beacon probe can have the fluorescent moiety attached on either end of the probe, 5′ or 3′. One such probe will hybridize better to a sequence from a methylated gene than to a non-methylated gene. Another such probe will hybridize better to a sequence from a non-methylated gene than to a sequence from the methylated gene.
Microemulsions or nanocompartments are formed with beads and primers. Because BEAMing requires thermal cycling, an emulsifier which is thermostable can optionally be used. One such emulsifier is Abil® EM90 (Degussa-Goldschmidt Chemical, Hopewell, Va.). Other such emulsifiers can be used as are known in the art.
Chemical reagents can be used which selectively modify either the methylated or non-methylated form of CpG dinucleotide motifs. Alternatively, methylation-sensitive restriction endonucleases can be used to detect methylated CpG dinucleotide motifs. Such endonucleases may either impartially cleave within methylated CpG recognition sites relative to non-methylated CpG recognition sites or preferentially cleave within non-methylated relative to methylated CpG recognition sites. Examples of the former are Acc III, BstN I, and Msp I. Examples of the latter are Acc II, Ava I, BssH II, BstU I, Hpa II, and Not I.
Modified products can be detected directly, or after a further reaction which creates products which are easily distinguishable. Means which detect altered size and/or charge can be used to detect modified products, including but not limited to electrophoresis, chromatography, and mass spectrometry. Examples of such chemical reagents for selective modification include hydrazine and bisulfate ions. Hydrazine-modified DNA can be treated with piperidine to cleave it. Bisulfate ion-treated DNA can be treated with alkali.
One way to distinguish between modified and nonmodified DNA is to use oligonucleotide probes which are specific for certain products. Such probes can be hybridized directly to modified DNA or to amplification products of modified DNA. Oligonucleotide probes can be labeled using any detection system known in the art. These include but are not limited to fluorescent moieties, radioisotope labeled moieties, bioluminescent moieties, luminescent moieties, chemiluminescent moieties, enzymes, substrates, receptors, or ligands.
Test cells can be obtained from surgical samples, such as biopsies or fine needle aspirates, from paraffin embedded tissues, from a body fluid such as bone marrow, blood, serum, plasma, lymph, cerebrospinal fluid, saliva, sputum, stool, urine, or semen. Such sources are not meant to be exhaustive, but rather exemplary.
In one aspect of this embodiment, a gene can be contacted with hydrazine, which modifies cytosine residues, but not methylated cytosine residues, then the hydrazine treated gene sequence is contacted with a reagent such as piperidine, which cleaves the nucleic acid molecule at hydrazine modified cytosine residues, thereby generating a product comprising fragments. The fragments can then be used in amplification reactions and the products analyzed.
Bisulfite ions, for example, sodium bisulfite, convert non-methylated cytosine residues to bisulfite modified cytosine residues. The bisulfite ion treated gene sequence can be exposed to alkaline conditions, which convert bisulfite modified cytosine residues to uracil residues. Sodium bisulfite reacts readily with the 5,6-double bond of cytosine (but poorly with methylated cytosine) to form a sulfonated cytosine reaction intermediate that is susceptible to deamination, giving rise to a sulfonated uracil. The sulfonate group can be removed by exposure to alkaline conditions, resulting in the formation of uracil. The DNA can be amplified, for example, by PCR, and sequenced to determine whether CpG sites are methylated in the DNA of the sample. Uracil is recognized as a thymine by Taq polymerase and, upon PCR, the resultant product contains cytosine only at the position where 5-methylcytosine was present in the starting template DNA. One can compare the amount or distribution of uracil residues in the bisulfite ion treated gene sequence of the test cell with a similarly treated corresponding non-methylated gene sequence. A decrease in the amount or distribution of uracil residues in the gene from the test cell indicates methylation of cytosine residues in CpG dinucleotides in the gene of the test cell. The amount or distribution of uracil residues also can be detected by contacting the bisulfite ion treated target gene sequence, following exposure to alkaline conditions, with an oligonucleotide that selectively hybridizes to a nucleotide sequence of the target gene that either contains uracil residues or that lacks uracil residues, but not both, and detecting selective hybridization (or the absence thereof) of the oligonucleotide.
Methyl-BEAMing can quantitatively assess DNA methylation in clinical samples. In addition to its potential value as a platform for clinical diagnosis, these studies have illuminated several features with important implications for the use of DNA methylation in cancer diagnostics, regardless of the platform used. First, the correspondence between the number of methylated vimentin DNA fragments and mutant APC and PIK3CA DNA fragments in the plasma samples from cancer patients provides strong evidence that circulating methylated DNA fragments are derived from the cancer cells themselves. As DNA methylation is plastic within mammalian cells, it was possible that such circulating DNA was derived from non-neoplastic stromal or inflammatory cells adjacent to the tumor or from elsewhere in the host. As far as we know, this is the first study to provide evidence of this kind, as it requires quantitative measures of both methylation and mutation to make the argument.
Second, our results demonstrate that several methylation sites within a region of interest should be concurrently assessed to achieve the required sensitivity and specificity in a diagnostic test based on methylation. This conclusion is based on the fact that even with carefully optimized bisulfite treatments, ˜0.5% of individual cytosines are unconverted. PCR errors can further contribute to this background level32. As less than 1% of the DNA in some clinical samples, such as those from plasma, are derived from tumors, it is critical to be able to differentiate biologically methylated molecules from such technical artifacts.
Third, our results demonstrate that the fraction of methylated fragments derived from cancers is higher in fecal DNA than in plasma DNA. Moreover, the fraction of methylated DNA fragments derived from adenomas is sufficiently high in fecal DNA to be detected (Table 2). We did not attempt to detect circulating DNA in adenoma patients, as previous studies have shown that there is very little, if any, circulating DNA derived from pre-malignant lesions of the colon22. This result suggests that fecal DNA is the preferred analyte for colorectal cancer screening in methylation assays. However, the fraction of methylated vimentin molecules from normal individuals is substantially higher in feces than in plasma (mean fraction of 0.96% and 0.025% in feces and plasma, respectively). It is known that DNA methylation is tissue specific, and normal cells within the gastrointestinal tract presumably contribute more methylated DNA to the feces than to the circulation. This background limits the sensitivity and specificity of Methyl-BEAMing of fecal DNA by raising the threshold for positivity.
The ability to quantitatively measure the fraction of methylated molecules via Methyl-BEAMing provides several diagnostic opportunities. There is nearly a 2000-fold difference between the concentrations of methylated vimentin molecules in the plasma of advanced cancer patients compared to normal individuals (Table 1). This difference could easily be translated into simple tests for assessing the extent of disease following therapeutic procedures30. One could also imagine a prenatal test for Down's Syndrome based on the assessment of methylation of DNA fragments of chromosome 2133. The development of such a test would require discovery of sequences that are methylated in the fetus but not in adults. Both these applications require a quantitative, rather than qualitative, assessment of DNA methylation such as afforded by Methyl-BEAMing. Though sequencing-by-synthesis instruments, such as the one employed in this study, also provide such quantification, they are not well-suited for high-throughput clinical analyses because of relatively long process times and high cost.
One of the most important applications of Methyl-BEAMing is in early diagnosis. In this study, 59% of patients with colorectal cancer could be detected by a plasma-based Methyl-BEAMing assay. Note that this sensitivity is close to the maximal detectable with the vimentin biomarker, which has been shown to be methylated in 53-83% of colorectal cancers25. The fact that 50% of presumably curable Duke's A and B stage cancers could be detected by this method was particularly encouraging.
Equally encouraging was the observation that 45% of patients with clinically significant pre-malignancies could be detected by Methyl-BEAMing of fecal DNA. Patients with these lesions would generally be treated by endoscopic excision without abdominal surgery. In screening an asymptomatic cohort of adults over age 50, the prevalence of colon cancers and of adenomas≧1 centimeter is expected to be 0.5-1.0% and 6.0%, respectively34, 35. Detection of 41% of cancers and 45% of adenomas by Methyl-BEAMing of stool DNA, at 95% specificity, would translate into a positive predictive value of 40%. This value exceeds that achieved in many studies of mammography, a cancer screening test of established clinical value in reducing mortality36.
How could the Methyl-BEAMing approach be improved? As it could in principle be applied to any gene that is hypermethylated in cancers, the sensitivity for early-stage tumors could easily be raised by including one or a few additional genes in Methyl-BEAMing assays. The specificities of plasma-based and fecal-based Methyl-BEAMing assays were 95% and 92%, respectively. Though specificities in this range are typical of diagnostic screening tests that are widely used36, the inclusion of additional genes in a methylation panel would undoubtedly lower the aggregate specificity. One challenge is therefore the identification of sequences in addition to vimentin that are methylated in tumor DNA but very rarely methylated in any normal adult cells. We have tested several commonly used methylated biomarkers in the context of selecting a biomarker that would be useful for testing in a body fluid and found most of them to be unsuitable. Hopefully, currently ongoing genome-wide epigenetic projects will identify new, more specific methylated sequences in the future37, 38. Another potential enhancement would involve the combined analysis of vimentin methylation with mutations of commonly mutated genes, such as KRAS, PIK3CA and TP53, which could in theory increase sensitivity without compromising specificity.
The above disclosure generally describes the present invention. All references disclosed herein are expressly incorporated by reference. A more complete understanding can be obtained by reference to the following specific examples which are provided herein for purposes of illustration only, and are not intended to limit the scope of the invention.
Sample collection and DNA purification. Plasma samples were collected by study nurses of Indivumed GmbH, Hamburg, Germany, from surgery patients treated in hospitals of Indivumeds' collaborative network, in particular from patients at the Israelitic Hospital and Clinic Alten Eichen (both in Hamburg, Germany) following strictly controlled SOP criteria. IRB approval was provided by the Ethical Board of the Physicians Association of Hamburg and patient samples and data were collected only after informed, written consent was obtained. The samples used in the current study were randomly chosen from those contributing through this protocol. Shortly before surgery, 18 ml of blood was taken from a central catheter and loaded into a standard blood collection tube containing EDTA. The tube was immediately chilled to 8° C. and transported to the laboratories within 30 min for plasma preparation. The blood cells were pelleted for 15 min at 200 g in a Leucosep tube (Greiner, Frickenhausen, Germany) filled with 15 ml of Ficoll-Paque solution. After centrifugation, the supernatant (i.e., plasma) was transferred into 1.5 ml tubes, immediately frozen, and stored at −80° C. The plasma samples were thawed at room temperature for 5 min, and any remaining debris was pelleted at 16,000 g for 5 min. The supernatant was transferred to a new tube. DNA was purified from the plasma supernatant using the QIAamp MinElute Virus Vacuum Kit (Qiagen) as recommended by the manufacturer. The DNA was eluted in EB buffer (Qiagen), and stored at −20° C.
Stool samples and clinical data were collected from patients undergoing colonoscopy at Aarhus University Hospital. The study was approved by the Regional Scientific Ethics Committee, Aarhus, Denmark and by the Danish Data Protection Agency and informed, written consent was obtained in every case. Stool samples were collected on a weekday before bowel preparation for colonoscopy or operation for colorectal cancer. The patients placed the container in an insulated box containing frozen bags which had been stored in a freezer at least 24 hours. The box was collected at the patient's home within eight hours. Immediately after the stool sample arrived at Aarhus University Hospital, the container was frozen at −80° C. until it was thawed for DNA purification. Stool samples were homogenized with a stool shaker (Exact Sciences Corp, Marlborough, Mass.). After homogenization, a 4 g stool equivalent of each sample was subjected to 2 centrifugations (5 min at 2536 g and 10 min at 16,500 g). The supernatant was incubated with 20 μL RNase (0.5 mg/mL) for one hour at 37° C., followed by precipitation with 1/10 volume of sodium acetate (3 M) and an equal volume of isopropanol. Stool DNA was dissolved in 4 mL of TE buffer and prepared for electrophoretic capture by mixing with 4 mL of water, 1 mL of 10×TBE (890 mM Tris base, 890 mM boric acid, 20 mM EDTA), and 1 mL of 10% SDS. DNA samples were denatured at 95° C. for 15 mM and then cooled on ice for 15 min. Vimentin fragments were then purified using a Reversible Electrophoretic Capture Affinity Protocol (RECAP)31. This was accomplished by repeatedly driving the DNA back and forth through the capture device loaded with oligonucleotide probes for the Vimentin gene target (capture probe sequences were 5′-Biotin-CGCCACCCTCCGCAGCCATGTCCACCAGGTCCG-3′ (SEQ ID NO: 4) and 5′-Biotin-GGCCATCGCCACCCTCCGCAGCCATGTCCACCAGG-3′(SEQ ID NO: 5)). After completion of electrophoretic cycling, the capture device was moved to a wash plate and washed four times with 500 μL of ST buffer (150 mM NaCl, 15 mM Tris-HCl, pH 7.4). The captured human DNA was eluted by adding 100 pit of 0.1 M NaOH to the capture device. The eluate was then neutralized with 10 μL of neutralization buffer (60 mM Tris pH 9.0, 6 mM EDTA, 0.5 M HCl).
Real-Time PCR. The amount of DNA per 2 ml plasma samples was quantified using a modified version of a human LINE-1 quantitative real-time PCR assay 39. Two primer sets were designed to amplify differently sized regions within the most abundant consensus region of the human LINE-1 family (for the 91-bp product, the primers were 5′-TGGCACATATACACCATGGAA-3′(SEQ ID NO: 6) and 5′-TGATGGTTTCCAATTTCATCC-3′(SEQ ID NO: 7); for the 69-bp product, the primers were: 5′-TGGCACATATACACCATGGAA-3′ (SEQ ID NO: 8) and 5′-TGTCCCTACAAAGGATATGAACTC-3′ (SEQ ID NO: 9)). PCR was performed in 25 reaction volumes consisting of template DNA equivalent to 254 of plasma, 0.5 U of Platinum Taq DNA Polymerase, 1×PCR buffer (see above), 6% (v/v) DMSO, 1 mM of each dNTP, 1:100,000 dilution of SYBR Green I (Invitrogen), and 0.2 μM of each primer. Amplification was carried out in an iCycler (Bio-Rad) using the following cycling conditions: 94° C. for 2 min; 3 cycles of 94° C. for 10 s, 67° C. for 15 s, 70° C. for 15 s; 3 cycles of 94° C. for 10 s, 64° C. for 15 s, 70° C. for 15 s; 3 cycles of 94° C. for 10 s, 61° C. for 15 s, 70° C. for 15 s; 35 cycles of 94° C. for 10 s, 58° C. for 15 s, 70° C. for 15 s. Various dilutions of normal human lymphocyte DNA were incorporated into each plate to serve as standards (0.006 to 1.56 ng). The threshold cycle number was determined using Bio-Rad analysis software. Each quantification for each LINE-1 product was performed in duplicate and the average value was used to determine the number of genome equivalents in the sample by comparison to the standards.
The amount of DNA per 4 g stool samples was quantified by real-time PCR assay using three sets of Vimentin specific primers (for the 81-bp product, the primers were 5′-GGGTGGACGTAGTCACGTAG-3 (SEQ ID NO: 10) and 5′-CCTCCTACCGCAGGATGTT-3′ (SEQ ID NO: 11); for the 91-bp product, the primers were 5′-CTGTAGGTGCGGGTGGAC-3′ (SEQ ID NO: 12) and 5′-CCTCCTACCGCAGGATGTT-3′ (SEQ ID NO: 13); for the 100-bp product, the primers were 5′-CCTCCTACCGCAGGATGTT-3′ (SEQ ID NO: 14) and 5′-CTGCCCAGGCTGTAGGTG-3′ (SEQ ID NO: 15)). PCR was performed in 25 μL reaction volumes consisting of template DNA equivalent to 0.016 g stool, 0.5 U of Platinum Taq DNA Polymerase, 1×PCR buffer, 6% (v/v) DMSO, 1 mM of each dNTP, 1:100,000 dilution of SYBR Green I (Invitrogen), and 0.2 μM of each primer. Amplification was carried out in an iCycler (Bio-Rad) using the following cycling conditions: 94° C. for 2 min; 3 cycles of 94° C. for 10 s, 67° C. for 15 s, 70° C. for 15 s; 3 cycles of 94° C. for 10 s, 64° C. for 15 s, 70° C. for 15 s; 3 cycles of 94° C. for 10 s, 61° C. for 15 s, 70° C. for 15 s; 35 cycles of 94° C. for 10 s, 59° C. for 15 s, 70° C. for 15 s. Various dilutions of normal human lymphocyte DNA were incorporated in each plate to serve as standards (0.39 to 25 ng). The threshold cycle number was determined using Bio-Rad analysis software. Each quantification for each Vimentin fragment was performed in duplicate and the average value was used to determine the number of genome equivalents in the sample by comparison to the standards.
Bisulfite conversion. DNA isolated from plasma and stool was treated with bisulfite using an Epitect Bisulfite Conversion kit (Qiagen), modified for isolating degraded DNA such as that found in plasma or stool. In brief, each bisulfite conversion reaction contained 18 μl purified DNA solution, 2 μl 0.5 g/ml single-stranded sonicated salmon sperm DNA (5S DNA), 85 μl bisulfite mix and 35 μl DNA protect buffer. Bisulfite conversion was carried out in a thermocycler at 99° C. for 5 min, 60° C. for 25 min, 99° C. for 5 min, 60° C. for 85 min, 99° C. for 5 min, and 60° C. for 175 min. The cleanup of bisulfited converted DNA followed the manufacturer's recommendations for formalin-fixed, paraffin-embedded (FFPE) DNA samples with the modification that MinElute Spin Columns (Qiagen) instead of Epitect spin columns were used for DNA disulfonation and purification. Bisulfite treated DNA was eluted into EB buffer (Qiagen) and stored at −20° C.
PCR amplification of bisulfite converted DNA. PCR was carried out by using a mixture of four primers that amplified both the methylated and the unmethylated Vimentin sequences (primers for methylated sequences were 5′-tcccgcgaaattaatacgacCTCGTCCTCCTACCGCAAAATATTC-3′ (SEQ ID NO: 16) and 5′-GTTGTTTAGGTTGTAGGTGCGGG-3′ (SEQ ID NO: 17); primers for unmethylated sequences were 5′-tcccgcgaaattaatacgacCTCATCCTCCTACCACAAAATATTC-3′ (SEQ ID NO: 18) and 5′-GTTGTTTAGGTTGTAGGTGTGGG-3′ (SEQ ID NO: 19)). PCR was performed in 50 μL reaction volumes consisting of 8 μl bisulfite-converted DNA, 2.5 U of AmpliTaq Gold DNA Polymerase (Applied Biosystems), 1×PCR Gold Buffer, 1 mM MgCl2, 0.2 mM dNTP mix, 0.1 uM of each primer. Amplification was carried out in a Thermo Hybrid thermocycler using the following conditions: 95° C. for 5 min; 3 cycles of 95° C. for 45 s, 67° C. for 45 s, 72° C. for 45 s; 3 cycles of 95° C. for 45 s, 64° C. for 45 s, 72° C. for 45 s; 3 cycles of 95° C. for 45 s, 61° C. for 45 s, 72° C. for 45 s; 45 cycles of 95° C. for 45 s, 58° C. for 45 s, 72° C. for 45 s; 1 cycle of 72° C. for 10 min. Note that these primers and PCR conditions result in a preferential amplification of methylated sequences40. This preferential amplification was confirmed using mixtures of fully methylated and fully unmethylated sequences (
BEAMing. Emulsion PCR was performed using modifications of the procedures described by Diehl et al.29. We first prepared a 150 μL PCR mixture containing 15 pg template DNA, 45 U of Platinum Taq DNA polymerase (Invitrogen), 1×PCR buffer (67 mM Tris-HCl (pH 8.8), 16.6 mM (NH4)2SO4, 10 mM β-mercaptoethanol, 1.17 mM MgCl2), 0.2 mM dNTPs, 5 mM MgCl2, 0.05 μM universal forward primer (5′-tcccgcgaaattaatacgac-3′ (SEQ ID NO: 20)), 8 μM Vimentin specific reverse primers (5′-GTTGTTTAGGTTGTAGGTGCGGG-3′ (SEQ ID NO: 21) and 5′-GTTGTTTAGGTTGTAGGTGTGGG-3′ (SEQ ID NO: 22)) and 6×107 magnetic streptavidin beads (MyOne, Invitrogen) coated with universal forward primer (5′-dual biotin-T-Spacer18-tcccgcgaaattaatacgac-3′ (SEQ ID NO: 23)). This PCR reaction mixture was combined with 600 μL of an oil/emulsifier mix (7% ABIL WE09, 20% mineral oil, 73% Tegosoft DEC; Degussa Goldschmidt Chemical, Hopewell, Va.) and one 5 mm steel bead (Qiagen) in one well of a 96 deep-well plate (Abgene). Emulsions were prepared by shaking the plate in a TissueLyser (Qiagen) for 10 s at 15 Hz and 7 s at 17 Hz and dispensed into eight wells of a 96 well PCR plate. Temperature cycling was performed at 94° C. for 2 min; 3 cycles of 94° C. for 10 s, 68° C. for 45 s, 70° C. for 75 s; 3 cycles of 94° C. for 10 s, 65° C. for 45 s, 70° C. for 75 s; 3 cycles of 94° C. for 10 s, 62° C. for 45 s, 70° C. for 75 s; 50 cycles of 94° C. for 10 s, 59° C. for 45 s, 70° C. for 75 s.
To break the emulsions, 150 μL breaking buffer (10 mM Tris-HCl, pH 7.5, 1% Triton-X 100, 1% SDS, 100 mM NaCl, 1 mM EDTA) was added to each well and mixed in a TissueLyser at 20 Hz for 20 s. Beads were recovered by centrifuging the suspension at 3200 g for 2 min and removing the oil phase. The breaking step was repeated twice. All beads from eight PCR wells were combined and resuspended in 100 μL wash buffer (20 mM Tris-HCl, pH 8.4, 50 mM KCl). The DNA on the beads was denatured for 5 min with 0.1 M NaOH. Finally, beads were washed with 100 μL wash buffer and resuspended in 100 μL of wash buffer.
Probe hybridization. To determine the fraction of beads containing PCR products from methylated DNA templates, 0.5 uM Cy5-labeled oligonucleotide (5′-Cy5-TCGGTCGGTTCGCGGTGTTCGA-3′ (SEQ ID NO: 24), complimentary to the bisulfite converted methylated sequence), and 0.5 uM FITC labeled oligonucleotide (5′-FAM-TTGGTTGGTTTGTGGTGTTTGA-3′ (SEQ ID NO: 25), complimentary to the bisulfite converted unmethylated sequence) were hybridized to the DNA on ˜107 beads in 100 μl hybridization buffer (30 mM Tris-HCl, pH 9.5, 13.4 mM MgCl2, 5% formamide) at 50° C. for 15 minutes. Beads were then collected with a magnet and resuspended in 200 μl 10 mM Tris, pH 7.5, 1 mM EDTA, pH 7.5.
Flow Cytometry. Beads were analyzed with a LSR II flow cytometer and data were analyzed with FACSDiva software (BD Biosciences). The flow rate was typically set at 5,000-10,000 events per second. Events were gated to exclude doublets and aggregates.
Massively parallel sequencing-by-synthesis. DNA isolated from normal lymphocytes (control) and stool was treated with bisulfite as described above. The converted DNA was PCR amplified using the same protocol as used for Methyl-BEAMing. The PCR product was purified with a QIAquick PCR purification column (Qiagen) and used to construct libraries for Solexa sequencing following Illumina's standard DNA sample preparation instructions. Briefly, (1) the PCR products were blunt-ended with T4 DNA polymerase, Klenow polymerase, and T4 polynucleotide kinase; (2) a dA was added to the 3′ end of each strand with Klenow (exo-) polymerase; (3) adapters designed for library construction were ligated to the PCR fragments; (4) ligation products were gel-purified to select for ˜180 bp fragments; and (5) PCR amplification was performed to enrich ligated fragments. The amplified library was quantified by real-time PCR and was denatured with 0.1 M NaOH to generate single-stranded DNA molecules, captured on Illumina flow cells, and amplified in situ. A Vimentin-specific oligonucleotide (5-GTGC/TGGGTGGAC/TGTAGTTAC/TGTAGT TTC/TGGTTGGA-3 (SEQ ID NO: 26), C/T indicates that the position was an equimolar mix of cytosine and thymidine) was used to prime sequencing for 36 cycles on an Illumina Genome Analyzer. Data analysis: The sequences generated were aligned to the bisulfite converted unmethylated Vimentin sequence. The following criterions were used to filter the tags for further analysis. Each tag was required (i) to pass the Illumina chastity filter; (ii) to have an average Phred sequence quality score above 20; and (iii) to have perfect match to the reference sequence at A, G, and T positions (allowing mismatched at C positions). At least 80,000 tags that met these criteria were evaluated in each case.
The first hurdle to overcome in developing Methyl-BEAMing was bisulfite conversion. DNA in clinical samples such as plasma is already degraded to small size by circulating nucleases and is present at only a few nanograms per mililiter22. Conventional methods for bisulfite conversion further degrade DNA and are difficult to implement with samples containing only small amounts of DNA because of cumulative losses during the procedure23, 24. After much optimization, we identified conditions that resulted in nearly complete (Mean+SEM: 99.4%+0.4%) conversion of dC to dU residues while preserving the majority of the initial DNA in a fashion permitting subsequent PCR amplification (see Experimental Methods for details).
The exon 1 region of the vimentin gene has been shown to be hypermethylated in colorectal cancers when compared to normal colorectal mucosae and other normal tissues25, 26. Moreover, this difference is the basis for the only commercially available diagnostic test based on DNA methylation (ColoSure, LabCorp). We therefore chose this region of vimentin to assess the dynamic range and accuracy of Methyl-BEAMing. The region of vimentin queried contains 5′-CpG sites that have been shown to be methylated in cancer cells. Primers surrounding this region were designed to amplify it, whether it was methylated or unmethylated, following bisulfite conversion. The PCR products were only ˜100 bp in length so as to accommodate the small size of circulating DNA molecules22, 27. These amplicons were then used for BEAMing (Beads, Emulsion, Amplification and Magnetics)28, 29.
BEAMing employs aqueous nanocompartments suspended in a continuous oil phase (
As a prelude to the experiments reported below, we tested Methyl-BEAMing on mixtures of templates representing DNA from peripheral blood lymphocytes (unmethylated vimentin) and a colorectal cancer cell line (fully methylated vimentin). We found that the fraction of beads containing methylated vimentin sequences was directly proportional to the fraction of methylated input DNA when the latter represented 0.1% to 100% of the input DNA (R2=1.00,
Some of the most important uses of cancer biomarkers involve the assessment of circulating molecules, either for early detection or disease-monitoring following therapy. To use methyl-BEAMing for such purposes, or indeed to use any test based on DNA methylation, it is assumed that the methylated molecules in the circulation emanate from cancer cells. To explicitly test this assumption, we compared the fraction of methylated vimentin molecules with the fraction of mutant molecules (APC G4189T and PIK3CA G1624A) in the plasma of two colorectal cancer patients. In these patients, the mutations were somatic and therefore exclusively present in the patient's neoplastic cells. Methyl-BEAMing was used to assess methylation and standard BEAMing was used to assess mutations30. The fraction of methylated vimentin templates in the first sample was 13.6%, quite similar to the fraction of mutated APC (17.5%). In the second patient's plasma, methylated vimentin represented 3.5% of the total vimentin templates while mutated PIK3CA represented 3.0% of the PIK3CA templates.
To determine the sensitivity and specificity of Methyl-BEAMing in plasma samples, we evaluated 191 samples, 81 from patients with colorectal cancer and 110 from age- and sex-matched controls. The total amount of DNA in the plasma was somewhat higher in patients with cancer than in the cancer-free controls, as noted previously 12 (
Next, we used Vimentin Methyl-BEAMing to screen Rn stool samples, including 38 from normal individuals, 20 from patients with>1 centimeter adenomas, and 22 from colorectal cancer patients of various stages. In each case, we homogenized 4 g of stool and captured vimentin fragments through electrophoresis-driven hybridization to oligonucleotides on a solid phase matrix 31. The concentration of vimentin DNA fragments in stool varied widely, as expected for samples from individuals with various diets and bowel habits (
To further validate the accuracy of Methyl-BEAMing for the quantification of methylated vimetin fragments in clinical samples, we evaluated samples with low and high fractions of methylated fragments on the Illumina Genome Analyzer sequencing instrument. Following bisulfite conversion and PCR amplification (
The disclosure of each reference cited is expressly incorporated herein.
This invention was made using funding from the National Institutes of Health. The U.S. government retains certain rights in the invention according to the terms of CA 43460, CA 57345, CA 62924, and CA 121113.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2010/030084 | 4/6/2010 | WO | 00 | 12/7/2011 |
Number | Date | Country | |
---|---|---|---|
61166970 | Apr 2009 | US |