It is important to develop new methods to determine methylation status and to monitor changes in methylation status.
The systems and methods as described herein may provide a substantially unbiased approach in detecting an epigenetic modification. This method may be an improvement in the field of detecting or monitoring methylation status particularly when compared to systems and methods that amplify sequences having a label or a moiety associated with an epigenetic modification.
All publications, patents, and patent applications herein are incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede or take precedence over any such contradictory material.
The novel features herein are set forth with particularity in the appended claims. A better understanding of the features and advantages herein will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles herein are utilized, and the accompanying drawings (also “figure” and “FIG.” herein), of which:
While various embodiments have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It should be understood that various alternatives to the embodiments herein may be employed.
A method as described herein may comprise associating a label with an epigenetically modified base of a nucleic acid sequence to form a labeled nucleic acid sequence; hybridizing a substantially complementary strand to the labeled nucleic acid sequence; and amplifying the substantially complementary strand in a reaction in which the labeled nucleic acid sequence is substantially not present. One or more individual elements of the method need not be performed in a particular order. For example, associating a label may occur after the hybridizing. One or more individual elements of a given method may be performed in a different order than described herein.
In this example of
In
In
In
In
In
In this example of
In
In
In
In
In
In this example of
In
In
In some cases, hybridizing may comprise (i) priming (such as random priming), (ii) ligation (such as adapter ligation), or (iii) a combination thereof. For example, in
In
In
In
The methods and systems as described herein may provide a result that may be far more representative of an extent to which a nucleic acid may be marked epigenetically. In some cases, the methods and systems may be superior to other methods of identification of epigenetic modifications. Other methods of identification may include the HMCP method or a method that comprises associating a sugar, a protein, an antibody, or a fragment of any of these with an epigenetic modification and detecting a presence of the sugar, the protein, the antibody, or fragment thereof. In some cases, nucleic acid sequences, such as fragments containing a high density of epigenetic modifications may not be detected using other methods of identification of epigenetic modifications. The unbiased approach of the present methods and systems provides for detection of high density epigenetic modifications of nucleic acid sequences, such as short fragments yielding an unbias detection.
In
As shown in
As shown in
In this example, both strands of double stranded DNA (dsDNA) fragments containing 5-hmC may be labeled using beta-glucosyltransferase (βGT) and UDP-6-azide-glucose (UDP-N3-glc). This step may be dsDNA selective (βGT may not work on single stranded DNA (ssDNA)). Position of label may be determined by the presence/absence of 5-hmC in the dsDNA parent fragment. A label may be azido-glucose, transferred to the 5-hmC from UDP-N3-glc by βGT. The labeling may be performed directly on the purified circulating tumor DNA (ctDNA) extract. Advantage of this may be that the ctDNA may not have been through a series of library prep steps ahead of labeling. So there may be likely more material at the labeling (improved efficiency) and may present a more representative sample to a labeling than may be the case post NGS prep.
In some cases, hybridizing may comprise (i) priming (such as loci specific priming), (ii) ligation (such as adapter ligation), or (iii) a combination thereof. For example, in
In
In
As shown in
The HMCP method may be referred to herein as the ‘standard’ method. The HMCP method may be referred to herein as HMCP, HMCP-v1, HMCPv1, HMCP, v1HMCP, v1 HMCP, or V1. The CLE method may be referred to herein as HMCP_CLE, HMCP-v2, HMCPv2, CLE-HMCP, v2HMCP, v2 HMCP, or V2.
For any of the methods described herein, including CLE, HMCP_LCE, HMCP_CLE, HMCP_LRE, HMCP_RLE, HMCP_LLSE, HMCP_LSLE, one or more individual elements of a given method may be performed in the order as described herein. In some cases, one or more individual elements of a given method need not be performed in a particular order described herein. In some cases, one or more individual elements of a given method may be performed in a different order than described herein.
In some cases, the complementary strand may be a substantially complementary strand or may comprise a portion that may be substantially complementary to a portion of a nucleic acid sequence.
Hybridizing may comprise hybridizing at least two complementary strands to at least two portions of a nucleic acid sequence. Hybridizing may comprise hybridizing at least a portion of a complementary strand to an adapter sequence of the nucleic acid sequence. Hybridizing may comprise extension, such as cDNA extension. Hybridizing may comprise priming, such as loci specific priming or random priming. Hybridizing may comprise ligation, such as adapter ligation. Hybridizing may comprise hybridizing a primer to a nucleic acid sequence and elongating from the primer to form a complementary strand. Hybridizing may comprise obtaining a complementary strand and hybridizing the complementary strand to the nucleic acid sequence.
A label may be associated with an epigenetically modified base of a nucleic acid sequence. A label may be associated with an epigenetically modified base before hybridizing. A label may be associated with an epigenetically modified base after hybridizing.
The method may comprise amplifying the complementary strand in a reaction in which the nucleic acid sequence may be substantially not present. The amplifying may comprise associating the nucleic acid sequence and complementary strand with a substrate, such as by a label. The amplifying may comprise washing a substrate that may be associated with the nucleic acid sequence and complementary strand, such as stringent washing. The amplifying may comprise eluting a complementary strand from the substrate on which the nucleic acid sequence remains. The amplifying may comprise amplifying the complementary strand.
An epigenetic modification may comprise a DNA methylation. A DNA methylation may comprise a hyper-methylation or a hypo-methylation. A DNA methylation may comprise a modification of a DNA base, such as a 5-methylcytosine (5-mC), a 4-methylcytosine, a 6-methyladenine, or a combination thereof.
As used herein, the singular forms “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise. Any reference to “or” herein may be intended to encompass “and/or” unless otherwise stated.
As used herein, the term “about” may mean the referenced numeric indication plus or minus 15% of that referenced numeric indication.
The term “fragment,” as used herein, may be a portion of a sequence, a subset that may be shorter than a full length sequence. A fragment may be a portion of a gene. A fragment may be a portion of a peptide or protein. A fragment may be a portion of an amino acid sequence. A fragment may be a portion of an oligonucleotide sequence. A fragment may be less than about: 20, 30, 40, 50 amino acids in length. A fragment may be less than about: 20, 30, 40, 50 oligonucleotides in length.
The term “homology,” as used herein, may be to calculations of “homology” or “percent homology” between two or more nucleotide or amino acid sequences that can be determined by aligning the sequences for optimal comparison purposes (e.g., gaps can be introduced in the sequence of a first sequence). The nucleotides at corresponding positions may then be compared, and the percent identity between the two sequences may be a function of the number of identical positions shared by the sequences (i.e., % homology=(# of identical positions/total # of positions)×100). For example, a position in the first sequence may be occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent homology between the two sequences may be a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. In some embodiments, the length of a sequence aligned for comparison purposes may be at least about: 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 95%, of the length of the reference sequence. A BLAST® search may determine homology between two sequences. The two sequences can be genes, nucleotides sequences, protein sequences, peptide sequences, amino acid sequences, or fragments thereof. The actual comparison of the two sequences can be accomplished by well-known methods, for example, using a mathematical algorithm. A non-limiting example of such a mathematical algorithm may be described in Karlin, S. and Altschul, S., Proc. Natl. Acad. Sci. USA, 90- 5873-5877 (1993). Such an algorithm may be incorporated into the NBLAST and XBLAST programs (version 2.0), as described in Altschul, S. et al., Nucleic Acids Res., 25:3389-3402 (1997). When utilizing BLAST and Gapped BLAST programs, any relevant parameters of the respective programs (e.g., NBLAST) can be used. For example, parameters for sequence comparison can be set at score=100, word length=12, or can be varied (e.g. , W=5 or W=20). Other examples include the algorithm of Myers and Miller, CABIOS (1989), ADVANCE, ADAM, BLAT, and FASTA. In another embodiment, the percent identity between two amino acid sequences can be accomplished using, for example, the GAP program in the GCG software package (Accelrys, Cambridge, UK).
The term “epigenetic modification” as used herein, may be any covalent modification of a nucleic acid base. In some cases, a covalent modification may comprise (i) adding a methyl group, a hydroxymethyl group, a carbon atom, an oxygen atom, or any combination thereof to one or more bases of a nucleic acid sequence, (ii) changing an oxidation state of a molecule associated with a nucleic acid sequence, such as an oxygen atom, or (iii) a combination thereof. A covalent modification may occur at any base, such as a cytosine, a thymine, a uracil, an adenine, a guanine, or any combination thereof. In some cases, an epigenetic modification may comprise an oxidation or a reduction. A nucleic acid sequence may comprise one or more epigenetically modified bases. An epigenetically modified base may comprise any base, such as a cytosine, a uracil, a thymine, adenine, or a guanine. An epigenetically modified base may comprise a methylated base, a hydroxymethylated base, a formylated base, or a carboxylic acid containing base or a salt thereof. An epigenetically modified base may comprise a 5-methylated base, such as a 5-methylated cytosine (5-mC). An epigenetically modified base may comprise a 5-hydroxymethylated base, such as a 5-hydroxymethylated cytosine (5-hmC). An epigenetically modified base may comprise a 5-formylated base, such as a 5-formylated cytosine (5-fC). An epigenetically modified base may comprise a 5-carboxylated base or a salt thereof, such as a 5-carboxylated cytosine (5-caC). In some cases, an epigenetically modified base may comprise a methyltransferase-directed transfer of an antivated group (mTAG).
An epigenetically modified base may comprise one or more bases or a purine (such as Structure 1) or one or more bases of a pyrimidine (such as Structure 2). An epigenetic modification may occur one or more of any positions. For example, an epigenetic modification may occur at one or more positions of a purine, including positions 1, 2, 3, 4, 5, 6, 7, 8, 9, as shown in Structure 1. In some cases, an epigenetic modification may occur at one or more positions of a pyrimidine, including positions 1, 2, 3, 4, 5, 6, as shown in Structure 2.
A nucleic acid sequence may comprise an epigenetically modified base. A nucleic acid sequence may comprise a plurality of epigenetically modified bases. A nucleic acid sequence may comprise an epigenetically modified base positioned within a CG site, a CpG island, or a combination thereof. A nucleic acid sequence may comprise different epigenetically modified bases, such as a methylated base, a hydroxymethylated base, a formylated base, a carboxylic acid containing base or a salt thereof, a plurality of any of these, or any combination thereof.
The term “sugar” as used herein, may be a sugar. A sugar may comprise a glucose, a fructose, a galactose, or a combination thereof. A sugar may comprise a disaccharide such as a sucrose, a maltose, a lactose, or any combination thereof. A sugar may comprise a monosaccharide, an oligosaccharide, or a polysaccharide. A sugar may comprise a modified sugar. A sugar may be modified such that the modified sugar may be configured to associate with an epigenetically modified base, such as a 5-methylated cytosine or a 5-hydroxymethylated cytosine. A sugar may comprise a modified glucose. A sugar may comprise a glucose, a glucose derivative, a gentibiose molecule, or any combination thereof. A sugar may comprise a uridine diphosphate glucose. A sugar may be modified with a detectable moiety, such as a radioactive moiety, a fluorescent moiety, a phosphorescent moiety, a chemiluminescent moiety, or any combination thereof. A sugar may be associated with a group for click chemistry. A sugar may be associated with an azido group, such as an N3 group. A sugar may be associated with an epigenetically modified base by employing a click chemistry reaction.
The term “barcode” as used herein may relate to a natural or synthetic nucleic acid sequence comprised by a polynucleotide allowing for unambiguous identification of the polynucleotide and other sequences comprised by the polynucleotide having said barcode sequence. The number of different barcode sequences theoretically possible can be directly dependent on the length of the barcode sequence; e.g., if a DNA barcode with randomly assembled adenine, thymidine, guanosine and cytidine nucleotides can be used, the theoretical maximal number of barcode sequences possible can be 1,048,576 for a length of ten nucleotides, and can be 1,073,741,824 for a length of fifteen nucleotides. Unique sample identifiers or barcodes can be completely scrambled (e.g., randomers of A, C, G, and T for DNA or A, C, G, and U for RNA) or they can have some regions of shared sequence. For example, a shared region on each end may reduce sequence biases in ligation events. In some cases, a shared region can be about or at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 common base pairs. In some cases, a shared region can be up to about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 common base pairs. Combinations of barcodes can be added to increase diversity.
A barcode may uniquely identify a subject, a sample (such as a cell-free sample), a nucleic acid sequence (such as a sequence having one or more epigenetically modified bases), or any combination thereof. A barcode may be associated with a nucleic acid sequence or a complementary strand. A nucleic acid sequence may comprise a single barcode. A nucleic acid sequence may comprise one or more barcodes, such as a first barcode and a second barcode. In some cases, the first barcode is different from the second barcode. In some cases, each barcode of a plurality of barcodes may be a unique barcode. In some cases, a barcode may comprise a sample identification barcode. For example, a first barcode may comprise a unique barcode and a second barcode may comprise a sample identification barcode.
The term “adapter” as used herein may be a nucleic acid with known or unknown sequence. An adapter may be attached to the 3′ end, 5′ end, or both ends of a nucleic acid (e.g. target nucleic acid). An adapter may comprise known sequences and/or unknown sequences. An adapter may be double-stranded or single-stranded. In some cases, an adapter can comprise a barcode (e.g. unique identifier sequence). In some cases, an adapter can be an amplification adapter. An amplification adapter may attach to a target nucleic acid and help the amplification of the target nucleic acid. For example, an amplification adapter may comprise one or more of: a primer binding site, a unique identifier sequence, a non-unique identifier sequence, and a sequence for immobilizing the target nucleic acid on a substrate. A target nucleic acid attached with an amplification adapter may be immobilized on a substrate. An amplification primer may hybridize to the adapter and be extended using the target nucleic acid as a template in an amplification reaction. In some cases, the unique identifiers in an adapter can be used to label the amplicons. In some cases, an adapter can be a sequencing adapter. A sequencing adapter may attach to a target nucleic acid and help the sequencing of the target nucleic acid. For example, a sequencing adapter may comprise one or more of: a sequencing primer binding site, a unique identifier sequence, a non-unique identifier sequence, and a sequence for immobilizing target nucleic acid on a substrate. A target nucleic acid attached with a sequencing adapter may be immobilized on a substrate on a sequencer. A sequencing primer may hybridize to the adapter and be extended using the target nucleic acid as a template in a sequencing reaction. In some cases, the unique identifiers in an adapter can be used to label the sequence reads of different target sequences, thus allowing high-throughput sequencing of a plurality of target nucleic acids. In some examples, an adapter sequence (such as a double-stranded or single-stranded oligonucleotide) may be ligated to one or both ends of a nucleic acids sequence. A nucleic acid sequence may comprise one or more epigenetically modified bases. A nucleic acid sequence may be from a sample, such as a cell free DNA sample. A nucleic acid sequence may be from a sample obtained from a subject. A nucleic acid sequence may comprise a double-stranded portion, a single-stranded portion, or a combination thereof. In some cases, an adapter may recognize or may be complementary to a primer, such as a universal primer. In some cases, an adapter may be specific to a sequencing method. In some cases, an adapter may be associated with a nucleic acid sequence or a complementary strand.
The term “nucleic acid sequence” as used herein may comprise DNA or RNA. In some cases, a nucleic acid sequence may comprise a plurality of nucleotides. In some cases, a nucleic acid sequence may comprise an artificial nucleic acid analogue. In some cases, a nucleic acid sequence comprising DNA, may comprise cell-free DNA, cDNA, fetal DNA, or maternal DNA. In some cases, a nucleic acid sequence may comprise miRNA, shRNA, or siRNA.
The term “substantially complementary strand” as used herein, may comprise from about 70%-100% bases that base pair with bases of a nucleic acid sequence. This percentage of base pairing may be measured by UV absorption of the nucleic acid sequence. In some cases, a substantially complementary strand may be hybridized to at least a portion of a nucleic acid sequence under stringent hybridization conditions.
The term “substantially free of an epigenetically modified base” as used herein, may comprise a complementary strand having no epigenetically modified base, or a complementary strand having from about 0.000001% to about 5% of a plurality of epigenetically modified bases of a nucleic acid sequence.
In some cases, a substantially complementary strand may be substantially free of a covalent modification. In some cases, a substantially complementary strand may be substantially free of (i) a methyl group, a hydroxymethyl group, a carbon atom, an oxygen atom, or any combination thereof, (ii) a change an oxidation state of a molecule associated with the substantially complementary strand or (iii) a combination thereof.
In some cases, a substantially complementary strand may be substantially free of an epigenetically modified base. In some cases, a substantially complementary strand may be free of an epigenetically modified base. In some cases, a substantially complementary strand may be amplified. An amplified product of the substantially complementary strand may comprise a plurality of epigenetically modified bases. In some cases, less than about: 90%, 80%, 70%, 60%, 50%, 40%, 30%, 20%, 10%, 5%, 4%, 3%, 2%, 1%, 0.01%, 0.001%, 0.0001%, 0.00001%, or 0.000001% of the amplified product comprises an epigenetically modified base. In some cases, a percentage may be by weight. In some cases, a percentage may be by a number of bases. In some cases, less than about 5% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, less than about 4% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, less than about 3% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, less than about 2% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, less than about 1% of an amplified product of a substantially complementary strand comprises an epigenetically modified base.
In some cases, from about 0.000001% to about 10% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 5% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 4% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 1% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 0.01% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 0.001% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 0.000001% to about 0.0001% of an amplified product of a substantially complementary strand comprises an epigenetically modified base. In some cases, from about 1% to about 10% of an amplified product of a substantially complementary strand comprises an epigenetically modified base.
A nucleic acid sequence may comprise a plurality of epigenetically modified bases. In some cases, a strand that may be substantially complementary to at least a portion of the nucleic acid may comprise less than about: 90%, 80%, 70%, 60%, 50%, 40%, 30%, 20%, 10%, 5%, 4%, 3%, 2%, 1%, 0.01%, 0.001%, 0.0001%, 0.00001%, or 0.000001% of the plurality of epigenetically modified bases of the nucleic acid sequence. In some cases, a percentage may be by weight. In some cases, a percentage may be by a number of bases. In some cases, less than about 5% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, less than about 4% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, less than about 3% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, less than about 2% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, less than about 1% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand.
In some cases, about 0% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 10% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 5% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 4% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 1% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 0.1% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 0.01% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 0.000001% to about 0.001% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand. In some cases, from about 1% to about 10% of the plurality of epigenetically modified bases of a nucleic acid sequence may be present in a substantially complementary strand.
In some cases, a substantially complementary strand may comprise an epigenetically modified base that may be different from an epigenetically modified base of a nucleic acid sequence.
The term “label” as used herein, may be a component that may be (a) associated with a substrate, (b) associated with an epigenetically modified base, or (c) a combination thereof. A label may be associated with an epigenetically modified base by a single bond, a double bond, a triple bond, a metal-associated bond, or an ion pairing. A label may comprise a magnetic metal, such as iron, nickel, cobalt, aluminum, or any combination thereof. A label may be associated with an epigenetically modified base by the assistance of an enzyme. A label may be associated with a substrate via (a) a biotin-streptavidin association, (b) a magnetic association, (c) an antibody-antigen association, or (d) any combination thereof. A label may be selectively for a portion of a nucleic acid sequence. A label may selectively associate with a double-stranded portion of a nucleic acid sequence as compared to single-stranded portion. A label may selectively associate with portions of a nucleic acid sequence having an epigenetically modified base as compared to portions having a non-modified base. A label may selectively associate with a type of epigenetically modified base, such as selectively associating with a 5-hydroxymethylated cytosine (5-hmC) as compared to a 5-methylated cytosine (5-mC). A label may comprise a sugar, such as a glucose. A glucose may comprise a modified glucose. A label may comprise more than one sugar, such as two sugars or more. A label may comprise a modified sugar, such as a modified glucose. A label may comprise a uridine diphosphate glucose (UDPG). A label may comprise a detectable label such as a radioactive label, a fluorescent label, a chemiluminescent label, a phosphorescent label, an infrared label, a visible label, a chemically reactive label (such as an azide-based label), or any combination thereof. In some cases, a label may be a label which results from incorporating a chromophore via a reaction with a radioactive label. A label may comprise a protein, peptide, or polypeptide. In some cases, a label may comprise an antibody or portion thereof. A label may comprise a tag, such as a FLAG-tag. A label may comprise a biotin or an avidin, such as streptavidin. A label may comprise a nucleic acid sequence. A label may comprise a substrate. In some cases, a different label may be employed to uniquely label different epigenetic modifications. For example, a first label may bind a methylated base and a second label may bind a hydroxymethylated base.
In some cases, a tag may comprise a giutathione-S-transferase (GST), a maltose binding protein (MBP), a green fluorescent protein (GFP). an AviTag, a Calmodulin tag, a polyglutamate tag, a FLAG tag, an human influenza hemagglutinin (HA) tag, a polyhistidine (His) tag, a Myc-tag, an S-tag, an streptavidin-binding peptide (SBP) tag, a Softag 1, a Strep tag, a TC tag, a V5 tag, an Xpress tag, an Isopeptag, a SpyTag, a biotin carboxyl carrier protein (BCCP) tag, a chitin binding protein (CBP) tag, a HaloTag, a thioredoxin tag, a T7 tag, a protein kinase A (PKA) tag, a c-Myc tag, a Trx tag, a Hsv tag, a CBD tag, a Dsb tag, a pelB/ompT, a KSI, a VSV-G tag, a 3-Gal tag, or any combination thereof. A tag may be a fusion tag, a covalent peptide tag, a protein tag, a peptide tag, an affinity tag, an epitope tag, a solubilization tag, or any combination thereof. A tag may comprise a recombinant protein. A tag may associate with a protein or protein fragment. A FLAG-tag may comprise a sequence or a portion thereof comprising DYKDDDDK, where D may be aspartic acid, Y may be tyrosine, and K may be lysine.
A label may be associated reversibly with a substrate. A label may be associated irreversibly with a substrate. A label may be reversibly associated with an epigenetically modified base. A label may be irreversibly associated with an epigenetically modified base. A label may be associated by binding to a substrate, an epigenetically modified base, or a combination thereof. A label may be bound by a single bond, a double bond, or a triple bond to a substrate. A label may be bound by a single bond, a double bond, or a triple bond to an epigenetically modified base.
The term “click-chemistry” as used herein may comprise a reaction having at least one of the following: (a) high yielding, (b) wide in scope, (c) create only byproducts that may be removed in the absence of chromatography, (d) stereospecific, (e) simple to perform, (f) conducted in easily removable or benign solvents. In some cases, click-chemistry comprises tagging, such as tagging a nucleic acid sequence or a complementary strand. In some cases, click-chemistry may associate a nucleic acid sequence with a label. Click-chemistry may comprise a reaction having a [3+2] cycloaddition; a thiol-ene reaction; a Diels-Alder reaction, an inverse electron demand Diels-Alder reaction; a [4+1] cycloaddition; a nucleophilic substitution; a carbonyl-chemistry-like formation of urea; an addition to a carbon-carbon double bond; or any combination thereof. In some cases, a [3+2] cycloaddition may comprise a Huisgen 1,3-dipolar cycloaddition. In some cases, a [4+1] cycloaddition may comprise a cycloaddition between an isonitrile and a tetrazine. Click-chemistry may comprise a copper(I)-catalyzed azide-alkyne cycloaddition (CuAAC); a strain-promoted azide-alkyne cycloaddition (SPAAC); a strain-promoted alkyne-nitrone cycloaddition (SPANC); or any combination thereof.
The term “moiety” as used herein, may be a component that may aid in or catalyze a reaction. In some cases, a moiety may comprise an enzyme or a catalytically active fragment thereof.
In some cases, a moiety may comprise an antibody or fragment thereof. In some cases, a moiety may comprise a protein, a peptide, or polypeptide. In some cases, a moiety may comprise a cofactor such as a coenzyme. In some cases, a moiety may comprise an enzyme, a protein or portion thereof, an antibody or portion thereof, a cofactor or any combination thereof. In some cases, a moiety, such as an enzyme, may aid in an association of a label with an epigenetically modified base. A moiety, such as an enzyme, may selectively associate a label with an epigenetically modified base present on a double-stranded oligonucleotide fragment as compared with an epigenetically modified base present on a single-stranded oligonucleotide fragment. A moiety, such as an enzyme, may selectively associate a label with an epigenetically modified base present on a single-stranded oligonucleotide fragment as compared with an epigenetically modified base present on a double-stranded oligonucleotide fragment. An enzyme may comprise a transferase. An enzyme may comprise a glucosyltransferase. An enzyme may comprise (a) an alpha-glucosyltransferase, (b) a beta-glucosyltransferase, (c) a beta-glucosyl-alpha-glucosyl-transferase, (d) J-glucosyltransferase, or (e) any combination thereof. A moiety, such as an enzyme, may comprise a modified moiety such as a genetically mutated moiety. A modified moiety may be modified to enhance an association of a label with an epigenetically modified base. A modified moiety may be modified to selectively aid in a) an association of a specific label with an epigenetically modified base, b) an association of a label with a specific epigenetically modified base, or c) a combination thereof.
In some cases, a moiety may catalyze a transfer of a methyl group to one or more bases of a nucleic acid sequence, a complementary strand, or a combination thereof. In some cases, a moiety may comprise a methyltransferase. In some cases, an enzyme may comprise a DNA methyltransferase 1 (DNMT1), a DNA methyltransferase 3-like (DNMT3L), a DNMT3A, a DNMT3B, a tRNA aspartic acid methyltransferase (TRDMT1), a DNMT3, any catalytically active fragment thereof, or any combination thereof.
In some cases, a moiety may catalyze a change in an epigenetic modification, such as a conversion of a methylated base to a hydroxymethylated base. In some cases, an enzyme may comprise a dioxygenase. In some cases, an enzyme may comprise a ten-eleven translocation (TET) family enzyme. In some cases, an enzyme may comprise TET1, TET2, TET3, CXXC finger protein 4 (CXXC4), any catalytically active fragment thereof, or any combination thereof.
In some cases, a moiety may catalyze an oxidative reaction, such as an oxidative decarboxylation. In some cases, an enzyme may comprise an isocitrate dehydrogenase (IDH) family enzyme. In some cases, an enzyme may comprise isocitrate dehydrogenase [NAD] subunit alpha (IDH3A), isocitrate dehydrogenase [NAD] subunit beta (IDH3B), isocitrate dehydrogenase [NAD] subunit gamma (IDH3G), isocitrate dehydrogenase 1 (IDH1), isocitrate dehydrogenase 2 (IDH2), any catalytically active fragment thereof, or any combination thereof.
A base of a nucleic acid sequence or a complementary strand may be deaminated, spontaneously or by contacting a moiety to a portion of a nucleic acid sequence. For example, a base, may be deaminated. In some cases, a base, a methylated base, a hydroxymethylated base, a formylated base, a carboxylated base, or any combination thereof may be deaminated. In some cases, a methylated cytosine may be deaminated. Deamination may occur selectively to a single base or to any combination of bases. Deamination may occur spontaneously. Deamination may occur by contacting a moiety to a portion of a nucleic acid sequence. A moiety may include an enzyme such as a deaminase, such as an adenosine deaminase, a guanine deaminase, or a cytidine deaminase. A deaminase may comprise activation-induced cytidine deaminase (AID), a conserved cytidine deaminase (CDA), apolipoprotein B mRNA editing enzyme catalytic polypeptide I (APOBEC1), apolipoprotein B mRNA-editing enzyme catalytic polypetide-like 3H (APOBEC3A-H), apolipoprotein B mRNA editing enzyme catalytic polypeptide-like 3G (APOBEC3G), or others. Bisulfite sequencing may deaminate one or more bases of a nucleic acid sequence or a complementary strand.
The term “sequencing” as used herein, may comprise bisulfite-free sequencing, bisulfite sequencing, TET-assisted bisulfite (TAB) sequencing, ACE-sequencing, high-throughput sequencing, Maxam-Gilbert sequencing, massively parallel signature sequencing, Polony sequencing, 454 pyrosequencing, Sanger sequencing, Illumina sequencing, SOLiD sequencing, Ion Torrent semiconductor sequencing, DNA nanoball sequencing, Heliscope single molecule sequencing, single molecule real time (SMRT) sequencing, nanopore DNA sequencing, shot gun sequencing, RNA sequencing, Enigma sequencing, or any combination thereof.
In some cases, a method may comprise sequencing. The sequencing may include bisulfite sequencing or bisulfite-free sequencing. In some cases, a method may comprise oxidizing one or more bases of a nucleic acid sequence or complementary strand or combination thereof. In some cases, a method may comprise selectively enriching for a nucleic acid sequence that contains at least one epigenetic modification.
The term “substrate” as used herein, may be a surface with which an entity (such as a label, a functional group, an epigenetic modification, a label or functional moiety associated with an epigenetic modification, a label or functional moiety associated with a parent strand) can be associated. In some cases, an entity may be immobilized to the substrate (such as a support). In some cases, an entity may be reversibly or irreversibly bound to the substrate (such as a support). In some cases, an entity may comprise a label. In such cases, a label may also associate with a nucleic acid sequence. In some cases, an entity may comprise a label, a nucleic acid sequence, a sugar, an enzyme, or any combination thereof. A substrate may comprise a bead. A substrate may comprise a plurality of beads. A substrate may comprise an array of beads. A substrate may comprise an array, such as an array of wells or an array of beads. A substrate (such as a solid support) may comprise a column, such as a packed column, a size-exclusion column, a magnetic column, or any combination thereof. A substrate may comprise a membrane. A substrate may comprise a bead, a capillary, a plate, a membrane, a wafer, a well, a plurality of any of these, an array of any of these, or any combination thereof. A substrate (such as a support) may positively select a nucleic acid sequence of interest by associating the nucleic acid sequence of interest with the substrate. A substrate may negatively select for a nucleic acid sequence of interest by associating other nucleic acid sequences of a sample with the substrate.
A bead may comprise one or more beads. A bead may comprise an array of beads. A bead may be associated with a substrate. A bead may be associated with a label. A bead may associate a label with a substrate. A bead may be associated with a substrate, a label, a nucleic acid sequence or any combination thereof. A bead may comprise a polymer, a metal, or a combination thereof. A bead may comprise a hydrogel, a silica gel, a glass, a resin, a metal, a metal alloy, a plastic, a cellulose, an agarose, a magnetic material, or any combination thereof.
The present disclosure provides substrates and methods of making substrates. The nature and geometry of a support or substrate can depend upon a variety of factors, including the type of array (e.g., one-dimensional, two-dimensional or three-dimensional). Generally, a substrate can be composed of any material which will not melt or otherwise substantially degrade under the conditions used to hybridize and/or denature nucleic acids. A substrate can be composed of any material which will permit coupling of an entity (such as a label associated with an epigenetic modification on a parent oligonucleotide fragment) at one or more discrete regions and/or discrete locations within the discrete regions. A substrate can be composed of any material which permit washing or physical or chemical manipulation without dislodging an entity (such as a label associated with an epigenetic modification on a parent oligonucleotide fragment) from the substrate.
Substrates can be fabricated by the transfer of an entity onto the solid surface in an organized high-density format followed by coupling the entity thereto. The techniques for fabrication of a substrate of the invention include, but are not limited to, photolithography, ink jet and contact printing, liquid dispensing and piezoelectrics. The patterns and dimensions of arrays are to be determined by each specific application. The sizes of each entity spot may be easily controlled by the users. A method of making a solid substrate can comprise contacting or coupling an entity to a discrete location.
A substrate may take a variety of configurations ranging from simple to complex, depending on the intended use of the array. Thus, a substrate can have an overall slide or plate configuration, such as a rectangular or disc configuration. A standard microplate configuration can be used. In some embodiments, the surface may be smooth or substantially planar, or have irregularities, such as depressions or elevations. In some instances, a substrate may have a rectangular cross-sectional shape, having a length of from about: 10-200 millimeters (mm), 40-150 mm, or 75-125 mm; a width of from about: 10-200 mm, 20-120 mm, or 25-80 mm, and a thickness of from about: 0.01-5.0 mm, 0.1-2 mm, or 0.2 to 1 mm.
A support may be organic or inorganic; may be metal (e.g., copper or silver) or non-metal; may be a polymer or nonpolymer; may be conducting, semiconducting or nonconducting (insulating); may be reflecting or nonreflecting; may be porous or nonporous; etc. A substrate as described above can be formed of any suitable material, including metals, metal oxides, semiconductors, polymers (particularly organic polymers in any suitable form including woven, nonwoven, molded, extruded, cast, etc.), silicon, silicon oxide, and composites thereof.
A number of materials (e.g., polymers) suitable for use as substrates (e.g., solid substrates) in the instant invention have been described in the art. Suitable materials for use as substrates include, but are not limited to, polycarbonate, gold, silicon, silicon oxide, silicon oxynitride, indium, tantalum oxide, niobium oxide, titanium, titanium oxide, platinum, iridium, indium tin oxide, diamond or diamond-like film, acrylic, styrene-methyl methacrylate copolymers, ethylene/acrylic acid, acrylonitrile-butadiene-styrene (ABS), ABS/polycarbonate, ABS/polysulfone, ABS/polyvinyl chloride, ethylene propylene, ethylene vinyl acetate (EVA), nitrocellulose, nylons (including nylon 6, nylon 6/6, nylon 6/6-6, nylon 6/9, nylon 6/10, nylon 6/12, nylon 11 and nylon 12), polyacrylonitrile (PAN), polyacrylate, polycarbonate, polybutylene terephthalate (PBT), poly(ethylene) (PE) (including low density, linear low density, high density, cross-linked and ultra-high molecular weight grades), poly(propylene) (PP), cis and trans isomers of poly(butadiene) (PB), cis and trans isomers of poly(isoprene), polyethylene terephthalate) (PET), polypropylene homopolymer, polypropylene copolymers, polystyrene (PS) (including general purpose and high impact grades), polycarbonate (PC), poly(epsilon-caprolactone) (PECL or PCL), poly(methyl methacrylate) (PMMA) and its homologs, poly(methyl acrylate) and its homologs, poly(lactic acid) (PLA), poly(glycolic acid), polyorthoesters, poly(anhydrides), nylon, polyimides, polydimethylsiloxane (PDMS), polybutadiene (PB), polyvinylalcohol (PVA), polyacrylamide and its homologs such as poly(N-isopropyl acrylamide), fluorinated polyacrylate (PFOA), poly(ethylene-butylene) (PEB), poly(styrene-acrylonitrile) (SAN), polytetrafluoroethylene (PTFE) and its derivatives, polyolefin plastomers, fluorinated ethylene-propylene (FEP), ethylene-tetrafluoroethylene (ETFE), perfluoroalkoxyethylene (PFA), polyvinyl fluoride (PVF), polyvinylidene fluoride (PVDF), polychlorotrifluoroethylene (PCTFE), polyethylene-chlorotrifluoroethylene (ECTFE), styrene maleic anhydride (SMA), metal oxides, glass, silicon oxide or other inorganic or semiconductor material (e.g., silicon nitride), compound semiconductors (e.g., gallium arsenide, and indium gallium arsenide), and combinations thereof.
Examples of well-known substrates include polypropylene, polystyrene, polyethylene, dextran, nylon, amylases, glass, natural and modified celluloses (e.g., nitrocellulose), polyacrylamides, agaroses and magnetite. In some instances, the substrate can be silica or glass because of its great chemical resistance against solvents, its mechanical stability, its low intrinsic fluorescence properties, and its flexibility of being readily functionalized. In one embodiment, the substrate is glass, particularly glass coated with nitrocellulose, more particularly a nitrocellulose-coated slide (e.g., FAST slides).
A substrate may be modified with one or more different layers of compounds or coatings that serve to modify the properties of the surface in a desirable manner. For example, a substrate may further comprise a coating material on the whole or a portion of the surface of the substrate. In some embodiments, a coating material enhances the affinity of the entity (such as a functional group) for the substrate. For example, the coating material can be nitrocellulose, silane, thiol, disulfide, or a polymer. When the material is a thiol, the substrate may comprise a gold-coated surface and/or the thiol comprises hydrophobic and hydrophilic moieties. When the coating material is a silane, the substrate comprises glass and the silane may present terminal moieties including, for example, hydroxyl, carboxyl, phosphate, glycidoxy, sulfonate, isocyanato, thiol, or amino groups. In an alternative embodiment, the coating material may be a derivatized monolayer or multilayer having covalently bonded linker moieties. For example, the monolayer coating may have thiol (e.g., a thioalkyl selected from the group consisting of a thioalkyl acid (e.g., 16-mercaptohexadecanoic acid), thioalkyl alcohol, thioalkyl amine, and halogen containing thioalkyl compound), disulfide or silane groups that produce a chemical or physicochemical bonding to the substrate. The attachment of the monolayer to the substrate may also be achieved by non-covalent interactions or by covalent reactions.
After attachment to the substrate, a coating may comprise at least one functional group. Examples of functional groups on the monolayer coating include, but are not limited to, carboxyl, isocyanate, halogen, amine or hydroxyl groups. In one embodiment, these reactive functional groups on the coating may be activated by standard chemical techniques to corresponding activated functional groups on the monolayer coating (e.g., conversion of carboxyl groups to anhydrides or acid halides, etc.). Exemplary activated functional groups of the coating on the substrate for covalent coupling to terminal amino groups include anhydrides, N-hydroxysuccinimide esters or other common activated esters or acid halides, Exemplary activated functional groups of the coating on the substrate include anhydride derivatives for coupling with a terminal hydroxyl group; hydrazine derivatives for coupling onto oxidized sugar residues of the linker compound; or maleimide derivatives for covalent attachment to thiol groups of the linker compound. To produce a derivatized coating, at least one terminal carboxyl group on the coating can be activated to an anhydride group and then reacted, for example, with a linker compound. Alternatively, the functional groups on the coating may be reacted with a linker having activated functional groups (e.g., N-hydroxysuccinimide esters, acid halides, anhydrides, and isocyanates) for covalent coupling to reactive amino groups on the coating.
A substrate can contain a linker (e.g., to indirectly couple an entity to the substrate). In one embodiment, a linker has one terminal functional group, a spacer region and an entity adhering region. The terminal functional groups for reacting with functional groups on an activated coating include halogen, amino, hydroxyl, or thiol groups. In some instances, a terminal functional group is selected from the group consisting of a carboxylic acid, halogen, amine, thiol, alkene, acrylate, anhydride, ester, acid halide, isocyanate, hydrazine, maleimide and hydroxyl group. The spacer region may include, but is not limited to, polyethers, polypeptides, polyamides, polyamines, polyesters, polysaccharides, polyols, multiple charged species or any other combinations thereof. Exemplary spacer regions include polymers of ethylene glycols, peptides, glycerol, ethanolamine, serine, inositol, etc. The spacer region may be hydrophilic in nature. The spacer region may be hydrophobic in nature. In some instances, the spacer has n oxyethylene groups, where n is between 2 and 25. In some instances, a region of a linker that adheres to an entity may be hydrophobic or amphiphilic with straight or branched chain alkyl, alkynyl, alkenyl, aryl, arylalkyl, heteroalkyl, heteroalkynyl, heteroalkenyl, heteroaryl, or heteroarylalkyl. In some instances, a region of a linker that adheres to an entity may comprise a C10-C25 straight or branched chain alkyl or heteroalkyl hydrophobic tail. In some instances, a linker comprises a terminal functional group on one end, a spacer, an entity adhering region, and a hydrophilic group on another end. The hydrophilic group at one end of the linker may be a single group or a straight or branched chain of multiple hydrophilic groups (e.g., a single hydroxyl group or a chain of multiple ethylene glycol units).
In some embodiments, a support can be planar. In some instances, a support can be spherical. In some instances, a support can be a bead. In some instances, a support can be magnetic. In some instances, a magnetic substrate can comprises magnetite, maghemitite, FePt, SrFe, iron, cobalt, nickel, chromium dioxide, ferrites, or mixtures thereof. In some instances, a support can be nonmagnetic. In some embodiments, the nonmagnetic substrate can comprise a polymer, metal, glass, alloy, mineral, or mixture thereof. In some instances a nonmagnetic material can be a coating around a magnetic substrate. In some instances, a magnetic material may be distributed in the continuous phase of a magnetic material. In some embodiments, the substrate comprises magnetic and nonmagnetic materials. In some instances, a substrate can comprise a combination of a magnetic material and a nonmagnetic material. In some embodiments, the magnetic material is at least about: 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, or about 80% by weight of the total composition of the substrate. In some embodiments, the bead size can be quite large, on the order of from about 100 microns to about 900 microns or in some cases even up to a diameter of about 3 mm. In other embodiments, the bead size can be on the order of from about 1 microns to about 150 microns. The average particle diameters of beads of the invention can be in the range of from about 2 μm to about several millimeters, e.g., diameters in ranges having lower limits of about: 2 μm, 4 μm, 6 μm, 8 μm, 10 μm, 20 μm, 30 μm, 40 μm, 50 μm, 60 μm, 70 μm, 80 μm, 90 μm, 100 μm, 150 μm, 200 μm, 300 μm, or 500 μm, and upper limits of 20 μm, 30 μm, 40 μm, 50 μm, 60 μm, 70 μm, 80 μm, 90 μm, 100 μm, 150 μm, 200 μm, 300 μm, 500 μm, 750 μm, 1 mm, 2 mm, or 3 mm.
The term “tissue” as used herein, may be any tissue sample. A tissue may be a tissue suspected or confirmed of having a disease or condition. A tissue may be a sample that may be substantially healthy, substantially benign, or otherwise substantially free of a disease or a condition. A tissue may be a tissue removed from a subject, such as a tissue biopsy, a tissue resection, an aspirate (such as a fine needle aspirate), a tissue washing, a cytology specimen, a bodily fluid, or any combination thereof. A tissue may comprise cancerous cells, tumor cells, non-cancerous cells, or a combination thereof. A tissue may comprise brain tissue, cerebral spinal tissue, cerebral spinal fluid, breast tissue, bladder tissue, kidney tissue, liver tissue, colon tissue, thyroid tissue, cervical tissue, prostate tissue, lung tissue, heart tissue, muscle tissue, pancreas tissue, anal tissue, bile duct tissue, a bone tissue, uterine tissue, ovarian tissue, endometrial tissue, vaginal tissue, vulvar tissue, stomach tissue, ocular tissue, nasal tissue, sinus tissue, penile tissue, salivary gland tissue, gut tissue, gallbladder tissue, gastrointestinal tissue, bladder tissue, brain tissue, spinal tissue, a blood sample, or any combination thereof. A tissue may be a sample that may be genetically modified.
The term “subject,” as used herein, may be any animal or living organism. Animals can be mammals, such as humans, non-human primates, rodents such as mice and rats, dogs, cats, pigs, sheep, rabbits, and others. Animals can be fish, reptiles, or others. Animals can be neonatal, infant, adolescent, or adult animals. Humans can be more than about: 1, 2, 5, 10, 20, 30, 40, 50, 60, 65, 70, 75, or about 80 years of age. The subject may have or be suspected of having a condition or a disease, such as cancer. The subject may be a patient, such as a patient being treated for a condition or a disease, such as a cancer patient. The subject may be predisposed to a risk of developing a condition or a disease such as cancer. The subject may be in remission from a condition or a disease, such as a cancer patient. The subject may be healthy.
The term “reads per kilobase per million mapped reads (RPKM),” as used herein, may be a method of quantifying gene expression from sequencing data (such as RNA sequencing data) by normalizing for a total read length and/or a number of sequencing reads. In some cases, RPKM may correct for differences in sample sequencing depth. In some cases, RPKM may correct for differences in gene length. In some cases, RPKM may correct for differences in both sample sequencing depth and gene length. RPKM may be a method of normalizing data for comparison of gene coverage values. In some cases, RPKM may be defined as numReads/((geneLength/1000)*(totalNumReads/1,000,000)), wherein numReads may be a number of reads mapped to a gene sequence, geneLenth may be a length of the gene sequence, and totalNumReads may be a total number of mapped reads of a sample.
A substantially complementary strand may be hybridized to a portion of a nucleic acid sequence. A substantially complementary strand may be substantially a same length as a nucleic acid sequence. A substantially complementary strand may comprise a length that may be at least about: 99%, 95%, 90%, 80%, 70%, 60%, 50%, 40%, 30%, 20%, 10%, 5%, 3%, 2%, 1% of a length of a nucleic acid sequence. A substantially complementary strand may be shorter, longer or the same in length compared to a nucleic acid sequence.
A substantially complementary strand may be hybridized to at least a portion of a nucleic acid sequence. In some cases, two substantially complementary strands may be hybridized to portions of a nucleic acid sequence. In some cases, the two substantially complementary strands may be ligated. In some cases, three or more substantially complementary strands may be hybridized to portions of a nucleic acid sequence. In some cases, the three or more substantially complementary strands may be ligated. In some cases, a substantially complementary strand may be hybridized to a portion of a nucleic acid sequence that may comprise an adaptor sequence. A substantially complementary strand may be elongated, such as elongated before amplifying.
A nucleic acid sequence may comprise a cytosine guanine (CG) site, a cytosine phosphate guanine (CpG) island, a portion of any of these, or a combination thereof. A CpG island may comprise one or more CG sites. A nucleic acid sequence may comprise one or more CG sites or portions thereof. A nucleic acid sequence may comprise dense CG sites, dense CpG islands or a combination thereof. A nucleic acid sequence may comprise a plurality of CG sites or portions thereof. A nucleic acid sequence may comprise one or more CpG islands or portions thereof. A nucleic acid sequence may comprise a plurality of CpG islands or portions thereof. One or more bases of a nucleic acid sequence comprising a CG site, a CpG island, a portion thereof, or any of these may comprise an epigenetically modified base, such as a methylated base or a hydroxymethylated base. One or more cytosines of a nucleic acid sequence comprising a CG site, a CpG island, a portion thereof, or any of these may comprise an epigenetically modified cytosine, such as a methylated cytosine or a hydroxymethylated cytosine. A CpG island (or a CG island) may be a region with a high frequency of CG sites. A CpG island may be a region of a nucleic acid sequence with at least about 200 basepairs (bp) and a GC percentage that may be greater than about 50% and with an observed-to-expected CpG ratio that may be greater than about 60%. An “observed-to-expected CpG ratio” may be derived where the observed may be calculated as:
(number of CpGs)
and the expected may be calculated as:
(number of C*number of G)/length of sequence
or the expected may be calculated as:
((number of C+number of G)/2)2/length of sequence
In some cases, a CpG island may be a region of a nucleic acid sequence with at least about: 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 225, 250, 300, 350, 400, 450, 500, 550, 600 bp. In some cases, a CpG island may be a region of a nucleic acid sequence with from about 20 to about 600 bp. In some cases, a CpG island may be a region of a nucleic acid sequence with from about 20 to about 500 bp. In some cases, a CpG island may be a region of a nucleic acid sequence with from about 10 to about 500 bp. In some cases, a CpG island may be a region of a nucleic acid sequence with from about 10 to about 300 bp. In some cases, a CpG island may be a region of a nucleic acid sequence with from about 20 to about 200 bp.
In some cases, a GC percentage in a CpG island may be greater than about: 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or greater. In some cases, a GC percentage in a CpG island may be from about 50% to about 95%. In some cases, a GC percentage in a CpG island may be from about 50% to about 99%. In some cases, a GC percentage in a CpG island may be from about 55% to about 85%. In some cases, a GC percentage in a CpG island may be from about 60% to about 99%. In some cases, a GC percentage in a CpG island may be from about 70% to about 99%.
The term “density of epigenetic modifications,” as used herein, may be a number or percentage of bases within a sequence that have an epigenetic modification. The epigenetic modification may be a methylated cytosine, a hydroxymethylated cytosine, a carboxylated cytosine, a formylated cytosine, or other epigenetic modification. In some cases, a sequence having 6 5-hmC may represent a sequence having a high density of epigenetic modifications. In some cases, a sequence having 2 5-hmC may represent a sequence having a low density of epigenetic modifications. A low density of epigenetic modifications may include a sequence having less than 5, 4, 3, or 2 CG sites per 10 base pairs (bp), 20 bp, 30 bp, 40 bp, 50 bp segment of the sequence. A high density of epigenetic modifications may include a sequence having more than 2, 3, 4, 5, 6, 7, 8, 9, 10 CG sites per 10 bp, 15 bp, 30 bp, 35 bp, 40 bp, 50 bp segment of the sequence. A high density of epigenetic modifications may include a sequence having a plurality of CpG islands or CG sites. A high density of epigenetic modifications may include a sequence having at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more CpG islands or CG sites per sequence.
As used herein, the term “cell-free” refers to the condition of the nucleic acid sequence as it appeared in the body before the sample is obtained from the body. For example, circulating cell-free nucleic acid sequences in a sample may have originated as cell-free nucleic acid sequences circulating in the bloodstream of the human body. In contrast, nucleic acid sequences that are extracted from a solid tissue, such as a biopsy, are generally not considered to be “cell-free.” In some cases, cell-free DNA may comprise fetal DNA, maternal DNA, or a combination thereof. In some cases, cell-free DNA may comprise DNA fragments released into a blood plasma. In some cases, the cell-free DNA may comprise circulating tumor DNA. In some cases, cell-free DNA may comprise circulating DNA indicative of a tissue origin, a disease or a condition. A cell-free nucleic acid sequence may be isolated from a blood sample. A cell-free nucleic acid sequence may be isolated from a plasma sample. A cell-free nucleic acid sequence may comprise a complementary DNA (cDNA). In some cases, one or more cDNAs may form a cDNA library.
In some cases, a nucleic acid sequence may be double-stranded, such as a cDNA library comprising the nucleic acid sequence. In some cases, a nucleic acid sequence may be double-stranded such as when a substantially complementary strand may be hybridized to at least a portion of the nucleic acid sequence. In some cases, a portion of a nucleic acid sequence may be double-stranded, such as when a primer may be hybridized to a portion of the nucleic acid sequence.
A nucleic acid sequence may be from a sample. A sample may be isolated from a subject. A subject may be a human subject. A sample may comprise a buccal sample, a saliva sample, a blood sample, a plasma sample, a reproductive sample (such as an egg or a sperm), a mucus sample, a cerebral spinal fluid sample, a tissue sample, a tissue biopsy, a surgical resection, a fine needle aspirate sample, or any combination thereof. In some cases, a sample may comprise a blood sample. In some cases, a sample may comprise a buccal sample.
In some cases, a subject may have previously received a diagnosis of a disease or condition prior to performing a method as described herein. A subject may have previously received a positive diagnosis of a disease, such as a cancer. A subject may have previously received an indeterminate or inclusive diagnosis of a disease, such as a cancer. A subject may be a subject in need thereof, such as a need for a definitive diagnosis or a need for a selection of a therapeutic treatment regime.
In some cases, a subject may not have previously received a diagnosis of a disease or condition prior to performing a method as described herein. In some cases, a subject may be suspected of having a disease or condition, such as having one or more symptoms of a disease or condition. In some cases, a subject may be at risk of developing a disease or condition, such as a subject having a biomarker or genetic indication that may be indicative of a risk of developing a disease or condition. In some cases, a disease or a condition may comprise a cancer.
In some cases, a method as described herein may comprise obtaining a result. A method may comprise obtaining a result and reporting the result. A result may be reported to a user, a medical professional, a subject, or any combination thereof. A result may be reported via a communication medium. A communication medium may include a written report or a printed report. A communication medium may include a visual display such as a graphical user interface. A communication medium may comprise a result provided by a computer, a tablet device, a cellphone, or other electronic device. A result may comprise a diagnosis of a disease or condition or a confirmation of an absence of a disease or condition. A result may comprise a diagnosis of a subject as having a disease or condition. A result may comprise a confirmation of an absence of the disease or condition. A result may comprise a likelihood or a risk of a subject to develop a disease or a condition. In some cases, a disease or a condition may comprise a cancer. A result may comprise predicting mortality of a subject, determining a biological age of a subject, or a combination thereof. A mortality prediction or biological age determination may be based on a presence of an epigenetic modification, sequencing information or any combination thereof. A result, such as a prediction of a likelihood of a disease or condition or a diagnosis of a disease or condition may be based on a presence of an epigenetic modification, sequencing information or a combination thereof. A presence of an epigenetic modification may include a pattern of epigenetic modification, a presence of a specific epigenetic modification, a level of an epigenetic modification, or any combination thereof.
A method as described herein may comprise comparing a result to a reference. A reference may comprise a plurality of references. A reference may comprise a database comprising a plurality of results. A reference may comprise a control sample. A reference may comprise a positive control sample, a negative control sample, or a combination thereof. A reference, such as a reference sample, may be obtained from a subject or from a different source, such as a different subject. A diagnosis may comprise comparing a result to a reference. In some cases, a result comprising a diagnosis may at least partially confirm a previous diagnosis.
One or more results obtained from a method described herein may provide a quantitative value or values indicative of one or more of the following: a likelihood of diagnostic accuracy, a likelihood of a presence of a condition in a subject, a likelihood of a subject developing a condition, a likelihood of success of a particular treatment, or any combination thereof. A method as described herein may predict a risk or likelihood of developing a condition. A method as described herein may be an early diagnostic indicator of developing a condition. A method as described herein may confirm a diagnosis or a presence of a condition. A method as described herein may monitor the progression of a condition. A method as described herein may monitor the efficacy of a treatment for a condition in a subject.
Samples obtained for analysis using the methods described herein may be obtained from a subject. The subject may not have any symptoms of a condition. The subject may have one or more symptoms of a condition. The subject may be a risk, such as a genetic risk, of developing a condition. The subject may have previously received a positive diagnosis. The subject may have previously received an indeterminate result from a diagnostic test. The subject may be currently receiving in a treatment.
Methods for diagnosing and/or suggesting, selecting, designating, recommending or otherwise determining a course of treatment for a subject having or suspected of having a condition can be employed in combination with the methods as described herein. These techniques may include cytological analysis or histological classification, molecular profiling, a blood test, a genetic analysis, ultrasound analysis, MRI results, CT scan results, other imaging scans, measurements of hormone cytokine or blood cell levels, or any combination thereof. The methods described herein may include at least one other type of diagnostic method. The methods described herein may include at least two other diagnostic methods.
In some embodiments, the methods of the present invention provide for storing the sample for a time such as seconds, minutes, hours, days, weeks, months, years or longer after the sample is obtained and before the sample is analyzed by one or more methods of the invention. In some cases, the sample obtained from a subject is subdivided prior to the step of storage or further analysis such that different portions of the sample are subject to different downstream methods or processes including but not limited to any combination of methods described herein, storage, bisulfite treatment, amplification, sequencing, labeling, cytological analysis, adequacy tests, nucleic acid extraction, molecular profiling or a combination thereof.
In some cases, a portion of the sample may be stored while another portion of said sample is further manipulated. Such manipulations may include but are not limited to any method as described herein; bisulfite treatment; sequencing; amplification; labeling; selective enrichment; molecular profiling; cytological staining; nucleic acid (RNA or DNA) extraction, detection, or quantification; gene expression product (RNA or Protein) extraction, detection, or quantification; fixation; and examination. The sample may be fixed prior to or during storage by any method known to the art such as using glutaraldehyde, formaldehyde, or methanol. In other cases, the sample is obtained and stored and subdivided after the step of storage for further analysis such that different portions of the sample are subject to different downstream methods.
A method as described herein may comprise treating a subject. In some cases, a treatment may comprise surgery, chemotherapy, radiation therapy, immunotherapy, targeted therapy, hormone therapy, stem cell transplantation, precision medicine, or any combination thereof. In some cases, a treatment may comprise further monitoring of a condition of a subject. In some cases, a subject diagnosed with a disease or condition may receive a treatment to treat a disease or a condition. In some cases, a subject receiving a confirmation of a likelihood or a risk of developing a disease or a condition, may receive a treatment, such as a preventive treatment. A treatment for a subject may be selected based on a result of a method, such as a confirmed positive diagnosis of a disease or a condition. A result may comprise one or more treatments, such as a recommended treatment, for a subject based on a result. A treatment may comprise a single treatment. A treatment may comprise a recurring treatment. A treatment may comprise a recurring treatment over a remaining lifespan of a subject. A treatment may comprise a daily treatment. A treatment may comprise a biweekly treatment. A treatment may be selected base on a result.
In some embodiments, a treatment for a subject can be a surgery (such as a tissue resection), a nutrition regime, a physical activity, a radiation treatment, a chemotherapy, an immunotherapy, a pharmaceutical composition, a cell transplantation, a blood fusion, or any combination thereof.
The methods described herein, such as assaying and comparing, may be conducted prior to an operation on a diseased tissue of the subject, such as a tumor resection. The methods described herein may be conducted prior to the subject having a positive disease diagnosis, such as a cancer or a tumor diagnosis. The methods described herein may be conducted on a subject suspected of having a condition or a disease, such as a cancer or a tumor. The methods described herein may be conducted on a subject that has received a positive disease diagnosis, such as a positive cancer or a positive tumor diagnosis. The methods described herein may be conducted on a subject having received a prior treatment regime, wherein the prior treatment regime was ineffective in eliminating the disease or condition, such as a cancer or tumor. A tissue sample may be obtained from a subject prior to performing the methods described herein. A tissue sample may be obtained during a biopsy, fine needle aspiration, blood sample, surgery resection, or any combination thereof.
Assaying a tissue sample of a subject may be performed at one or more time points. A separate tissue sample may be obtained from the subject for assaying at each of the one or more time points. Assaying at one or more time points may be performed on the same tissue sample. Assaying at one or more time points may provide an assessment of an effectiveness of a drug, a longitudinal course of a disease treatment regime, or a combination thereof. At each of the one or more time points, a tissue sample may be compared to a same reference. A tissue sample may be compared to a different reference at each of the one or more time points. The one or more time points may be the same. The one or more time points may be different. The one or more time points may comprise at least one time point prior to a drug administration, at least one time point after a drug administration, at least one time point prior to a positive disease diagnosis, at least one time point after a disease remission diagnosis, at least one time point during a disease treatment regime, or a combination thereof.
The methods as described herein may be used for diagnosis of a particular condition and also to monitor efficacy of a particular treatment after an initial diagnosis or monitor progression of a particular condition. The methods as described herein may be used to monitoring a subject as risk of developing a particular condition, as a preventive measure. The methods as described herein may be used alone for diagnosis and/or monitoring efficacy of a particular treatment. The methods as described herein may be used in combination with other assays for diagnosis or monitoring (such as a cytological analysis or molecular profiling).
A subject may be monitored using methods as disclosed herein. For example, a subject may be diagnosed with condition, such as a cancer or a genetic disorder. This initial diagnosis may or may not involve the use of the methods described herein. The subject may be prescribed a treatment such as surgical resection of a tumor or chemotherapy. The results of the treatment may be monitored on an ongoing basis by the methods described herein to detect the efficacy of the treatment. In another example, a subject may be diagnosed with a benign tumor or a precancerous lesion or nodule, and the tumor, nodule, or lesion may be monitored on an ongoing basis by the methods described herein to detect any changes in the state of the tumor or lesion.
The methods described herein may also be used to ascertain the potential efficacy of a specific treatment prior to administering to a subject. For example, a subject may be diagnosed with cancer. The methods described herein may indicate a presence of one or more epigenetic residues on a particular nucleic acid sequence known to be involved in cancer malignancy. A further sample may be obtained from the subject and cultured in vitro using methods known to the art. The application of various inhibitors or drugs may then be tested for growth inhibition. The methods described herein may also be used to monitor the effect of these inhibitors on for example down-stream targets of the implicated pathway.
In some embodiments, the methods described herein may be used as a research tool to identify new markers for diagnosis of conditions (such as suspected tumors); to monitor the effect of drugs or candidate drugs on samples such as tumor cells, cell lines, tissues, or organisms; or to uncover new pathways for disease prevention or inhibition (such as oncogenesis and/or tumor suppression).
In some cases, an oligonucleotide fragment may comprises one or more epigenetically modified bases, such as (a) one or more epigenetically modified cytosines, (b) one or more epigenetically modified uracils, (c) one or more epigenetically modified thymines, (d) one or more epigenetically modified guanine, (e) one or more epigenetically modified adenines, or (f) any combination thereof.
A nucleic acid sequence may comprise one or more epigenetically modified bases. For example, a nucleic acid sequence may comprise at least about: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more epigenetically modified bases per about 20 basepairs of the nucleic acid sequence. A nucleic acid sequence may comprise about: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 epigenetically modified bases per about 20 basepairs of the nucleic acid sequence.
A nucleic acid sequence may comprise one or more epigenetically modified bases. For example, about: 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, at least about: 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 4% to about 10% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 4% to about 6% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 4% to about 20% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 4% to about 30% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 3% to about 30% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 30% to about 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 40% to about 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 50% to about 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases. In some cases, from about 60% to about 90% of total bases of a nucleic acid sequence may comprise epigenetically modified bases.
A nucleic acid sequence (in some cases comprising a plurality of epigenetically modified residues) may be enriched. Enrichment of the nucleic acid sequence may comprise amplification such as amplification by polymerase chain reaction (PCR), loop mediated isothermal amplification, nucleic acid sequence based amplification, strand displacement amplification, multiple displacement amplification, rolling circle amplification, ligase chain reaction, helicase dependent amplification, ramification amplification method, or any combination thereof.
In some cases, amplification may comprise at least 2 cycles of amplification. Amplification may comprise at least 3 cycles of amplification. Amplification may comprise at least 4 cycles of amplification. Amplification may comprise at least 5 cycles of amplification. Amplification may comprise at least 6 cycles of amplification. Amplification may comprise at least 7 cycles of amplification. Amplification may comprise at least 8 cycles of amplification. Amplification may comprise at least 9 cycles of amplification. Amplification may comprise at least 10 cycles of amplification. Amplification may comprise at least 11 cycles of amplification. Amplification may comprise at least 12 cycles of amplification. Amplification may comprise at least 13 cycles of amplification. Amplification may comprise at least 14 cycles of amplification. Amplification may comprise at least 15 cycles of amplification. Amplification may comprise at least 20 cycles of amplification. Amplification may comprise at least 25 cycles of amplification. Amplification may comprise at least 30 cycles of amplification.
In some cases, amplification of a given number of cycles produces a plurality of sequence reads that retain a percentage of original sequence length. In some cases, about 90% of the plurality of sequence reads retain at least about 90% of the sequence length. In some cases, about 80% of the plurality of sequence reads retain at least about 90% of the sequence length. In some cases, about 75% of the plurality of sequence reads retain at least about 90% of the sequence length. In some cases, about 95% of the plurality of sequence reads retain at least about 90% of the sequence length. In some cases, about 85% of the plurality of sequence reads retain at least about 90% of the sequence length.
In some cases, about 90% of the plurality of sequence reads retain at least about 85% of the sequence length. In some cases, about 80% of the plurality of sequence reads retain at least about 85% of the sequence length. In some cases, about 75% of the plurality of sequence reads retain at least about 85% of the sequence length. In some cases, about 95% of the plurality of sequence reads retain at least about 85% of the sequence length. In some cases, about 85% of the plurality of sequence reads retain at least about 85% of the sequence length.
In some cases, about 90% of the plurality of sequence reads retain at least about 80% of the sequence length. In some cases, about 80% of the plurality of sequence reads retain at least about 80% of the sequence length. In some cases, about 75% of the plurality of sequence reads retain at least about 80% of the sequence length. In some cases, about 95% of the plurality of sequence reads retain at least about 80% of the sequence length. In some cases, about 85% of the plurality of sequence reads retain at least about 80% of the sequence length.
In some cases, a portion of bases of the substantially complementary strand may base pair with a nucleic acid sequence. In some cases, at least about: 70%, 75%, 80%, 85%, 90%, 95%, or 98% of bases of the substantially complementary strand may base pair with a nucleic acid sequence. In some cases, at least about 70% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, at least about 80% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, at least about 90% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, at least about 95% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, at least about 98% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, from about 70% to 100% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, from about 75% to 100% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, from about 80% to 100% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, from about 85% to 100% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some cases, from about 90% to 100% of bases of the substantially complementary strand may base pair with the nucleic acid sequence. In some case, a substantially complementary strand may hybridize to a nucleic acid sequence under substantially stringent hybridization conditions, such as a substantially high hybridization temperature, a substantially low salt content in a hybridization buffer, or a combination thereof.
In some cases, at least about: 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, at least about 1% of the bases of a nucleic acid sequence may comprise an epigenetically modified base.. In some cases, at least about 2% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In sonic cases, at least about 3% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, at least about 4% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. in some cases, at least about 5% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, at least about 10% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, from about 10% to about 100% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, from about 10% to about 90% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, from about 5% to about 100% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, from about 4% to about 100% of the bases of a nucleic acid sequence may comprise an epigenetically modified base. In some cases, from about 3% to about 100% of the bases of a nucleic acid sequence may comprise an epigenetically modified base.
In some cases, a nucleic acid sequence comprises at least about: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15 or 20 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 1 epigenetically modified base per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 2 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 3 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 4 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 5 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least about 10 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises from about 1 to about 10 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 3 to about 10 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 4 to about 10 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 5 to about 10 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence.
In some cases, a nucleic acid sequence comprises at least from about 1 to about 3 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 4 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 5 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 8 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 10 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 15 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence. In some cases, a nucleic acid sequence comprises at least from about 1 to about 20 epigenetically modified bases per at least about 20 bases of a nucleic acid sequence.
A sample obtained from a subject can comprise tissue, cells, cell fragments, cell organelles, nucleic acids, genes, gene fragments, expression products, gene expression products, gene expression product fragments or any combination thereof. A sample can be heterogeneous or homogenous. A sample can comprise blood, urine, cerebrospinal fluid, seminal fluid, saliva, sputum, stool, lymph fluid, tissue, mucus, or any combination thereof. A sample can be a tissue-specific sample such as a sample obtained from a reproductive tissue (such as a sperm or an egg), thyroid, skin, heart, lung, kidney, breast, pancreas, liver, muscle, smooth muscle, bladder, gall bladder, colon, intestine, brain, esophagus, prostate, or any combination thereof.
A sample of the present disclosure can be obtained by various methods, such as, for example, fine needle aspiration (FNA), core needle biopsy, vacuum assisted biopsy, incisional biopsy, excisional biopsy, core biopsy, punch biopsy, shave biopsy, skin biopsy, or any combination thereof.
A sample may be obtained from a subject by another individual or entity, such as a healthcare (or medical) professional or robot. A medical professional can include a physician, nurse, medical technician or other. In some cases, a physician may be a specialist, such as an oncologist, surgeon, or endocrinologist. A medical technician may be a specialist, such as a cytologist, phlebotomist, radiologist, pulmonologist or others. medical professional may obtain a sample from a subject for testing or refer the subject to a testing center or laboratory for the submission of the sample. The medical professional may indicate to the testing center or laboratory the appropriate test or assay to perform on the sample, such as methods of the present disclosure including determining gene sequence data, gene expression levels, sequence variant data, or any combination thereof.
In some cases, a medical professional need not be involved in the initial diagnosis of a condition or a disease or the initial sample acquisition. An individual, such as the subject, may alternatively obtain a sample through the use of an over the counter kit. The kit may contain collection unit or device for obtaining the sample as described herein, a storage unit for storing the sample ahead of sample analysis, and instructions for use of the kit.
Epigenetic modifications may be monitored over time. Monitoring epigenetic modification over time may include monitoring changes in a presence of an epigenetic modification, a level of an epigenetic modification, a pattern of an epigenetic modification. Monitoring may include monitoring an efficacy of a therapeutic, monitoring a progression of a disease, monitoring a regression of a disease, monitoring a risk or likelihood of developing a disease, monitoring a mortality prediction or biological age, or any combination thereof. A sample can be obtained a) pre-operatively, b) post-operatively, c) after a disease diagnosis, d) during routine screening following remission or cure of a disease, e) when a subject may be suspected of having a disease, f) during a routine office visit or clinical screen, g) following the request of a medical professional, or any combination thereof. Multiple samples at separate times can be obtained from the same subject, such as before treatment for a disease commences and after treatment ends, such as monitoring a subject over a time course. Multiple samples can be obtained from a subject at separate times to monitor the absence or presence of disease progression, regression, or remission in the subject.
A condition or a disease, as disclosed herein, can include a cancer, a neurological disorder, or an autoimmune disease.
In some cases, a disease or condition may comprise a neurological disorder, In some cases, a neurological disorder may comprise Acquired Epileptiform Aphasia, Acute Disseminated Encephalomyelitis, Adrenoleukodystrophy, Agenesis of the corpus callosum, Agnosia, Aicardi syndrome, Alexander disease, Alpers' disease, Alternating hemiplegia, Alzheimer's disease, Amyotrophic lateral sclerosis (see Motor Neuron Disease), Anencephaly, Angelman syndrome, Angiomatosis, Anoxia, Aphasia, Apraxia, Arachnoid cysts, Arachnoiditis, Arnold-Chiari malformation, Arteriovenous malformation, Asperger's syndrome, Ataxia Telangiectasia, Attention Deficit Hyperactivity Disorder, Autism, Auditory processing disorder, Autonomic Dysfunction, Back Pain, Batten disease, Behcet's disease, Bell's palsy, Benign Essential Blepharospasm, Benign Focal Amyotrophy, Benign Intracranial Hypertension, Bilateral frontoparietal polymicrogyria, Binswanger's disease, Blepharospasm, Bloch-Sulzberger syndrome, Brachial plexus injury, Brain abscess, Brain damage, Brain injury, Brain tumor, Brown-Sequard syndrome, Canavan disease, Carpal tunnel syndrome (CTS), Causalgia, Central pain syndrome, Central pontine myelinolysis, Centronuclear myopathy, Cephalic disorder, Cerebral aneurysm, Cerebral arteriosclerosis, Cerebral atrophy, Cerebral gigantism, Cerebral palsy, Charcot-Marie-Tooth disease, Chiari malformation, Chorea, Chronic inflammatory demyelinating polyneuropathy (CIDP), Chronic pain, Chronic regional pain syndrome, Coffin Lowry syndrome, Coma, including Persistent Vegetative State, Congenital facial diplegia, Corticobasal degeneration, Cranial arteritis, Craniosynostosis, Creutzfeldt-Jakob disease, Cumulative trauma disorders, Cushing's syndrome, Cytomegalic inclusion body disease (CIBD), Cytomegalovirus Infection, Dandy-Walker syndrome, Dawson disease, De Morsier's syndrome, Dejerine-Klumpke palsy, Dejerine-Sottas disease, Delayed sleep phase syndrome, Dementia, Dermatomyositis, Neurological Dyspraxia, Diabetic neuropathy, Diffuse sclerosis, Dysautonomia, Dyscalculia, Dysgraphia, Dyslexia, Dystonia, Early infantile epileptic encephalopathy, Empty sella syndrome, Encephalitis, Encephalocele, Encephalotrigeminal angiomatosis, Encopresis, Epilepsy, Erb's palsy, Erythromelalgia, Essential tremor, Fabry's disease, Fahr's syndrome, Fainting, Familial spastic paralysis, Febrile seizures, Fisher syndrome, Friedreich's ataxia, FART Syndrome, Gaucher's disease, Gerstmann's syndrome, Giant cell arteritis, Giant cell inclusion disease, Globoid cell Leukodystrophy, Gray matter heterotopia, Guillain-Barre syndrome, HTLV-1 associated myelopathy, Hallervorden-Spatz disease, Head injury, Headache, Hemifacial Spasm, Hereditary Spastic Paraplegia, Heredopathia atactica polyneuritiformis, Herpes zoster oticus, Herpes zoster, Hirayama syndrome, Holoprosencephaly, Huntington's disease, Hydranencephaly, Hydrocephalus, Hypercortisolism, Hypoxia, Immune-Mediated encephalomyelitis, Inclusion body myositis, Incontinentia pigmenti, Infantile phytanic acid storage disease, Infantile Refsum disease, Infantile spasms, Inflammatory myopathy, Intracranial cyst, Intracranial hypertension, Joubert syndrome, Kearns-Sayre syndrome, Kennedy disease, Kinsbourne syndrome, Klippel Feil syndrome,
Krabbe disease, Kugelberg-Welander disease, Kuru, Lafora disease, Lambert-Eaton myasthenic syndrome, Landau-Kleffner syndrome, Lateral medullary (Wallenberg) syndrome, Learning disabilities, Leigh's disease, Lennox-Gastaut syndrome, Lesch-Nyhan syndrome, Leukodystrophy, Lewy body dementia, Lissencephaly, Locked-In syndrome, Lou Gehrig's disease, Lumbar disc disease, Lyme disease-Neurological Sequelae, Machado-Joseph disease (Spinocerebellar ataxia type 3), Macrencephaly, Maple Syrup Urine Disease, Megalencephaly, Melkersson-Rosenthal syndrome, Menieres disease, Meningitis, Menkes disease, Metachromatic leukodystrophy, Microcephaly, Migraine, Miller Fisher syndrome, Mini-Strokes, Mitochondrial Myopathies, Mobius syndrome, Monomelic amyotrophy, Motor Neuron Disease, Motor skills disorder, Moyamoya disease, Mucopolysaccharidoses, Multi-Infarct Dementia, Multifocal motor neuropathy, Multiple sclerosis, Multiple system atrophy, Muscular dystrophy, Myalgic encephalomyelitis, Myasthenia gravis, Myelinoclastic diffuse sclerosis, Myoclonic Encephalopathy of infants, Myoclonus, Myopathy, Myotubular myopathy, Myotonia congenita,Narcolepsy, Neurofibromatosis, Neuroleptic malignant syndrome, Neurological manifestations of AIDS, Neurological sequelae of lupus, Neuromyotonia, Neuronal ceroid lipofuscinosis, Neuronal migration disorders, Niemann-Pick disease, Non 24-hour sleep-wake syndrome, Nonverbal learning disorder, O'Sullivan-McLeod syndrome, Occipital Neuralgia, Occult Spinal Dysraphism Sequence, Ohtahara syndrome, Olivopontocerebellar atrophy, Opsoclonus myoclonus syndrome, Optic neuritis, Orthostatic Hypotension, Overuse syndrome, Palinopsia, Paresthesia, Parkinson's disease, Paramyotonia Congenita, Paraneoplastic diseases, Paroxysmal attacks, Parry-Romberg syndrome, Rombergs Syndrome, Pelizaeus-Merzbacher disease, Periodic Paralyses, Peripheral neuropathy, Persistent Vegetative State, Pervasive neurological disorders, Photic sneeze reflex, Phytanic Acid Storage disease, Pick's disease, Pinched Nerve, Pituitary Tumors, PMG, Polio, Polymicrogyria, Polymyositis, Porencephaly, Post-Polio syndrome, Postherpetic Neuralgia (PHN), Postinfectious Encephalomyelitis, Postural Hypotension, Prader-Willi syndrome, Primary Lateral Sclerosis, Prion diseases, Progressive Hemifacial Atrophy also known as Rombergs Syndrome, Progressive multifocal leukoencephalopathy, Progressive Sclerosing Poliodystrophy, Progressive Supranuclear Palsy, Pseudotumor cerebri, Ramsay-Hunt syndrome (Type I and Type II), Rasmussen's encephalitis, Reflex sympathetic dystrophy syndrome, Refsum disease, Repetitive motion disorders, Repetitive stress injury, Restless legs syndrome, Retrovirus-associated myelopathy, Rett syndrome, Reye's syndrome, Rombergs Syndrome, Rabies, Saint Vitus dance, Sandhoff disease, Schytsophrenia, Schilder's disease, Schizencephaly, Sensory Integration Dysfunction, Septo-optic dysplasia, Shaken baby syndrome, Shingles, Shy-Drager syndrome, Sjogren's syndrome, Sleep apnea, Sleeping sickness, Snatiation, Sotos syndrome, Spasticity, Spina bifida, Spinal cord injury, Spinal cord tumors, Spinal muscular atrophy, Spinal stenosis, Steele-Richardson-Olszewski syndrome, see Progressive Supranuclear Palsy, Spinocerebellar ataxia, Stiff-person syndrome, Stroke, Sturge-Weber syndrome, Subacute sclerosing panencephalitis, Subcortical arteriosclerotic encephalopathy, Superficial siderosis, Sydenham's chorea, Syncope, Synesthesia, Syringomyelia, Tardive dyskinesia, Tay-Sachs disease, Temporal arteritis, Tethered spinal cord syndrome, Thomsen disease, Thoracic outlet syndrome, Tic Douloureux, Todd's paralysis, Tourette syndrome, Transient ischemic attack, Transmissible spongiform encephalopathies, Transverse myelitis, Traumatic brain injury, Tremor, Trigeminal neuralgia, Tropical spastic paraparesis, Trypanosomiasis, Tuberous sclerosis, Vasculitis including temporal arteritis, Von Hippel-Lindau disease (VEIL), Viliuisk Encephalomyelitis (VE), Wallenberg's syndrome, Werdnig-Hoffman disease, West syndrome, Whiplash, Williams syndrome, Wilson's disease, X-Linked Spinal and Bulbar Muscular Atrophy, and Zellweger syndrome. Neurological conditions can comprise movement disorders, for example multiple system atrophy (MSA).
In some cases, a disease or condition may comprise an autoimmune disease. In some cases, an autoimmune disease may comprise acute disseminated encephalomyelitis (ADEM), acute necrotizing hemorrhagic leukoencephalitis, Addison's disease, agammaglobulinemia, allergic asthma, allergic rhinitis, alopecia areata, amyloidosis, ankylosing spondylitis, anti-GBM/anti-TBM nephritis, antiphospholipid syndrome (APS), autoimmune aplastic anemia, autoimmune dysautonomia, autoimmune hepatitius, autoimmune hyperlipidemia, autoimmune immunodeficiency, autoimmune inner ear disease (MED), autoimmune myocarditis, autoimmune pancreatitis, autoimmune retinopathy, autoimmune thrombocytopenic purpura (ATP), autoimmune thyroid disease, axonal & neuronal neuropathies. Balo disease, Behcet's disease, bullous pemphigoid, cardiomyopathy, Castlemen disease, celiac sprue (non-tropical), Chagas disease, chronic fatigue syndrome, chronic inflammatory demyelinating polyneuropathy (CIDP), chronic recurrent multifocal ostomyelitis (CRMO), Churg-Strauss syndrome, cicatricial pemphigoid/benign mucosal pemphigoid, Crohn's disease, Cogan's syndrome, cold agglutinin disease, congenital heart block, coxsackie myocarditis. CREST disease, essential mixed cryoglobulinemia, demyelinating neuropathies, dermatomyositis, Devic's disease (neuromyelitis optica), discoid lupus, Dressler's syndrome, endometriosis, eosinophillic fasciitis, erythema nodosum, experimental allergic encephalomyelitis, Evan's syndrome, fibromyalgia., fibrosing alveolitis, giant cell arteritis (temporal arteritis), glomerulonephritis, Goodpasture's syndrome, Grave's disease, Guillain-Barre syndrome. Hashimoto's encephalitis, Hashimoto's thyroiditis, hemolytic anemia, Henock-Schoniein purpura, herpes gestationis, hypogammaglobulinemia, idiopathic thrombocytopenic purpura (ITP), IgA nephropathy, immmunoregulatory lipoproteins, inclusion body myositis, insulin-dependent diabetes (type 1), interstitial cystitis, juvenile arthritis, juvenile diabetes, Kawasaki syndrome, Lambert-Eaton syndrome, leukocytoclastic vasculitis, lichen planus, lichen sclerosus, ligneous conjunctivitis, linear IgA disease (LAD), Lupus (SLE), Lynie disease, Meniere's disease, microscopic polyangitis, mixed connective tissue disease (MCTD), Mooren's ulcer, Mucha-Habermann disease, multiple sclerosis, myasthenia gravis, myositis, narcolepsy, neuromyelitis optica (Devic's), neutropenia, ocular cicatricial pemphigoid, optic neuritis, palindromic rheumatism, PANDAS (Pediatric Autoimmune. Neuropsychiatric Disorders Associated with Streptococcus), paraneoplastic cerebellar degeneration, paroxysmal nocturnal hernoglobinuria (PNH), Parry Romberg syndrome, Parsonnage-Turner syndrome, pars plantis (peripheral uveitis), pemphigus, peripheral neuropathy, perivenous encephalomyelitis, pernicious anemia, POEMS syndrome, polyarteritis nodosa, type I, II & III autoimmune polyglandular syndromes, polymyalgia rheumatic, polymyositis, postmyocardial infarction syndrome, postpericardiotomy syndrome, progesterone dermatitis, primary biliary cirrhosis, primary sclerosing cholangitis, psoriasis, psoriatic arthritis, idiopathic pulmonary fibrosis, pyoderma gangrenosum, pure red cell aplasis, Raynaud's phenomena, reflex sympathetic dystrophy, Reiter's syndrome, relapsing polychondritis, restless legs syndrome, retroperitoneal fibrosis, rheumatic fever, rheumatoid arthritis, sarcoidosis, Schmidt syndrome, scleritis, scleroderma, Slogren's syndrome, sperm and testicular autoimmunity, stiff person syndrome, subacute bacterial endocarditis (SBE), sympathetic ophthalmia, Takayasu's arteritis, temporal arteritis/giant cell arteries, thrombocytopenic purpura (TPP), Tolosa-Hunt syndrome, transverse myelitis, ulcerative colitis, undifferentiated connective tissue disease (UCTD), uveitis, vasculitis, vesiculobullous dermatosis, vitiligo or Wegener's granulomatosis or , chronic active hepatitis, primary biliary cirrhosis, cadilated cardiomyopathy, myocarditis, autoimmune polyendocrine syndrome type I (APS-I), cystic fibrosis vasculitides, acquired hypoparathyroidism, coronary artery disease, pemphigus foliaceus, pemphigus vulgaris, Rasmussen encephalitis, autoimmune gastritis, insulin hypoglycemic syndrome (Hirata disease), Type B insulin resistance, acanthosis, systemic lupus erythematosus (SLE), pernicious anemia, treatment-resistant Lyme arthritis, polyneuropathy, demyelinating diseases, atopic dermatitis, autoimmune hypothyroidism, vitiligo, thyroid associated ophthalmopathy, autoimmune coeliac disease, ACTH deficiency, dermatomyositis, Sjogren syndrome, systemic sclerosis, progressive systemic sclerosis, morphea, primary antiphospholipid syndrome, chronic idiopathic urticaria, connective tissue syndromes, necrotizing and crescentic glomerulonephritis (NCGN), systemic vasculitis, Raynaud syndrome, chronic liver disease, visceral leishmaniasis, autoimmune C1 deficiency, membrane proliferative glomerulonephritis (MPGN), prolonged coagulation time, immunodeficiency, atherosclerosis, neuronopathy, paraneoplastic pemphigus, paraneoplastic stiff man syndrome, paraneoplastic encephalomyelitis, subacute autonomic neuropathy, cancer-associated retinopathy, paraneoplastic opsoclonus myoclonus ataxia, lower motor neuron syndrome and Lambert-Eaton myasthenic syndrome.
In some cases, a disease or a condition may comprise AIDS, anthrax, botulism, brucellosis, chancroid, chlamydial infection, cholera, coccidioidomycosis, cryptosporidiosis, cyclosporiasis, dipheheria, ehrlichiosis, arboviral encephalitis, enterohemorrhagic Escherichia coli, giardiasis, gonorrhea, dengue fever, haemophilus influenza, Hansen's disease (Leprosy), hantavirus pulmonary syndrome, hemolytic uremic syndrome, hepatitis A, hepatitis B, hepatitis C, human immunodeficiency virus, legionellosis, listeriosis, lyme disease, malaria, measles. Meningococcal disease, mumps, pertussis (whooping cough), plague, paralytic poliomyelitis, psittacosis, Q fever, rabies, rocky mountain spotted fever, rubella, congenital rubella syndrome (SARS), shigellosis, smallpox, streptococcal disease (invasive group A), streptococcal toxic shock syndrome, streptococcus pneumonia, syphilis, tetanus, toxic shock syndrome, trichinosis, tuberculosis, tularemia, typhoid fever, vancomycin intermediate resistant staphylocossus aureus, varicella, yellow fever, variant Creutzfeldt-Jakob disease (vCJD), Eblola hemorrhagic fever, Echinococcosis, Hendra virus infection, human monkeypox, influenza. A, H5N1, lassa fever, Margurg hemorrhagic fever, Nipah virus, O'nyong fever, Rift valley fever, Venezuelan equine encephalitis and West Nile virus.
In some cases, a disease or condition may comprise a cancer. In some cases, a cancer may comprise thyroid cancer, adrenal cortical cancer, anal cancer, aplastic anemia, bile duct cancer, bladder cancer, bone cancer, bone metastasis, central nervous system (CNS) cancers, peripheral nervous system (PNS) cancers, breast cancer, Castleman's disease, cervical cancer, childhood. Non-Hodgkin's lymphoma, lymphoma, colon and rectum cancer, endometrial cancer, esophagus cancer, Ewing's family of tumors (e.g. Ewing's sarcoma), eye cancer, gallbladder cancer, gastrointestinal carcinoid tumors, gastrointestinal stromal tumors, gestational trophoblastic disease, hairy cell leukemia, Hodgkin's disease, Kaposi's sarcoma, kidney cancer, laryngeal and hypopharyngeal cancer, acute lymphocytic leukemia, acute myeloid leukemia, children's leukemia, chronic lymphocytic leukemia, chronic myeloid leukemia, liver cancer, lung cancer, lung carcinoid tumors, Non-Hodgkin's lymphoma, male breast cancer, malignant mesothelioma, multiple myeloma, myelodysplastic syndrome, myeloproliferative disorders, nasal cavity and paranasal cancer, nasopharyngeal cancer, neuroblastoma, oral cavity and oropharyngeal cancer, osteosarcoma, ovarian cancer, pancreatic cancer, penile cancer, pituitary tumor, prostate cancer, retinoblastoma, rhabdomyosarcoma, salivary gland cancer, sarcoma (adult soft tissue cancer), melanoma skin cancer, non-melanoma skin cancer, stomach cancer, testicular cancer, thymus cancer, uterine cancer (e.g. uterine sarcoma), vaginal cancer, vulvar cancer, or Waldenstrom's macroglobulinaemia.
A condition or a disease, as disclosed herein, can include hyperproliferative disorders. Malignant hyperproliferative disorders can be stratified into risk groups, such as a low risk group and a medium-to-high risk group. Hyperproliferative disorders can include but may not be limited to cancers, hyperplasia, or neoplasia. In some cases, the hyperproliferative cancer can be breast cancer such as a ductal carcinoma in duct tissue of a mammary gland, medullary carcinomas, colloid carcinomas, tubular carcinomas, and inflammatory breast cancer; ovarian cancer, including epithelial ovarian tumors such as adenocarcinoma in the ovary and an adenocarcinoma. that has migrated from the ovary into the abdominal cavity; uterine cancer; cervical cancer such as adenocarcinoma in the cervix epithelial including squamous cell carcinoma and adenocarcinomas; prostate cancer, such as a prostate cancer selected from the following: an adenocarcinoma or an adenocarcinoma that has migrated to the bone; pancreatic cancer such as epithelioid carcinoma in the pancreatic duct tissue and an adenocarcinoma in a pancreatic duct; bladder cancer such as a transitional cell carcinoma in urinary bladder, urothelial carcinomas (transitional cell carcinomas), tumors in the urothelial cells that line the bladder, squamous cell carcinomas, adenocarcinomas, and small cell cancers; leukemia such as acute myeloid leukemia (AMU, acute lymphocytic leukemia, chronic lymphocytic leukemia, chronic myeloid leukemia, hairy cell leukemia, myelodysplasia, myeloproliferative disorders, acute myelogenous leukemia (AML), chronic myelogenous leukemia (CMI), mastocytosis, chronic lymphocytic leukemia (CLL), multiple myeloma (MM), and myelodysplastic syndrome (MDS); bone cancer; lung cancer such as non-small cell lung cancer (NSCLC), which may be divided into squamous cell carcinomas, adenocarcinomas, and large cell undifferentiated carcinomas, and small cell lung cancer; skin cancer such as basal cell carcinoma, melanoma, squamous cell carcinoma and actinic keratosis, which may be a skin condition that sometimes develops into squamous cell carcinoma; eye retinoblastoma; cutaneous or intraocular (eye) melanoma; primary liver cancer (cancer that begins in the liver); kidney cancer; autoimmune deficiency syndrome (AIDS)-related lymphoma such as diffuse large B-cell lymphoma, B-cell immunoblastic lymphoma and small non-cleaved cell lymphoma; Kaposi's Sarcoma; viral-induced cancers including hepatitis B virus (HM), hepatitis C virus (HCV), and hepatocellular carcinoma; human lymphotropic virus-type I (HBV) and adult T-cell leukemialymphoma; and human papilloma virus (HPV) and cervical cancer; central nervous system (CNS) cancers such as primary brain tumor, which includes gliomas (astrocytoma, anaplastic astrocytoma, or glioblastoma multiforme), oligodendrogliomas, ependymomas, meningiomas, lymphomas, schwannomas, and medulloblastomas; peripheral nervous system (PNS) cancers such as acoustic neuromas and malignant peripheral nerve sheath tumors (MPNST) including neurofibromas and schwannomas, malignant fibrous cytomas, malignant fibrous histiocytomas, malignant meningiomas, malignant mesotheliomas, and malignant mixed Müllerian tumors; oral cavity and oropharyngeal cancer such as hypopharyngeal cancer, laryngeal cancer, nasopharyngeal cancer, and oropharyngeal cancer; stomach cancer such as lymphomas, gastric stromal tumors, and carcinoid tumors; testicular cancer such as germ cell tumors (GCTs), which include seminomas and nonseminomas, and gonadal stromal tumors, which include Leydig cell tumors and Sertoli cell tumors; thymus cancer such as to thymomas, thymic carcinomas, Hodgkin disease, non-Hodgkin lymphomas carcinoids or carcinoid tumors; rectal cancer; and colon cancer. In some cases, the diseases stratified, classified, characterized, or diagnosed by the methods of the present disclosure include but may not be limited to thyroid disorders such as for example benign thyroid disorders including but not limited to follicular adenomas, Hurthle cell adenomas, lymphocytic thyroiditis, and thyroid hyperplasia. In some cases, the diseases stratified, classified, characterized, or diagnosed by the methods of the present disclosure include but may not be limited to malignant thyroid disorders such as for example follicular carcinomas, follicular variant of papillary thyroid carcinomas, medullary carcinomas, and papillary carcinomas.
Conditions or diseases of the present disclosure can include a genetic disorder. A genetic disorder may be an illness caused by abnormalities in genes or chromosomes. Genetic disorders can be grouped into two categories: single gene disorders and multifactorial and polygenic (complex) disorders. A single gene disorder can be the result of a single mutated gene. Inheriting a single gene disorder can include but not be limited to autosomal dominant, autosomal recessive, X-linked dominant, X-linked recessive, Y-linked and mitochondrial inheritance. Only one mutated copy of the gene can be necessary for a person to be affected by an autosomal dominant disorder. Examples of autosomal dominant type of disorder can include but may not be limited to Huntington's disease, Neurofibromatosis 1, Marfan Syndrome, Hereditary nonpolyposis colorectal cancer, or Hereditary multiple exostoses, In autosomal recessive disorders, two copies of the gene must be mutated for a subject to be affected by an autosomal recessive disorder. Examples of this type of disorder can include but may not be limited to cystic fibrosis, sickle-cell disease (also partial sickle-cell disease), Tay-Sachs disease, Niemann-Pick disease, or spinal muscular atrophy. X-linked dominant disorders are caused by mutations in genes on the X chromosome such as X-linked hypophosphatemic rickets, Some X-linked dominant conditions such as Rett syndrome, Incontinentia Pigmenti type 2 and Aicardi Syndrome can be fatal. X-linked recessive disorders are also caused by mutations in genes on the X chromosome, Examples of this type of disorder can include but are not limited to Hemophilia A, Duchenne muscular dystrophy, red-green color blindness, muscular dystrophy and Androgenetic alopecia. Y-linked disorders are caused by mutations on the Y chromosome, Examples can include but are riot limited to Male Infertility and hypertrichosis pinnae. The genetic disorder of mitochondrial inheritance, also known as maternal inheritance, can apply to genes in mitochondrial DNA such as in Leber's Hereditary Optic Neuropathy.
Genetic disorders may also be complex, multifactorial or polygenic. Polygenic genetic disorders can be associated with the effects of multiple genes in combination with lifestyle and environmental factors. Although complex genetic disorders can cluster in families, they do not have a clear-cut pattern of inheritance. Multifactorial or polygenic disorders can include heart disease, diabetes, asthma, autism, autoimmune diseases such as multiple sclerosis, cancers, ciliopathies, cleft palate, hypertension, inflammatory bowel disease, mental retardation or obesity.
Other genetic disorders can include but may not be limited to 1p36 deletion syndrome, 21-hydroxyla.se deficiency, 22q11.2 deletion syndrome, aceruloplasminemia, achondrogenesis, type achondroplasia, acute intermittent porphyria, adenylosuccinate lyase deficiency, Adrenoleukodystrophy, Alexander disease, alkaptonuria, alpha-I antitrypsin deficiency, Alstrom syndrome, Alzheimer's disease (type 1, 2, 3, and 4), Amelogenesis Imperfecta, amyotrophic lateral sclerosis, Amyotrophic lateral sclerosis type 2, Amyotrophic lateral sclerosis type 4, amyotrophic lateral sclerosis type 4, androgen insensitivity syndrome, Anemia, Angelman syndrome, Apert syndrome, ataxia-telangiectasia, Beare-Stevenson cutis gyrata syndrome, Benjamin syndrome, beta thalassemia, biotimidase deficiency, Birt-Hogg-Dube syndrome, bladder cancer, Bloom syndrome, Bone diseases, breast cancer, Camptomelic dysplasia, Canavan disease, Cancer, Celiac Disease, Chronic Granulomatous Disorder (CGD), Charcot-Marie-Tooth disease, Charcot-Marie-Tooth disease Type 1, Charcot-Marie-Tooth disease Type 4, Charcot-Marie-Tooth disease Type 2, Charcot-Marie-Tooth disease Type 4, Cockayne syndrome, Coffin-Lowry syndrome, collagenopathy types II and XI, Colorectal Cancer, Congenital absence of the vas deferens, congenital bilateral absence of vas deferens, congenital diabetes, congenital erythropoietic porphyria, Congenital heart disease, congenital hypothyroidism, Connective tissue disease, Cowden syndrome, Cri du chat syndrome, Crohn's disease, fibrostenosing, Crouzon syndrome, Crouzonodermoskeletal syndrome, cystic fibrosis, De Grouchy Syndrome, Degenerative nerve diseases, Dent's disease, developmental disabilities, DiGeorge syndrome, Distal spinal muscular atrophy type V, Down syndrome, Dwarfism, Ehlers-Danlos syndrome, Ehlers-Danlos syndrome arthrochalasia type, Ehlers-Danlos syndrome classical type, Ehlers-Danlos syndrome dermatosparaxis type, Ehlers-Danlos syndrome kyphoscoliosis type, vascular type, erythropoietic protoporphyria, Fabry's disease, Facial injuries and disorders, factor V Leiden thrombophilia, familial adenomatous polyposis, familial dysautonomia, fanconi anemia, FG syndrome, fragile X syndrome, Friedreich ataxia, Friedreich's ataxia, G6PD deficiency, galactosemia, Gaucher's disease (type 1, 2, and 3), Genetic brain disorders, Glycine encephalopathy, Haemochromatosis type 2, Haemochromatosis type 4, Harlequin Ichthyosis, Head and brain malformations, Hearing disorders and deafness, Hearing problems in children, hemochromatosis (neonatal, type 2 and type 3), hemophilia, hepatoerythropoietic porphyria, hereditary coproporphyria, Hereditary Multiple Exostoses, hereditary neuropathy with liability to pressure palsies, hereditary nonpolyposis colorectal cancer, homocystinuria, Huntington's disease, Hutchinson Gilford Progeria Syndrome, hyperoxaluria, primary, hyperphenylalaninemia, hypochondrogenesis, hypochondroplasia, idic15, incontinentia pigmenti, infantile Gaucher disease, infantile-onset ascending hereditary spastic paralysis, infertility, Jackson-Weiss syndrome, Joubert syndrome, Juvenile Primary Lateral Sclerosis, Kennedy disease, Klinefelter syndrome, Kniest dysplasia, Krabbe disease, Learning disability, Lesch-Nyhan syndrome, Leukodystrophies, Li-Fraumeni syndrome, lipoprotein lipase deficiency, familial, Male genital disorders, Marfan syndrome, McCune-Albright syndrome, McLeod syndrome, Mediterranean fever, familial, Menkes disease, Menkes syndrome, Metabolic disorders, methemoglobinemia beta-globin type, Methemoglobinemia congenital methaemoglobinaemia, methylmalonic acidemia, Micro syndrome, Microcephaly, Movement disorders, Mowat-Wilson syndrome, Mucopolysaccharidosis (MPS I), Muenke syndrome. Muscular dystrophy. Muscular dystrophy, Duchenne and Becker type, muscular dystrophy, Duchenne and Becker types, myotonic dystrophy, Myotonic dystrophy type 1 and type 2, Neonatal hemochromatosis, neurofibromatosis, neurofibromatosis 1, neurofibromatosis 2, Neurofibromatosis type I, neurofibromatosis type II, Neurologic diseases, Neuromuscular disorders, Niemann-Pick disease, Nonketotic hyperglycinemia, nonsyndromic deafness, Nonsyndromic deafness autosomal recessive, Noonan syndrome, osteogenesis imperfecta (type I and type III), otospondylomegaepiphyseal dysplasia, pantothenate kinase-associated neurodegeneration, Patau Syndrome (Trisomy 13), Pendred syndrome, Peutz-Jeghers syndrome, Pfeiffer syndrome, phenylketonuria, porphyria, porphyria cutanea tarda, Prader-Willi syndrome, primary pulmonary hypertension, prion disease, Progeria, propionic acidemia, protein C deficiency, protein S deficiency, pseudo-Gaucher disease, pseudoxanthoma elasticum, Retinal disorders, retinoblastoma, retinoblastoma FA—Friedreich ataxia, Rett syndrome, Rubinstein-Taybi syndrome, Sandhoff disease, sensory and autonomic neuropathy type III, sickle cell anemia, skeletal muscle regeneration, Skin pigmentation disorders, Smith Lemli Opitz Syndrome, Speech and communication disorders, spinal muscular atrophy, spinal-bulbar muscular atrophy, spinocerebellar ataxia, spondyloepimetaphyseal dysplasia, Strudwick type, spondyloepiphyseal dysplasia congenita, Stickler syndrome, Stickler syndrome COL2A1, Tay-Sachs disease, tetrahydrobiopierin deficiency, thanatophoric dysplasia, thiamine-responsive megaloblastic anemia with diabetes mellitus and sensorineural deafness, Thyroid disease, Tourette's Syndrome, Treacher Collins syndrome, triple X syndrome, tuberous sclerosis, Turner syndrome, Usher syndrome, variegate porphyria, von Hippel-Lindau disease, Waardenburg syndrome, Weissenbacher-Zweymüller syndrome, Wilson disease, Wolf-Hirschhorn syndrome, Xeroderma Pigmentosum, X-linked severe combined immunodeficiency, X-linked sideroblastic anemia, or X-linked spinal-bulbar muscle atrophy.
A kit may include a label, a substrate (such as a solid support), a control nucleic acid sequence, a container, an enzyme or fragment thereof, instructions for use, or any combination thereof. A control nucleic acid sequence may be associated with the substrate. A control nucleic acid sequence may not be associated with a substrate and the kit may include instructions for associating the control nucleic acid sequence with the substrate.
A kit may be a general kit for all tissue samples or disease types. A kit may be a specific kit for a specific tissue sample, such as a plasma sample, a blood sample, a serum sample, a buccal sample, or a urine sample. A kit may be a specific kit for a specific disease such as cancer.
A kit may provide periodic updates of a database of references or analysis software that compute a result of the method. A kit may provide software to automate one or more aspects of a method, such as a comparison to a reference to provide a result or to provide a summary of a result that may be be reported or displayed or downloaded by a medical professional and/or entered into a database. A result or a summary of results may include any of the results disclosed herein, including recommendations of treatment options for subject and a risk occurrence of a disease or condition.
A kit may provide a unit or device for obtaining a sample from a subject (e.g., a device with a needle coupled to an aspirator).
A kit may provide instructions for performing methods as disclosed herein, and include all necessary buffers and reagents for hybridizing, sequencing, amplifying, associating, extending, or combination thereof. A kit may include instructions for analyzing a result.
An informational material of a kit may comprise printed matter, e.g., a printed text, drawing, and/or photograph, e.g., a. label or printed sheet. An information material may comprise Braille, computer readable material, video recording, or audio recording. In some cases, the informational material of the kit may include contact information, e.g., a. physical address, email address, website, or telephone number, where a user of the kit can obtain substantive information about a compound described herein and/or its use in the methods described herein. Informational material may be provided in any combination of formats.
A kit may include a package, such as a fiber-based package, a cardboard package, or a polymeric package, such as a styrofoam box. A package may be configured so as to substantially maintain a temperature differential between an interior and an exterior. In some cases, it may provide insulating properties to keep one or more components of a kit at a preselected temperature for a preselected time. A kit may include one or more containers for a composition containing a compound(s) described herein. In some embodiments, a kit may contain separate containers (such as two separate containers for two components of a kit), dividers or compartments for one or more components, and informational material. For example, a kit component may be contained in a bottle, a vial, or a syringe, and informational material may be contained in a plastic sleeve or a packet. In other embodiments, separate components of a kit may be contained within a single, undivided container. For example, a kit component may be contained in a bottle, a vial or a syringe that has attached thereto the informational material in the form of a label. In some embodiments, a kit may include a plurality (e.g., a pack) of individual containers, each containing one or more unit dosage forms (e.g., a dosage form described herein) of a component described herein. For example, the kit may include a plurality of syringes, ampules, foil packets, or blister packs, each containing a single unit dose of a kit component described herein. Containers of a kit may be air tight, waterproof (e.g., impermeable to changes in moisture or evaporation), and/or light-tight. A kit may include a device suitable for administration of the component, e.g., a syringe, inhalant, pipette, forceps, measured spoon, dropper (e.g,, eye dropper), swab (e.g., a cotton swab or wooden swab), or any such delivery device. In a preferred embodiment, the device may be a medical implant device, e.g., packaged for surgical insertion.
A basic research business, a disease diagnostic business, a molecular profiling business, a pharmaceutical business, or any other business associated with patient healthcare may provide a kit for performing the methods described herein.
The present disclosure provides computer control systems that are programmed to implement methods of the disclosure.
The computer system 101 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 105, which can be a single core or multi core processor, or a plurality of processors for parallel processing. The computer system 101 can also include memory or memory location 110 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 115 (e.g., hard disk), communication interface 120 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 125, such as cache, other memory, data storage and/or electronic display adapters. The memory 110, storage unit 115, interface 120 and peripheral devices 125 can be in communication with the CPU 105 through a communication bus (solid lines), such as a motherboard. The storage unit 115 can be a data storage unit (or data repository) for storing data. The computer system 101 can be operatively coupled to a computer network (“network”) 130 with the aid of the communication interface 120. The network 130 can be the Internet, an internet and/or extranet, or an intranet and/or extranet that is in communication with the Internet. The network 130 in some cases is a telecommunication and/or data network. The network 130 can include one or more computer servers, which can enable distributed computing, such as cloud computing. The network 130, in some cases with the aid of the computer system 101, can implement a peer-to-peer network, which may enable devices coupled to the computer system 101 to behave as a client or a server.
The CPU 105 can execute a sequence of machine-readable instructions, which can be embodied in a program or software. The instructions may be stored in a memory location, such as the memory 110. The instructions can be directed to the CPU 105, which can subsequently program or otherwise configure the CPU 105 to implement methods of the present disclosure. Examples of operations performed by the CPU 105 can include fetch, decode, execute, and writeback.
The CPU 105 can be part of a circuit, such as an integrated circuit. One or more other components of the system 101 can be included in the circuit. In some cases, the circuit is an application specific integrated circuit (ASIC).
The storage unit 115 can store files, such as drivers, libraries and saved programs. The storage unit 115 can store user data, e.g., user preferences and user programs. The computer system 101 in some cases can include one or more additional data storage units that are external to the computer system 101, such as located on a remote server that is in communication with the computer system 101 through an intranet or the Internet.
The computer system 101 can communicate with one or more remote computer systems through the network 130. For instance, the computer system 101 can communicate with a remote computer system of a user. Examples of remote computer systems include personal computers (e.g., portable PC), slate or tablet PC's (e.g., Apple® iPad, Samsung® Galaxy Tab), telephones, Smart phones (e.g., Apple® iPhone, Android-enabled device, Blackberry®), or personal digital assistants. The user can access the computer system 1101 via the network 130.
Methods as described herein can be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the computer system 101, such as, for example, on the memory 110 or electronic storage unit 115. The machine executable or machine readable code can be provided in the form of software. During use, the code can be executed by the processor 105. In some cases, the code can be retrieved from the storage unit 115 and stored on the memory 110 for ready access by the processor 105. In some situations, the electronic storage unit 115 can be precluded, and machine-executable instructions are stored on memory 110.
The code can be pre-compiled and configured for use with a machine having a processer adapted to execute the code, or can be compiled during runtime. The code can be supplied in a programming language that can be selected to enable the code to execute in a pre-compiled or as-compiled fashion.
Aspects of the systems and methods provided herein, such as the computer system 101, can be embodied in programming. Various aspects of the technology may be thought of as “products” or “articles of manufacture” typically in the form of machine (or processor) executable code and/or associated data that is carried on or embodied in a type of machine readable medium. Machine-executable code can be stored on an electronic storage unit, such as memory (e.g., read-only memory, random-access memory, flash memory) or a hard disk. “Storage” type media can include any or all of the tangible memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide non-transitory storage at any time for the software programming. All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer into the computer platform of an application server. Thus, another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links. The physical elements that carry such waves, such as wired or wireless links, optical links or the like, also may be considered as media bearing the software. As used herein, unless restricted to non-transitory, tangible “storage” media, terms such as computer or machine “readable medium” refer to any medium that participates in providing instructions to a processor for execution.
Hence, a machine readable medium, such as computer-executable code, may take many forms, including but not limited to, a tangible storage medium, a carrier wave medium or physical transmission medium. Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the databases, etc. shown in the drawings. Volatile storage media include dynamic memory, such as main memory of such a computer platform. Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier-wave transmission media may take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a ROM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer may read programming code and/or data. Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
The computer system 101 can include or be in communication with an electronic display 135 that comprises a user interface (UI) 140 for providing, for example, one or more results (immediate results or archived results from a previous experiment), one or more user inputs, reference values from a library or database, or a combination thereof. Examples of UT's include, without limitation, a graphical user interface (GUI) and web-based user interface.
Methods and systems of the present disclosure can be implemented by way of one or more algorithms. An algorithm can be implemented by way of software upon execution by the central processing unit 105. The algorithm can, for example, determine optimized conditions via supervised learning to optimize conditions such as a buffer type, a buffer concentration, a temperature, an incubation period. Conditions may be optimized for an oligonucleotide fragment, such as an oligonucleotide fragment having a particular number of epigenetic modifications or a particular length of sequence.
An aspect of the present disclosure provides a method. In some cases, the method may comprise (a) associating a label with an epigenetically modified base of a nucleic acid sequence to form a labeled nucleic acid sequence; (b) hybridizing a substantially complementary strand to the labeled nucleic acid sequence; and (c) amplifying the substantially complementary strand in a reaction in which the labeled nucleic acid sequence may be substantially not present.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise (a) hybridizing a substantially complementary strand to a nucleic acid sequence comprising an epigenetically modified base; (b) associating a label with the epigenetically modified base of a nucleic acid sequence to form a labeled nucleic acid sequence; and (c) amplifying the substantially complementary strand in a reaction in which the labeled nucleic acid sequence may be substantially not present.
In some cases, the label may be associated with a substrate. In some cases, the substrate may comprise a bead. In some cases, the bead may be a magnetic bead. In some cases, the substrate may comprise an array.
In some cases, the substantially complimentary strand may be shorter in length than the labeled nucleic acid sequence. In some cases, the substantially complimentary strand may be elongated before the amplifying. In some cases, hybridizing may comprise hybridizing at least two substantially complementary strands to the labeled nucleic acid sequence. In some cases, the method may comprise ligating the at least two substantially complementary strands.
In some cases, the labeled nucleic acid sequence may comprise an adapter sequence. In some cases, hybridizing may comprise hybridizing at least a portion of the substantially complimentary strand to the adapter sequence. In some cases, the nucleic acid sequence may comprise a first barcode. In some cases, the nucleic acid sequence may comprise a second barcode. In some cases, the first barcode may be a unique barcode and the second barcode may be a sample barcode.
In some cases, the epigenetically modified base of the nucleic acid sequence may be a hydroxymethylated base (hmB). In some cases, the hmB may be 5-hydroxymethylated base (5-hmB). In some cases, the 5-hmB may be a 5-hydroxymethylated cytosine (5-hmC). In some cases, the epigenetically modified base of the nucleic acid sequence may comprise a methylated base, a hydroxymethylated base, a formylated base, or a carboxylic acid containing base or a salt thereof. In some cases, at least a portion of the nucleic acid sequence or the labeled nucleic acid sequence may be double-stranded. In some cases, the label may be associated with the epigenetically modified base by a single bond, a double bond, or a triple bond.
In some cases, the method may comprise separating the substantially complementary strand from the labeled nucleic acid sequence. In some cases, the nucleic acid sequence may comprise at least: from about 1 to about 3; from about 1 to about 5; from about 1 to about 10; from about 1 to about 15; or from about 1 to about 20 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, the nucleic acid sequence may comprise at least about: 1, 5, 10, 15 or 20 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, at least about: 70%, 75%, 80%, 85%, 90%, or 95% of bases of the substantially complementary strand may base pair with the labeled nucleic acid sequence. In some cases, the substantially complementary strand may hybridize to the nucleic acid sequence under stringent hybridization conditions.
In some cases, the nucleic acid sequence may comprise a cytosine guanine (CG) site, a cytosine phosphate guanine (CpG) island, or a combination thereof. In some cases, the nucleic acid sequence may comprise cell-free DNA. In some cases, the nucleic acid sequence may comprise a cDNA sequence. In some cases, the method may comprise sequencing an amplified product.
In some cases, the nucleic acid sequence may be from a sample. In some cases, the sample may be from a subject. In some cases, the subject may be a human. In some cases, the sample may comprise a buccal sample, a saliva sample, a blood sample, a plasma sample, a reproductive sample, a mucus sample, cerebral spinal fluid sample, a tissue sample, or any combination thereof.
In some cases, the method may comprise obtaining a result. In some cases, the method may comprise comparing the result to a reference. In some cases, the method may comprise communicating the result via a communication medium.
In some cases, the subject may be diagnosed with a condition. In some cases, the method may comprise diagnosing the subject as having a condition. In some cases, the method may comprise diagnosing the subject as having a likelihood of developing a condition. In some cases, the diagnosing may be based on the comparing the result to the reference. In some cases, the diagnosing may at least partially confirm a previous diagnosis. In some cases, the condition may be a cancer.
In some cases, the method may comprise selecting a treatment for the subject. In some cases, the method may comprise treating the subject. In some cases, the treating may comprise: surgery, chemotherapy, radiation therapy, immunotherapy, targeted therapy, hormone therapy, stem cell transplant, and precision medicine. In some cases, the method may comprise repeating the associating, the hybridizing and the amplifying at different time points.
In some cases, the subject may be a human. In some cases, the label may comprise a sugar. In some cases, the sugar may comprise a glucose. In some cases, the glucose may be modified.
In some cases, the label may be associated with the epigenetically modified base with the assistance of an enzyme. In some cases, the enzyme may be selective for a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selectively associated with the epigenetically modified base at a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selective for a portion of the nucleic acid sequence. In some cases, the portion may be double-stranded.
In some cases, the substantially complementary strand may be substantially free of an epigenetically modified base. In some cases, a substantially complementary strand may be free of an epigenetically modified base. In some cases, the amplifying may result in a plurality of nucleic acid strands. In some cases, less than about 2% of the plurality of nucleic acid strands may comprise an epigenetically modified base. In some cases, the nucleic acid sequence may comprise a plurality of epigenetically modified bases. In some cases, the substantially complementary strand may comprise less than about 2% of the plurality of epigenetically modified bases. In some case, the substantially complementary strand may comprise an epigenetically modified base.
Another aspect of the present disclosure provides a kit. In some cases, the kit may comprise: instructions for use; a container; a label configured to (i) associate with an epigenetically modified nucleic acid sequence and to (ii) associate with a substrate; a control nucleic acid sequence associated with a substrate and a substrate configured to associate with the label.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise (a) associating a label with an epigenetically modified base of a nucleic acid sequence to form a labeled nucleic acid sequence; (b) hybridizing a substantially complementary strand to the labeled nucleic acid sequence; and (c) amplifying the substantially complementary strand in a reaction in which the labeled nucleic acid sequence is substantially not present.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise (a) hybridizing a substantially complementary strand to a nucleic acid sequence comprising an epigenetically modified base; (b) associating a label with the epigenetically modified base of a nucleic acid sequence to form a labeled nucleic acid sequence; and (c) amplifying the substantially complementary strand in a reaction in which the labeled nucleic acid sequence is substantially not present.
In some cases, the label may be associated with a substrate. In some cases, the substrate may comprise a bead. In some cases, the bead may be a magnetic bead. In some cases, the substrate may comprise an array.
In some cases, the substantially complimentary strand may be shorter in length than the labeled nucleic acid sequence. In some cases, the substantially complimentary strand may be elongated before the amplifying. In some cases, the hybridizing may comprise hybridizing at least two substantially complementary strands to the labeled nucleic acid sequence. In some cases, the method may comprise ligating the at least two substantially complementary strands. In some cases, the labeled nucleic acid sequence may comprise an adapter sequence. In some cases, the hybridizing may comprise hybridizing at least a portion of the substantially complimentary strand to the adapter sequence.
In some cases, the nucleic acid sequence may comprise a first barcode. In some cases, the nucleic acid sequence may comprise a second barcode. In some cases, the first barcode may be a unique barcode and the second barcode is a sample barcode.
In some cases, the epigenetically modified base of the nucleic acid sequence may be a hydroxymethylated base (hmB). In some cases, the hmB may be 5-hydroxymethylated base (5-hmB). In some cases, the 5-hmB may be a 5-hydroxymethylated cytosine (5-hmC). In some cases, the epigenetically modified base of the nucleic acid sequence may comprise a methylated base, a hydroxymethylated base, a formylated base, or a carboxylic acid containing base or a salt thereof. In some cases, at least a portion of the nucleic acid sequence or the labeled nucleic acid sequence may be double-stranded. In some cases, the label may be associated with the epigenetically modified base by a single bond, a double bond, or a triple bond. In some cases, the method may comprise separating the substantially complementary strand from the labeled nucleic acid sequence.
In some cases, the nucleic acid sequence may comprise at least: from about 1 to about 3; from about 1 to about 5; from about 1 to about 10; from about 1 to about 15; or from about 1 to about 20 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, the nucleic acid sequence may comprise at least about: 1, 5, 10, 15 or 20 epigenetically modified bases per at least about 20 bases of the nucleic acid sequence. In some cases, at least about: 70%, 75%, 80%, 85%, 90%, or 95% of bases of the substantially complementary strand may base pair with the labeled nucleic acid sequence. In some cases, the substantially complementary strand may hybridize to the nucleic acid sequence under stringent hybridization conditions.
In some cases, the nucleic acid sequence may comprise a cytosine guanine (CG) site, a cytosine phosphate guanine (CpG) island, or a combination thereof. In some cases, the nucleic acid sequence may comprise cell-free DNA. In some cases, the nucleic acid sequence may comprise a cDNA sequence. In some cases, the method may comprise sequencing an amplified product.
In some cases, the nucleic acid sequence may be from a sample. In some cases, the sample may be from a subject. In some cases, the subject may be a human. In some cases, the sample may comprise a buccal sample, a saliva sample, a blood sample, a plasma sample, a reproductive sample, a mucus sample, cerebral spinal fluid sample, a tissue sample, or any combination thereof.
In some cases, the method may comprise obtaining a result. In some cases, the method may comprise comparing the result to a reference. In some cases, the method may comprise communicating the result via a communication medium.
In some cases, the subject may be diagnosed with a condition. In some cases, the method may comprise diagnosing the subject as having a condition. In some cases, the method may comprise comprising diagnosing the subject as having a likelihood of developing a condition. In some cases, the diagnosing may be based on the comparing the result to the reference. In some cases, the diagnosing at least partially may confirm a previous diagnosis. In some cases, the condition may be a cancer.
In some cases, the method may comprise selecting a treatment for the subject. In some cases, the method may comprise treating the subject. In some cases, the treating may comprise: surgery, chemotherapy, radiation therapy, immunotherapy, targeted therapy, hormone therapy, stem cell transplant, and precision medicine. In some cases, the method may comprise repeating the associating, the hybridizing and the amplifying at different time points. In some cases, the subject may be a human.
In some cases, the label may comprise a sugar. In some cases, the sugar may comprise a glucose. In some cases, the glucose may be modified. In some cases, the label may be associated with the epigenetically modified base with the assistance of an enzyme. In some cases, the enzyme can be selective for a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selectively associated with the epigenetically modified base at a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selective for a portion of the nucleic acid sequence. In some cases, the portion may be double-stranded.
Another aspect of the present disclosure provides a kit. In some cases, the kit may comprise: instructions for use; a container; a label configured to (i) associate with an epigenetically modified nucleic acid sequence and to (ii) associate with a substrate; a control nucleic acid sequence associated with a substrate and a substrate configured to associate with the label.
In some cases, the substantially complementary strand may be substantially free of an epigenetically modified base. In some cases, the substantially complementary strand may be free of an epigenetically modified base. In some cases, the amplifying may result in a plurality of nucleic acid strands, wherein less than about 2% of the plurality of nucleic acid strands may comprise an epigenetically modified base. In some cases, the nucleic acid sequence may comprise a plurality of epigenetically modified bases, and wherein the substantially complementary strand may comprise less than about 2% of the plurality of epigenetically modified bases. In some cases, the substantially complementary strand may comprise an epigenetically modified base.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise: detecting a presence of a plurality of epigenetically modified residues in a nucleic acid sequence, wherein the plurality of epigenetically modified residues comprises at least 2 epigenetically modified residues, and wherein a sensitivity of detection remains substantially constant with an increasing number of epigenetically modified residues in the plurality of epigenetically modified residues.
In some cases, the at least 2 epigenetically modified residues may be at least 4 epigenetically modified residues. In some cases, the sensitivity of detection may comprise detecting a presence of at least about 90% of the plurality of epigenetically modified residues. In some cases, the sensitivity of detection may comprise detecting a presence of each epigenetically modified residue of the plurality of epigenetically modified residues.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise: enriching a nucleic acid sequence, wherein the nucleic acid sequence comprises (i) a plurality of epigenetically modified residues and (ii) a sequence length, wherein the plurality of epigenetically modified residues comprises at least 2 epigenetically modified residues, wherein the enriching comprises at least 4 cycles of amplification and produces a plurality of sequence reads, and wherein about 90% of the plurality of sequence reads retain at least about 90% of the sequence length.
In some cases, the at least 2 epigenetically modified residues may be at least 4 epigenetically modified residues. In some cases, the at least 4 cycles of amplification may be at least 8 cycles of amplification.
In some cases, the nucleic acid sequence may comprise cell-free DNA. In some cases, the nucleic acid sequence may comprise a cDNA sequence.
In some cases, an epigenetically modified residue of the plurality of epigenetically modified residues may be a hydroxymethylated base (hmB). In some cases, the hmB may be 5-hydromethylated base (5-hmB). In some cases, the 5-hmB may be a 5-hydroxymethylated cytosine (5-hmC). In some cases, an epigenetically modified residue of the plurality of epigenetically modified residues may comprise a methylated base, a hydroxymethylated base, a formylated base, or a carboxylic acid containing base or a salt thereof.
In some cases, at least a portion of the nucleic acid sequence may be double-stranded. In some cases, the nucleic acid sequence may comprise a cytosine guanine (CG) site, a cytosine phosphate guanine (CpG) island, or a combination thereof.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise: enriching a nucleic acid sequence comprising a plurality of epigenetically modified residues to produce a plurality of sequence reads, wherein at least about 90% of the plurality of sequencing reads produced from the enriching are from about 1% to about 50% of a genome.
In some cases, the at least about 90% of the plurality of sequencing reads produced may be from about 1% to about 20% of the genome. In some cases, a length of the plurality of sequencing reads may be at least about 10 basepairs. In some cases, the plurality of epigenetically modified residues may be at least about 2 epigenetically modified residues. In some cases, the plurality of epigenetically modified residues may be at least about 6 epigenetically modified residues.
In some cases, a label may be associated with an epigenetically modified residue of the plurality of epigenetically modified residues. In some cases, the label may be associated with the epigenetically modified residue by a single bond, a double bond, or a triple bond.
In some cases, the nucleic acid sequence may comprise at least: from about 1 to about 3; from about 1 to about 5; from about 1 to about 10; from about 1 to about 15; or from about 1 to about 20 epigenetically modified residues per at least about 20 bases of the nucleic acid sequence. In some cases, the nucleic acid sequence may comprise at least about: 1, 5, 10, 15 or 20 epigenetically modified residues per at least about 20 bases of the nucleic acid sequence.
In some cases, the nucleic acid sequence may comprise cell-free DNA. In some cases, the nucleic acid sequence may comprise a cDNA sequence. In some cases, the nucleic acid sequence may be from a sample. In some cases, the sample may be obtained from a subject. In some cases, the subject may be a human. In some cases, the sample may comprise a buccal sample, a saliva sample, a blood sample, a plasma sample, a reproductive sample, a mucus sample, cerebral spinal fluid sample, a tissue sample, or any combination thereof.
In some cases, the method may further comprise obtaining a result. In some cases, the method may further comprise comparing the result to a reference. In some cases, the method may further comprise communicating the result via a communication medium.
In some cases, the subject may be diagnosed with a condition. In some cases, the method may further comprise diagnosing the subject as having a condition. In some cases, the method may further comprise diagnosing the subject as having a likelihood of developing a condition. In some cases, the diagnosing may be based on the comparing the result to the reference. In some cases, the diagnosing at least partially may confirm a previous diagnosis. In some cases, the condition may be a cancer.
In some cases, the method may further comprise selecting a treatment for the subject. In some cases, the method may further comprise treating the subject. In some cases, the treating may comprise: surgery, chemotherapy, radiation therapy, immunotherapy, targeted therapy, hormone therapy, stem cell transplant, and precision medicine.
In some cases, the label may comprise a sugar. In some cases, the sugar may comprise a glucose. In some cases, the glucose may be modified. In some cases, the label may be associated with the epigenetically modified residue with assistance of an enzyme. In some cases, the enzyme may be selective for a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selectively associated with the epigenetically modified residue at a portion of the nucleic acid sequence that is double-stranded. In some cases, the label may be selective for a portion of the nucleic acid sequence. In some cases, the portion may be double-stranded.
Another aspect of the present disclosure provides a method. In some cases, the method may comprise: assaying the cell-free sample by next generation sequencing to identify a nucleic acid sequence, wherein a presence of a 5-hydroxymethylcytosine (5-hmC) in the nucleic acid sequence identifies the cell-free sample as malignant for the cancer. In some cases, the cell-free sample may be obtained from a subject having or suspected of having said cancer. In some cases, the method may further comprise selecting a treatment for the subject based on the presence of the 5-hmC. In some cases, the presence of the 5-hmC may comprise a level of 5-hmC in the cell-free sample. In some cases, the nucleic acid sequence may comprise a cytosine guanine (CG) site, a cytosine phosphate guanine (CpG) island, or a combination thereof. In some cases, the method may further comprise obtaining a result based on the presence of the 5-hmC. In some cases, the method may further comprise communicating the result via a communication medium. In some cases, a label may be associated with an epigenetically modified base of the nucleic acid sequence.
A subject may be suspected of having a cancer. A sample comprising a nucleic acid sequence may be obtained from the subject by at least one of: a plasma sample, a serum sample, a blood sample, a urine sample, a buccal sample. The nucleic acid sequence may be isolated from the sample. Epigenetic modifications present on the nucleic acid sequence may be labeled with UDP-6-N3-Glu employing T4 Phage beta-glucosyltransferase (T4-BGT) and/or with click chemistry. A substantially complementary strand may be hybridized to a. portion of the nucleic acid sequence comprising the epigenetic modifications. The nucleic acid sequence may be contacted with a substrate such that the labelled nucleic acid sequence comprising the epigenetic modifications may be bound to the substrate. The substrate may be washed with a washing buffer. The substantially complementary strand may be separated from the nucleic acid sequence that may be associated with the substrate. The substantially complementary strand may he amplified in the absence of the nucleic acid sequence. The subject may he diagnosed as having the cancer when a presence of an epigenetic modification associated with the cancer may be confirmed present in the sample obtained from the subject.
Cell free DNA (cfDNA) may be extracted from plasma following Bioo Scientific's NextPrep-Mag cfDNA Isolation Kit instructions. When using formalin-fixed, paraffin-embedded (FFPE) genomic DNA (gDNA) extraction an appropriate kit may be chosen and a kit protocol followed. Genomic DNA (gDNA) from tissue may be fragmented. An appropriate amount of gDNA (100 ng ˜2000 ng) may be diluted in low-TE buffer and may be sheared to 150 basepair (bp) with Covaris in micro TUBE-50. DNA may be quantified by Qubit and QC by Bioanalyser to check size distribution. Controls may be spiked-in at 0.1% w/w (10 pg of each control for 10 nanogram (ng) fragmented gDNA or cfDNA)
Follow KAPA Hyper Prep Kit.
Each end repair and A-tailing reaction may be assembled in a tube.
The tube may be vortexed, may be spun down, and may be returned to ice. The tube may be incubated in a thermocycler programmed as outlined below:
DNA barcoded adapters from Bioo Scientific (NEXTflex DNA Barcodes-24) may be used for ligation. Adaptors may be diluted according to the amount of DNA used: 0.3 μM for 1 ng; 3 μM for 10 ng; 15 μM for 100 ng. In a same tube(s) in which end repair and A-tailing may be performed, each adapter ligation reaction may be assembled as follows:
The tube may be mixed thoroughly and may be centrifuged briefly. The tube may be incubated at 20° C. for 45 minutes (min).
In the same tube(s), a 0.8X bead-based cleanup may be performed by combining the following:
The tube(s) may be mixed thoroughly by vortexing and may be incubated at room temperature for 10 minutes so that DNA may bind to the beads. The tube(s) may be placed on a magnet (such that the beads may be captured) until the liquid may be clear. The supernatant may be removed and discarded. The residual fluid at the bottom of the tube may be collected by popping the spin tube and then returning the tube to magnet for a few seconds, removing and discarding the residual fluid. The tube(s) may be kept on the magnet and 200 μL of 80% ethanol may be added. The tube(s) may be incubated on the magnet at room temperature for ˜30 seconds. The ethanol may be removed and discarded. The tube(s) may be kept on the magnet and 200 μL of 80% ethanol may be added. The tube(s) may be incubated on the magnet at room temperature for approximately 30 seconds. The ethanol may be removed and discarded. The beads may be dried at room temperature for 3-5 minutes (or until all the ethanol has evaporated). The AMPure beads may be thoroughly resuspended in 20 μL of H2O. The tube(s) may be incubated at room temperature for 5 minutes to elute DNA off the beads. The tube(s) may be placed on a magnet to capture the beads and may be incubated until the liquid may be clear. The clear supernatant may be transferred to a new tube(s). One μL may be kept for Input Library Amplification. 18 μL may be transferred to fresh tube(s) and may continue to the 5-hmC labelling reaction and click chemistry.
One μl of pre pull-down library disclosed herein may be mixed with 9 μl of 10 mM Tris-HCl (pH8.0), 2 μl of this 10 times diluted library may be used for amplification in 50 μL using KAPA HiFi Hotstart DNA Polymerase.
Assemble each library amplification reaction may be as follows:
A tube may be mixed thoroughly and may be centrifuged briefly. The following cycling protocol may be utilized for amplification:
50 μL AMPure beads may be added to the 50 μL of PCR reaction from paragraph [00259], and may be vortexed to mix, and may be incubate at room temperature for 10 minutes (min). The tube(s) may be placed on a magnet to capture the beads until the liquid may be clear. The supernatant may be removed and discarded.
The tube(s) may be kept on the magnet and 200 μL of 80% ethanol may be added. The tube(s) may be incubated on the magnet at room temperature for approximately 30 seconds. The ethanol may be removed and discarded. The tube(s) may be kept on the magnet and 200 μL of 80% ethanol may be added. The tube(s) may be incubated on the magnet at room temperature for approximately 30 seconds. The ethanol may be removed and discarded. The beads may be dried at room temperature for 3-5 minutes (or until all the ethanol has evaporated). The AMPure beads may be thoroughly resuspended in 10 μL of 10 mM Tris-HCl (pH 8.0). The tube(s) may be incubated at room temperature for 5 minutes to elute DNA off the beads. The tube(s) may be placed on a magnet to capture the beads and incubated until the liquid may be clear. The clear supernatant may be transferred to a new tube(s). The samples may be stored at −20° C. Qubit and Bioanalyser may be utilized to quantify the library.
5-hmC Labelling Reaction and Click Chemistry.
Each labelling reaction may be assembled in a tube as follows.
A tube may be mixed thoroughly and centrifuged briefly and incubated at 37° C. for 30 minutes in a Thermocycler. One μL of 20 mM DBCO-PEG4-Biotin may be added to each tube at the end of reaction. The tube may be mixed thoroughly and centrifuged briefly, and incubated for 2 hours (hrs) at 37° C. in a Thermocycler. One μL of 10 mg/ml Salmon Sperm DNA may be added to the reaction mixture.
The Micro Bio-Spin P30 column may be inverted several times to resuspend the settled gel and remove any bubbles. The tip may be snapped off and placed the column in a 2.0 ml collection tube. The top cap may be removed. The excess packing buffer may be allowed to drain by gravity to the top of the gel bed (about two minutes). The drained buffer may be discarded and then the column may be placed back into the 2.0 ml tube. Centrifugation may occur for 2 minutes at 1,000 g to remove the remaining packing buffer. The buffer may be discarded. 500 μL of Bead Blocking Buffer 1 (BBB1) buffer may be applied to the column and may be centrifuged 2 minutes at 1000 g, and buffer from collection tube may be discarded. Another 500 μL of BBB1 buffer may be applied to the column, and the buffer may be allowed to drain by gravity for about 2 minutes, and the buffer may be discarded in collection tube, and then may centrifuged 2 minutes at 1000 g, and the buffer may be discarded from collection tube. The column may be placed in a new 1.5 ml DNA LowBind eppendorf tube. The reaction mixture from paragraph [00263] may be loaded to the centre of the column, and may be centrifuged for 4 minutes at 1000 g to collect DNA (˜40 μL).
M-270 Streptavidin beads may be blocked with Salmon Sperm DNA: The beads in the vial may be vortexed to be resuspended and the required volume (x=20% extra of number of samples processed) of beads may be transferred to a 1.5 ml eppendorf tube. The beads may be washed with 500 μL BBB1 buffer and resuspended. The tube may be placed on a magnet for 1 minute and the supernatant may be discarded. Washing of beads may be repeated twice, for a total of 3 washes. The beads may be resuspended in 500 μl of 100 μg/m1 Salmon Sperm DNA in BBB1 buffer (add 5 μl of 10 mg/ml Salmon Sperm DNA to 495 μl BBB1 buffer) and may be incubated for 30 minutes at room temperature with mixing. Washing of beads may be repeated twice. Residual fluid at bottom of tube may be collected, then the tube may be returned to magnet for a few seconds. The residual buffer may be removed and discarded. Beads in the same volume of BBB1 buffer may be resuspended as the initial volume of magnetic beads taken from the vial. One μL blocked M-270 Streptavidin beads may be added to reaction mixture from paragraph [00264], and may be incubated at 1300 rpm at room temperature for 30 minutes in a Thermomixer. A tube may be placed on magnetic stand for approximately 1 minute, and the supernatant may be removed and discarded.
200 μL BBB1 may be added to tube(s) from paragraph [00265] and may be incubated at room temperature for 5 minutes at 1500 rotations per minute (rpm) in a Thermomixer. The tube may be placed on a magnetic stand for approximately 2 minutes to remove and discard supernatant. The wash may be repeated twice. 200 μL BBB2 may be added and the tube may be incubated at room temperature for 5 minutes at 1500 rpm in a Thermomixer. The tube may be placed on a magnetic stand for approximately 2 minutes to remove and discard supernatant. The wash may be repeated twice. 200 μL BBB3 may be added and the tube may be incubated at room temperature for 5 minutes at 1500 rpm in a Thermomixer. The tube may be placed on magnetic stand for approximately 2 minutes to remove and discard supernatant. The wash may be repeated twice. 200 μL BBB4 may be added and the tube may be incubated at 55° C. for 5 minutes at 1500 rpm in a Thermomixer. The tube may be placed on a magnetic stand for approximately 2 minutes to remove and discard supernatant. The wash may be repeated twice. A final wash using 50 μL of H2O may be performed without disturbing the beads. Pop spin may be performed to remove residual buffer. The beads may be resuspended in 20 μL of H2O, and the bead suspension may be transferred to 0.2 ml PCR strip tubes.
Each library amplification reaction may be assembled as follows:
A tube may be mixed thoroughly and centrifuged briefly and amplified using the following cycling protocol:
The tube(s) may be placed on a magnet to capture the beads until the liquid may be clear. The supernatant may be transferred to fresh tube (s). To the 50 μL of PCR reaction, 50 μL of AMPure beads may be added, and may be vortexed to mix and may be incubated for 10 minutes at room temperature. The tube may be centrifuged briefly and beads may be precipitated using a magnetic rack for 5 minutes at room temperature. Supernatant may be carefully removed and may be washed twice with 200 μL of 80% ethanol without disturbing the beads. The beads may be left on the magnetic rack, with the lids open, until dry (5-10 minutes). 10 μL of 10 mM Tris-HCl (pH 8.0) may be added to the beads and may be incubated at room temperature for 5 minutes. The supernatant may be collected in a fresh tube. Qubit and Bioanalyser may be employed to quantify the library.
10× Glucosylation Buffer (10× β-GT Buffer) (1 mL):
BBB1: Bead Blocking Buffer 1 (100 mL)
BBB2: Bead Blocking Buffer 2 (100 ml):
BBB3: Bead Blocking Buffer 3 (100 ml):
BBB4: Bead Blocking Buffer 4 (100 ml):
Shearing of Genomic DNA (gDNA) to 150 bp
Genomic DNA (gDNA) may be diluted with low-TE buffer and may be sheared to 150 basepair (bp) with a Covaris in a micro TUBE-50. In some cases, purified cell-free DNA (cfDNA) can be used without the shearing.
Library preparation may follow KAPA Hyper Prep kit protocol using DNA barcoded adapters (24-plex) from Bioo Scientific. The adaptors may be diluted based on the input DNA used: 0.3 μM for 1 ng; 3 μM for 10 ng; 15 μM for 100 ng. For the final purification of the DNA, elution may occur in 12 ul of water. In some cases, a spike-in control may be employed. In such cases, a spike in the controls may occur at 0.1% weight by weight (w/w) before library preparation (10 picogram (pg) of each control 5-C, 5-mC, and 5-hmC; 5-hmC control can contain either 2 or 6 5-hmC).
a. Anneal NEXTflex primer to adapted DNA.
b. Klenow extension
a. Labelling of 5-hmC
b. Click reaction
Buffer exchange may occur in a Micro spin P30 column with 500 μl of bead blocking buffer 1 (BBB1) twice (spin for 2 minutes each time at 1000 g). The reaction mixture from b. above may be loaded to the centre of column, and may be centrifuged for 4 minutes (mins) at 1000 g to collect the DNA in a DNA low-bind eppendorf tube.
The M270 streptavidin beads may be prepared. A) 10 μl of M-270 may be washed in 500 μl BBB1 twice. B) The beads may be blocked in 500 μlof BBB1 containing 100 μg/ml Salmon Sperm DNA at room temperature for 30 minutes on a rotator. C) The beads may be washed twice in 500 μl of BBB1. D) The beads may be resuspended in 10 μl of BBB1. One μl of the blocked M-270 Streptavidin beads may be added to the purified DNA from paragraph [00277] and may be incubated in a Thermomixer at 22° C. for 30 minutes at 1300 rotations per minute (rpm).
The beads may be washed in 200 μl bead blocking buffer 1 (BBB1) in a Thermomixer at 1500 rpm for 5 minutes at 22° C. and the washing may be repeated for a total of 3 washes. Then, the beads may be washed in 200 μl BBB2 in a Thermomixer at 1500 rpm for 5 minutes at 22° C. and the washing may be repeated for a total of 3 washes. Next, the beads may be washed in 200 μl BBB3 in a Thermomixer at 1500 rpm for 5 minutes at 22° C. and the washing may be repeated for a total of 3 washes. Then, the beads may be washed in 200 μl BBB4 in a Thermomixer at 1500 rpm for 5 minutes at 55° C. and the washing may be repeated for a total of 3 washes. A final wash with 50 μl of H2O may occur without mixing. The water wash may be removed. 20 μl 0.1N NaOH may be added to the beads, and the beads may be resuspended and incubated in a Thermomixer at 1300 rpm for 10 minutes at 22° C. Using a magnet, the NaOH supernatant may be removed from the beads and pipetted into a new tube. Immediately the NaOH supernatant may be neutralized with 10 μl 0.2 M Tris pH 7.0. Continue to library enrichment by PCR.
a. Set up PCR
PCR Programme
b. The amplified products may be purified using 1X AMPure XP beads and the DNA may be eluted in 10 μl of 10 mM Tris-HCl pH 8.0.
BBB1: Bead Blocking Buffer 1 (100 ml):
BBB2: Bead Blocking Buffer 2 (100ml):
BBB3: Bead Blocking Buffer 3 (100 ml):
BBB4: Bead Blocking Buffer 4 (100 ml):
In some cases, as shown in
In contrast,
In some cases, less than about: 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 25%, or 30% of the genome may be covered when the CLE method is used. In some cases, from about 1% to about 30% of the genome may be covered when the CLE method is used. In some cases, from about 1% to about 20% of the genome may be covered when the CLE method is used. In some cases, from about 1% to about 15% of the genome may be covered when the CLE method is used. In some cases, from about 1% to about 10% of the genome may be covered when the CLE method is used. In some cases, from about 5% to about 25% of the genome may be covered when the CLE method is used.
In some cases, more than about: 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, or 90% of the genome may be covered when the HMCP method is used. In some cases, from about 50% to about 85% of the genome may be covered when the HMCP method is used. In some cases, from about 50% to about 80% of the genome may be covered when the HMCP method is used. In some cases, from about 50% to about 70% of the genome may be covered when the HMCP method is used. In some cases, from about 60% to about 80% of the genome may be covered when the HMCP method is used. In some cases, from about 60% to about 90% of the genome may be covered when the HMCP method is used.
In
An MDS plot may be a way of visualizing similarity between samples in a dataset, in a two dimensional space. Each point may represent one sample, which may be labelled by its clinical identifier and the method used (for example, HMCP or CLE method). The plot may be based on the RPKM ratio of pulldown:input of the top varying features. An euclidean distance may be calculated between samples, based on the variation in the data, represented as a distance matrix. This approach may be used to create coordinates of the points on the MDS plot. In some cases, two points that may be close to each other on an MDS plot may be more closely related in their RPKM enrichment profile than two points that may be distant each other.
While preferred embodiments have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the disclosure be limited by the specific examples provided within the specification. While the disclosure has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Furthermore, it shall be understood that all aspects described herein are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments described herein may be employed. It is therefore contemplated that the disclosure shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the disclosure and that methods and structures within the scope of these claims and their equivalents be covered thereby.
This application claims the benefit of U.S. provisional application 62/507,035 filed on May 16, 2017, and. U.S. Provisional Patent Application No. 62/638, 528 filed on Mar. 5, 2018, which are herein incorporated by reference in their entireties.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2018/000680 | 5/15/2018 | WO |
Number | Date | Country | |
---|---|---|---|
62638528 | Mar 2018 | US | |
62507035 | May 2017 | US |