Throughout this application various publications, patents, and/or patent applications are referenced. The disclosures of the publications, patents and/or patent applications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this invention pertains.
The present teachings pertain to chemically modified oligonucleotide sequence primer compositions and methods for sequencing DNA and fragment analysis. The teachings also relate to compositions for preparing, fragment analysis and sequencing of nucleic acids such as cDNA and DNA. In particular, methods, compositions, systems, apparatuses and kits for amplifying one or more target sequences within a sample containing a plurality of target sequences are described. Optionally, a plurality of target sequences, for example at least 10, 50, 100, 500, or 1000, are amplified within a single amplification reaction. In some embodiments, the disclosure relates generally to methods, compositions, systems, apparatuses and kits for amplifying one or more target sequences from a single source, such as genomic DNA or formalin-fixed paraffin-embedded (FFPE) DNA. Methods of amplification and sequencing and compositions and kits thereof are described in this disclosure which can be used to verify the nucleotide sequence of the one or more target sequences from a single source by Sanger sequencing, in combination with/consecutively or after, sequencing by NGS methods.
In one aspect of the invention, a method for sequencing at least one amplicon is provided which includes the steps of: providing at least one amplicon, wherein the at least one amplicon comprises a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying the at least one amplicon in a first reaction mixture which includes a plurality of nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture comprising a nuclease and at least one chemically-enhanced primer causing the plurality of nuclease sensitive amplification primers to be degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the at least one chemically-enhanced primer in a sequencing reaction; and producing extension products of the at least one chemically enhanced primer. In some embodiments, the extension products may be fluorescently labeled. In various embodiments, the first priming sequence may have been used to produce the amplicon. In various embodiments, the first priming sequence may include at least one cleavable moiety. In some embodiments, the preceding sequence may be a portion of the first priming sequence. In various embodiments of the method, the steps of contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed in the same reaction vessel. In various embodiments, the steps of amplifying the at least one amplicon, contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed without intermediate purification steps.
In various embodiments of the method, the at least one amplicon further includes a succeeding sequence 3′ to the sequence of interest wherein the succeeding sequence is complementary to a second priming sequence used to produce the at least one amplicon. The at least one amplicon may have a length of about 100 nucleotides to about 400 nucleotides. The sequence of interest of the at least one amplicon may have a length of about 100 nucleotides to about 300 nucleotides. In other embodiments, the sequence of interest of the at least one amplicon may have a length of about 125 nucleotides to about 275 or about 250 nucleotides. The at least one amplicon may be a plurality of amplicons. In some embodiments, the plurality of amplicons may include at least two different amplicons, a first having a sequence of interest that is a major variant sequence and a second amplicon having a minor variant sequence from the same region of a sample nucleic acid.
In some embodiments, the method further includes the steps of obtaining sequencing results based on the sequencing reaction; and determining a nucleotide base sequence of at least the sequence of interest based on the results. The sequencing results may be obtained via a mobility based separation method. In some embodiments, the mobility based separation method may be capillary electrophoresis. In some embodiments, the determined nucleotide base sequence of at least the sequence of interest may be compared to a second nucleotide base sequence of at least the sequence of interest obtained from a NGS method of sequencing. The NGS method of sequencing may include massively parallel sequencing techniques like sequencing by synthesis using fluorophore or semiconductor detection and pyrosequencing, to name a few. In some embodiments, the NGS method of sequencing may be semiconductor sequencing.
In various embodiments, amplifying DNA may include polymerase chain reaction amplification. In selected embodiments, the sequencing reaction may include cycle sequencing.
In various embodiments, the first reaction mixture may also include a polymerase. The polymerase may be a thermostable polymerase. In some embodiments, the polymerase may be Taq polymerase. The first reaction mixture may further include deoxynucleotide triphosphates.
In various embodiments, the second reaction mixture further comprises a polymerase, deoxynucleotide triphosphates, and dye-labelled dideoxynucleotide triphosphates. The polymerase of the second reaction mixture may be a thermostable polymerase. In some embodiments, the polymerase is Taq polymerase.
In various embodiments of the method for sequencing at least one amplicon, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I.
In various embodiments of the method for sequencing at least one amplicon,the chemically-enhanced primer may include an oligonucleotide sequence, a NCM and none or at least one nuclease-resistant linkage. In some embodiments, the chemically-enhanced primer may include one nuclease-resistant linkage at a terminal 3′ end. The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within a oligonucleotide sequence of the chemically-enhanced primer. In some embodiments the plurality of NCMs may be at a terminal 5′ end. In various embodiments, the NCM may be a (Cn) spacer wherein n is any integer from 1 to 9. The NCM may include a plurality of (Cn) spacers. In various embodiments, the chemically-enhanced primer may have a structure of the formula: (Cn)x-OLIGO , where (Cn)x has a structure of the following formula:
where each instance of n may independently be an integer of 1 to 9; and x may be an integer of 1 to about 30;
OLIGO has a structure of the following formula:
In some embodiments, the chemically enhanced primer may have any structure as described in this disclosure.
In various embodiments of the method for sequencing at least one amplicon, each of the plurality of nuclease- sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. In some embodiments, the plurality of nuclease-sensitive amplification primers may prime a set of sequences connected to a specific disease state.
In another aspect of the invention, a method is provided for confirming a DNA sequence, which includes the steps of: amplifying a sample comprising nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying a first aliquot of the plurality of amplicons in a first reaction mixture including a plurality of nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture that includes a nuclease and at least one chemically-enhanced primer, where by contacting the nuclease with the first reaction mixture, the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the at least one chemically-enhanced primer in a sequencing reaction; and producing extension products of the chemically enhanced primer. In some embodiments, the extension products may be fluorescently labeled. In various embodiments, the first priming sequence may have been used to produce the amplicon. The first priming sequence may include at least one cleavable moiety. In some embodiments, the preceding sequence may be a portion of the first priming sequence.
In various embodiments of the method, the steps of contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed in the same reaction vessel. In various embodiments, the steps of amplifying the plurality of amplicons, contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed without intermediate purification steps.
In various embodiments of the method, each of the plurality of amplicons further includes a succeeding sequence 3′ to the sequence of interest wherein the succeeding sequence is complementary to a second priming sequence used to produce the amplicon. Each of the plurality of amplicons may have a length of about 100 nucleotides to about 400 nucleotides. The sequence of interest of each of the plurality of amplicons may have a length of about 100 nucleotides to about 300 nucleotides. In other embodiments, the sequence of interest of each of the plurality of amplicons may have a length of about 125 nucleotides to about 275 or about 250 nucleotides. In some embodiments, the plurality of amplicons may include at least two different amplicons, a first having a sequence of interest that is a major variant sequence and a second amplicon having a minor variant sequence from the same region of a sample nucleic acid.
In some embodiments, the method further includes the steps of obtaining sequencing results based on the sequencing reaction; and determining a nucleotide base sequence of at least the sequence of interest based on the results. The sequencing results may be obtained via a mobility based separation method. In some embodiments, the mobility based separation method may be capillary electrophoresis. In some embodiments, the determined nucleotide base sequence of at least the sequence of interest may be compared to a second nucleotide base sequence of at least the sequence of interest obtained from a NGS method of sequencing performed on a second aliquot of the plurality of amplicons. The NGS method of sequencing may include massively parallel sequencing techniques like sequencing by synthesis using fluorophore or semiconductor detection and pyrosequencing, to name a few. In some embodiments, the NGS method of sequencing may be semiconductor sequencing.
In various embodiments, amplifying DNA may include polymerase chain reaction amplification. In selected embodiments, the sequencing reaction may include cycle sequencing.
In various embodiments, the first reaction mixture may also include a polymerase. The polymerase may be a thermostable polymerase. In some embodiments, the polymerase may be Taq polymerase. The first reaction mixture may further include deoxynucleotide triphosphates.
In various embodiments, the second reaction mixture further comprises a polymerase, deoxynucleotide triphosphates, and dye-labelled dideoxynucleotide triphosphates. The polymerase of the second reaction mixture may be a thermostable polymerase. In some embodiments, the polymerase is Taq polymerase.
In various embodiments of the method for confirming a DNA sequence, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I.
In various embodiments of the method for confirming a DNA sequence, the chemically-enhanced primer may include an oligonucleotide sequence, a NCM and none or at least one nuclease-resistant linkage. In some embodiments, the chemically-enhanced primer may include one nuclease-resistant linkage at a terminal 3′ end. The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within a oligonucleotide sequence of the chemically-enhanced primer. In some embodiments the plurality of NCMs may be at a terminal 5′ end. In various embodiments, the NCM may be a (Cn) spacer wherein n is any integer from 1 to 9. The NCM may include a plurality of (Cn) spacers. In various embodiments, the chemically-enhanced primer may have a structure of the formula: (Cn)x-OLIGO, where (Cn)x has a structure of the following formula:
In some embodiments, the chemically enhanced primer may have any structure described in this disclosure.
In various embodiments of the method for confirming a DNA sequence, each of the plurality of nuclease-sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. In some embodiments, the plurality of nuclease-sensitive amplification primers may prime a set of sequences connected to a specific disease state.
In yet another aspect of the invention, a method is provided for preparing DNA for sequencing, including the steps of: amplifying a sample comprising nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying an aliquot of the plurality of amplicons in a first reaction mixture which includes nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture comprising a nuclease and a chemically-enhanced primer whereby the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the chemically-enhanced primer in a sequencing reaction; and producing extension products of the chemically enhanced primer.
In a further aspect of the invention, a method is provided for sequencing and verifying a variant nucleic acid sequence of interest, including the steps: amplifying a sample which includes nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; splitting the plurality of amplicons into a first aliquot and a second aliquot; amplifying the first aliquot of the plurality of amplicons in a first reaction mixture including nuclease-sensitive amplification primers to form a first amplified DNA product; contacting the first reaction mixture containing the first amplified DNA product with a second reaction mixture which includes a nuclease and a chemically-enhanced primer where by contacting the nuclease with the first reaction mixture, the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the first amplified DNA product with the chemically-enhanced primer in a sequencing reaction; producing extension products of the chemically enhanced primer; obtaining sequencing results of at least the sequence of interest of the extended chemically enhanced primer using a mobility dependent separation; and determining a nucleotide base sequence of at least the sequence of interest of the extended chemically enhanced primer; amplifying the second aliquot of the amplicons to form a second DNA product; obtaining sequencing results of at least the sequence of interest of the second DNA product using a NGS sequencing method; and verifying a nucleotide sequence of the second DNA product by comparing it to the nucleotide base sequence of at least the sequence of interest of the extended chemically enhanced primer. In various embodiments of the method, the step of amplifying the second aliquot of the plurality of amplicons to form a second DNA product further comprises at least one of ligating adaptors, binding to beads, and ligating barcodes.
In another aspect of the invention, a composition for sequencing nucleic acid is provided that includes: a PCR amplification reaction product that comprises: a DNA product amplified from at least one amplicon, wherein the amplicon comprises a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; non-nuclease-resistant amplification primer(s); and a chemically enhanced primer wherein the chemically enhanced primer comprises an oligonucleotide sequence, a NCM and none or at least one nuclease-resistant linkage.
The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within a oligonucleotide sequence of the chemically-enhanced primer. The NCM may be a (Cn) spacer wherein n can be any integer from 1 to 9. In various embodiments, the NCM comprises a plurality of (Cn) spacers. In various embodiments, the chemically-enhanced primer may have a structure of Formula I:
wherein B is a nucleobase; K is S or O; each n is independently an integer of 1 to 9; m is 0 or 1; × is an integer of 1 to about 30; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
In some embodiments of the compositions of the invention, the chemically enhanced primer may be any chemically enhanced primer described in this disclosure.
In various embodiments, the oligonucleotide portion of the chemically-enhanced primer may include a universal primer. The universal primer may be selected from M13, US1, T7, SP6, and T3. The universal primer may be M13. In some embodiments, the chemically-enhanced primer may include one nuclease-resistant linkage.
In some embodiments of the compositions of the invention, the composition may further include a nuclease. In other embodiments, the composition may further include a polymerase, deoxynucleotide triphosphates, dideoxynucleotide triphosphates and a dye-label. In some embodiments, the dideoxynucleotide triphosphates may include dideoxynucleotide triphosphates labeled with the dye-label. The dye-labeled dideoxynucleotide triphosphates may include fluorescent dye-labeled dideoxynucleotide triphosphates. In some embodiments, the dye-label may be attached to the NCM or the oligonucleotide sequence.
In some embodiments of the compositions of the invention, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I. In some embodiments, the polymerase may be Taq polymerase.
In some embodiments, the PCR amplification reaction product further includes an amplified DNA product where the DNA product is the amplification product of a plurality of amplicons.
In yet another aspect of the invention, a chemically enhanced primer is provided that includes an oligonucleotide sequence, at least one NCM and none or at least one nuclease-resistant linkage, and where at least 10 of the nucleotides at a 3′ terminus of the chemically enhanced primer are complementary to at least 10 of the nucleotides at the 5′ terminus of an amplicon, wherein the amplicon includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence.
In some embodiments, the chemically-enhanced primer comprises one nuclease-resistant linkage at the terminal 3′ end. The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within an oligonucleotide sequence of the chemically-enhanced primer. The NCM may be a (Cn) spacer wherein n may be any integer from 1 to 9. The NCM may include a plurality of (Cn) spacers.
In various embodiments of the chemically enhanced primer of the invention, the chemically-enhanced primer may have a structure of the formula: (Cn)x-OLIGO , wherein (Cn)x has a structure of the following formula:
wherein each instance of n is independently an integer of 1 to 9; and x is an integer of 1 to about 30; OLIGO has a structure of the following formula:
In another aspect of the invention, a kit is described which includes: a polymerase, a nuclease, at least one deoxynucleotide triphosphate, and dideoxynucleotide triphosphates. The dideoxynucleotide triphosphates may be dideoxynucleotide triphosphates labeled with a dye-label. The dye-labeled dideoxynucleotide triphosphates may be fluorescent dye-labeled dideoxynucleotide triphosphates.
In some embodiments, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I. In various embodiments, the kit may include a chemically enhanced primer as described in this disclosure.
In other embodiments, the kit may further include a plurality of nuclease sensitive amplification primers. The plurality of nuclease- sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. The plurality of nuclease-sensitive amplification primers of the kit may be configured to prime a set of sequences connected to a specific disease state.
Various patents, patent applications, and other publications are referred to herein, all of which are incorporated herein in their entireties by reference. In addition, the following standard reference works are incorporated herein by reference: Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., edition as of October 2007; Sambrook, Russell, and Sambrook, Molecular Cloning: A Laboratory Manual, 3rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2001. In the event of a conflict between the instant specification and any document incorporated by reference, the specification shall control, it being understood that the determination of whether a conflict or inconsistency exists is within the discretion of the inventors and can be made at any time.
Additional features and advantages of the present teachings will be evident from the description that follows, and in part will be apparent from the description, or can be learned by practice of the present teachings. It is to be understood that both the foregoing summary and the following detailed description are exemplary and explanatory only and are intended to provide further explanation of the present teachings without limiting the present teachings.
The accompanying drawings, which are incorporated in and constitute a part of this specification exemplify embodiments disclosed and, together with the description, serve to explain and illustrate principles of disclosed embodiments. Specifically:
To facilitate understanding of the present teachings, the following definitions are provided. It is to be understood that, in general, terms not otherwise defined are to be given their ordinary meanings or meanings as generally accepted in the art.
As used herein, “amplify”, “amplifying” or “amplification reaction” and their derivatives, refer generally to any action or process whereby at least a portion of a nucleic add molecule (referred to as a template nucleic acid molecule) is replicated or copied into at least one additional nucleic acid molecule. The additional nucleic acid molecule optionally includes sequence that is substantially identical or substantially complementary to at least some portion of the template nucleic: acid molecule. The template nucleic acid molecule can be single-stranded or double-stranded and the additional nucleic add molecule can independently be single-stranded or double-stranded. In some embodiments, amplification includes a template-dependent in vitro enzyme-catalyzed reaction for the production of at least one copy of at least some portion of the nucleic add molecule or the production of at least one copy of a nucleic acid sequence that is complementary to at least some portion of the nucleic add molecule. Amplification optionally includes linear or exponential replication of a nucleic add molecule. In some embodiments, such amplification is performed using isothermal conditions; in other embodiments, such amplification can include thermocycling. In some embodiments, the amplification k a multiplex amplification that includes the simultaneous amplification of a plurality of target sequences in a single amplification reaction. At least some of the target sequences can be situated on the same nucleic add molecule or on different target nucleic acid molecules included in the single amplification reaction. In some embodiments, “amplification” includes amplification of at least some portion of DNA- and RNA-based nucleic acids alone, or in combination. The amplification reaction can include single or double-stranded nucleic acid substrates and can further including any of the amplification processes known to one of ordinary skill in the art. In some embodiments, the amplification reaction includes polymerase chain reaction (PCR).
As used herein, “amplification conditions” and its derivatives, generally refers to conditions suitable for amplifying one or more nucleic acid sequences. Such amplification can be linear or exponential. In some embodiments, the amplification conditions can include isothermal conditions or alternatively can include thermocycling conditions, or a combination of isothermal and thermocycling conditions. In some embodiments, the conditions suitable for amplifying one or more nucleic acid sequences includes polymerase chain reaction (PCR) conditions. Typically, the amplification conditions refer to a reaction mixture that is sufficient to amplify nucleic acids such as one or more target sequences, or to amplify an amplified target sequence ligated to one or more adapters, e.g., an adapter-ligated amplified target sequence. Generally, the amplification conditions include a catalyst for amplification or for nucleic acid synthesis, for example a polymerase; a primer that possesses some degree of complementarity to the nucleic acid to be amplified; and nucleotides, such as deoxyribonucleotide triphosphates (dNTPs) to promote extension of the primer once hybridized to the nucleic acid. The amplification conditions can require hybridization or annealing of a primer to a nucleic acid, extension of the primer and a denaturing step in which the extended primer is separated from the nucleic acid sequence undergoing amplification. Typically, but not necessarily, amplification conditions can include thermocycling; in some embodiments, amplification conditions include a plurality of cycles where the steps of annealing, extending and separating are repeated. Typically, the amplification conditions include cations such as Mg++ or Mn++ (e.g., MgCl2, etc) and can also include various modifiers of ionic strength.
As used herein, “target sequence” or “sequence of interest” and its derivatives, refers generally and interchangeably to any single or double-stranded nucleic acid sequence that can be amplified or synthesized according to the disclosure, including any nucleic acid sequence suspected or expected to be present in a sample. In some embodiments, the sequence of interest is present in double-stranded form and includes at least a portion of the particular nucleotide sequence to be amplified or synthesized, or its complement, prior to the addition of target-specific primers or appended adapters. Target sequences can include the nucleic acids to which primers useful in the amplification or synthesis reaction can hybridize prior to extension by a polymerase. In some embodiments, the term refers to a nucleic acid sequence whose sequence identity, ordering or location of nucleotides is determined by one or more of the methods of the disclosure.
As defined herein, a “cleavable group” generally refers to any moiety that once incorporated into a nucleic acid can be cleaved under appropriate conditions. For example, a cleavable group can be incorporated into a target-specific primer, an amplified sequence, an adapter or a nucleic acid molecule of the sample. In an exemplary embodiment, a target-specific primer can include a cleavable group that becomes incorporated into the amplified product and is subsequently cleaved after amplification, thereby removing a portion, or all, of the target-specific primer from the amplified product. The cleavable group can be cleaved or otherwise removed from a target-specific primer, an amplified sequence, an adapter or a nucleic acid molecule of the sample by any acceptable means. For example, a cleavable group can be removed from a target-specific primer, an amplified sequence, an adapter or a nucleic acid molecule of the sample by enzymatic, thermal, photo-oxidative or chemical treatment. In one aspect, a cleavable group can include a nucleobase that is not naturally occurring. For example, an oligodeoxyribonucleotide can include one or more RNA nucleobases, such as uracil that can be removed by a uracil glycosylase. In some embodiments, a cleavable group can include one or more modified nucleobases (such as 7-methylguanine, 8-oxo-guanine, xanthine, hypoxanthine, 5,6-dihydrouracil or 5-methylcytosine) or one or more modified nucleosides (i.e., 7-methylguanosine, 8-oxo-deoxyguanosine, xanthosine, inosine, dihydrouridine or 5-methylcytidine). The modified nucleobases or nucleotides can be removed from the nucleic acid by enzymatic, chemical or thermal means. In one embodiment, a cleavable group can include a moiety that can be removed from a primer after amplification (or synthesis) upon exposure to ultraviolet light (i.e., bromodeoxyuridine). In another embodiment, a cleavable group can include methylated cytosine. Typically, methylated cytosine can be cleaved from a primer for example, after induction of amplification (or synthesis), upon sodium bisulfite treatment. In some embodiments, a cleavable moiety can include a restriction site. For example, a primer or target sequence can include a nucleic acid sequence that is specific to one or more restriction enzymes, and following amplification (or synthesis), the primer or target sequence can be treated with the one or more restriction enzymes such that the cleavable group is removed. Typically, one or more cleavable groups can be included at one or more locations with a target-specific primer, an amplified sequence, an adapter or a nucleic acid molecule of the sample.
As used herein, “cleavage step” and its derivatives, generally refers to any process by which a cleavable group is cleaved or otherwise removed from a target-specific primer, an amplified sequence, an adapter or a nucleic acid molecule of the sample. In some embodiments, the cleavage steps involves a chemical, thermal, photo-oxidative or digestive process.
The terms “complementary” and “complement” and their variants, as used herein, refer to any two or more nucleic acid sequences (e.g., portions or entireties of template nucleic acid molecules, target sequences and/or primers) that can undergo cumulative base pairing at two or more individual corresponding positions in antiparallel orientation, as in a hybridized duplex. Such base pairing can proceed according to any set of established rules, for example according to Watson-Crick base pairing rules or according to some other base pairing paradigm. Optionally there can be “complete” or “total” complementarity between a first and second nucleic acid sequence where each nucleotide in the first nucleic acid sequence can undergo a stabilizing base pairing interaction with a nucleotide in the corresponding antiparallel position on the second nucleic acid sequence. “Partial” complementarity describes nucleic acid sequences in which at least 20%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, at least 50%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, at least 70%, 80%, 90%, 95% or 98%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. Sequences are said to be “substantially complementary” when at least 85% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, two complementary or substantially complementary sequences are capable of hybridizing to each other under standard or stringent hybridization conditions. “Non-complementary” describes nucleic acid sequences in which less than 20% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. Sequences are said to be “substantially non-complementary” when less than 15% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, two non-complementary or substantially non-complementary sequences cannot hybridize to each other under standard or stringent hybridization conditions. A “mismatch” is present at any position in the two opposed nucleotides are not complementary. Complementary nucleotides include nucleotides that are efficiently incorporated by DNA polymerases opposite each other during DNA replication under physiological conditions. In a typical embodiment, complementary nucleotides can form base pairs with each other, such as the A-T/U and G-C base pairs formed through specific Watson-Crick type hydrogen bonding, or base pairs formed through some other type of base pairing paradigm, between the nucleobases of nucleotides and/or polynucleotides in positions antiparallel to each other. The complementarity of other artificial base pairs can be based on other types of hydrogen bonding and/or hydrophobicity of bases and/or shape complementarity between bases.
As used herein, “DNA barcode” or “DNA tagging sequence” and its derivatives, refers generally to a unique short (6-14 nucleotide) nucleic acid sequence within an adapter that can act as a ‘key’ to distinguish or separate a plurality of amplified target sequences in a sample. For the purposes of this disclosure, a DNA barcode or DNA tagging sequence can be incorporated into the nucleotide sequence of an adapter.
As used herein, “contacting” and its derivatives, when used in reference to two or more components, refers generally to any process whereby the approach, proximity, mixture or commingling of the referenced components is promoted or achieved without necessarily requiring physical contact of such components, and includes mixing of solutions containing any one or more of the referenced components with each other. The referenced components may be contacted in any particular order or combination and the particular order of recitation of components is not limiting.
As used herein, the term “determining a nucleotide base sequence” or the term “determining information about a sequence” encompasses “sequence determination” and also encompasses other levels of information such as eliminating one or more possibilities for a sequence. It is noted that performing sequence determination of a polynucleotide typically yields equivalent information regarding the sequence of a perfectly complementary (100% complementary) polynucleotide and thus is equivalent to sequence determination performed directly on a perfectly complementary polynucleotide.
As used herein, the term “end” and its variants, when used in reference to a nucleic acid molecule, for example a target sequence or amplified target sequence, can include the terminal 30 nucleotides, the terminal 20 and even more typically the terminal 15 nucleotides of the nucleic acid molecule. A linear nucleic acid molecule comprised of linked series of contiguous nucleotides typically includes at least two ends. In some embodiments, one end of the nucleic acid molecule can include a 3′ hydroxyl group or its equivalent, and can be referred to as the “3′ end” and its derivatives. Optionally, the 3′ end includes a 3′ hydroxyl group that is not linked to a 5′ phosphate group of a mononucleotide pentose ring. Typically, the 3′ end includes one or more 5′ linked nucleotides located adjacent to the nucleotide including the unlinked 3′ hydroxyl group, typically the 30 nucleotides located adjacent to the 3′ hydroxyl, typically the terminal 20 and even more typically the terminal 15 nucleotides. Generally, the one or more linked nucleotides can be represented as a percentage of the nucleotides present in the oligonucleotide or can be provided as a number of linked nucleotides adjacent to the unlinked 3′ hydroxyl. For example, the 3′ end can include less than 50% of the nucleotide length of the oligonucleotide. In some embodiments, the 3′ end does not include any unlinked 3′ hydroxyl group but can include any moiety capable of serving as a site for attachment of nucleotides via primer extension and/or nucleotide polymerization. In some embodiments, the term “3′ end” for example when referring to a target-specific primer, can include the terminal 10 nucleotides, the terminal 5 nucleotides, the terminal 4, 3, 2 or fewer nucleotides at the 3′end. In some embodiments, the term “3′ end” when referring to a target-specific primer can include nucleotides located at nucleotide positions 10 or fewer from the 3′ terminus.
As used herein, “5′ end”, and its derivatives, generally refers to an end of a nucleic acid molecule, for example a target sequence or amplified target sequence, which includes a free 5′ phosphate group or its equivalent. In some embodiments, the 5′ end includes a 5′ phosphate group that is not linked to a 3′ hydroxyl of a neighboring mononucleotide pentose ring. Typically, the 5′ end includes to one or more linked nucleotides located adjacent to the 5′ phosphate, typically the 30 nucleotides located adjacent to the nucleotide including the 5′ phosphate group, typically the terminal 20 and even more typically the terminal 15 nucleotides. Generally, the one or more linked nucleotides can be represented as a percentage of the nucleotides present in the oligonucleotide or can be provided as a number of linked nucleotides adjacent to the 5′ phosphate. For example, the 5′ end can be less than 50% of the nucleotide length of an oligonucleotide. In another exemplary embodiment, the 5′ end can include about 15 nucleotides adjacent to the nucleotide including the terminal 5′ phosphate. In some embodiments, the 5′ end does not include any unlinked 5′ phosphate group but can include any moiety capable of serving as a site of attachment to a a 3′ hydroxyl group, or to the 3′end of another nucleic acid molecule. In some embodiments, the term “5′ end” for example when referring to a target-specific primer, can include the terminal 10 nucleotides, the terminal 5 nucleotides, the terminal 4, 3, 2 or fewer nucleotides at the 5′end. In some embodiments, the term “5′ end” when referring to a target-specific primer can include nucleotides located at positions 10 or fewer from the 5′ terminus. In some embodiments, the 5′ end of a target-specific primer can include only non-cleavable nucleotides, for example nucleotides that do not contain one or more cleavable groups as disclosed herein, or a cleavable nucleotide as would be readily determined by one of ordinary skill in the art.
As used herein, the term “hybridization” is consistent with its use in the art, and generally refers to the process whereby two nucleic acid molecules undergo base pairing interactions. Two nucleic acid molecule molecules are said to be hybridized when any portion of one nucleic acid molecule is base paired with any portion of the other nucleic acid molecule; it is not necessarily required that the two nucleic acid molecules be hybridized across their entire respective lengths and in some embodiments, at least one of the nucleic acid molecules can include portions that are not hybridized to the other nucleic acid molecule. The phrase “hybridizing under stringent conditions” and its variants refers generally to conditions under which hybridization of a target-specific primer to a target sequence occurs in the presence of high hybridization temperature and low ionic strength. In one exemplary embodiment, stringent hybridization conditions include an aqueous environment containing about 30 mM magnesium sulfate, about 300 mM Tris-sulfate at pH 8.9, and about 90 mM ammonium sulfate at about 60-68° C., or equivalents thereof. As used herein, the phrase “standard hybridization conditions” and its variants refers generally to conditions under which hybridization of a primer to an oligonucleotide (i.e., a target sequence), occurs in the presence of low hybridization temperature and high ionic strength. In one exemplary embodiment, standard hybridization conditions include an aqueous environment containing about 100 mM magnesium sulfate, about 500 mM Tris-sulfate at pH 8.9, and about 200 mM ammonium sulfate at about 50-55° C., or equivalents thereof.
As used herein, the terms “ligating”, “ligation” and their derivatives refer generally to the act or process for covalently linking two or more molecules together, for example, covalently linking two or more nucleic acid molecules to each other. In some embodiments, ligation includes joining nicks between adjacent nucleotides of nucleic acids. In some embodiments, ligation includes forming a covalent bond between an end of a first and an end of a second nucleic acid molecule. In some embodiments, for example embodiments wherein the nucleic acid molecules to be ligated include conventional nucleotide residues, the litigation can include forming a covalent bond between a 5′ phosphate group of one nucleic acid and a 3′ hydroxyl group of a second nucleic acid thereby forming a ligated nucleic acid molecule. In some embodiments, any means for joining nicks or bonding a 5′phosphate to a 3′ hydroxyl between adjacent nucleotides can be employed. In an exemplary embodiment, an enzyme such as a ligase can be used. Generally for the purposes of this disclosure, an amplified target sequence can be ligated to an adapter to generate an adapter-ligated amplified target sequence.
As used herein, “ligase” and its derivatives, refers generally to any agent capable of catalyzing the ligation of two substrate molecules. In some embodiments, the ligase includes an enzyme capable of catalyzing the joining of racks between adjacent nucleotides of a nucleic add. In some embodiments, the ligase includes an enzyme capable of catalyzing the formation of a covalent bond between a 5′ phosphate of one nucleic acid molecule to a 3′ hydroxyl of another nucleic acid molecule thereby forming a ligated nucleic acid molecule. Suitable ligases may include, but not limited to, T4 DNA ligase, T4 RNA ligase, and E. coli DNA ligase.
As used herein, “blunt-end ligation” and its derivatives, refers generally to ligation of two blunt-end double-stranded nucleic acid molecules to each other. A “blunt end” refers to an end of a double-stranded nucleic acid molecule wherein substantially all of the nucleotides in the end of one strand of the nucleic acid molecule are base paired with opposing nucleotides in the other strand of the same nucleic acid molecule. A nucleic acid molecule is not blunt ended if it has an end that includes a single-stranded portion greater than two nucleotides in length, referred to herein as an “overhang”. In some embodiments, the end of nucleic acid molecule does not include any single stranded portion, such that every nucleotide in one strand of the end is based paired with opposing nucleotides in the other strand of the same nucleic acid molecule. In some embodiments, the ends of the two blunt ended nucleic acid molecules that become ligated to each other do not include any overlapping, shared or complementary sequence. Typically, blunted-end ligation excludes the use of additional oligonucleotide adapters to assist in the ligation of the double-stranded amplified target sequence to the double-stranded adapter, such as patch oligonucleotides as described in Mitra and Varley, US2010/0129874, published May 27, 2010. In some embodiments, blunt-ended ligation includes a nick translation reaction to seal a nick created during the ligation process.
As used herein, the terms “adapter” or “adapter and its complements” and their derivatives, refers generally to any linear oligonucleotide which can be ligated to a nucleic acid molecule of the disclosure. Optionally, the adapter includes a nucleic acid sequence that is not substantially complementary to the 3′ end or the 5′ end of at least one target sequences within the sample. In some embodiments, the adapter is substantially non-complementary to the 3′ end or the 5′ end of any target sequence present in the sample. In some embodiments, the adapter includes any single stranded or double-stranded linear oligonucleotide that is not substantially complementary to an amplified target sequence. In some embodiments, the adapter is substantially non-complementary to at least one, some or all of the nucleic acid molecules of the sample. In some embodiments, suitable adapter lengths are in the range of about 10-100 nucleotides, about 12-60 nucleotides and about 15-50 nucleotides in length. Generally, the adapter can include any combination of nucleotides and/or nucleic acids. In some aspects, the adapter can include one or more cleavable groups at one or more locations. In another aspect, the adapter can include a sequence that is substantially identical, or substantially complementary, to at least a portion of a primer, for example a universal primer. In some embodiments, the adapter can include a barcode or tag to assist with downstream cataloguing, identification or sequencing. In some embodiments, a single-stranded adapter can act as a substrate for amplification when ligated to an amplified target sequence, particularly in the presence of a polymerase and dNTPs under suitable temperature and pH.
As used herein, the term “polymerase chain reaction” (“PCR”) refers to the method of K. B. Mullis U.S. Pat. Nos. 4,683,195 and 4,683,202, hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a polynucleotide of interest in a mixture of genomic DNA without cloning or purification. This process for amplifying the polynucleotide of interest consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired polynucleotide of interest, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded polynucleotide of interest. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the polynucleotide of interest molecule. Following annealing, the primers are extended with a polymerase to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired polynucleotide of interest. The length of the amplified segment of the desired polynucleotide of interest (amplicon) is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of repeating the process, the method is referred to as the “polymerase chain reaction” (hereinafter “PCR”). Because the desired amplified segments of the polynucleotide of interest become the predominant nucleic acid sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified”. As defined herein, target nucleic acid molecules within a sample including a plurality of target nucleic acid molecules are amplified via PCR. In a modification to the method discussed above, the target nucleic acid molecules can be PCR amplified using a plurality of different primer pairs, in some cases, one or more primer pairs per target nucleic acid molecule of interest, thereby forming a multiplex PCR reaction. Using multiplex PCR, it is possible to simultaneously amplify multiple nucleic acid molecules of interest from a sample to form amplified target sequences. It is also possible to detect the amplified target sequences by several different methodologies (e.g., quantitation with a bioanalyzer or qPCR, hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; incorporation of 32P-labeled deoxynucleotide triphosphates, such as dCTP or dATP, into the amplified target sequence). Any oligonucleotide sequence can be amplified with the appropriate set of primers, thereby allowing for the amplification of target nucleic acid molecules from genomic DNA, cDNA, formalin-fixed paraffin-embedded DNA, fine-needle biopsies and various other sources. In particular, the amplified target sequences created by the multiplex PCR process as disclosed herein, are themselves efficient substrates for subsequent PCR amplification or various downstream assays or manipulations.
As defined herein “multiplex amplification” refers to selective and non-random amplification of two or more target sequences within a sample using at least one target-specific primer. In some embodiments, multiplex amplification is performed such that some or all of the target sequences are amplified within a single reaction vessel. The “plexy” or “plex” of a given multiplex amplification refers generally to the number of different target-specific sequences that are amplified during that single multiplex amplification. In some embodiments, the plexy can be about 12-plex, 24-plex, 48-plex, 96-plex, 192-plex, 384-plex, 768-plex, 1536-plex, 3072-plex, 6144-plex or higher.
“Cycle sequencing” as used herein, refers to a process that includes adding to a target nucleic acid or an amplification product thereof, sequencing primer, deoxynucleotide triphosphates (dNTPs), dye-labeled chain terminating nucleotides (e.g.,dideoxynucleotide triphosphates (ddNTPs-dyes)), and DNA polymerase, followed by thermal cycle sequencing. Standard cycle sequencing procedures are well established. Cycle sequencing procedures are described in more detail, for example, in U.S. Pat. No. 5,741,676, and U.S. Pat. No. 5,756,285, each hereby incorporated by reference in its entirety. In certain embodiments, “cycle sequencing” comprises dNTPS, a sequencing primer (labeled or not), ddNTPs (labeled or not) and DNA polymerase as known to one of skill in the art. It is noted that a labeled sequencing primer can provide fragment analysis information and/or determination of the sequence of a target nucleic acid or amplification product thereof.
As used herein, the term “PCR/cycle sequencing” refers to a method for determining a nucleotide sequence of DNA by PCR amplifying the DNA, followed by sequencing reactions repeated (or cycled) several times. This cycling is similar to PCR because the sequencing reaction is allowed to proceed at a preselected temperature where polymerase extension may occur, i.e. 42° C.-55° C., then extension is stopped by heating to 95° C., and finally the cycle is started again at 42° C.-55° C. Cycle sequencing uses a thermostable DNA polymerase.
As used herein, the term “phosphorothioate linkage” refers to an inter-nucleotide linkage comprising a sulfur atom in place of a non-bridging oxygen atom within the phosphate linkages of a sugar phosphate backbone. The term phosphorothioate linkage refers to both phosphorothioate inter-nucleotide linkages and phosphorodithioate inter-nucleotide linkages. A “phosphorothioate linkage at a terminal 3′ end” refers to a phosphorothioate linkage at the 3′ terminus, that is, the last phosphate linkage of the sugar phosphate backbone at the 3′ terminus. A phosphorothioate linkage at a terminal 3′ end is illustrated in
As used herein, the term “phosphodiester linkage” may refer to the linkage—PO4—which is used to link nucleotide monomers, such as the inter-nucleotide linkages found in naturally-occurring DNA. Additionally, “phosphodiester linkage” may refer to portions of the NCMs or NCM linkers of the chemically-enhanced primers of the present disclosure.
As used herein, the term “nuclease-resistant linkage” refers to an oligonucleotide sequence, such as a primer, that is resistant to digestion in the 3′ to 5′ direction by nuclease. Phosphorothioate and boronophosphate linkages are two examples of nuclease-resistant linkages. The examples are not to be construed as limiting to just these examples.
As used herein, the term “primer” and its derivatives refer generally to any polynucleotide that can hybridize to a target sequence of interest. In some embodiments, the primer can also serve to prime nucleic acid synthesis. Typically, the primer functions as a substrate onto which nucleotides can be polymerized by a polymerase; in some embodiments, however, the primer can become incorporated into the synthesized nucleic acid strand and provide a site to which another primer can hybridize to prime synthesis of a new strand that is complementary to the synthesized nucleic acid molecule. The primer may be comprised of any combination of nucleotides or analogs thereof, which may be optionally linked to form a linear polymer of any suitable length. In some embodiments, the primer is a single-stranded oligonucleotide or polynucleotide. (For purposes of this disclosure, the terms ‘polynucleotide” and “oligonucleotide” are used interchangeably herein and do not necessarily indicate any difference in length between the two). In some embodiments, the primer is single-stranded but it can also be double-stranded. The primer optionally occurs naturally, as in a purified restriction digest, or can be produced synthetically. In some embodiments, the primer acts as a point of initiation for amplification or synthesis when exposed to amplification or synthesis conditions; such amplification or synthesis can occur in a template-dependent fashion and optionally results in formation of a primer extension product that is complementary to at least a portion of the target sequence. Exemplary amplification or synthesis conditions can include contacting the primer with a polynucleotide template (e.g., a template including a target sequence), nucleotides and an inducing agent such as a polymerase at a suitable temperature and pH to induce polymerization of nucleotides onto an end of the target-specific primer. If double-stranded, the primer can optionally be treated to separate its strands before being used to prepare primer extension products. In some embodiments, the primer is an oligodeoxyribonucleotide or an oligoribonucleotide. In some embodiments, the primer can include one or more nucleotide analogs. The exact length and/or composition, including sequence, of the target-specific primer can influence many properties, including melting temperature (Tm), GC content, formation of secondary structures, repeat nucleotide motifs, length of predicted primer extension products, extent of coverage across a nucleic acid molecule of interest, number of primers present in a single amplification or synthesis reaction, presence of nucleotide analogs or modified nucleotides within the primers, and the like. In some embodiments, a primer can be paired with a compatible primer within an amplification or synthesis reaction to form a primer pair consisting or a forward primer and a reverse primer. In some embodiments, the forward primer of the primer pair includes a sequence that is substantially complementary to at least a portion of a strand of a nucleic acid molecule, and the reverse primer of the primer of the primer pair includes a sequence that is substantially identical to at least of portion of the strand. In some embodiments, the forward primer and the reverse primer are capable of hybridizing to opposite strands of a nucleic acid duplex. Optionally, the forward primer primes synthesis of a first nucleic acid strand, and the reverse primer primes synthesis of a second nucleic acid strand, wherein the first and second strands are substantially complementary to each other, or can hybridize to form a double-stranded nucleic acid molecule. In some embodiments, one end of an amplification or synthesis product is defined by the forward primer and the other end of the amplification or synthesis product is defined by the reverse primer. In some embodiments, where the amplification or synthesis of lengthy primer extension products is required, such as amplifying an exon, coding region, or gene, several primer pairs can be created than span the desired length to enable sufficient amplification of the region. In some embodiments, a primer can include one or more cleavable groups. In some embodiments, primer lengths are in the range of about 10 to about 60 nucleotides, about 12 to about 50 nucleotides and about 15 to about 40 nucleotides in length. Typically, a primer is capable of hybridizing to a corresponding target sequence and undergoing primer extension when exposed to amplification conditions in the presence of dNTPS and a polymerase. In some instances, the particular nucleotide sequence or a portion of the primer is known at the outset of the amplification reaction or can be determined by one or more of the methods disclosed herein. In some embodiments, the primer includes one or more cleavable groups at one or more locations within the primer.
As used herein, “target-specific primer” and its derivatives, refers generally to a single stranded or double-stranded polynucleotide, typically an oligonucleotide, that includes at least one sequence that is at least 50% complementary, typically at least 75% complementary or at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% or at least 99% complementary, or identical, to at least a portion of a nucleic acid molecule that includes a target sequence. In such instances, the target-specific primer and target sequence are described as “corresponding” to each other. In some embodiments, the target-specific primer is capable of hybridizing to at least a portion of its corresponding target sequence (or to a complement of the target sequence); such hybridization can optionally be performed under standard hybridization conditions or under stringent hybridization conditions. In some embodiments, the target-specific primer is not capable of hybridizing to the target sequence, or to its complement, but is capable of hybridizing to a portion of a nucleic acid strand including the target sequence, or to its complement. In some embodiments, the target-specific primer includes at least one sequence that is at least 75% complementary, typically at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% complementary, or more typically at least 99% complementary, to at least a portion of the target sequence itself; in other embodiments, the target-specific primer includes at least one sequence that is at least 75% complementary, typically at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% complementary, or more typically at least 99% complementary, to at least a portion of the nucleic acid molecule other than the target sequence. In some embodiments, the target-specific primer is substantially non-complementary to other target sequences present in the sample; optionally, the target-specific primer is substantially non-complementary to other nucleic acid molecules present in the sample. In some embodiments, nucleic acid molecules present in the sample that do not include or correspond to a target sequence (or to a complement of the target sequence) are referred to as “non-specific” sequences or “non-specific nucleic acids”. In some embodiments, the target-specific primer is designed to include a nucleotide sequence that is substantially complementary to at least a portion of its corresponding target sequence. In some embodiments, a target-specific primer is at least 95% complementary, or at least 99% complementary, or identical, across its entire length to at least a portion of a nucleic acid molecule that includes its corresponding target sequence. In some embodiments, a target-specific primer can be at least 90%, at least 95% complementary, at least 98% complementary or at least 99% complementary, or identical, across its entire length to at least a portion of its corresponding target sequence. In some embodiments, a forward target-specific primer and a reverse target-specific primer define a target-specific primer pair that can be used to amplify the target sequence via template-dependent primer extension. Typically, each primer of a target-specific primer pair includes at least one sequence that is substantially complementary to at least a portion of a nucleic acid molecule including a corresponding target sequence but that is less than 50% complementary to at least one other target sequence in the sample. In some embodiments, amplification can be performed using multiple target-specific primer pairs in a single amplification reaction, wherein each primer pair includes a forward target-specific primer and a reverse target-specific primer, each including at least one sequence that substantially complementary or substantially identical to a corresponding target sequence in the sample, and each primer pair having a different corresponding target sequence. In some embodiments, the target-specific primer can be substantially non-complementary at its 3’ end or its 5′ end to any other target-specific primer present in an amplification reaction. In some embodiments, the target-specific primer can include minimal cross hybridization to other target-specific primers in the amplification reaction. In some embodiments, target-specific primers include minimal cross-hybridization to non-specific sequences in the amplification reaction mixture. In some embodiments, the target-specific primers include minimal self-complementarity. In some embodiments, the target-specific primers can include one or more cleavable groups located at the 3′ end. In some embodiments, the target-specific primers can include one or more cleavable groups located near or about a central nucleotide of the target-specific primer. In some embodiments, one of more targets-specific primers includes only non-cleavable nucleotides at the 5′ end of the target-specific primer. In some embodiments, a target specific primer includes minimal nucleotide sequence overlap at the 3′end or the 5′ end of the primer as compared to one or more different target-specific primers, optionally in the same amplification reaction. In some embodiments 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more, target-specific primers in a single reaction mixture include one or more of the above embodiments. In some embodiments, substantially all of the plurality of target-specific primers in a single reaction mixture includes one or more of the above embodiments.
As used herein, the term “chemically-enhanced primer” refers to a primer that can have a negatively charged moiety at a terminal 5′ end of the primer or within the primer. The primer can also include a nuclease-resistant linkage at the last phosphate linkage of the sugar phosphate backbone at the 3′ terminus.
As used herein, the term “sequencing primer” refers to an oligonucleotide primer that is used to initiate a sequencing reaction performed on a nucleic acid. The term “sequencing primer” refers to both a forward sequencing primer and to a reverse sequencing primer.
As used herein, the term “extension primer” refers to an oligonucleotide, capable of annealing to a nucleic acid region adjacent a target sequence, and serving as an initiation primer for elongation of the oligonucleotide by using the target sequence as the complementary template for nucleotide extension under suitable conditions well known in the art. Typically, a sequencing reaction employs at least one extension primer or a pair of extension primers. The pair would include an “upstream” or “forward” primer and a “downstream” or “reverse” primer, which delimit a region of the nucleic acid target sequence to be sequenced.
As used herein, the term “amplification primer” refers to an oligonucleotide, capable of annealing to an RNA or DNA region adjacent a target sequence, and serving as an initiation primer for nucleic acid synthesis under suitable conditions well known in the art. Typically, a PCR reaction employs a pair of amplification primers including an “upstream” or “forward” primer and a “downstream” or “reverse” primer, which delimit a region of the RNA or DNA to be amplified.
As used herein, the term “tailed primer” or “tailed amplification primer” or “tailed sequencing primer” refers to a primer that includes at its 3′end a sequence capable of annealing to an RNA or DNA region adjacent a target sequence, and serving as an initiation primer for DNA synthesis under suitable conditions well known in the art. The primer includes at its 5′end a sequence that is not complementary to the target sequence.
The term “extension” and its variants, as used herein, when used in reference to a given primer, comprises any in vivo or in vitro enzymatic activity characteristic of a given polymerase that relates to polymerization of one or more nucleotides onto an end of an existing nucleic acid molecule. Typically but not necessarily such primer extension occurs in a template-dependent fashion; during template-dependent extension, the order and selection of bases is driven by established base pairing rules, which can include Watson-Crick type base pairing rules or alternatively (and especially in the case of extension reactions involving nucleotide analogs) by some other type of base pairing paradigm. In one non-limiting example, extension occurs via polymerization of nucleotides on the 3′OH end of the nucleic acid molecule by the polymerase.
The term “nucleic acid sequence” as used herein can refer to the nucleic acid material itself and is not restricted to the sequence information (i.e. the succession of letters chosen among the five base letters A, C, G, T, or U) that biochemically characterizes a specific nucleic acid, for example, a DNA or RNA molecule. Nucleic acids shown herein are presented in a 5′→3′ orientation unless otherwise indicated.
The term “mobility-dependent separation” as used herein can refer to the separation of nucleic acid fragments due to the charge and size associated with the fragment.
The term “fluorescent dye” as used herein refers to moieties that absorb light energy at a defined excitation wavelength and emit light energy at a different wavelength. Preferably the fluorescent dyes selected for use are spectrally resolvable. As used herein, “spectrally resolvable” means that the dyes can be distinguished on the basis of their spectral characteristics, particularly fluorescence emission wavelength, under conditions of operation. For example, the identity of the one or more terminal nucleotides can be correlated to a distinct wavelength of maximum light emission intensity, or perhaps a ratio of intensities at different wavelengths.
The term “nucleobase” or “base” as used herein refers to a nitrogen-containing heterocyclic moiety capable of forming Watson-Crick type hydrogen bonds with a complementary nucleobase or nucleobase analog, e.g. a purine, a 7-deazapurine, or a pyrimidine. Typical nucleobases are the naturally occurring nucleobases adenine, guanine, cytosine, 5mC, uracil, thymine, and analogs of naturally occurring nucleobases, e.g. 7-deazaadenine, 7-deaza-8-azaadenine, 7-deazaguanine, 7-deaza-8-azaguanine, N6-Δ2 isopentenyl-adenine(6iA), N6-Δ2-isopentenyl-2-methylthioadenine (2ms6iA), N2-dimethyl-guanine(dmG), 7-methylguanine (7mG), inosine, nebularine, nitropyrrole, nitroindole, 2-amino-purine, 2,6-diamino-purine, hypoxanthine, pseudouridine, pseudocytidine, pseudoisocytidine, 5-propynyl-cytidine, isocytidine, isoguanine, 2-thiopyrimidine, 6-thioguanine, 4-thiothymine, 4-thiouracil, O6-methylguanine, N6-methyl-adenine, O4-methylthymine, 5,6-dihydrothymine, 5,6-dihydrouracil, 4-methylindole, pyrazolo[3,4-D]pyrimidines (see, e.g., U.S. Pat. Nos. 6,143,877 and 6,127,121 and PCT Published Application WO 01/38584)and ethenoadenine. Nonlimiting examples of nucleotide bases can be found, e.g., in Fasman, Practical Handbook of Biochemistry and Molecular Biology, pp. 385-394, CRC Press, Boca Raton, Fla. (1989).
As used herein, the term “nucleotide” and its variants comprises any compound, including without limitation any naturally occurring nucleotide or analog thereof, which can bind selectively to, or can be polymerized by, a polymerase. Typically, but not necessarily, selective binding of the nucleotide to the polymerase is followed by polymerization of the nucleotide into a nucleic acid strand by the polymerase; occasionally however the nucleotide may dissociate from the polymerase without becoming incorporated into the nucleic acid strand, an event referred to herein as a “non-productive” event. Such nucleotides include not only naturally occurring nucleotides but also any analogs, regardless of their structure, that can bind selectively to, or can be polymerized by, a polymerase. While naturally occurring nucleotides typically comprise base, sugar and phosphate moieties, the nucleotides of the present disclosure can include compounds lacking any one, some or all of such moieties. In some embodiments, the nucleotide can optionally include a chain of phosphorus atoms comprising three, four, five, six, seven, eight, nine, ten or more phosphorus atoms. In some embodiments, the phosphorus chain can be attached to any carbon of a sugar ring, such as the 5′ carbon. The phosphorus chain can be linked to the sugar with an intervening O or S. In one embodiment, one or more phosphorus atoms in the chain can be part of a phosphate group having P and O. In another embodiment, the phosphorus atoms in the chain can be linked together with intervening O, NH, S, methylene, substituted methylene, ethylene, substituted ethylene, CNH2, C(O), C(CH2), CH2CH2, or C(OH)CH2R (where R can be a 4-pyridine or 1-imidazole). In one embodiment, the phosphorus atoms in the chain can have side groups having O, BH3, or S. In the phosphorus chain, a phosphorus atom with a side group other than O can be a substituted phosphate group. In the phosphorus chain, phosphorus atoms with an intervening atom other than O can be a substituted phosphate group. Some examples of nucleotide analogs are described in Xu, U.S. Pat. No. 7,405,281. In some embodiments, the nucleotide comprises a label and referred to herein as a “labeled nucleotide”; the label of the labeled nucleotide is referred to herein as a “nucleotide label”. In some embodiments, the label can be in the form of a fluorescent dye attached to the terminal phosphate group, i.e., the phosphate group most distal from the sugar. Some examples of nucleotides that can be used in the disclosed methods and compositions include, but are not limited to, ribonucleotides, deoxyribonucleotides, modified ribonucleotides, modified deoxyribonucleotides, ribonucleotide polyphosphates, deoxyribonucleotide polyphosphates, modified ribonucleotide polyphosphates, modified deoxyribonucleotide polyphosphates, peptide nucleotides, modified peptide nucleotides, metallonucleosides, phosphonate nucleosides, and modified phosphate-sugar backbone nucleotides, analogs, derivatives, or variants of the foregoing compounds, and the like. In some embodiments, the nucleotide can comprise non-oxygen moieties such as, for example, thio- or borano-moieties, in place of the oxygen moiety bridging the alpha phosphate and the sugar of the nucleotide, or the alpha and beta phosphates of the nucleotide, or the beta and gamma phosphates of the nucleotide, or between any other two phosphates of the nucleotide, or any combination thereof. “Nucleotide 5′-triphosphate” refers to a nucleotide with a triphosphate ester group at the 5′ position, and are sometimes denoted as “NTP”, or “dNTP” and “ddNTP” to particularly point out the structural features of the ribose sugar. The triphosphate ester group can include sulfur substitutions for the various oxygens, e.g. .alpha.-thio-nucleotide 5′-triphosphates. For a review of nucleic acid chemistry, see: Shabarova, Z. and Bogdanov, A. Advanced Organic Chemistry of Nucleic Acids, VCH, New York, 1994.
As used herein, the terms “polynucleotide”, “nucleic acid”, or “oligonucleotide” refers to a linear polymer of nucleosides (including deoxyribonucleosides, ribonucleosides, or analogs thereof) joined by inter-nucleosidic linkages. Whenever a polynucleotide such as an oligonucleotide is represented by a sequence of letters, such as “ATGCCTG,” it will be understood that the nucleotides are in 5′→3′ order from left to right and that “A” denotes deoxyadenosine, “C” denotes deoxycytidine, “G” denotes deoxyguanosine, and “T” denotes deoxythymidine, unless otherwise noted. The letters A, C, G, and T can be used to refer to the bases themselves, to nucleosides, or to nucleotides comprising the bases, as is standard in the art. In naturally occurring polynucleotides, the inter-nucleoside linkage is typically a phosphodiester bond, and the subunits are referred to as “nucleotides.” Oligonucleotide primers comprising other inter-nucleoside linkages, such as phosphorothioate linkages, are used in certain embodiments of the teachings. It will be appreciated that one or more of the subunits that make up such an oligonucleotide primer with a non-phosphodiester linkage may not comprise a phosphate group. Such analogs of nucleotides are considered to fall within the scope of the term “nucleotide” as used herein, and nucleic acids comprising one or more inter-nucleoside linkages that are not phosphodiester linkages are still referred to as “polynucleotides”, “oligonucleotides”, etc.
As used herein, “polymerase” and its derivatives, generally refers to any enzyme that can catalyze the polymerization of nucleotides (including analogs thereof) into a nucleic acid strand. Typically but not necessarily, such nucleotide polymerization can occur in a template-dependent fashion. Such polymerases can include without limitation naturally occurring polymerases and any subunits and truncations thereof, mutant polymerases, variant polymerases, recombinant, fusion or otherwise engineered polymerases, chemically modified polymerases, synthetic molecules or assemblies, and any analogs, derivatives or fragments thereof that retain the ability to catalyze such polymerization. Optionally, the polymerase can be a mutant polymerase comprising one or more mutations involving the replacement of one or more amino acids with other amino acids, the insertion or deletion of one or more amino acids from the polymerase, or the linkage of parts of two or more polymerases. Typically, the polymerase comprises one or more active sites at which nucleotide binding and/or catalysis of nucleotide polymerization can occur. Some exemplary polymerases include without limitation DNA polymerases and RNA polymerases. The term “polymerase” and its variants, as used herein, also refers to fusion proteins comprising at least two portions linked to each other, where the first portion comprises a peptide that can catalyze the polymerization of nucleotides into a nucleic acid strand and is linked to a second portion that comprises a second polypeptide. In some embodiments, the second polypeptide can include a reporter enzyme or a processivity-enhancing domain. Optionally, the polymerase can possess 5′ exonuclease activity or terminal transferase activity. In some embodiments, the polymerase can be optionally reactivated, for example through the use of heat, chemicals or re-addition of new amounts of polymerase into a reaction mixture. In some embodiments, the polymerase can include a hot-start polymerase or an aptamer based polymerase that optionally can be reactivated.
As defined herein, “sample” and its derivatives, is used in its broadest sense and includes any specimen, culture and the like that is suspected of including a target. In some embodiments, the sample comprises DNA, RNA, PNA, LNA, chimeric, hybrid, or multiplex-forms of nucleic acids. The sample can include any biological, clinical, surgical, agricultural, atmospheric or aquatic-based specimen containing one or more nucleic acids. The term also includes any isolated nucleic acid sample such a genomic DNA, fresh-frozen or formalin-fixed paraffin-embedded nucleic acid specimen.
As used herein “sequence determination”, “determining a nucleotide base sequence”, “sequencing”, and like terms includes determination of partial as well as full sequence information. That is, the term includes sequence comparisons, fingerprinting, and like levels of information about a target polynucleotide, as well as the express identification and ordering of each nucleoside of the target polynucleotide within a region of interest. In certain embodiments, “sequence determination” comprises identifying a single nucleotide, while in other embodiments more than one nucleotide is identified. Identification of nucleosides, nucleotides, and/or bases are considered equivalent herein. It is noted that performing sequence determination on a polynucleotide typically yields equivalent information regarding the sequence of a perfectly complementary polynucleotide and thus is equivalent to sequence determination performed directly on a perfectly complementary polynucleotide.
As used herein, the term “kit” refers to any delivery system for delivering materials. In the context of reaction assays, such delivery systems include systems that allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, enzymes, primer set(s), etc. in the appropriate containers) and/or supporting materials (e.g., buffers, written instructions for performing the assay etc.) from one location to another. For example, kits can include one or more enclosures (e.g., boxes) containing the relevant reaction reagents and/or supporting materials. As used herein, the term “fragmented kit” refers to a delivery system comprising two or more separate containers that each contain a subportion of the total kit components. The containers may be purchased and/or delivered to the intended recipient together or separately. For example, a first container may contain an enzyme for use in an assay, while a second container contains oligonucleotides. Indeed, any delivery system comprising two or more separate containers that each contains a subportion of the total kit components are included in the term “fragmented kit.” In contrast, a “combined kit” refers to a delivery system containing all of the components of a reaction assay in a single container (e.g., in a single box housing each of the desired components). The term “kit” includes both fragmented and combined kits.
As will be appreciated by one of ordinary skill in the art, references to templates, oligonucleotides, primers, etc., generally mean populations or pools of nucleic acid molecules that are substantially identical within a relevant region rather than single molecules. For example, a “template” generally means a plurality of substantially identical template molecules; a “primer” generally means a plurality of substantially identical primer molecules, and the like.
As used herein, the terms “comprises,” “comprising,” “includes,” “including,” “has,” “having” or any other variation thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, article, or apparatus that comprises a list of features is not necessarily limited only to those features but may include other features not expressly listed or inherent to such process, method, article, or apparatus. Further, unless expressly stated to the contrary, “or” refers to an inclusive-or and not to an exclusive-or. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present).
Methods for verification of NGS sequencing results. The introduction of defined Ion AmpliSeq™ panels, amongst other commercially available assays using Next Generation Sequencing (NGS) techniques for detection and characterization of actionable mutations occurring in tumor tissue has the potential to revolutionize translational oncology research. This technology is further described in U.S. Application Ser. Nos. 61/479,952, filed on Apr. 28, 2011; 61/531,583, filed on Sep. 6, 2011; 61/531,574, filed on Sep. 6, 2011; 61/538,079, filed on Sep. 22, 2011; 61/564,763, filed on Nov. 29, 2011; 61/578,192, filed on Dec. 20, 2011; 61/594,160, filed on Feb. 2, 2012; 61/598,881, filed on Feb. 14, 2012; 61/598,892, filed on Feb. 14, 2012; 61/625,596, filed on Apr. 17, 2012; 61/639,017, filed on Apr. 26, 2012; Ser. No. 13/458,739, filed on Apr. 27, 2012; Ser. No. 13/663,334, filed on Oct. 29, 2012; Ser. No. 13/679,706, filed on Nov. 16, 2012; and an application entitled “Detection, Identification, Validation, and Enrichment of Target Nucleic Acids”, Attorney Docket No. LT00974 PRO, filed on even date; and each disclosure is hereby incorporated by reference in its entirety.
The Ion Ampliseq™ cancer hot spot panel version 2 (CHP v2) by Ion Torrent includes 207 actionable mutation targets present in 50 genes and the more comprehensive Ion Oncomine™ cancer panel (OCP) developed by Life Technologies Compendia Bioscience™ contains over 2000 mutations. A hallmark of these Ion Torrent Ampliseq cancer panels is the low amount of input DNA needed which is critical when the clinical specimen material is limited such as with fine needle biopsy, aspirates, LCM or FFPE samples. Typically, 10 ng of DNA obtained from these sources is sufficient to produce informative sequencing data. Often, cancer-causing or promoting mutations are detected at relatively low allele frequencies like 10-20% compared to the major normal allele.
New methods are needed to verify these findings of low frequency mutations by an orthologous method such as traditional dye-fluorescent Sanger sequencing on a capillary electrophoresis (CE) instrument such as the Applied Biosystems 3500 genetic analyzer.
A workflow that enables the amplification and Sanger sequencing of individual Ion AmpliSeq targets directly from the Ampliseq™ library starting material is described here. This workflow can also be used with library starting materials arising out of other Next Generation Sequencing (NGS) methods of massively parallel sequencing.
The method requires a retainer of 1 μl (˜5%) of the original Ampliseq™ preamplification material. A dilution of this aliquot is used as template source for individualized PCR/sequencing reactions. A random selection of 48 targets from the CHPv2 panel may be successfully amplified and Sanger-sequenced from an Ion Torrent Ampliseq™ library originally prepared from 10 ng of FFPE DNA. Furthermore, the successful Sanger-re-sequencing of all individual 24 targets covering the TP53 exons from the same sample processed and pre-amplified with the OncoMine AmpliSeq panel.
Taken together, this method permits reflex-test of potential mutations of interest from very material-limited specimen using Sanger CE sequencing. It provides a reflex solution for verifying and following up NGS results by Sanger sequencing particularly for samples with very limited amounts of available DNA, such as samples obtained from any of fine needle biopsies, aspirates, formalin-fixed, paraffin-embedded (FFPE), and Laser Capture Microdissection (LCM).
Additionally, this workflow offers other advantages over typical Sanger sequencing protocols, removing extra manipulations and purifications. This streamlining also is advantageous when working with quantity limited samples. For example, a typical PCR reaction uses an excess of amplification primers, some primers remain unincorporated upon completion of the PCR reaction. This necessitates removal of the excess primers before proceeding to a sequencing reaction, because the excess amplification primers will interfere with the subsequent sequencing reaction, and may produce aberrant sequence ladders. The PCR reaction furthermore contains an excess of dNTPs that can interfere with the subsequent sequencing reaction. In the current workflow, addition of a nuclease to the sequencing reaction mixture before the start of the cycle sequencing reaction, which nuclease may be but is not limited to exonuclease I, utilizes its hydrolytic properties to degrade single-stranded DNA present in the PCR mixture, thus allowing the amplification product (amplicon) to be used more efficiently in the subsequent sequencing reaction.
Resolution of nucleic acid sequence near the sequencing primer had been difficult in the past to obtain without sacrificing throughput residence time during electrophoresis. Adjustments in the type of mobility system, adjusting denaturing conditions and temperature can improve resolution but always at the expense of increased electrophoresis time. Difficulties in removal of unincorporated reactants and long residence time when performing size-dependent mobility separation contributed to inefficiencies in nucleic acid sequencing. There are several advantages of the improved BigDye® Direct amplification/sequencing workflow that addresses these problems. Only one post synthesis cleanup is needed; it can be performed very easily in the same reactor vessel. Additionally, the nature of the chemically enhanced sequencing primer produces extension products that can be more easily detected without contamination and complicating signal from excess reagent, without slowing the electrophoresis experiment.
What has been surprising discovered by Applicant is that Ampliseq™ primer design is transferable to Sanger CE sequencing. Using the advanced chemistry of BigDye Direct sequencing which streamlines the workflow as described here and in the cross-referenced applications, allows simpler, less time intensive sequencing analysis which also has very high 5′ resolution. Use of M13 tags for target specific nuclease sensitive amplification primers permits the use of M13 chemically enhanced sequencing primers, which survive in situ nuclease degradation of excess PCR amplification primers before the start of sequence fragment production. Additionally, the other modifications of the M13 chemically enhanced sequencing primers allows basecalling to begin at base number 1 of the sequence of interest. Various aspects of the use of chemically-enhanced sequencing primers and the combined steps of the Sanger sequencing workflow are further described in U.S. Application Ser. Nos. 61/026,085, filed Feb. 4, 2008; Ser. No. 12/365,140, filed Feb. 3, 2009; 61/407,899, filed Oct. 28, 2010; 61/408,553, filed Oct. 29, 2010; Ser. No. 13/284,839, filed Oct. 28, 2011; and Ser. No. 13/397,626, filed Feb. 15, 2012, and each disclosure of which is hereby incorporated by reference in its entirety.
Applicant has also surprisingly found that 1 ng of genomic DNA is sufficient for Fwd/Rev pair of sequences from a single target. Further, pre-amplification (PA) material from low complexity Ampliseq™ panels (i.e. CHP v2 and OCP) can be diluted and used as template source for re-PCR and Sanger sequencing.
High complexity Ampliseq™ panels (CCP and whole exome) has not been attempted yet for re-PCR and sequencing, but may afford access.
A method for sequencing at least one amplicon is provided which includes the steps of: providing at least one amplicon, wherein the at least one amplicon comprises a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying the at least one amplicon in a first reaction mixture which includes a plurality of nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture comprising a nuclease and at least one chemically-enhanced primer causing the plurality of nuclease sensitive amplification primers to be degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the at least one chemically-enhanced primer in a sequencing reaction; and producing extension products of the at least one chemically enhanced primer. In some embodiments, the extension products may be fluorescently labeled. In various embodiments, the first priming sequence may have been used to produce the amplicon. In various embodiments, the first priming sequence may include at least one cleavable moiety. In some embodiments, the preceding sequence may be a portion of the first priming sequence. In various embodiments of the method, the steps of contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed in the same reaction vessel. In various embodiments, the steps of amplifying the at least one amplicon, contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed without intermediate purification steps.
In various embodiments of the method, the at least one amplicon further includes a succeeding sequence 3′ to the sequence of interest wherein the succeeding sequence is complementary to a second priming sequence used to produce the at least one amplicon. The at least one amplicon may have a length of about 100 nucleotides to about 400 nucleotides. The sequence of interest of the at least one amplicon may have a length of about 100 nucleotides to about 300 nucleotides. In other embodiments, the sequence of interest of the at least one amplicon may have a length of about 125 nucleotides to about 275 or about 250 nucleotides. The at least one amplicon may be a plurality of amplicons. In some embodiments, the plurality of amplicons may include at least two different amplicons, a first having a sequence of interest that is a major variant sequence and a second amplicon having a minor variant sequence from the same region of a sample nucleic acid.
In some embodiments, the method further includes the steps of obtaining sequencing results based on the sequencing reaction; and determining a nucleotide base sequence of at least the sequence of interest based on the results. The sequencing results may be obtained via a mobility based separation method. In some embodiments, the mobility based separation method may be capillary electrophoresis. In some embodiments, the determined nucleotide base sequence of at least the sequence of interest may be compared to a second nucleotide base sequence of at least the sequence of interest obtained from a NGS method of sequencing. The NGS method of sequencing may include massively parallel sequencing techniques like sequencing by synthesis using fluorophore or semiconductor detection and pyrosequencing, to name a few. In some embodiments, the NGS method of sequencing may be semiconductor sequencing.
In various embodiments, amplifying DNA may include polymerase chain reaction amplification. In selected embodiments, the sequencing reaction may include cycle sequencing.
In various embodiments, the first reaction mixture may also include a polymerase. The polymerase may be a thermostable polymerase. In some embodiments, the polymerase may be Taq polymerase. The first reaction mixture may further include deoxynucleotide triphosphates.
In various embodiments, the second reaction mixture further comprises a polymerase, deoxynucleotide triphosphates, and dye-labelled dideoxynucleotide triphosphates. The polymerase of the second reaction mixture may be a thermostable polymerase. In some embodiments, the polymerase is Taq polymerase.
In various embodiments of the method for sequencing at least one amplicon, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I.
In various embodiments of the method for sequencing at least one amplicon, each of the plurality of nuclease-sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. In some embodiments, the plurality of nuclease-sensitive amplification primers may prime a set of sequences connected to a specific disease state.
Another method is described for confirming a DNA sequence, which includes the steps of: amplifying a sample comprising nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying a first aliquot of the plurality of amplicons in a first reaction mixture including a plurality of nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture that includes a nuclease and at least one chemically-enhanced primer, where by contacting the nuclease with the first reaction mixture, the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the at least one chemically-enhanced primer in a sequencing reaction; and producing extension products of the chemically enhanced primer. In some embodiments, the extension products may be fluorescently labeled. In various embodiments, the first priming sequence may have been used to produce the amplicon. The first priming sequence may include at least one cleavable moiety. In some embodiments, the preceding sequence may be a portion of the first priming sequence.
In various embodiments of the method, the steps of contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed in the same reaction vessel. In various embodiments, the steps of amplifying the plurality of amplicons, contacting the first reaction mixture with the second reaction mixture, inactivating the nuclease, and producing the extension products of the chemically enhanced primer may be performed without intermediate purification steps.
In various embodiments of the method, each of the plurality of amplicons further includes a succeeding sequence 3′ to the sequence of interest wherein the succeeding sequence is complementary to a second priming sequence used to produce the amplicon. Each of the plurality of amplicons may have a length of about 100 nucleotides to about 400 nucleotides. The sequence of interest of each of the plurality of amplicons may have a length of about 100 nucleotides to about 300 nucleotides. In other embodiments, the sequence of interest of each of the plurality of amplicons may have a length of about 125 nucleotides to about 275 or about 250 nucleotides. In some embodiments, the plurality of amplicons may include at least two different amplicons, a first having a sequence of interest that is a major variant sequence and a second amplicon having a minor variant sequence from the same region of a sample nucleic acid.
In some embodiments, the method further includes the steps of obtaining sequencing results based on the sequencing reaction; and determining a nucleotide base sequence of at least the sequence of interest based on the results. The sequencing results may be obtained via a mobility based separation method. In some embodiments, the mobility based separation method may be capillary electrophoresis. In some embodiments, the determined nucleotide base sequence of at least the sequence of interest may be compared to a second nucleotide base sequence of at least the sequence of interest obtained from a NGS method of sequencing performed on a second aliquot of the plurality of amplicons. The NGS method of sequencing may include massively parallel sequencing techniques like sequencing by synthesis using fluorophore or semiconductor detection and pyrosequencing, to name a few. In some embodiments, the NGS method of sequencing may be semiconductor sequencing.
In various embodiments, amplifying DNA may include polymerase chain reaction amplification. In selected embodiments, the sequencing reaction may include cycle sequencing.
In various embodiments, the first reaction mixture may also include a polymerase. The polymerase may be a thermostable polymerase. In some embodiments, the polymerase may be Taq polymerase. The first reaction mixture may further include deoxynucleotide triphosphates.
In various embodiments, the second reaction mixture further comprises a polymerase, deoxynucleotide triphosphates, and dye-labelled dideoxynucleotide triphosphates. The polymerase of the second reaction mixture may be a thermostable polymerase. In some embodiments, the polymerase is Taq polymerase.
In various embodiments of the method for confirming a DNA sequence, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I.
In various embodiments of the method for confirming a DNA sequence, each of the plurality of nuclease-sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. In some embodiments, the plurality of nuclease-sensitive amplification primers may prime a set of sequences connected to a specific disease state.
Another method is provided for preparing DNA for sequencing, including the steps of: amplifying a sample comprising nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; amplifying an aliquot of the plurality of amplicons in a first reaction mixture which includes nuclease-sensitive amplification primers to form an amplified DNA product; contacting the first reaction mixture containing the amplified DNA product with a second reaction mixture comprising a nuclease and a chemically-enhanced primer where by contacting the nuclease with the first reaction mixture, the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the amplified DNA product with the chemically-enhanced primer in a sequencing reaction; and producing extension products of the chemically enhanced primer.
Yet another method is described for sequencing and verifying a variant nucleic acid sequence of interest, including the steps: amplifying a sample which includes nucleic acid using at least a first priming sequence to provide a plurality of amplicons, where each of the plurality of amplicons includes a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; splitting the plurality of amplicons into a first aliquot and a second aliquot; amplifying the first aliquot of the plurality of amplicons in a first reaction mixture including nuclease-sensitive amplification primers to form a first amplified DNA product; contacting the first reaction mixture containing the first amplified DNA product with a second reaction mixture which includes a nuclease and a chemically-enhanced primer where by contacting the nuclease with the first reaction mixture, the nuclease sensitive amplification primers are degraded by the nuclease; inactivating the nuclease; priming the first amplified DNA product with the chemically-enhanced primer in a sequencing reaction; producing extension products of the chemically enhanced primer; obtaining sequencing results of at least the sequence of interest of the extended chemically enhanced primer using a mobility dependent separation; and determining a nucleotide base sequence of at least the sequence of interest of the extended chemically enhanced primer; amplifying the second aliquot of the amplicons to form a second DNA product; obtaining sequencing results of at least the sequence of interest of the second DNA product using a NGS sequencing method; and verifying a nucleotide sequence of the second DNA product by comparing it to the nucleotide base sequence of at least the sequence of interest of the extended chemically enhanced primer. In various embodiments of the method, the step of amplifying the second aliquot of the plurality of amplicons to form a second DNA product further comprises at least one of ligating adaptors, binding to beads, and ligating barcodes.
For any of the methods described above and throughout this disclosure, the nucleic acid can also be amplified using other methods such as, for example, multiple strand displacement amplification, helicase displacement amplification, nick translation, Q beta replicase amplification, rolling circle amplification, and other isothermal amplification methods. The nucleic acid to be amplified can comprise, for example, RNA, DNA, cDNA, genomic DNA, viral DNA, plasmid DNA, recombinant DNA, amplicon DNA, synthetic DNA or the like.
For any of the methods described above and throughout this disclosure, templates to be sequenced can be synthesized by PCR in individual aqueous compartments (also called “reactors”) of an emulsion. In some embodiments, the compartments can each contain a particulate support such as a bead having a suitable first amplification primer attached thereto, a first copy of the template, a second amplification primer, and components needed for a PCR reaction (for example nucleotides, polymerase, cofactors, and the like). Methods for preparing emulsions are described, for example, in U.S. Pat. No. 6,489,103 B1, U.S. Pat. No. 5,830,663, and in U.S. Patent Application Publication No. US 2004/0253731. Methods for performing PCR within individual compartments of an emulsion to produce clonal populations of templates attached to microparticles are described, for example, in Dressman, D., et al, Proc. Natl. Acad. Sci., 100(15):8817-8822, 2003, and in PCT publication WO2005010145. All of the patents, applications, publications, and articles described herein are incorporated in their entireties by reference.
According to various embodiments, the amplification primers can comprise tailed primers. The tailed primers can be used, for example, to generate a target specific amplicon that incorporates nucleic acid sequence capable of annealing to a universal primer or a gene specific primer.
For any of the methods described above and throughout this disclosure, nucleases suitable for use in the subject methods preferentially degrade single-stranded polynucleotides over double-stranded polynucleotides, thus destroying excess primers while leaving intact double-stranded amplicons available for sequencing in subsequent steps. In various embodiments, the nuclease enzyme can comprise, for example, exonuclease I. Exonuclease I can be obtained from various commercial suppliers, for example from USB Corp., Cleveland, Ohio. Appropriate reaction conditions can include, for example, optimal time, temperature, and buffer parameters to provide for nuclease enzyme activity. In some embodiments, for example, excess amplification primer can be degraded by adding exonuclease Ito the amplification reaction product and incubating at about 37° C. for about 10 to about 30 min. Exonuclease I can hydrolyze single-stranded DNA in a 3′→5′ direction. The exonuclease I can be sensitive to heat inactivation and can be essentially 100 percent deactivated by heating, for example, heating at about 80° C. for about 15 minutes. Other heat inactivated nucleases may be used in the subject methods and compositions including but not limited to Exo III, Pfu or DNA pol I. In various embodiments, the inactivation of the nuclease can occur within the vesicle and in the same reaction step as the sequencing reaction
The chemically-enhanced sequencing primer can be essentially non-degraded by a reaction mixture comprising a nuclease, for example, exonuclease I, under reaction conditions at which excess amplification primer can be degraded by the nuclease. By “essentially non-degraded” it is intended that any degradation that takes place of the chemically-enhanced sequencing primer is not of a level that significantly interferes with the process employed to generate sequencing and/or fragment analysis data in the subsequent sequencing reactions or fragment analysis reactions. In some embodiments, the chemically-enhanced sequencing primer can comprise one of more nuclease-resistant internucleotide linkage(s). For example, the internucleotide linkage may be a phosphorothioate linkage. In some embodiments, the chemically-enhanced sequencing primer can comprise a nuclease-resistant internucleotide linkage at a terminal 3′ end, at a terminal 5′ end, and/or at one or more internal linkage sites. In some embodiments, the nuclease resistant internucleotide linkage is at least one phosphorothioate linkage. Chemically-enhanced sequencing primers were synthesized having one or two phosphorothioate linkages on the terminal 3′ end to protect the chemically-enhanced sequencing primers from exonuclease I digestion. The Sp stereoisomer can protect the primer from exonuclease I digestion but the Rp steroisomer was found to provide no protection from exonuclease I digestion (data not shown).
For any of the methods described above and throughout this disclosure, the mobility-dependent separation is selected from separation by charge and separation by size, wherein the separation by size plus charge is selected from gel electrophoresis and capillary electrophoresis and separation by size is by a liquid gradient, and a denaturing gradient medium. I he sequencing reaction products can be analyzed on a sieving or non-sieving medium. In some embodiments of these teachings, for example, the PCR products can be analyzed by electrophoresis; e.g., capillary electrophoresis, as described in H. Wenz et al. (1998), G
For any of the methods described above and throughout this disclosure, each of the ddNTPs can be labeled with a different fluorescent dye (ddNTP-dye). For example, the ddNTPs can comprise BigDye® ddNTPs, available from Applied Biosystems, Foster City, Calif. In some embodiments, the chemically-enhanced primer can be labeled with a fluorescent dye. The label can be attached to the oligonucleotide sequence and/or the NCM region of the chemically-enhanced primer.
For any of the methods described above and throughout this disclosure, the chemically-enhanced primer may include an oligonucleotide sequence, a NCM and none or at least one nuclease-resistant linkage. In some embodiments, the chemically-enhanced primer may include one nuclease-resistant linkage at a terminal 3′ end. The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within a oligonucleotide sequence of the chemically-enhanced primer. In some embodiments the plurality of NCMs may be at a terminal 5′ end. In various embodiments, the NCM may be a (Cn) spacer wherein n is any integer from 1 to 9. The NCM may include a plurality of (Cn) spacers. In various embodiments, the chemically-enhanced primer may have a structure of the formula: (Cn)x-OLIGO , where (Cn)x has a structure of the following formula:
where each instance of n may independently be an integer of 1 to 9; and x may be an integer of 1 to about 30;
OLIGO has a structure of the following formula:
In some embodiments, the chemically enhanced primer may have any structure as described in this disclosure.
Other types of chemically-enhanced primers can be utilized within the scope of the present teachings. For example, a nuclease resistant sequencing primer can comprise an alkyl phosphonate monomer, RO—P(═O)(—Me)(—OR), such as dA-Me-phosphonamidite, and/or a triester monomer, RO—P(═O)(—OR′)(—OR), such as dA-Me-phophoramidite (available from Glen Research, Sterling, Va.), and/or a locked nucleic acid monomer (available from Exiqon, Woburn, Mass.), and/or a boranophosphate monomer, RO—P(—BH3)(═O)(—OR), as described by Shaw, Barbara Ramsey, et al., in “Synthesis of Boron-Containing ADP and GDP Analogues: Nucleoside 5′-(P-Boranodisphosphates)”, Perspectives in Nucleoside and Nucleic Acid Chemistry, pg. 125-130, (2000), or the like.
In another method, one or more chemically-enhanced primers may be used for ligation extension reactions. In some embodiments, the chemically-enhanced primer for use in a ligation extension reaction is labeled fluorescently. In some embodiments, the ligation extension chemically-enhanced primer is labeled fluorescently at a 3′ terminus.
Polymerases useful in the methods. A variety of nucleic acid polymerases may be used in the methods described herein. For example, the nucleic acid polymerizing enzyme can be a thermostable polymerase or a thermally degradable polymerase. Suitable thermostable polymerases include, but are not limited to, polymerases isolated from Thermus aquaticus, Thermus thermophilus, Pyrococcus woesei, Pyrococcus furiosus, Thermococcus litoralis, and Thermotogo maritima. Suitable thermodegradable polymerases include, but are not limited to, E. coli DNA polymerase I, the Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, and others. Examples of other polymerizing enzymes that can be used in the methods described herein include T7, T3, SP6 RNA polymerases and AMV, M-MLV and HIV reverse transcriptases.
Non-limiting examples of commercially available polymerases that can be used in the methods described herein include, but are not limited to, TaqFS®, AmpliTaq® CS (Applied Biosystems), AmpliTaq FS (Applied Biosystems), AmpliTaq Gold ® (Applied Biosystems), Kentaq1 (AB Peptide, St. Louis, Mo.), Taquenase (ScienTech Corp., St. Louis, Mo.), ThermoSequenase (Amersham), Bst polymerase, VentR(exo−) DNA polymerase, Reader™ Taq DNA polymerase, VENT™ DNA polymerase (New England Biolabs), DEEPVENT™ DNA polymerase (New England Biolabs), PFUTurbo™ DNA polymerase (Stratagene), Tth DNA polymerase, KlenTaq-1 polymerase, SEQUENASE™ 1.0 DNA polymerase (Amersham Biosciences), and SEQUENASE 2.0 DNA polymerase (United States Biochemicals).
Uses for the methods. Optionally, the method further includes detecting and/or identifying mutations present in the sample identified through nucleic acid sequencing of the amplified target sequence.
In some embodiments, target sequences or amplified target sequences are directed to mutations associated with cancer. In some embodiments, the target sequences or amplified target sequences are directed to mutations associated with one or more cancers selected from the group consisting of head and neck cancers, brain cancer, breast cancer, ovarian cancer, cervical cancer, colorectal cancer, endometrial cancer, gallbladder cancer, gastric cancer, bladder cancer, prostate cancer, testicular cancer, liver cancer, lung cancer, kidney (renal cell) cancer, esophageal cancer, pancreatic cancer, thyroid cancer, bile duct cancer, pituitary tumor, wilms tumor, kaposi sarcoma, osteosarcoma, thymus cancer, skin cancer, heart cancer, oral and larynx cancer, leukemia, neuroblastoma and non-hodgkin lymphoma. In one embodiment, the mutations can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In one embodiment, the mutations can include variation in copy number. In one embodiment, the mutations can include germline or somatic mutations. In one embodiment, the mutations associated with cancer are located in at least one of the genes provided in Tables 1 or 4 of U.S. Patent Publication 20120295819, or provided in Table 7 of U.S. Application No. 61/598,881, each hereby incorporated by reference in its entirety. In some embodiments, the mutations can be any of the genomic coordinates provided in Table 5 of U.S. Patent Publication 20120295819, or provided in Table 7 of U.S. Application 61/598,881, each hereby incorporated by reference in its entirety. In some embodiments, the target sequences directed to mutations associated with cancer can include any one or more of the mutations provided in Table 10 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, the mutations can be found within any one or more of the genomic coordinates provided in Table 16 or Table 18 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety.
In some embodiments, the mutations associated with cancer are located in at least one of the genes selected from ABI1; ABL1; ABL2; ACSL3; ACSL6; AFF1; AFF3; AFF4;AKAP9; AKT1; AKT2; ALK; APC; ARHGAP26; ARHGEF12; ARID1A; ARNT; ASPSCR1; ASXL1; ATF1; ATIC; ATM; AXIN2; BAP1; BARD1; BCAR3; BCL10; BCL11A; BCL11B; BCL2; BCL3; BCL6; BCL7A;BCL9; BCR; BIRC3; BLM; BMPR1A; BRAF; BRCA1; BRCA2; BRD3; BRD4; BRIP1; BUB1B; CARD11; CARS; CASC5; CBFA2T3; CBFB; CBL; CBLB; CBLC; CCDC6; CCNB1IP1; CCND1; CCND2; CD74; CD79A; CDC73; CDH1; CDH11; CDK4; CDK6; CDKN2A; CDKN2B; CDKN2C; CDX2; CEBPA; CEP110; CHEK1; CHEK2; CHIC2; CHN1; CIC; CIITA; CLP1; CLTC; CLTCL1; COL1A1; CREB1; CREB3L2; CREBBP; CRTC1; CRTC3; CSF1R; CTNNB1; CXCR7; CYLD; CYTSB; DCLK3; DDB2; DDIT3; DDR2; DDX10; DDX5; DDX6; DEK; DGKG; DICER1; DNMT3A; EGFR; EIF4A2; ELF4; ELL; ELN; EML4; EP300; EPS15; ERBB2; ERBB4; ERC1; ERCC2; ERCC3; ERCC4; ERCC5; ERG; ETV1; ETV4; ETV5; ETV6; EWSR1; EXT1; EXT2; EZH2; FAM123B; FANCA; FANCC; FANCD2; FANCE; FANCF; FANCG; FAS; FBXW7; FCRL4; FGFR1; FGFR1OP; FGFR2; FGFR3; FH; FIP1L1; FLCN; FLI1; FLT1; FLT3; FNBP1; FOXL2; FOXO1; FOXO3; FOXO4; FOXP1; FUS; GAS7; GATA1; GATA2; GATA3; GMPS; GNAQ; GNAS; GOLGA5; GOPC; GPC3; GPHNGPR124; HIP1; HIST1H41; HLF; HNF1A; HNRNPA2B1; HOOK3; HOXA11; HOXA13; HOXA9; HOXC11; HOXC13; HOXD13; HRAS; HSP90AA1; HSP90AB1; IDH1; IDH2; IKZF1; IL2; IL21R; IL6ST; IRF4; ITGA10; ITGA9; ITK; JAK1; JAK2; JAK3; KDM5A; KDM5C; KDM6A; KDR; KDSR; KIAA1549; KIT; KLF6; KLK2; KRAS; KTN1; LASP1; LCK; LCP1; LHFP; LIFR; LMO2; LPP; MAF; MALT1; MAML2; MAP2K1; MAP2K4; MDM2; MDM4; MECOM; MEN1; MET; MITF; MKL1; MLH1; MLL; MLLT1; MLLT10; MLLT3; MLLT4; MLLT6; MN1; MPL; MRE11A; MSH2; MSH6; MSI2; MSN; MTCP1; MTOR; MUC1; MYB; MYC; MYCL1; MYCN; MYH11; MYH9; MYST3; MYST4; NACA; NBN; NCOA1; NCOA2; NCOA4; NEK9; NF1; NF2; NFE2L2; NFKB2; NIN; NKX2-1; NLRP1; NONO; NOTCH1; NOTCH2; NPM1; NR4A3; NRAS; NSD1; NTRK1; NTRK3; NUMA1; NUP214; NUP98; OLIG2; OMD; PAFAH1B2; PALB2; PATZ1; PAX3; PAX5; PAX7; PAX8; PBRM1; PBX1; PCM1; PDE4DIP; PDGFB; PDGFRA; PDGFRB; PER1; PHOX2B; PICALM; PIK3CA; PIK3R1; PIM1; PLAG1; PML; PMS1; PMS2; POU2AF1; POU5F1; PPARG; PPP2R1A; PRCC; PRDM16; PRF1; PRKAR1A; PRRX1; PSIP1; PTCH1; PTEN; PTPN11; RABEP1; RAD50; RAD51L1; RAF1; RANBP17; RAP1GDS1; RARA; RB1; RBM15; RECQL4; REL; RET; RHOH; RNF213; ROS1; RPN1; RPS6KA2; RUNX1; RUNX1T1; SBDS; SDHAF2; SDHB; SETD2; SFPQ; SFRS3; SH3GL1; SLC45A3; SMAD4; SMARCA4; SMARCB1; SMO; SOCS1; SRC; SRGAP3; SS18; SS18L1; STIL; STK11; STK36; SUFU; SYK; TAF15; TAF1L; TAL1; TAL2; TCF12; TCF3; TCL1A; TET1; TET2; TEX14; TFE3; TFEB; TFG; TFRC; THRAP3; TLX1; TLX3; TMPRSS2; TNFAIP3; TOP1; TP53; TPM3; TPM4; TPR; TRIM27; TRIM33; TRIP11; TSC1; TSC2; TSHR; USP6; VHL; WAS; WHSC1L1; WRN; WT1; XPA; XPC; ZBTB16; ZMYM2; ZNF331; ZNF384; and ZNF521.
In some embodiments, the mutations associated with cancer are located in at least one of the genes selected from ABL1; AKT1; ALK; APC; ATM; BRAF; CDH1; CDKN2A; CSF1R; CTNNB1; EGFR; ERBB2; ERBB4; FBXW7; FGFR1; FGFR2; FGFR3; FLT3; GNAS; HNF1A; HRAS; IDH1; JAK2; JAK3; KDR; KIT; KRAS; MET; MLH1; MPL; NOTCH1; NPM1; NRAS; PDGFRA; PIK3CA; PTEN; PTPN11; RB1; RET; SMAD4; SMARCB1; SMO; SRC; STK11; TP53; and VHL.
In some embodiments, the amplified target sequences are directed to any one of more of the genomic coordinates provided in Tables 5, 7 or 18 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, any one or more of the cancer target-specific primers provided in Tables 2, 3, 6 or 17 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety, can be used to amplify a target sequence present in a sample as disclosed by the methods described herein.
In some embodiments, the cancer target-specific primers from Tables 2, 3, 6, or 17 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety, can include 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 40, 60, 80, 100, 150, 200, 400, 500, 800, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000, 10,000, 11,000, 12,000, 13,000 or more, target-specific primers. In some embodiments, the amplified target sequences can include any one or more of the amplified target sequences generated at the genomic coordinates (using amplicon ID target-specific primers) provided in Tables 5, 7, 10 or 18 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, at least one of the target-specific primers associated with cancer is at least 90% identical to at least one nucleic acid sequence selected from SEQ ID NOs: 1-103,143 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, at least one of the target-specific primers associated with cancer is complementary across its entire length to at least one target sequence in a sample. In some embodiments, at least one of the target-specific primers associated with cancer includes a non-cleavable nucleotide at the 3′ end. In some embodiments, the non-cleavable nucleotide at the 3′ end includes the terminal 3′ nucleotide. In one embodiment, the amplified target sequences are directed to individual exons having a mutation associated with cancer. In some embodiments, the disclosure relates generally to the selective amplification of more than one target sequences in a sample and the detection and/or identification of mutations associated with cancer. In some embodiments, the amplified target sequences include two or more nucleotide sequences provided in Table 2 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, the amplified target sequences can include any one or more the amplified target sequences generated at the genomic coordinates using the amplicon ID target-specific primers provided in Table 5 of U.S. Patent Publication 20120295819, or provided in Table 7 of US Application 61/598,881, each of which is hereby incorporated by reference in its entirety. In one embodiment, the amplified target sequences include 100, 200, 500, 1000, 2000, 3000, 6000, 8000, 10,000, 12,000, or more amplicons from Tables 1-5 of U.S. Patent Publication 20120295819, or Tables 6 and 7 of US Application 61/598,881 hereby incorporated by reference in their entireties. In some embodiments, the disclosure relates generally to the detection and optionally, the identification of clinically actionable mutations. As defined herein, the term “clinically actionable mutations” includes mutations that are known or can be associated by one of ordinary skill in the art with, but not limited to, prognosis for the treatment of cancer. In one embodiment, prognosis for the treatment of cancer includes the identification of mutations associated with responsiveness or non-responsiveness of a cancer to a drug, drug combination, or treatment regime. In one embodiment, the disclosure relates generally to the amplification of a plurality of target sequences from a population of nucleic acid molecules linked to, or correlated with, the onset, progression or remission of cancer.
In some embodiments, target-specific primers are designed using the primer criteria disclosed herein. In some embodiments, target-specific primers are designed using the primer criteria disclosed herein and directed to one or more genes associated with breast cancer. In some embodiments, target-specific primers associated with breast cancer include at least one target-specific primer selected from one or more genes selected from the group consisting of AIM1, AR, ATM, BARD1, BCAS1, BRIP1, CCND1, CCND2, CCNE1, CDH1, CDK3,CDK4,CDKN2A, CDKN2B, CAMK1D, CHEK2, DIRAS3, EGFR, ERBB2, EPHA3, ERBB4, ETV6, GNRH1, KCTD9, CDCA2, EBF2, EMSY, BNIP3L, PNMA2, DPYSL2, ADRA1A, STMN4, TRIM35, PAK1, AQP11, CLSN1A, RSF1, KCTD14, THRSP, NDUFC2, ALG8, KCTD21, USP35, GAB2, DNAH9, ZNF18, MYOCD, STK11, TP53, JAK1, JAK2, MET, PDGFRA, PML, PTEN, RET, TMPRSS2, WNK1, FGFR1, IGF1R, PPP1R12B, PTPRT, GSTM1, IPO8, MYC, ZNF703, MDM1, MDM2, MDM4,MKK4, P14KB, NCOR1, NBN, PALB2, RAD50, RAD51, PAK1,RSF1, INTS4, ZMIZ1, SEPHS1, FOXM1, SDCCAG1, IGF1R, TSHZ2, RPSK6K1, PPP2R2A, MTAP, MAP2K4, AURKB, BCL2, BUB1, CDCA3, CDCA4, CDC20, CDC45, CHEK1, FOXM1, HDAC2, IGF1R, KIF2C, KIFC1, KRAS, RB1, SMAD4, NCOR1, UTX, MTHDFD1L, RAD51AP1, TTK and UBE2C.
In some embodiments, the disclosure relates generally to the amplification of target sequences directed to mutations associated with a congenital or inherited disease. In some embodiments, the disclosure can include the amplification of target sequences directed to somatic or germline mutations. In some embodiments, the mutations can be autosomal dominant or autosomal recessive. In one embodiment, the mutations associated with a congenital or inherited disease are located in at least one of the genes or diseases provided in Table 4 of U.S. Patent Publication 20120295819, hereby incorporated by reference in its entirety. In some embodiments, the disclosure relates to the amplification of target sequences in a sample associated with one or more inherited diseases selected from the group consisting of Adenosine Aminohydrolase Deficiency (ADA); Agammaglobulinemia, X-linked, Type 1; Alagille Syndrome; All Hypertrophic and Dilated Cardiomyopathy; Alopecia Universalis Congenita (ALUNC); Alpers Syndrome; Alpha-1-Antitrypsin Deficiency; Alpha-Thalassemia-Southeast Asia; Amyotrophic Lateral Sclerosis-Lou Gehrig's Disease; Androgen Insensitivity Syndrome; Aniridia; Ankylosing spondylitis; APC-Associated Polyposis Conditions; Argininosuccinate Lyase Deficiency; Arrhythmogenic Right Ventricular Dysplasia/Cardiomyopathy; Ataxia with Oculomotor Apraxia Type 2; Ataxia with Vitamin E Deficiency; Ataxia-Telangiectasia; Autoimmune Polyendocrine Syndrome; Beta-Hydroxyisobutyryl CoA Deacylase deficiency (HIBCH deficiency); Biotinidase Deficiency; Blepharophimosis-ptosis-epicanthus inversus; Bloom Syndrome; Brachydactyly; Brachydactyly-Hypertension Syndrome; Brachydactyly Type B1; Branchiootorenal Spectrum Disorders; BRCA1; Campomelic Dysplasia; Canavan; Cerebrotendinous Xanthomatosis; Ceroid-lipofuscinoses-Batton; Charcot-Marie-Tooth Disease Type 2B; Charcot-Marie-Tooth Neuropathy Type 1B; Charcot-Marie-Tooth Neuropathy Type 2A2; Charge Syndrome; Cherubism; Choroideremia; Citrin Deficiency; Citrullinemia Type I; Coffin-Lowry Syndrome; Cohen Syndrome; Collagen 4A5; Common Variable Immune Deficiency; Congenital Adrenal Hyperplasia; Congenital Cataracts, Facial Dysmorphism, and Neuropathy; Congenital Disorder of Glycosylation Type 1a; Congenital Myasthenic Syndromes; Cornelia de Lange Syndrome; Cystic fibrosis; Cystinosis; Darier Disease; Desmin Storage Myopathy; DFNA2 Nonsyndromic Hearing Loss; Diamond-Blackfan Anemia; Double Cortex Syndrome; Duane Syndrome; Duchenne/Becker muscular dystrophy; Dysferlinopathy; Dyskeratosis Congenita; Early-Onset Familial Alzheimer Disease; Early-Onset Primary Dystonia (DYT1); Ehlers Danlos; Ehlers-Danlos Syndrome, Classic Type; Ehlers-Danlos Syndrome, Hypermobility Type; Ehlers-Danlos Syndrome, Kyphoscoliotic Form; Emery-Dreifuss Muscular Dystrophy X linked; Epidermolysis Bullosa Simplex; Fabry Disease; Facioscapulohumeral Muscular Dystrophy; Familial Dysautonomia (HSAN III); Familial Hyperinsulinism (FHI); Familial Hypertrophic Cardiomyopathy; Familial Transthyretin Amyloidosis; Fanconi Anemia; Fragile X; Friedreich Ataxia; FRMD7-Related Infantile Nystagmus; Fryns Syndrome; Galactosemia; Gaucher Disease; Glycine Encephalopathy; Glycogen Storage Disease Type VI; Hemophagocytic Lymphohistiocytosis; Hemophilia A; Hemophilia B; Hepatic Veno-Occlusive Disease with Immunodeficiency; Hereditary Hemorrhagic Telangiectasia; Hereditary Neuropathy with Liability to Pressure Palsies; Hereditary Nonpolyposis Colon Cancer; Hexosaminidase A Deficiency; HFE-Associated Hereditary Hemochromatosis; Holt-Oram Syndrome; Huntington Disease; Hydroxymethylbilane Synthase (HMBS) Deficiency; Hypophosphatasia; Inclusion Body Myopathy 2; Incontinentia Pigmenti; Juvenile Polyposis Syndrome; Kallmann Syndrome; Leber Congenital Amaurosis; Leber congenital amaurosis 10; Li-Fraumeni Syndrome; Limb-Girdle Muscular Dystrophy Type 2A Calpainopathy; LIS1-Associated Lissencephaly; Long QT Syndrome; Lowe Syndrome; Malignant Hyperthermia Susceptibility; Maple Syrup Urine Disease; MAPT-Related Disorders; McKusick-Kaufman Syndrome; MECP2-Rett Syndrome; Menkes; Metachromatic Leukodystrophy; Methylmalonic Acidemia; Mucolipidosis II; Multiple Endocrine Neoplasia Type 1; Multiple Endocrine Neoplasia Type 2; Myotonia Congenita; Myotonic Dystrophy Type 1; Myotonic Dystrophy Type 2; Nail-Patella Syndrome; Nemaline Myopathy; Neurofibromatosis 1; Neurofibromatosis 2; Noonan Syndrome; Ocular Albinism, X-Linked; Oculocutaneous Albinism Type 1; Oculocutaneous Albinism Type 2; Oculopharyngeal Muscular Dystrophy; Optic Atrophy Type 1; Ornithine Transcarbamylase Deficiency; Osteogenesis Imperfecta; Parkinson Disease; Pendred Syndrome; Peroxisome Biogenesis, Zellweger; Phenylketonuria; Polycystic Kidney Disease; Pompe Disease-GSD II; Primary Ciliary Dyskinesia; Retinitis Pigmentosa; Retinoblastoma; Saethre-Chotzen Syndrome; SCN9A-Related Inherited Erythromelalgia; SHOX-Related Haploinsufficiency; Sickle Cell Disease; Smith-Lemli-Opitz Syndrome; Smith-Magenis Syndrome; Sotos Syndrome; Spastic Paraplegia 3A; Spastic Paraplegia 7; Spastic Paraplegia 8; Spastic Paraplegia Type 1; Spastic Paraplegia Type 4; Spinal Muscular Atrophy; Spinocerebellar Ataxia 2; Spinocerebellar Ataxia 3; Spinocerebellar Ataxia 7; Spinocerebellar Ataxia Type 1; Stickler Syndrome; Thanatophoric Dysplasia; Thoracic Aortic Aneurysms and Aortic Dissections; Treacher Collins Syndrome; Trimethylaminuria; Tuberous Sclerosis Complex; Udd Distal Myopathy; Usher Syndrome type 1; Very Long Chain Acyl-Coenzyme A Dehydrogenase Deficiency; von Hippel-Lindau; Waardenburg Syndrome, Type 1; Werner Syndrome; Wilms Tumor; Wilson Disease; Wiskott-Aldrich; X-Linked Adrenal Hypoplasia Congenita; X-Linked Adrenoleukodystrophy; X-Linked Dystonia-Parkinsonism; X-linked Juvenile Retinoschisis; X-linked myotubular Myopathy; X-Linked SCIDS; and Zellweger Syndrome.
In one embodiment, the mutations associated with a congenital or inherited disease can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In some embodiments, the mutations associated with an inherited or congenital disease includes copy number variation. In some embodiments, the disclosure relates generally to the selective amplification of at least one target sequence and the detection and/or identification of mutations associated with an inherited disease. In some embodiments, the mutations associated with a congenital or inherited disease can be located in one or more of the genes selected from the group consisting of ABCA4; ABCC8; ABCD1; ACADVL; ACTA2; ACTC; ACTC1; ACVRL1; ADA; AIPL1; AIRE; ALK1; ALPL; AMT; APC; APP; APTX; AR; ARL6; ARSA; ASL; ASPA; ASS; ASS1; ATL; ATM; ATP2A2; ATP7A; ATP7B; ATXN1; ATXN2; ATXN3; ATXN7; BBS6; BCKDHA; BCKDHB; BEST1; BMPR1A; BRCA1; BRCA2; BRIP1; BTD; BTK; C2orf25; CA4; CALR3; CAPN3; CAV3; CCDC39; CCDC40; CDH23; CEP290; CERKL; CFTR; CHAT; CHD7; CHEK2; CHM; CHRNA1; CHRNB1; CHRND; CHRNE; CLCN1; CNBP; CNGB1; COH1; COL11A1; COL11A2; COL1A1; COL1A2; COL2A1; COL3A1; COL4A5; COL5A1; COL5A2; COL7A1; COL9A1; CRB1; CRX; CTDP1; CTNS; CYP21A2; CYP27A1; DAX1; DBT; DCX; DES; DHCR7; DJ1; DKC1; DLD; DMD; DMPK; DNAAF1; DNAAF2; DNAH11; DNAH5; DNAI1;DNA12; DNAL1;DNM2; DOK7; DSC2; DSG2; DSP; DYSF; DYT1; EMD; ENG; EYA1; EYS; F8; F9; FANCA; FANCC; FANCF; FANCG;FANCJ; FANDC2; FBN1; FBXO7; FGFR1; FGFR3; FMO3; FMR1; FOXL2; FRG1; FRMD7; FSCN2; FXN; GAA; GALT; GBA; GBE1; GCSH; GDFS; GJB2; GJB3; GJB6; GLA; GLDC; GNE; GNPTAB; GPC3; GPR143; GUCY2D; HBA1; HBA2; HBB; HD; HERG; HEXA; HFE; HHF; HIBCH; HLA-B27; HMBS; HPLH1; HPRP3; HR; HTNB; HTT; IKBKAP; IKBKG; IL2RG; IMPDH1; ITGB4; JAG1; JPH3; KCNE1; KCNE2; KCNH2; KCNQ1; KCNQ4; KIAA0196; KLHL7; KRAS; KRT14; KRT5; L1CAM; LAMB3; LAMP2; LDB3; LMNA; LMX18; LRAT; LRRK2; MAPT; MC1R; MECP2; MED12; MEN1; MERTK; MFN2; MKKS; MLH1; MMAA; MMAB; MMACHC; MMADHC; MPZ; MSH2; MTM1; MTND5; MTTG; MTTI; MTTK; MTTL1; MTTQ; MUT; MYBPC3; MYH11; MYH6; MYH7; MYL2; MYL3; MYLK2; MYO7A; NDS; ND6; NEMO; NF1; NF2; NIPBL; NROB1; NR2E3; NRAS; NSD1; OCA2; OCRL; OPA1; OTC; PABPN1; PAFAH1B1; PAH; PARK2; PARK7; PARKIN; PAX3; PAX6; PCDH15; PEX1; PEX2; PEX10; PEX13; PEX14; PEX19; PEX26; PEX3; PEX5; PINK1; PKD1; PKD2; PKD3; PKHD1; PKP2; PLEC1; PLOD1; PMM2; PMP22; POLG; PPT1; PRCD; PRKAG2; PRNP; PROM1; PRPF3; PRPF8; PRPH2; PRPN; PSEN1; PSEN2; PTCH1; PTPN11; RAB7A; RAF1; RAI1; RAPSN; RB1; RDH12; RDS; RECQL3; RET; RHO; ROR2; RP1; RP2; RP9; RPE65; RPGR; RPGRIP1; RPL11; RPL35A; RPS10; RPS17; RPS19; RPS24; RPS26; RPS6KA3; RPS7; RPSL5; RS1; RSPH4A; RSPH9; RYR1; RYR2; SALL4; SCA3; SCNSA; SCN9A; SEMA4A; SERPINA1; SERPING1; SGCD; SH3BP2; SHOX; SIX1; SIX5; SLC25A13; SLC25A4; SLC26A4; SMAD4; SMN1; SNCA; SNRNP200; SOD1; SOS1; SOX9;SP110; SPAST; SPATA7; SPG3A; SPG4; SPG7; TAF1; TBXS; TCOF1; TGFBR1; TGFBR2; TNFRSC13C; TNNC1; TNNI3; TNNT1; TNNT2; TNXB; TOPORS; TOR1A; TP53; TPM1; TRNG; TRNI; TRNK; TRNL1; TRNQ; TSC1; TSC2; TTN; TTPA; TTR; TULP1; TWIST1; TXNDC3; TYR; USH1C; USH1H; USH2A; VCL; VHL; VPS1313; WAS; WRN; WT1; and ZNF9. [00156]Amplicon used as input to the verification methods. The pre-amplified nucleic acid used in the verification and sequencing methods of the invention may be obtained from many sources. The amplicon may be produced by PCR amplification of a size limited sample, including but not limited to the preamplification methods referred to here as the Ampliseq panels or assays. The amplicon may also be produced by bridge amplification such as may be used in Sequencing by Synthesis methods of sequencing. The amplicon may be produced via emulsion PCR while attached to a bead or surface. The amplicon may be produced by any form of amplification that can increase the amount of size limited sample to afford both sequencing via massively parallel processes as well as permitting the reserve of an aliquot of the preamplified sample to be used in the resequencing and verification methods of this invention.
Since the amplicon may be produced by many methods, the nature of its structure may be varied. The amplicon has at least a sequence of interest and a preceding sequence 5′ to the sequence of interest. This preceding sequence is introduced during the process used to preamplify the size limited sample. As such, the 5′preceding sequence itself may include two distinct regions; a process derived sequence portion including all or part of a 5′ portion of the 5′preceding sequence and a sequence specific region including all or a part of the 3′ portion of the 5′preceding sequence.
The 5′ process derived sequence region of the 5′ preceding sequence may have a wide variety of sequence types. The particular sequence depends on the process used for the preamplication. This 5′ process derived sequence region may be a “universal” primer sequence, a bar code sequence, a pull out sequence, and adaptor, a sequence used to immobilize the precursor sequence used to pre-amplify the limited sample, or some combination. The 5′ process derived sequence may be incorporated thru polymerase extension of a precursor species or by another type of incorporation, including but not limited to ligation. Each of these process derived sequences can be used to more selectively re-sequence, confirm or verify an initial sequencing analysis.
The 3′ sequence specific region of the 5′preceding sequence may be the portion of the primer that actually primes the specific extension of the primer species and thus provides for the expansion of the size limited sample. The 3′ portion can be used to focus the output of the preamp towards a preselected set of loci to be interrogated in the sequencing methods, for example, as in the AmpliSeq™ Cancer Hotspot Panel v.2.
Alternatively, the 5′ preceding sequence may have only a sequence specific region which includes the entire 5′ preceding sequence. For example, an amplicon produced by extension of a primer having a target-sequence specific oligonucleotide sequence would not have a process derived sequence portion, only a target-sequence specific oligonucleotide sequence for its entire length.
Chemically enhanced Primer. According to various embodiments of the present teachings, provided is a chemically-enhanced primer comprising an oligonucleotide sequence, a negatively charged moiety (NCM) and at least one nuclease-resistant linkage.
In some embodiments the at least one nuclease-resistant linkage includes but is not limited to at least one phosphorothioate linkage (PS) or at least one boronophosphate linkage. In other embodiments the nuclease-resistant linkage is not present in the chemically-enhanced primer. In yet other embodiments, a chemically-enhanced primer may comprise an oligonucleotide sequence, a negatively charged moiety (NCM), where the oligonucleotide inter-nucleotide linkages consist of phosphodiester inter-nucleotide linkages.
The primer can be used to prime a target nucleic acid in a sequencing reaction, herein referred to as a chemically-enhanced sequencing primer or for fragment analysis, herein referred to as a chemically-enhanced extension primer. The oligonucleotide sequence can be a universal primer or a gene specific nucleotide sequence. Examples of universal primers include but are not limited to M13 (P/N 402071 and 402072, Applied Biosystems), US1 (UNISEQ, PLoS Medicine 3(10)e431 (2006)), T7 (P/N 402126, but without dye, Applied Biosystems), SP6 (P/N 402128, but without dye, Applied Biosystems), and T3 (P/N 402127, but without dye, Applied Biosystems). The sequences for M13, T7, SP6 and T3 are shown in Table 1.
The oligonucleotide sequence can also contain a dye-label such as a fluorescent label. In various embodiments of the present teachings the NCM can be located at the terminal 5′ end of the oligonucleotide sequence or within the oligonucleotide sequence. Examples of NCM include but are not limited to a phosphodiester moiety having a structure of the formula
(which is introduced to the chemically-enhanced primer by reacting a phosphoramidite7(available from Glen Research) with an appropriate reaction partner containing an oligonucleotide) referred to here as a (C)n spacer, wherein n can be from 1-12, the amino acids aspartic acid and glutamic acid as well as nucleotides and nucleotide analogs (dATP, dCTP, dGTP and dTTP). The NCM can contain only one negatively charged monomer or a plurality of negatively charged moieties, for example at least five, ten, 12, 15, 18, 20, 24 or more repeat units of the spacer, for example, (Cn)x. where x is any integer between 1 and at least 11, at least 12, at least 15, at least 18, at least 20, at least 24 or 30 Cn spacers where “n” is 3 or 6, e.g., C3 spacers, C6 spacers or a combination of C3 and C6 spacers in a linear arrangement or a branched arrangement. The C3 and C6 spacers individually or in combination can also form a branched NCM by forming a doubler or a trebler such as, for example, (C3)3-treb-M13 or [(C3)2-treb]-treb-M13, where the NCM is represented by (C3)3-treb or [(C3)2-treb]-treb and M13 represents the oligonucleotide sequence, as would be known to one of skill in the art. The NCM can also contain a dye-label such as a fluorescent label. In various embodiments at least none, at least one, at least two or more phosphorothioate linkages can be at a terminal 3′end of the oligonucleotide sequence. The presence of at least one nuclease-resistant linkage provides resistance to digestion by 3′-5′ nucleases such as Exonuclease I (P/N M02935 New England Biolabs, Ipswich, Mass.), Exo III (P/N M02065, New England Biolabs, Ipswich, Mass.), Pfu (Promega, P/N M7741, Madison, Wis.), and DNA pol I (P/N M02095, New England Biolabs, Ipswich, Mass.). The resistance of the chemically-enhanced primer to nuclease digestion offers the advantage of eliminating a PCR clean-up step in the PCR to sequencing protocol. Removal of the extra non-nuclease resistant amplification primers left over from the PCR step can be accomplished in the sequencing reaction mixture. A brief exposure of the PCR amplification reaction to the nuclease within the sequencing reaction mixture degrades the non-nuclease resistant amplification primers followed by an inactivation of the nuclease. The chemically-enhanced primer remains available for the sequencing reaction while the non-nuclease resistant amplification primers and the nuclease have been removed and inactivated, respectively.
In some embodiments the chemically-enhanced primer has a structure of Formula I:
wherein B is a nucleobase; K is S or O; each n is independently an integer of 1 to 12; m is 0 or 1; × is an integer of 1 to about 50; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
For chemically-enhanced primers having a structure of Formula I, OLIGO represents the portion of the chemically-enhanced primer of Formula I that comprises an oligonucleotide. Each nucleotide of the oligonucleotide comprises a nucleobase B portion and a ribose portion:
The chemically-enhanced primer of Formula I may comprise one or more B, wherein B is a naturally occurring nucleobase. In other embodiments, the chemically-enhanced primer of Formula I may comprise one or more B, wherein B is a nucleobase analog.
The chemically-enhanced primer of Formula I may have only one phosphothiorate linkage, wherein m is 0, having a structure of Formula I-A:
The chemically-enhanced primer of Formula I may be labeled with a dye, including dyes that are fluorescent. The chemically-enhanced primer of Formula I may include one or more B labeled with a dye, and is represented as Bf. In some embodiments, the 3′ terminal nucleotide of the chemically-enhanced primer has a fluorescently labeled B. The chemically-enhanced primer may contain a 3′ fluorescently labeled terminal nucleotide wherein the B of the 3′ terminal nucleotide is a nucleobase analog. Alternatively, the chemically-enhanced primer may contain a 5′ terminal nucleotide having a fluorescently labeled B, which can be represented as Bf. In some embodiments, wherein the chemically-enhanced primer contains a 5′ terminal nucleotide containing the fluorescently labeled nucleobase, Bf, the labeled nucleobase is a nucleobase analog. In other embodiments, the chemically-enhanced primer may contain a fluorescently labeled NCM attached directly or indirectly to one of a plurality of NCMs and/or a linker moiety to the 5′ terminal nucleotide of the primer. Additionally, the chemically-enhanced primer of Formula I may be fluorescently labeled on the nucleobase of a nucleotide located at an internal position of the oligonucleotide, and the internal fluorescently labeled nucleotide may be selected to be at any position of the non-terminal portion of the oligonucleotide.
When the chemically-enhanced primer of Formula I contains a fluorescent label, the chemically-enhanced primer may have a structure of one of the following formulae:
wherein FL is a dye label and Bf is a dye labeled nucleobase . Fl and Bf may each represent a fluorescent dye label.
For the chemically-enhanced primer of Formula I, each n can independently be an integer of 1 to 12. In some embodiments, n is 1, 2, 3, 4, 5, 6, 7, 8, or 9. In some embodiments, n is 3. In other embodiments, n is 4. Alternatively, n may be 6. In some embodiments of the chemically enhanced primer of Formula I, when x is greater than 2, a first instance of n is selected to be 3 and a second instance of n is selected to be 6. In further embodiments of the chemically-enhanced primers of Formula I, when x is greater than 2, more than one instance of n is selected to be 3, and more than one instance of n is selected to be 6. In yet other embodiments, when x is greater than 5, a plurality of n is selected to be 3, and a second plurality of n is selected to be 6.
The chemically-enhanced primer of Formula I may have m=1 or m=0. In some embodiments the chemically-enhanced primer of Formula I has m=0.
The chemically-enhanced primer of Formula I may have x, wherein x is an integer of 1 to about 50. In some of the embodiments of the chemically-enhanced primer of Formula I, x is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In other embodiments, x is 10, 15, 18, 20 or 24. In some embodiments, x is 5, 8, 9, 10 or 15. In other embodiments, x is 11, 12, 13, 14, 17 or 20. In other embodiments, x is 30. In further embodiments, x is at least 5, at least 6, at least 8, at least 9, at least 10, at least 15 at least 18, at least 20, or at least 24. In some embodiments, x is 15. In yet other embodiments, x is 8 or 9.
In some embodiments, the chemically-enhanced primers comprise a second plurality y of
moieties, wherein y is an integer of 1-20. In some embodiments, when a first plurality x of n has a value of a first integer, then a second plurality y of n is an integer of 1 to 20. In some embodiments, the chemically-enhanced primer may have a first plurality of n wherein n is 3 and x is 15, and a second plurality of n wherein n is 6 and x is 5. All combinations of n, x and y are contemplated for use in the chemically-enhanced primers of Formula I.
In the chemically-enhanced primer of Formula I, z is an integer of 3 to about 100. In some embodiments, z is an integer of 5 to 50, 5 to 40, or 5 to about 30. In other embodiments, z is an integer of 5 to 25, or 5 to 20.
In some of the embodiments of the chemically-enhanced primer of Formula I, K is S. In other embodiments, K is O.
In some embodiments of the chemically-enhanced primer of Formula I, W is H or OH.
The chemically-enhanced primer of Formula I, I-B, I-C, I-E, I-F, or I-G, may have any combination of B, Bf, FL, K, m, n, W, x, and z of the ranges and selections disclosed above.
The chemically-enhanced primer of Formula I-D may have any combination of B, FL, K, m, n, W, x, and z of the ranges and selections disclosed above.
The chemically-enhanced primer of Formula I-A, I-H, I-J or I-K, may have any combination of B, Bf, FL, K, m, n, W, x, and z of the ranges and selections disclosed above.
In other embodiments, the chemically-enhanced primer is a compound having a structure of Formula II:
wherein B is a nucleobase; K is S or O; each n is independently an integer of 1 to 12; m is 0 or 1; × is an integer of 1 to about 50; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
The chemically-enhanced primer of Formula II may be referred to as a doubler, and represents a branched arrangement of NCM moieties.
The chemically-enhanced primer of Formula II may comprise one or more B, wherein B is a naturally occurring nucleobase. In other embodiments, the chemically-enhanced primer of Formula II may comprise one or more B, wherein B is a nucleobase analog.
The chemically-enhanced primer of Formula II may have only one phosphothiorate linkage, wherein m is 0.
In some embodiments, the chemically-enhanced primer of Formula II may be labeled with a dye, including dyes that are fluorescent. The chemically-enhanced primer of Formula II may include one or more B labeled with a dye, and is represented as Bf. In some embodiments, the 3′ terminal nucleotide of the chemically-enhanced primer has a fluorescently labeled B. The chemically-enhanced primer may contain a 3′ fluorescently labeled terminal nucleotide wherein the B of the 3′ terminal nucleotide is a nucleobase analog. Alternatively, the chemically-enhanced primer may contain a 5′ terminal nucleotide having a fluorescently labeled B, which can be represented as Bf. In some embodiments, wherein the chemically-enhanced primer contains a 5′ terminal nucleotide containing the fluorescently labeled nucleobase, Bf, the labeled nucleobase is a nucleobase analog. In other embodiments, the chemically-enhanced primer may contain a fluorescently labeled NCM attached directly or indirectly to one of a plurality of NCMs and/or a linker moiety to the 5′ terminal nucleotide of the primer. Additionally, the chemically-enhanced primer of Formula II may be fluorescently labeled on the nucleobase of a nucleotide located at an internal position of the oligonucleotide, and the internal fluorescently labeled nucleotide may be selected to be at any position of the non-terminal portion of the oligonucleotide.
For the chemically-enhanced primer of Formula II, n can be an integer of 1 to 9. In some embodiments, n is 1, 2, 3, 4, 5, 6, 7, 8, or 9. In some embodiments, n is 3. In other embodiments, n is 4. Alternatively, n may be 6. . In some embodiments of the chemically-enhanced primer of Formula II, when x is greater than 2, a first instance of n is selected to be 3 and a second instance of n is selected to be 6. In further embodiments of the chemically-enhanced primers of Formula II, when x is greater than 2, more than one instance of n is selected to be 3, and more than one instance of n is selected to be 6. In yet other embodiments, when x is greater than 5, a plurality of n is selected to be 3, and a second plurality of n is selected to be 6.
The chemically-enhanced primer of Formula II may have m=1 or m=0. In some embodiments the chemically-enhanced primer of Formula II has m=0.
The chemically-enhanced primer of Formula II may have x wherein x is an integer of 1 to about 50. In some of the embodiments of the chemically-enhanced primer of Formula II, x is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In other embodiments, x is 10, 15, 18, 20 or 24. In some embodiments, x is 5, 8, 9, 10 or 15. In other embodiments, x is 11, 12, 13, 14, 17 or 20. In other embodiments, x is 30. In further embodiments, x is at least 5, at least 6, at least 8, at least 9, at least 10, at least 15 at least 18, at least 20, or at least 24. In some embodiments, x is 15. In yet other embodiments, x is 8 or 9.
In some embodiments, the chemically-enhanced primers comprise a second plurality y of
moieties, wherein y is an integer of 1-20. In some embodiments, when a first plurality x of n has a value of a first integer, then a second plurality y of n is an integer of 1 to 20. In some embodiments, the chemically-enhanced primer may have a first plurality of n wherein n is 3 and x is 15, and a second plurality of n wherein n is 6 and x is 5. All combinations of n, x and y are contemplated for use in the chemically-enhanced primers of Formula II.
In the chemically-enhanced primer of Formula II, z is an integer of 3 to about 100. In some embodiments, z is an integer of 5 to 50, 5 to 40, or 5 to about 30. In other embodiments, z is an integer of 5 to 25, or 5 to 20. In some of the embodiments of the chemically-enhanced primer of Formula II, K is S. In other embodiments, K is O. In some embodiments of the chemically-enhanced primer of Formula II, W is H or OH. The chemically-enhanced primer of Formula II may have any combination of B, K, m, n, W, x, and z of the ranges and selections disclosed above.
In yet other embodiments, the chemically-enhanced primer is a compound having a structure of the Formula III:
wherein B is a nucleobase; K is S or O; each n is independently an integer of 1 to 12; m is 0 or 1; x is an integer of 1 to about 50; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
The chemically-enhanced primer of Formula III may be referred to as a trebler and represents a branched arrangement of NCM moieties.
The chemically-enhanced primer of Formula III may comprise one or more B, wherein B is a naturally occurring nucleobase. In other embodiments, the chemically-enhanced primer of Formula III may comprise one or more B, wherein B is a nucleobase analog.
The chemically-enhanced primer of Formula III may have only one phosphothiorate linkage, wherein m is 0.
The chemically-enhanced primer of Formula III may be labeled with a dye, including dyes that are fluorescent. The chemically-enhanced primer of Formula III may include one or more B labeled with a dye, and is represented as Bf. In some embodiments, the 3′ terminal nucleotide of the chemically-enhanced primer has a fluorescently labeled B. The chemically-enhanced primer may contain a 3′ fluorescently labeled terminal nucleotide wherein the B of the 3′ terminal nucleotide is a nucleobase analog. Alternatively, the chemically-enhanced primer may contain a 5′ terminal nucleotide having a fluorescently labeled B, which can be represented as Bf. In some embodiments, wherein the chemically-enhanced primer contains a 5′ terminal nucleotide containing the fluorescently labeled nucleobase, Bf, the labeled nucleobase is a nucleobase analog. In other embodiments, the chemically-enhanced primer may contain a fluorescently labeled NCM attached directly or indirectly to one of a plurality of NCMs and/or a linker moiety to the 5′ terminal nucleotide of the primer. Additionally, the chemically-enhanced primer of Formula II may be fluorescently labeled on the nucleobase of a nucleotide located at an internal position of the oligonucleotide, and the internal fluorescently labeled nucleotide may be selected to be at any position of the non-terminal portion of the oligonucleotide.
For the chemically-enhanced primer of Formula III, n can be an integer of 1 to 9. In some embodiments, n is 1, 2, 3, 4, 5, 6, 7, 8, or 9. In some embodiments, n is 3. In other embodiments, n is 4. Alternatively, n may be 6. In some embodiments of the chemically-enhanced primer of Formula III, when x is greater than 2, a first instance of n is selected to be 3 and a second instance of n is selected to be 6. In further embodiments of the chemically-enhanced primers of Formula III, when x is greater than 2, more than one instance of n is selected to be 3, and more than one instance of n is selected to be 6. In yet other embodiments, when x is greater than 5, a plurality of n is selected to be 3, and a second plurality of n is selected to be 6.
The chemically-enhanced primer of Formula III may have m=1 or m=0. In some embodiments the chemically-enhanced primer of Formula III has m=0.
The chemically-enhanced primer of Formula III may have x wherein x is an integer of 1 to about 30. In some of the embodiments of the chemically-enhanced primer of Formula II, x is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In other embodiments, x is 10, 15, 18, 20 or 24. In some embodiments, x is 5, 8, 9, 10 or 15. In other embodiments, x is 11, 12, 13, 14, 17 or 20. In other embodiments, x is 30. In further embodiments, x is at least 5, at least 6, at least 8, at least 9, at least 10, at least 15 at least 18, at least 20, or at least 24. In some embodiments, x is 15. In yet other embodiments, x is 8 or 9.
In the chemically-enhanced primer of Formula III, z is an integer of 3 to about 100. In some embodiments, z is an integer of 5 to 50, 5 to 40, or 5 to about 30. In other embodiments, z is an integer of 5 to 25, or 5 to 20.
In some embodiments, the chemically-enhanced primers comprise a second plurality y of
moieties, wherein y is an integer of 1-20. In some embodiments, when a first plurality x of n has a value of a first integer, then a second plurality y of n is an integer of 1 to 20. In some embodiments, the chemically-enhanced primer may have a first plurality of n wherein n is 3 and x is 15, and a second plurality of n wherein n is 6 and x is 5. All combinations of n, x and y are contemplated for use in the chemically-enhanced primers of Formula III.
In some of the embodiments of the chemically-enhanced primer of Formula III, K is S. In other embodiments, K is O. In some embodiments of the chemically-enhanced primer of Formula III, W is H or OH. The chemically-enhanced primer of Formula III may have any combination of B, K, m, n, W, x, and z of the ranges and selections disclosed above.Other embodiments of the chemically-enhanced primer are represented by Formula IV:
wherein each instance of n is independently an integer of 1 to 12; × is an integer of 1 to 50;
v is an integer of 1 to 9; t is 0 or 1; LINKER comprises 3-100 atoms;
OLIGO has a structure of the following formula:
wherein B is a nucleobase; K is S or O; m is 0 or 1; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
The chemically-enhanced primer of Formula IV, may comprise one or more B wherein B is a naturally occurring nucleobase. In other embodiments, the chemically-enhanced primer of Formula IV may comprise one or more B, wherein B is a nucleobase analog.
The chemically-enhanced primer of Formula IV may be labeled with a dye, including dyes that are fluorescent. The chemically-enhanced primer having a formula of (Cn)x-OLIGO may include one or more B labeled with a dye, and is represented as Bf. In some embodiments, when the chemically-enhanced primer has at least one B labeled with a dye, the B may be a nucleobase analog. In some embodiments, the 3′ terminal nucleotide of the chemically-enhanced primer has a fluorescently labeled B, which can be represented as Bf. Alternatively, the chemically-enhanced primer may contain a 5′ terminal nucleotide having a fluorescently labeled B, which can be represented as Bf. Additionally, the chemically-enhanced primer having a formula of (Cn)x-OLIGO, may be fluorescently labeled on the nucleobase of a nucleotide located at an internal position of the oligonucleotide, and the internal fluorescently labeled nucleotide may be selected to be at any position of the non-terminal portion of the oligonucleotide. In other embodiments, the chemically-enhanced primer may contain a fluorescently labeled NCM attached directly or indirectly to one of a plurality of NCMs and/or to a NCM linker moiety forming a covalent attachment to the 5′ terminal nucleotide of the primer.
LINKER is an NCM linker and may comprise 3-100 atoms and include ether, amide, phosphodiester, and ester moieties to form a covalent linkage between the NCM and the oligonucleotide. LINKER may be attached to the 5′ carbon of the ribose of the nucleotide at the 5′ terminus of the oligonucleotide. In some embodiments, LINKER is present. In other embodiments the NCM phosphodiester moiety or moieties are directly attached to OLIGO.
For the chemically-enhanced primer of Formula IV, v can be an integer of 1 to 9. In some embodiments, v is 1. In other embodiments, v is 2. In yet other embodiments, v is 3.
For the chemically-enhanced primer of Formula IV, n can be an integer of 1 to 12. In some embodiments, n is 1, 2, 3, 4, 5, 6, 7, 8, or 9. In other embodiments, n is an integer of 1 to 9. In some embodiments, n is 3. In other embodiments, n is 4. Alternatively, n may be 6The chemically-enhanced primer of Formula IV has x , wherein x is an integer of 1 to about 30. In some of the embodiments of the chemically-enhanced primer of Formula IV, x is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In other embodiments, x is 10, 15, 18, 20 or 24. In some embodiments, x is 5, 8, 9, 10 or 15. In other embodiments, x is 11, 12, 13, 14, 17 or 20. In further embodiments, x is at least 5, at least 6, at least 8, at least 9, at least 10, at least 15 at least 18, at least 20, or at least 24. In some embodiments, x is 15. In yet other embodiments, x is 8 or 9.
In some embodiments of the chemically-enhanced primer of Formula IV, when x is greater than 2, a first instance of n is selected to be 3 and a second instance of n is selected to be 6. In further embodiments of the chemically-enhanced primers of Formula IV, when x is greater than 2, more than one instance of n is selected to be 3, and more than one instance of n is selected to be 6. In yet other embodiments, when x is greater than 5, a plurality of n is selected to be 3, and a second plurality of n is selected to be 6.
In some embodiments, the chemically-enhanced primers comprise a second plurality y of
moieties, wherein y is an integer of 1- 20. In some embodiments, when a first plurality x of n has a value of a first integer, then a second plurality y of n is an integer of 1 to 20. In some embodiments, the chemically-enhanced primer may have a first plurality of n wherein n is 3 and x is 15, and a second plurality of n wherein n is 6 and x is 5. All combinations of n, x and y are contemplated for use in the chemically-enhanced primers of Formula IV.
The chemically-enhanced primer having a formula of Formula IV has m=1 or m=0. In some embodiments the chemically-enhanced primer of Formula IV has m=0. The chemically-enhanced primer having a formula of Formula IV has z , wherein z is an integer of 3 to about 100. In some embodiments, z is an integer of 5 to 50, 5 to about 40, or 5 to about 30. In other embodiments, z is an integer of 5 to 25, or 5 to 20. In some of the embodiments of the chemically-enhanced primer of Formula IV, K is S. In other embodiments, K is O. In some embodiments of the chemically-enhanced primer of Formula IV, W is H or OH. The chemically-enhanced primer of Formula IV, may have any combination of B, n, t, v, x, m, y, z, K or W of the ranges and selections disclosed above.
In some embodiments, the chemically-enhanced primer of Formula IV is a chemically-enhanced primer (Cn)x-OLIGO , wherein (Cn)x has a structure of the following formula:
wherein each instance of n is independently an integer of 1 to 12; and x is an integer of 1 to about 30; and OLIGO has a structure of the following formula:
wherein B, K, m, z, y, Nt, and W are as defined above for Formula IV.
For the chemically-enhanced primer having a formula of (Cn)x-OLIGO, v is 1, t is 0, no LINKER is present, and the chain of NCM moieties are attached to OLIGO directly. [00212]The chemically-enhanced primer having a formula of (Cn)x-OLIGO, may have any combination of B, n, x, m, z, K or W of the ranges and selections disclosed above for Formula IV.
In yet other embodiments, the chemically-enhanced primer is represented by the following formulae:
(Cn)x-OLIGO* , wherein (Cn)x has a structure of the following formula:
wherein each instance of n is independently an integer of 1 to 12; and x is an integer of 1 to about 30; OLIGO* has a structure of the following formula:
wherein B, K, m, z, y, Nt, and W are as defined above for Formula IV.
For the chemically-enhanced primer having a formula of (Cn)x-OLIGO*, v is 1, t is 0, no LINKER is present, and the chain of NCM moieties are attached to OLIGO* directly. [00215]The chemically-enhanced primer having a formula of (Cn)x-OLIGO*, may have any combination of B, n, x, m, z, or W of the ranges and selections disclosed above for Formula IV.
Chemically-enhanced primers having a formula of (Cn)x Formula VI-Al include , but are not limited to:
Chemically-enhanced primers having a formula of (Cn)x-OLIGO* include, but are not limited to:
The chemically enhanced primer includes a pre-determined number of nucleotides at its 3′terminus which are at least partially complementary to an equivalent number of nucleotides of the preceding sequence 5′ to the sequence of interest of the amplicon, which can then hybridize during the sequencing reaction to produce extension products of the chemically enhanced primer. In some embodiments, the 3′ pre-determined number of nucleotides of the chemically enhanced primer are at least partially complementary to a gene specific sequence at the 5′ terminus of the preceding sequence of the amplicon. In some embodiments, the 5′nucleotides of the preceding sequence of the amplicon, to which the 3′ pre-determined number of nucleotides of the chemically enhanced primer hybridizes, are not gene specific. When the 5′nucleotides of the preceding sequence of the amplicon are not a gene specific sequence, the 5′nucleotides of the preceding sequence may be any suitable tag, tail, universal sequence, bar code or ligation product.
Using the chemically-enhanced sequencing primer, it has been found that high quality and highly accurate nucleic acid sequence results can be obtained in about 50% less time overall for a PCR to sequencing results workflow using POP7TM polymer on any of the 3130, 3730 or 3500 capillary electrophoresis platforms (Applied Biosystems, Foster City, Calif.). Specifically, improvement of 5′ sequence resolution is provided by use of the chemically enhanced primer which permits elucidation of base 1 from the chemically-enhanced sequencing primer. The throughput is increased as well as reducing hands-on-time by eliminating a separate PCR clean-up step prior to initiation of the sequencing reaction. (
The chemically-enhanced sequencing primer and improved workflow improves polymorphism detection and more efficient use of allele specific sequencing primers for heterozygous ambiguity resolution. Various aspects of the use and synthesis of chemically-enhanced sequencing primers are further described in U.S. Application Ser. Nos. 61/026,085, filed Feb. 4, 2008; Ser. No. 12/365,140, filed Feb. 3., 2009; 61/407,899, filed Oct. 28, 2010; 61/408,553, filed Oct. 29, 2010; Ser. No. 13/284,839, filed Oct. 28, 2011; and Ser. No. 13/397,626, filed Feb. 15, 2012, and each disclosure of which is hereby incorporated by reference in its entirety.
Compositions. A composition for sequencing nucleic acid is described that includes: a PCR amplification reaction product that comprises: a DNA product amplified from at least one amplicon, wherein the amplicon comprises a sequence of interest and a preceding sequence 5′ to the sequence of interest incorporated from a first priming sequence; non-nuclease-resistant amplification primer(s); and a chemically enhanced primer wherein the chemically enhanced primer comprises an oligonucleotide sequence, a NCM and none or at least one nuclease-resistant linkage.
The chemically-enhanced primer may include a plurality of NCMs either at a terminal 5′ end or within a oligonucleotide sequence of the chemically-enhanced primer. The NCM may be a (Cn) spacer wherein n can be any integer from 1 to 9. In various embodiments, the NCM comprises a plurality of (Cn) spacers. In various embodiments, the chemically-enhanced primer may have a structure of Formula I:
wherein B is a nucleobase; K is S or O; each n is independently an integer of 1 to 9; m is 0 or 1; × is an integer of 1 to about 30; z is an integer of 3 to about 100; W is OH, F, OMe, or H; and Nt is a moiety having a formula:
In some embodiments of the compositions of the invention, the chemically enhanced primer may be any chemically enhanced primer described in this disclosure.
In various embodiments, the oligonucleotide portion of the chemically-enhanced primer may include a universal primer. The universal primer may be selected from M13, US1, T7, SP6, and T3. The universal primer may be M13. In some embodiments, the chemically-enhanced primer may include one nuclease-resistant linkage.
In some embodiments of the compositions of the invention, the composition may further include a nuclease. In other embodiments, the composition may further include a polymerase, deoxynucleotide triphosphates, dideoxynucleotide triphosphates and a dye-label. In some embodiments, the dideoxynucleotide triphosphates may include dideoxynucleotide triphosphates labeled with the dye-label. The dye-labeled dideoxynucleotide triphosphates may include fluorescent dye-labeled dideoxynucleotide triphosphates. In some embodiments, the dye-label may be attached to the NCM or the oligonucleotide sequence.
In some embodiments of the compositions of the invention, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I. In some embodiments, the polymerase may be Taq polymerase.
In some embodiments, the PCR amplification reaction product further includes an amplified DNA product where the DNA product is the amplification product of a plurality of amplicons.
In some embodiments, the composition for sequencing nucleic acids can further comprise more than one chemically-enhanced sequencing primer. In some embodiments, the polymerase can comprise Taq polymerase, for example AmpliTaq Gold polymerase. In some embodiments, the nuclease can comprise exonuclease I. In some embodiments, the chemically-enhanced sequencing primer can comprise at least one phosphorothioate linkage. In other embodiments, the chemically-enhanced sequencing primer can comprise a terminal 3′ end phosphorothioate linkage. In some embodiments the chemically-enhanced sequencing primer can comprise a dye, for example a fluorescent dye-labeled oliogonucleotide and/or at least one fluorescently dye-labeled NCM moiety within the NCM compound.
According to various embodiments, the composition for sequencing nucleic acid can comprise a polymerase, for example a DNA polymerase, in an amount of from about 0.01 Unit to about 20 Units, for example, from about 0.1 Unit to about 1.0 Unit, or about 0.8 Unit. The composition can comprise polymerase in an amount within a range having an upper limit of from about 10 Units to about 20 Units and a lower limit of from about 0.01 Unit to about 0.05 Unit. According to various embodiments, the composition can comprise a nuclease, for example exonuclease I, in an amount of from about 1 Unit to about 40 Units, for example, from about 2 Units to about 15 Units, or about 10 Units. The composition can comprise nuclease in an amount within a range having an upper limit of from about 10 Units to about 40 Units, and a lower limit of from about 1 Unit to about 2 Units.
According to various embodiments, the composition for sequencing nucleic acid can comprise a chemically-enhanced sequencing primer, in an amount of from about 0.1 μM to about 20 μM, for example about 1.0 μM. The composition can comprise a chemically-enhanced sequencing primer in an amount within a range having an upper limit of from about 10 μM to about 20 μM and a lower limit of from about 0.05 μM to about 0.1 μM. According to various embodiments, the composition can comprise dNTPs in an amount of from about 20 μM to about 5000 μM, for example, about 500 μM. The composition can comprise dNTPs in an amount within a range having an upper limit of from about 2000 μM to about 5000 μM and a lower limit of from about 20 μM to about 50 μM. According to various embodiments, the composition can comprise ddNTPs in an amount of from about 0.03 μM to about 10 μM, for example about 3 μM. The composition can comprise ddNTPs in an amount within a range having an upper limit of from about 5 μM to about 10 μM and a lower limit of from about 0.01 μM to about 0.05 μM. All molar amounts are based on final concentrations of the final volume.
According to various embodiments, the composition can comprise one or more non-nuclease-resistant amplification primers in an amount of from about 0.1 μM to about 20 μM each, for example about 0.01 μM or 1.0 μM. The composition can comprise one or more non-nuclease-resistant amplification primers in an amount within a range having an upper limit of from about 10 μM to about 20 μM each and a lower limit of from about 0.05 μM to about 0.1 μM each. All molar amounts are based on final concentrations of the final volume.
According to various embodiments, the composition for sequencing nucleic acid can further comprise a PCR amplification product. In some embodiments, the PCR amplification product can comprise an amplified DNA target sequence. In some embodiments, the PCR amplification product can comprise non-nuclease-resistant amplification primer(s). The non-nuclease-resistant amplification primer can comprise, for example, phosphodiester linkages that are sensitive to degradation by exonuclease. In some embodiments, the PCR amplification product can comprise a target specific amplicon that incorporates nucleic acid sequence capable of annealing to a universal primer.
Kits. The present teachings are also directed to kits that utilize the chemically-enhanced primer composition and methods described above. In some embodiments, a basic kit can comprise a container having one or more chemically-enhanced primers, as described in this disclosure. A kit can also optionally comprise instructions for use.
A kit is described which includes: a polymerase, a nuclease, at least one deoxynucleotide triphosphate, and dideoxynucleotide triphosphates. The dideoxynucleotide triphosphates may be dideoxynucleotide triphosphates labeled with a dye-label. The dye-labeled dideoxynucleotide triphosphates may be fluorescent dye-labeled dideoxynucleotide triphosphates.
In some embodiments, the nuclease may be selected from exonuclease I, Exo III, Pfu and DNA pol I. In various embodiments, the kit may include a chemically enhanced primer as described in this disclosure.
In other embodiments, the kit may further include a plurality of nuclease sensitive amplification primers. The plurality of nuclease- sensitive amplification primers may be configured to prime a sequence of interest of a specific disease state. The plurality of nuclease-sensitive amplification primers of the kit may be configured to prime a set of sequences connected to a specific disease state.
A kit can also comprise other optional kit components, such as, for example, one or more of a nuclease, a sufficient quantity of enzyme for sequencing or fragment analysis, buffer to facilitate the sequencing reaction or fragment analysis reaction , dNTPs, modified dNTPs, dNTP analogs and 7-Deaza-dGTP for strand extension during sequencing reaction or fragment analysis reaction, ddNTPs, a dye-label, loading solution for preparation of the sequenced or fragment analyzed material for electrophoresis, genomic DNA as a template control, a size marker to insure that materials migrate as anticipated in the separation medium, and a protocol and manual to educate the user and limit error in use. The amounts of the various reagents in the kits also can be varied depending upon a number of factors, such as the optimum sensitivity of the process. It is within the scope of these teachings to provide test kits for use in manual applications or test kits for use with automated detectors or analyzers. Kits may have more than one chemically enhanced primer, and the number of NCM moieties may be different in each of the chemically enhanced primers. Kits for plasmid sequencing may have any of the components listed above, but do not include a nuclease.
Examples. Examples of the compositions and methods of the present teachings are shown below. These examples are not limiting of the present teachings, and those of ordinary skill in the art will recognize that the components used in the reactions may be readily substituted with equivalent reagents known in the art.
C6 spacer+Oligo seq. synthesis, no phospohorothioate group: An 18 base oligonucleotide labeled with one or more C6 spacers at the 5′ position was made on an ABI model 394 DNA synthesizer using standard phosphoramidite chemistry. The C6 spacer phosphoramidite was obtained from Chem Genes Corp. (P/N CLP-1120, Wilmington, Mass.). The labeled 18mer was made with the trityl group intact from a one micromole column. On completion of the synthesis the oligonucleotide was cleaved off the support with NH4OH and purified by HPLC using an ABI RP-300 (C-8) column (4.6×220 mm) using a flow rate of 1.5 ml/min. and a solvent gradient of 0.1M triethylarnmoniurn acetate-water pH 7.0 and acetonitrile, the trityl group was removed and the product was isolated by ethanol precipitation.
C3 spacer +Oligo seq. synthesis, no phosphorothioate group: An 18 base oligonucleotide labeled with one or more C3 spacers (P/N 104913-90, Glen Research), at the 5′ position was made on an ABI model 394 DNA synthesizer using standard phosphoramidite chemistry. The labeled 18mer was made with the trityl group intact from a one micromole column. On completion of the synthesis the oligonucleotide was cleaved off the support with NH4OH and purified by HPLC using an ABI RP-300 (C-8) column (4.6×220 mm) using a flow rate of LS ml/rein. and a solvent gradient of 0.1M triethylammonium acetate-water pH 7.0 and acetonitrile, the trityl group was removed and the product was isolated by ethanol precipitation.
Protocol for oligo labeled with one or more C-3 spacer containing a 3′ phosphorothioate linkage: An 18 base oligonucleotide labeled with one or more C-3 spacers at the 5′ position was made on an ABI model 394 DNA synthesizer using standard phosphoramidite chemistry. The 3′ phosphorothioate linkage was made using standard methods with sulfurizing reagent (TEM P/N 401267 (Applied Biosystems, Foster City, Calif.). The C3 spacer phosphorainidite was obtained from Glen Research (P/N 10-1913-90). The labeled 18mer was made with the trityl group intact from a one micromole synthesis column. On completion of the synthesis the oligonucleotide was cleaved off the support with NH4OH and purified by HPLC using an ABI RP-300 (C-18) column (4.6×220 mm) using a flow rate of 1.5 inl/rnin. and a solvent gradient of 0.1M triethyiammonium acetate-water pH TO and acetonitrile, the trityl group was removed and the product was isolated by ethanol precipitation. Note: To synthesize more than one phosphorothioate linkage or to place this linkage anywhere in the 18-mer oligonucleotide chain, oxidize using the sulfurizing reagent at these position(s).
A minor fraction (e.g. 5%) of original pre-amplication material from the Ion AmpliSeq comprehensive cancer panel v2 (CHP v2) and the Ion Oncomine cancer panel (OCP) is used for follow up (confirmatory) analysis using traditional fluorescent dye terminator sequencing a.k.a. Sanger sequencing and detection by automated capillary electrophoresis (CE) such as the Applied Biosystems 3500 XL Genetic Analyzer.
Molecular analysis of genetic mutations (variants) in tumor samples is becoming increasingly used for tumor characterization and the diagnostic and therapeutic management of cancer patients. Often, only very limited amount of tumor specimen is initially (i.e. pre-surgery) available for example by fine needle biopsy of suspected tumor tissue or aberrant cell clusters present in a formalin fixed paraffin embedded (FFPE) preparations.
The Ion AmpliSeq™ cancer panels are designed to amplify a multitude of oncologically relevant target genes from low amount of input DNA (10 ng) for subsequent sequencing on a chip-based platform (e.g. the Ion Torrent PGM™ instrument). The Ion Ampliseg™ CHPv2 panel covers 207 loci and the OCP panel over 2000 targets. These panels and disclosure related to loci selection, amplicon size, and primer development are further described in U.S. Application Ser. Nos. 61/479,952, filed on Apr. 28, 2011; 61/531,583, filed on Sep. 6, 2011; 61/531,574, filed on Sep. 6, 2011; 61/538,079, filed on Sep. 22, 2011; 61/564,763, filed on Nov. 29, 2011; 61/578,192, filed on Dec. 20, 2011; 61/594,160, filed on Feb. 2, 2012; 61/598,881, filed on Feb. 14, 2012; 61/598,892, filed on Feb. 14, 2012; 61/625,596, filed on Apr. 17, 2012; 61/639,017, filed on Apr. 26, 2012; Ser. No. 13/458,739, filed on Apr. 27, 2012; Ser. No. 13/663,334, filed on Oct. 29, 2012; Ser. No. 13/679,706, filed on Nov. 16, 2012; and an application entitled “Detection, Identification, Validation, and Enrichment of Target Nucleic Acids”, Attorney Docket No. LT00974 PRO, filed on even date; and each disclosure is hereby incorporated by reference in its entirety.
Occasionally, sequencing results obtained from the next generation sequencing (NGS) platform need to be confirmed with an orthogonal methodology such as traditional Sanger sequencing. Such a reflex test is indicated when a minor variant i.e. a coding or functionally relevant nucleotide variant occurs not in a 50% frequency as expected if inherited in a Mendelian fashion, but rather at a lower frequency (i.e. between 5-25%) which are typical for somatic mutations i.e. a mutational event that happens spontaneously or is causative for or driving carcinogenesis.
For a pair of forward and reverse Sanger sequencing reactions of a single target 5 ng of genomic DNA is typically needed which may quickly exhaust the amount available from the clinical specimen.
Therefore, it is desirable to exploit the enriched, pre-amplified, pool of target sequences from the Ampliseq reaction for re-sequencing by Sanger.
Typically an Ampliseg™ pre-amplification reaction is set up to be performed in a reaction volume of 20 uL. Removing an aliquot of 1 ul (i.e. 5%) of this material prior to the primer trimming step (see
The typical input amount of human DNA that goes into an AmpliSeq™ reaction is 10 ng which is the equivalent of 3000 genome copies or 1500 cells. The preamplication conditions are:
A 1 ul aliquot of the preamplification material (PA) is diluted 1:1000 in 1 ml TE buffer.
Assuming 100% efficiency in the pre-amplification PCR reaction the number of targets can be calculated that go into the subsequent PCR reaction for Sanger Sequencing (BDD PCR)
Due to the pre-amplification reaction there is sufficient target materials available for reflex testing. The previously target-limited sample is not limited anymore. Besides Sanger sequencing also other detection methods can be potentially employed, including but not limited to digital PCR, cast-PCR or other allele-specific PCR methods, single base extension chemistry and detection (by CE or Mass spectroscopy).
Further, panels of informative or actionable targets (e.g. TP53, KRAS, BRAF, EGFR, etc.) can be designed that can be analyzed by Sanger Sequencing as a first step prior to next gen sequencing or used exclusively by Sanger sequencing (or another method). PCR primers used:
The amplicon and primer sequences of the CHPv2 panel were evaluated for general suitability for singular PCR. From this list, a total of 48 targets (including 18 “difficult” targets, having GC rich regions, short amplicon regions, hompolymer A or homopolymer T stretches) and their corresponding PCR primer pairs and added M13 Forward sequence (tgtaaaacgacggccagt) to the 5′ end of the AmpliSeq forward primer and M13 reverse sequence (caggaaacagctatgacc) to the 5′ end of the AmpliSeq reverse primer. These are nuclease sensitive amplification primers for use in the methods described in this disclosure. Primer data are listed in Table 8, following this section.
Primers are delivered resuspended in water at a concentration of 100 μM. A Primer Pair (PP) master plate is generated by combining 10 ul of each corresponding (Forward and reverse) primer pair and then diluted with 80 uL low TE to 100 uL so that the primers in the pair were at a concentration of 10 μM.
The orientation of the primer pairs on the master plate is as follows:
Primer Layout on Plate
From the PP Master plate test plates are generated by diluting the 10 uM stock primer pairs further to 1 uM in low TE and plated 10 uL of the diluent to two sets (CHPv2 A and B) of 96 well plates in the following composition:
Each plate contains thus 4 sections of 24 identical primer pairs allowing the processing of 4 samples which were typically:
Primers were allowed to dry in situ on the plate and then used within 2 weeks for PCR Sequencing experiments using the BigDye® Direct Sequencing Kit, which includes reagents for PCR amplification and subsequent cycle sequencing chemistries.
The BigDye® Direct Sequencing Kit uses M13-tagged PCR primers. This is advantageous because in the subsequent sequencing reaction chemically enhanced primers, as described in the sections above, and having a sequence of M13 forward or reverse, are used as sequencing primers. This allows sequence reading almost immediately at the 5′end of the PCR amplicon. This maximizes the sequence information which is important since the AmpliSeq primers are designed to be fairly short (125-175 nt) owing to the short nature of heavily fragmented FFPE DNA.
After PCR, BigDye® Direct Sequencing reagent is added to the amplification reaction product in situ in the amplification mixture along with the chemically enhanced sequencing primer(s). This reagent contains not only the typical reagents needed for cycle sequencing (a polymerase, dNTPs, and dye-labelled ddNTPs). , but also contains nuclease to remove the need for additional purification manipulations by removing excess amplification primers in situ.
Sequencing reactions were purified by addition of 55 ul BigDye® Xterminator beads solution mix followed by 30 min vigorous vortexing, to remove smaller molecule contaminants. After spinning the beads to the bottom of the well the plates were put into a Applied Biosystems 3500 XL Genetic Analyzer for capillary electrophoresis and sequence base calling. The resulting .ab1 sequencing files were quality assessed with Applied Biosystems Sequence Scanner software and then further analyzed using Applied Biosystems Variant Reporter software for detection of variants.
The amplified samples are analyzed by methods that resolve nucleobase sequences as would be known to one of skill in the art. For example, capillary electrophoresis can be used following the instrument manufactures directions. BigDye XTerminator Purification Kit (Applied Biosystems, P/N 4376486) can be used in cycle sequencing clean up to prevent the co-injection of un-incorporated dye-labeled terminators, dNTPs and salts with dye-labeled extension products into a capillary electrophoresis DNA analyzer. Briefly, 13 μL sequencing reaction mixture was combined with 45 μL SAM Solution and 10 μL XTerminator Solution. After vortexing the sample plate at 1800 rpm for 20 minutes, spin the plate at 1000×g for 2 minutes. To each well was added 30 μL of 70% ethanol and the plate was centrifuged at 1650×g for 15 minutes. The solution was removed by inverting the plate onto a paper towel and centrifuging at 180×g for 1 minute. The precipitated sequencing reaction was then dissolved in 10 μL of 50 μM EDTA and loaded onto an AB 3500xL Genetic Analyzer equipped with a 50 cm capillary array (Applied Biosystems, Foster City, Calif.).
Capillary electrophoresis (CE) was performed on the current Applied Biosystems instruments, for example the Applied Biosystems 3500xl Genetic Analyzer, using the dye set Z as described the instrument's User Guide. There are ShortReadSeq_BDX_POP7, RapidSeq_BDX_POP7, FastSeq_BDX_POP7, StdSeq_BDX_POP7 run modules. For example, BDxFastSeq50_POP7xl_1 parameters were: oven temperature: 60C, sample injection for 5 sec at 1.6 kV and electrophoresis at 13.4 kV for 2520 sec in Performance Optimized Polymer (POP-7™ polymer) with a run temperature of 60° C. Variations in instrument parameters, e.g. injection conditions, were different on other CE instruments such as the 3500 or 3730x1 Genetic Analyzers. The data were collected using versions the Applied Biosystems Data Collection Software specific to the different instruments, such as 3500 Data Collection Software v1.0. The sequence traces were analyzed by Applied Biosystems KB™ Basecaller Software v1.4.1 with KB_3500_POP7_BDTv3direct.bcc and KB_3500_POP7_BDTv3direct.mob to determine the correct base calls.
Those skilled in the art understand that the detection techniques employed are generally not limiting. Rather, a wide variety of detection means are within the scope of the disclosed methods and kits, provided that they allow the presence or absence of an amplicon to be determined. While the principles of this invention have been described in connection with specific embodiments, it should be understood clearly that these descriptions are made only by way of example and are not intended to limit the scope of the invention. What has been disclosed herein has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit what is disclosed to the precise forms described. Many modifications and variations will be apparent to the practitioner skilled in the art. What is disclosed was chosen and described in order to best explain the principles and practical application of the disclosed embodiments of the art described, thereby enabling others skilled in the art to understand the various embodiments and various modifications that are suited to the particular use contemplated. It is intended that the scope of what is disclosed be defined by the following claims and their equivalence.
This application claims priority under 35 U.S.C. §119(e) to U.S. Provisional Application Nos. 62/059,821, filed Oct. 3, 2014, and 62/059,824, filed Oct. 3, 2014, the disclosures of which are hereby incorporated by reference in its entirety for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2015/052675 | 9/28/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62059821 | Oct 2014 | US | |
62059824 | Oct 2014 | US |