STRAND DISPLACING SEQUENCING ENZYMES

REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY

The Sequence Listing titled 051385-563001US_SL_ST26.XML, was created on Dec. 15, 2022 in machine format IBM-PC, MS-Windows operating system, is 23,573 bytes in size, and is hereby incorporated by reference in its entirety for all purposes.

BACKGROUND

Genetic analysis is taking on increasing importance in modern society as a diagnostic, prognostic, and as a forensic tool, and typically requires amplification of genomic fragments. A majority of nucleic acid amplification techniques (e.g., DNA amplification) used in university, medical, and clinical laboratory research is performed using the polymerase chain reaction (PCR), though in the past decade alternative amplification methods have emerged that eliminate thermal cycling (i.e., isothermal amplification). Typical isothermal amplification methods require the use of a DNA polymerase with a strong strand displacement activity to displace downstream DNA, thereby enabling continuous replication without thermal cycling. Efficient amplification typically requires elevated temperatures to enable the annealing of primers at specific locations on the dsDNA. However, few thermostable strand displacing enzymes exist. For example, SD DNA polymerase (a mutant Taq DNA polymerase) and the large fragment of Bst DNA polymerase possess favorable characteristics for isothermal amplification, but both are inactivated and elevated temperatures (e.g., greater than 70° C.). In addition to nucleic acid amplification, polymerase mediated nucleic acid sequencing methods (e.g., sequencing-by-synthesis) benefit by using a thermostable, strand-displacing polymerase. Thus, there is a need for thermostable, strand-displacing polymerases. Disclosed herein, inter alia, are solutions to these and other problems in the art.

BRIEF SUMMARY

In an aspect is provided a polymerase including an amino acid sequence that is at least 80% identical to a continuous 500 amino acid sequence within SEQ ID NO: 1; including a first mutation at amino acid position 409 or an amino acid position corresponding to position 409; and at least one mutation at amino acid position 7 or an amino acid position corresponding to position 7; at amino acid position 579 or an amino acid position corresponding to position 579; at amino acid position 588 or an amino acid position corresponding to position 588; or at amino acid position 742 or an amino acid position corresponding to position 742.

In an aspect is provided a method of incorporating a modified nucleotide into a nucleic acid sequence including combining in a reaction vessel: (i) a nucleic acid template, (ii) a nucleotide solution, and (iii) a polymerase, wherein the polymerase is a polymerase as described herein. In embodiments, the modified nucleotide includes a label (e.g., a label linked to the nucleobase via an optionally cleavable linker). In embodiments, the modified nucleotide includes a reversible terminator moiety (e.g., a polymerase-compatible cleavable moiety bonded to the 3′ oxygen of a nucleotide).

In another aspect is provided a method of sequencing a nucleic acid sequence including: a. hybridizing a nucleic acid template with a primer to form a primer-template hybridization complex; b. contacting the primer-template hybridization complex with a DNA polymerase and modified nucleotides, wherein the DNA polymerase is the polymerase as described herein, wherein the modified nucleotide includes a detectable label; c. subjecting the primer-template hybridization complex to conditions which enable the polymerase to incorporate a modified nucleotide into the primer-template hybridization complex to form a modified primer-template hybridization complex; and d. detecting the detectable label; thereby sequencing a nucleic acid sequence.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the hairpin-challenge strand-displacing assay, which utilizes streptavidin bead, depicted as a large circle (bead) and diamond (streptavidin), conjugated to a dual hairpin challenge template.

FIG. 2 depicts an alignment of two sequences described herein. SEQ ID NO:3 is aligned to SEQ ID NO:1 and amino acid positions 541-560 are depicted in FIG. 2. The alignment highlights a deletion in SEQ ID NO:3 relative to SEQ ID NO:1, such that any amino acid positions beyond amino acid position 554 are shifted −1 in SEQ ID NO:3 relative to SEQ ID NO:1 (i.e., amino acid E554 in SEQ ID NO:3 corresponds to E555 in SEQ ID NO:1).

DETAILED DESCRIPTION

The aspects and embodiments described herein relate to strand displacing polymerases and uses thereof.

Definitions

All patents, patent applications, articles and publications mentioned herein, both supra and infra, are hereby expressly incorporated herein by reference in their entireties.

Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Various scientific dictionaries that include the terms included herein are well known and available to those in the art. Although any methods and materials similar or equivalent to those described herein find use in the practice or testing of the disclosure, some preferred methods and materials are described. Accordingly, the terms defined immediately below are more fully described by reference to the specification as a whole. It is to be understood that this disclosure is not limited to the particular methodology, protocols, and reagents described, as these may vary, depending upon the context in which they are used by those of skill in the art. The following definitions are provided to facilitate understanding of certain terms used frequently herein and are not meant to limit the scope of the present disclosure.

As used herein, the singular terms “a”, “an”, and “the” include the plural reference unless the context clearly indicates otherwise. Reference throughout this specification to, for example, “one embodiment”, “an embodiment”, “another embodiment”, “a particular embodiment”, “a related embodiment”, “a certain embodiment”, “an additional embodiment”, or “a further embodiment” or combinations thereof means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present disclosure. Thus, the appearances of the foregoing phrases in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.

As used herein, the term “about” means a range of values including the specified value, which a person of ordinary skill in the art would consider reasonably similar to the specified value. In embodiments, the term “about” means within a standard deviation using measurements generally acceptable in the art. In embodiments, about means a range extending to +/−10% of the specified value. In embodiments, about means the specified value.

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by a person of ordinary skill in the art. See, e.g., Singleton et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY 2nd ed., J. Wiley & Sons (New York, N.Y. 1994); Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, Cold Springs Harbor Press (Cold Springs Harbor, N Y 1989). Any methods, devices and materials similar or equivalent to those described herein can be used in the practice of this invention. The following definitions are provided to facilitate understanding of certain terms used frequently herein and are not meant to limit the scope of the present disclosure.

“Nucleic acid” refers to nucleotides (e.g., deoxyribonucleotides or ribonucleotides) and polymers thereof in either single-, double- or multiple-stranded form, or complements thereof. The terms “polynucleotide,” “oligonucleotide,” “oligo” or the like refer, in the usual and customary sense, to a sequence of nucleotides. The term “nucleotide” refers, in the usual and customary sense, to a single unit of a polynucleotide, i.e., a monomer. Nucleotides can be ribonucleotides, deoxyribonucleotides, or modified versions thereof. Examples of polynucleotides contemplated herein include single and double stranded DNA, single and double stranded RNA, and hybrid molecules having mixtures of single and double stranded DNA and RNA with linear or circular framework. Non-limiting examples of polynucleotides include a gene, a gene fragment, an exon, an intron, intergenic DNA (including, without limitation, heterochromatic DNA), messenger RNA (mRNA), transfer RNA, ribosomal RNA, a ribozyme, cDNA, a recombinant polynucleotide, a branched polynucleotide, a plasmid, a vector, isolated DNA of a sequence, isolated RNA of a sequence, a nucleic acid probe, and a primer. Polynucleotides useful in the methods of the disclosure may comprise natural nucleic acid sequences and variants thereof, artificial nucleic acid sequences, or a combination of such sequences.

The term “duplex” in the context of polynucleotides refers, in the usual and customary sense, to double strandedness. Nucleic acids can be linear or branched. For example, nucleic acids can be a linear chain of nucleotides or the nucleic acids can be branched, e.g., such that the nucleic acids comprise one or more arms or branches of nucleotides. Optionally, the branched nucleic acids are repetitively branched to form higher ordered structures such as dendrimers and the like. Different polynucleotides may have different three-dimensional structures, and may perform various functions, known or unknown.

Nucleic acids, including e.g., nucleic acids with a phosphothioate backbone, can include one or more reactive moieties. As used herein, the term reactive moiety includes any group capable of reacting with another molecule, e.g., a nucleic acid or polypeptide through covalent, non-covalent or other interactions. By way of example, the nucleic acid can include an amino acid reactive moiety that reacts with an amino acid on a protein or polypeptide through a covalent, non-covalent or other interaction.

The term “base” and “nucleobase” as used herein refers to a purine or pyrimidine compound, or a derivative thereof, that may be a constituent of nucleic acid (i.e. DNA or RNA, or a derivative thereof). In embodiments, the base is a derivative of a naturally occurring DNA or RNA base (e.g., a base analogue). In embodiments, the base is a base-pairing base. In embodiments, the base pairs to a complementary base. In embodiments, the base is capable of forming at least one hydrogen bond with a complementary base (e.g., adenine hydrogen bonds with thymine, adenine hydrogen bonds with uracil, guanine pairs with cytosine). Non-limiting examples of a base includes cytosine or a derivative thereof (e.g., cytosine analogue), guanine or a derivative thereof (e.g., guanine analogue), adenine or a derivative thereof (e.g., adenine analogue), thymine or a derivative thereof (e.g., thymine analogue), uracil or a derivative thereof (e.g., uracil analogue), hypoxanthine or a derivative thereof (e.g., hypoxanthine analogue), xanthine or a derivative thereof (e.g., xanthine analogue), guanosine or a derivative thereof (e.g., 7-methylguanosine analogue), deaza-adenine or a derivative thereof (e.g., deaza-adenine analogue), deaza-guanine or a derivative thereof (e.g., deaza-guanine), deaza-hypoxanthine or a derivative thereof, 5,6-dihydrouracil or a derivative thereof (e.g., 5,6-dihydrouracil analogue), 5-methylcytosine or a derivative thereof (e.g., 5-methylcytosine analogue), or 5-hydroxymethylcytosine or a derivative thereof (e.g., 5-hydroxymethylcytosine analogue) moieties. In embodiments, the base is thymine, cytosine, uracil, adenine, guanine, hypoxanthine, xanthine, theobromine, caffeine, uric acid, or isoguanine. In embodiments, the base is

embedded image

A polynucleotide is typically composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T) (uracil (U) for thymine (T) when the polynucleotide is RNA). Thus, the term “polynucleotide sequence” is the alphabetical representation of a polynucleotide molecule; alternatively, the term may be applied to the polynucleotide molecule itself. This alphabetical representation can be input into databases in a computer having a central processing unit and used for bioinformatics applications such as functional genomics and homology searching. Polynucleotides may optionally include one or more non-standard nucleotide(s), nucleotide analog(s) and/or modified nucleotides.

The term “isolated”, when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It can be, for example, in a homogeneous state and may be in either a dry or an aqueous solution. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified.

The terms “analog” and “analogue” and “derivative” in reference to a chemical compound, refers to compounds having a structure similar to that of another one, but differing from it in respect of one or more different atoms, functional groups, or substructures that are replaced with one or more other atoms, functional groups, or substructures. In the context of a nucleotide useful in practicing the invention, a nucleotide analog refers to a compound that, like the nucleotide of which it is an analog, can be incorporated into a nucleic acid molecule (e.g., an extension product) by a suitable polymerase, for example, a DNA polymerase in the context of a dNTP analogue. The terms also encompass nucleic acids containing known nucleotide analogs or modified backbone residues or linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, include, without limitation, phosphodiester derivatives including, e.g., phosphoramidate, phosphorodiamidate, phosphorothioate (also known as phosphothioate having double bonded sulfur replacing oxygen in the phosphate), phosphorodithioate, phosphonocarboxylic acids, phosphonocarboxylates, phosphonoacetic acid, phosphonoformic acid, methyl phosphonate, boron phosphonate, or O-methylphosphoroamidite linkages (see Eckstein, OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL APPROACH, Oxford University Press) as well as modifications to the nucleotide bases such as in 5-methyl cytidine or pseudouridine; and peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with positive backbones; non-ionic backbones, modified sugars, and non-ribose backbones (e.g. phosphorodiamidate morpholino oligos or locked nucleic acids (LNA) as known in the art), including those described in U.S. Pat. Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, CARBOHYDRATE MODIFICATIONS IN ANTISENSE RESEARCH, Sanghui & Cook, eds. Nucleic acids containing one or more carbocyclic sugars are also included within one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. In embodiments, the internucleotide linkages in DNA are phosphodiester, phosphodiester derivatives, or a combination of both.

The term “complement,” as used herein, refers to a nucleotide (e.g., RNA or DNA) or a sequence of nucleotides capable of base pairing with a complementary nucleotide or sequence of nucleotides. As described herein and commonly known in the art, the complementary (matching) nucleoside of adenosine is thymidine and the complementary (matching) nucleoside of guanosine is cytidine. Thus, a complement may include a sequence of nucleotides that base pair with corresponding complementary nucleotides of a second nucleic acid sequence. The nucleotides of a complement may match, partially or completely, the nucleotides of the second nucleic acid sequence. Where the nucleotides of the complement completely match each nucleotide of the second nucleic acid sequence, the complement forms base pairs with each nucleotide of the second nucleic acid sequence. Where the nucleotides of the complement partially match the nucleotides of the second nucleic acid sequence, only some of the nucleotides of the complement form base pairs with nucleotides of the second nucleic acid sequence. Examples of complementary sequences include coding and non-coding sequences, wherein the non-coding sequence contains complementary nucleotides to the coding sequence and thus forms the complement of the coding sequence. A further example of complementary sequences are sense and antisense sequences, wherein the sense sequence contains complementary nucleotides to the antisense sequence and thus forms the complement of the antisense sequence.

“DNA” refers to deoxyribonucleic acid, a polymer of deoxyribonucleotides (e.g., dATP, dCTP, dGTP, dTTP, dUTP, etc.) linked by phosphodiester bonds. DNA can be single-stranded (ssDNA) or double-stranded (dsDNA), and can include both single and double-stranded (or “duplex”) regions. “RNA” refers to ribonucleic acid, a polymer of ribonucleotides linked by phosphodiester bonds. RNA can be single-stranded (ssRNA) or double-stranded (dsRNA), and can include both single and double-stranded (or “duplex”) regions. Single-stranded DNA (or regions thereof) and ssRNA can, if sufficiently complementary, hybridize to form double-stranded DNA/RNA complexes (or regions).

The term “DNA primer” refers to any DNA molecule that may hybridize to a DNA template and be bound by a DNA polymerase and extended in a template-directed process for nucleic acid synthesis. The term “DNA template” refers to any DNA molecule that may be bound by a DNA polymerase and utilized as a template for nucleic acid synthesis.

The term “dATP analogue” refers to an analogue of deoxyadenosine triphosphate (dATP) that is a substrate for a DNA polymerase. The term “dCTP analogue” refers to an analogue of deoxycytidine triphosphate (dCTP) that is a substrate for a DNA polymerase. The term “dGTP analogue” refers to an analogue of deoxyguanosine triphosphate (dGTP) that is a substrate for a DNA polymerase. The term “dNTP analogue” refers to an analogue of deoxynucleoside triphosphate (dNTP) that is a substrate for a DNA polymerase. The term “dTTP analogue” refers to an analogue of deoxythymidine triphosphate (dUTP) that is a substrate for a DNA polymerase. The term “dUTP analogue” refers to an analogue of deoxyuridine triphosphate (dUTP) that is a substrate for a DNA polymerase.

The term “extendible” means, in the context of a nucleotide, primer, or extension product, that the 3′-OH group of the particular molecule is available and accessible to a DNA polymerase for extension or addition of nucleotides derived from dNTPs or dNTP analogues. “Incorporation” means joining of the modified nucleotide to the free 3′ hydroxyl group of a second nucleotide via formation of a phosphodiester linkage with the 5′ phosphate group of the modified nucleotide. The second nucleotide to which the modified nucleotide is joined will typically occur at the 3′ end of a polynucleotide chain. As used herein, the term “incorporating” or “chemically incorporating,” when used in reference to a primer and a nucleotide, refers to the process of joining the nucleotide to the primer or extension product thereof by formation of a phosphodiester bond. As used herein, the term “extension” or “elongation” is used in accordance with their plain and ordinary meanings and refer to synthesis by a polymerase of a new polynucleotide strand (i.e., an “extension strand”) complementary to a template strand by adding free nucleotides (e.g., dNTPs) from a reaction mixture that are complementary to the template in a 5′-to-3′ direction, including condensing a 5′-phosphate group of a dNTPs with a 3′-hydroxy group at the end of the nascent (elongating) DNA strand.

The term “modified nucleotide” refers to nucleotide or nucleotide analogue modified in some manner. Typically, a nucleotide contains a single 5-carbon sugar moiety, a single nitrogenous base moiety and 1 to three phosphate moieties. In particular, embodiments, a nucleotide can include a blocking moiety or a label moiety. A blocking moiety (e.g., a reversible terminator moiety) on a nucleotide prevents formation of a covalent bond between the 3′ hydroxyl moiety of the nucleotide and the 5′ phosphate of another nucleotide. A blocking moiety on a nucleotide can be reversible (i.e., a reversible terminator), whereby the blocking moiety can be removed or modified to allow the 3′ hydroxyl to form a covalent bond with the 5′ phosphate of another nucleotide. A blocking moiety can be effectively irreversible under particular conditions used in a method set forth herein. A label moiety of a nucleotide can be any moiety that allows the nucleotide to be detected, for example, using a spectroscopic method. Exemplary label moieties are fluorescent labels, mass labels, chemiluminescent labels, electrochemical labels, detectable labels and the like. One or more of the above moieties can be absent from a nucleotide used in the methods and compositions set forth herein. For example, a nucleotide can lack a label moiety or a blocking moiety or both.

A “removable” group, e.g., a label or a blocking group or protecting group, refers to a chemical group that can be removed from a dNTP analogue such that a DNA polymerase can extend the nucleic acid (e.g., a primer or extension product) by the incorporation of at least one additional nucleotide. Removal may be by any suitable method, including enzymatic, chemical, or photolytic cleavage. Removal of a removable group, e.g., a blocking group, does not require that the entire removable group be removed, only that a sufficient portion of it be removed such that a DNA polymerase can extend a nucleic acid by incorporation of at least one additional nucleotide using a dNTP of dNTP analogue.

“Reversible blocking groups” or “reversible terminators” include a blocking moiety located, for example, at the 3′ position of the nucleotide and may be a chemically cleavable moiety such as an allyl group, an azidomethyl group or a methoxymethyl group, or may be an enzymatically cleavable group such as a phosphate ester. Suitable nucleotide blocking moieties are described in applications WO 2004/018497, U.S. Pat. Nos. 7,057,026, 7,541,444, WO 96/07669, U.S. Pat. Nos. 5,763,594, 5,808,045, 5,872,244 and 6,232,465 the contents of which are incorporated herein by reference in their entirety. The nucleotides may be labelled or unlabeled. They may be modified with reversible terminators useful in methods provided herein and may be 3′-O-blocked reversible or 3′-unblocked reversible terminators. In nucleotides with 3′-O-blocked reversible terminators, the blocking group —OR [reversible terminating (capping) group] is linked to the oxygen atom of the 3′-OH of the pentose, while the label is linked to the base, which acts as a reporter and can be cleaved. The 3′-O-blocked reversible terminators are known in the art, and may be, for instance, a 3′-ONH₂reversible terminator, a 3′-O-allyl reversible terminator, or a 3′-O-azidomethyl reversible terminator.

3′-O-Blocked Reversible Terminator

embedded image

In embodiments, provided herein are polymerases capable of incorporating three differently sized reversible terminator probes linked to the 3′ oxygen: an A-Term, S-Term, and i-term. A-Term refers to azide-containing terminators (Guo J, et al. PNAS 2008); for example having the formula:

embedded image

S-Term refers to sulfide-containing terminators (WO 2017/058953); for example having the formula

embedded image

wherein R″ is unsubstituted C₁-C₄alkyl. The i-Term probe refers to an isomeric reversible terminator For example, an i-term probe has the formula:

embedded image

wherein R^Aand R^Bare hydrogen or alkyl, wherein at least one of R^Aor R^Bare hydrogen to yield a stereoisomeric probe, and R^Cis the remainder of the reversible terminator.

In embodiments, the nucleotide is

embedded image

wherein Base is a Base as described herein, R³is —OH, monophosphate, or polyphosphate or a nucleic acid, and R′ is a reversible terminator having the formula:

embedded image

wherein R^Aand R^Bare hydrogen or alkyl and R^Cis the remainder of the reversible terminator. In embodiments, the reversible terminator is

embedded image