COMPOSITIONS COMPRISING A VARIANT POLYPEPTIDE AND USES THEREOF

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. Said XML copy, created on Jul. 21, 2023, is named A2186-704810_SL and is 162,069 bytes in size.

BACKGROUND

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and CRISPR-associated (Cas) genes, collectively known as CRISPR-Cas or CRISPR/Cas systems, are adaptive immune systems in archaea and bacteria that defend particular species against foreign genetic elements.

SUMMARY OF THE INVENTION

It is against the above background that the present invention provides certain advantages and advancements over the prior art.

Although this invention disclosed herein is not limited to specific advantages or functionalities, the invention provides variant polypeptide comprising an alteration relative to a parent polypeptide of SEQ ID NO: 3, and wherein the alteration is a combination of amino acid substitutions listed in Table 4.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising one or more of the following substitutions: P14R, D32R, I61R, E311R.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising a substitution one or more of positions P14, D32, I61, E311, T338, and E736 relative to SEQ ID NO: 3.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 90% identity to SEQ ID NO: 3 and comprising a substitution at each of positions P14, D32, I61, E311, T338, and E736 relative to SEQ ID NO: 3. In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising a substitution at each of positions P14, D32, I61, E311, T338, and E736 relative to SEQ ID NO: 3. In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 98% identity to SEQ ID NO: 3 and comprising a substitution each of positions P14, D32, I61, E311, T338, and E736 relative to SEQ ID NO: 3.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 90% identity to SEQ ID NO: 3 and comprising each of the following substitutions: P14R, D32R, I61R, E311R, T338G, and E736G. In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising each of the following substitutions: P14R, D32R, I61R, E311R, T338G, and E736G. In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 98% identity to SEQ ID NO: 3 and comprising each of the following substitutions: P14R, D32R, I61R, E311R, T338G, and E736G.

In some embodiments, the variant polypeptide comprises a substitution at P14 (e.g., a P14R substitution).

In some embodiments, the variant polypeptide comprises a substitution at E311 (e.g., an E31 IR substitution).

In some embodiments, the variant polypeptide comprises a substitution at D32 (e.g., a D32R substitution).

In some embodiments, the variant polypeptide comprises a substitution at I61 (e.g., a I61R substitution).

In some embodiments, the variant polypeptide comprises a substitution at T338 (e.g., a T338G substitution).

In some embodiments, the variant polypeptide comprises a substitution at E736 (e.g., a E736G substitution).

In some embodiments, the variant polypeptide comprises a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R substitution), D32 (e.g., a D32R substitution), I61 (e.g., a I61R substitution), T338 (e.g., a T338G substitution), E736 (e.g., an E736G substitution), D55 (e.g., D55G), D590 (e.g., D590G), D145 (e.g., D145G), K35 (e.g., K35G), K221 (e.g., K221G), E154 (e.g., E154G), or any combination thereof. In some embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E311R, D590G, E154G, and E736G. In certain embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E31 IR, D55G, and E736G. In some embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E31 IR, D145G, and E736G. In some embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E311R, D590G, D145G, and E736G. In some embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E31 IR, K35G, and E736G. In certain embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E311R, K221G, and E736G. In certain embodiments, the variant polypeptide comprises each of the following substitutions: P14R, D32R, I61R, E31 IR, D590G, T338G, and E736G. In some embodiments, the variant polypeptide comprises each of the substitutions listed on any row of Table 8.

In some embodiments, the variant polypeptide comprises a sequence according to SEQ ID NO: 53. In some embodiments, the variant polypeptide comprises a sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 53.

In some embodiments, the variant polypeptide comprises a K at position 208. In some embodiments, the variant polypeptide comprises a D at position 302. In some embodiments, the variant polypeptide comprises a D at position 590. In some embodiments, the variant polypeptide comprises an E at position 154. In some embodiments, the variant polypeptide comprises a D at position 567. In some embodiments, the variant polypeptide comprises an L at position 38. In some embodiments, the variant polypeptide comprises a D at position 145. In some embodiments, the variant polypeptide comprises a C at position 13. In some embodiments, the variant polypeptide comprises a D at position 55. In some embodiments, the variant polypeptide comprises a K at position 221. In some embodiments, the variant polypeptide comprises a K at position 35. In some embodiments, the variant polypeptide comprises a G at position 223. In some embodiments, the variant polypeptide comprises an N at position 109. In some embodiments, the variant polypeptide comprises a D at position 719.

In some embodiments, the variant polypeptide comprises a residue other than M at position M1.

In some embodiments, the variant polypeptide exhibits increased binary complex formation with an RNA guide, relative to a parent polypeptide.

In some embodiments, a binary complex comprising the variant polypeptide exhibits increased stability, relative to a parent binary complex.

In some embodiments, the variant polypeptide exhibits increased nuclease activity, relative to a parent polypeptide.

In one aspect, the disclosure provides a gene editing system comprising the variant polypeptide disclosed herein or a first nucleic acid encoding the variant polypeptide, wherein the gene editing system further comprises an RNA guide or a second nucleic acid encoding the RNA guide, wherein the RNA guide comprises a direct repeat sequence and a spacer sequence.

In one aspect, the disclosure provides a gene editing system comprising a polypeptide disclosed herein or a first nucleic acid encoding the polypeptide, wherein the gene editing system further comprises an RNA guide or a second nucleic acid encoding the RNA guide, wherein the RNA guide comprises a direct repeat sequence having a sequence according to CCUGUUGUGAAUACUCUUUUAUAGGUAUCAAACAAC (SEQ ID NO: 112) or a sequence with at least 80%, 90%, 95%, or 95% identity thereto, and a spacer sequence. In some embodiments, the polypeptide is a polypeptide having a sequence according to SEQ ID NO: 3 having a sequence with at least 80%, 90%, 95%, or 95% identity to SEQ ID NO: 3. In some embodiments, the polypeptide is a variant polypeptide as described herein.

In some embodiments, the direct repeat sequence is at least 90% identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 90% identity to SEQ ID NO: 14 or SEQ ID NO: 15.

In some embodiments, the direct repeat sequence is at least 95% identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 95% identity to SEQ ID NO: 14 or SEQ ID NO: 15.

In some embodiments, the direct repeat sequence is any one of SEQ ID NOs: 4-13 or comprises a sequence of SEQ ID NO: 14 or SEQ ID NO: 15.

In some embodiments, the direct repeat sequence comprises a sequence according to CCUGUUGUGAAUACUCUUUUAUAGGUAUCAAACAAC (SEQ ID NO: 112) or a sequence with at least 80%, 90%, 95%, or 95% identity thereto.

In some embodiments, the spacer sequence comprises about 15 nucleotides to about 35 nucleotides in length.

In some embodiments, the spacer sequence is specific to a target sequence within a target nucleic acid, and wherein the target sequence is adjacent to a protospacer adjacent motif (PAM) sequence.

In some embodiments, the PAM sequence is 5′-TTR-3′, 5′-NTTR-3′, 5′-NTTN-3′, 5′-RTTR-3′, 5′-ATTR-3′, or 5′-RTTG-3′, wherein N is any nucleotide, Y is C or T, and R is A or G.

In some embodiments, the PAM sequence is 5′-TTG-3′, 5′-TTA-3′, 5′-ATTG-3′, 5′-TTTA-3′, or 5′-TTTG-3′.

In some embodiments, the variant polypeptide further comprises a nuclear localization signal (NLS).

In some embodiments, the NLS is N-terminal or C-terminal of the sequence having at least 98% identity to SEQ ID NO: 3 or SEQ ID NO: 53.

In certain embodiments, the variant polypeptide or gene editing system further comprises a second NLS.

In some embodiments, the NLS is N-terminal of sequence having at least 98% identity to SEQ ID NO: 3 and the second NLS is C-terminal of the sequence having at least 98% identity to SEQ ID NO: 3.

In certain embodiments, the NLS or the second NLS each independently has an amino acid sequence of an NLS of Table 10, or a sequence having at least 70%, 75%, 80%, 85%, 90%, or 95% identity thereto.

In some embodiments, the variant polypeptide or gene editing system comprises a linker between the NLS and the sequence having at least 98% identity to SEQ ID NO: 3.

In certain embodiments, the variant polypeptide or gene editing system comprises a linker (e.g., a second linker) between the second NLS and the sequence having at least 98% identity to SEQ ID NO: 3.

In some embodiments, the linker or second linker each independently has an amino acid sequence of a linker of Table 10, or a sequence having at least 70%, 75%, 80%, 85%, 90%, or 95% identity thereto.

In certain embodiments, the variant polypeptide has an amino acid sequence of Table 11, or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity thereto.

In some embodiments, the variant polypeptide further comprises a peptide tag, a fluorescent protein, a base-editing domain, a DNA methylation domain, a histone residue modification domain, a localization factor, a transcription modification factor, a light-gated control factor, a chemically inducible factor, or a chromatin visualization factor.

In some embodiments, the gene editing system disclosed herein comprises the first nucleic acid encoding the variant polypeptide.

In some embodiments, the first nucleic acid comprises a nucleic acid sequence of Table 9, or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity thereto.

In certain embodiments, the first nucleic acid comprises a nucleic acid sequence of Table 10, or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity thereto.

In some embodiments, the first nucleic acid comprises a nucleic acid sequence of Table 11, or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity thereto.

In some embodiments, the first nucleic acid is codon-optimized for expression in a cell.

In some embodiments, the first nucleic acid is a messenger RNA (mRNA).

In some embodiments, the first nucleic acid is included in a vector.

In some embodiments, the gene editing system disclosed herein, the system comprises the second nucleic acid encoding the RNA guide.

In some embodiments, the nucleic acid encoding the RNA guide is located in a vector.

In some embodiments, the vector comprises the both the first nucleic acid encoding the variant polypeptide and the second nucleic acid encoding the RNA guide.

In some embodiments, the system comprises the first nucleic acid encoding the variant polypeptide, which is located on a first vector, and wherein the system comprises the second nucleic acid encoding the RNA guide, which is located on a second vector.

In some embodiments, the first and second vector are the same vector.

In some embodiments, the vector comprises a retroviral vector, a lentiviral vector, a phage vector, an adenoviral vector, an adeno-associated vector, or a herpes simplex vector.

In some embodiments, the variant polypeptide or gene editing system is present in a delivery system comprising a nanoparticle (e.g., a lipid nanoparticle), a liposome, an exosome, a microvesicle, or a gene-gun.

The disclosure further provides a cell comprising the variant polypeptide or the gene editing system disclosed herein.

In some embodiments, the cell is a eukaryotic cell.

In some embodiments, the cell is a mammalian cell or a plant cell.

In some embodiments, the cell is a human cell.

The disclosure further provides a method for editing a gene in a cell, the method comprising contacting the cell with the variant polypeptide or gene editing system disclosed herein.

Although this invention disclosed herein is not limited to specific advantages or functionalities, the invention also provides a variant polypeptide comprising an alteration relative to a parent polypeptide of SEQ ID NO: 3, and wherein the alteration is a substitution of Table 2. In some embodiments, the substitution is a P14R substitution, an E311R substitution, a D32R substitution, an I61R substitution, a G223R substitution, an N109R substitution, and/or a D719R substitution.

In certain embodiments, the variant polypeptide comprises a) a P14R substitution, an E311R substitution, and a D32R substitution; b) a P14R substitution, an E311R substitution, and a G223R substitution; c) a P14R substitution, an E311R substitution, a D32R substitution, and an I61R substitution; or d) a D32R substitution, an N109R substitution, an E311R substitution, and a D719R substitution. In some embodiments, the variant polypeptide comprises c) a P14R substitution, an E31 IR substitution, a D32R substitution, and an I61R substitution.

In some embodiments, the variant polypeptide further comprises a K208G substitution, a D302G substitution, a D590G substitution, an E154G substitution, a D567G substitution, an L38G substitution, a D145G substitution, a C13G substitution, a T338G substitution, a P14G substitution, a D55G substitution, a K221G substitution, a K35G substitution, and an E736G substitution.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising a substitution one or more of positions P14, E311, D32, I61, G223, N109, and D719 relative to SEQ ID NO: 3.

In one aspect, the disclosure provides a variant polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO: 3 and comprising a substitution one or more of positions K208, D302, D590, E154, D567, L38, D145, C13, T338, P14, D55, K221, K35, and E736 relative to SEQ ID NO: 3.

In some embodiments, the variant polypeptide comprises a substitution at P14 (e.g., a P14R substitution). In certain embodiments, the variant polypeptide comprises a substitution at E311 (e.g., an E311R substitution). In some embodiments, the variant polypeptide comprises a substitution at D32 (e.g., a D32R substitution). In certain embodiments, the variant polypeptide comprises a substitution at I61 (e.g., a I61R substitution). In some embodiments, the variant polypeptide comprises a substitution at G223 (e.g., a G223 substitution). In certain embodiments, the variant polypeptide comprises a substitution at N109 (e.g., a N109R substitution). In some embodiments, the variant polypeptide comprises a substitution at D719 (e.g., a D719R substitution).

In certain embodiments, the variant polypeptide comprises a substitution at position P14 (e.g., a P14R substitution), an E311 (e.g., an E311R substitution), a D32 (e.g., a D32R substitution), an I61 (e.g., a I61R substitution), a G223 (e.g., a G223 substitution), an N109 (e.g., a N109R substitution), a D719 (e.g., a D719R substitution), or any combination thereof.

In some embodiments, the variant polypeptide comprises a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), and D32 (e.g., a D32R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, D32R variant polypeptide).

In some embodiments, the variant polypeptide comprises a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), and G223 (e.g., a G223R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, G223R variant polypeptide).

In certain embodiments, the variant polypeptide comprises a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), D32 (e.g., a D32R substitution), and I61 (e.g., an I61R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, D32R and I61R variant polypeptide).

In some embodiments, the variant polypeptide comprises a substitution at position D32R (e.g., a D32R substitution), N109 (e.g., an N109R), E311 (e.g., an E311R substitution), and D719 (e.g., a D719R substitution) relative to SEQ ID NO: 3 (e.g., a D32R, N109R, E311R and D719R variant polypeptide).

In certain embodiments, the variant polypeptide comprises a substitution at K208 (e.g., a K208G substitution), D302 (e.g., a D302G substitution), D590 (e.g., a D590G substitution), E154 (e.g., an E154G substitution), D567 (e.g., a D567G substitution), L38 (e.g., an L38G substitution), D145 (e.g., a D145G substitution), C13 (e.g., a C13G substitution), T338 (e.g., a T338G substitution), P14 (e.g., a P14G substitution), D55 (e.g., a D55G substitution), K221 (e.g., a K221G substitution), K35 (e.g., a K35G substitution), and E736 (e.g., an E736G substitution), or any combination thereof.

In particular embodiments, the variant polypeptide exhibits increased binary complex formation with an RNA guide, relative to a parent polypeptide. In certain embodiments, a binary complex comprising the variant polypeptide exhibits increased stability, relative to a parent binary complex.

In some embodiments, the variant polypeptide exhibits increased nuclease activity, relative to a parent polypeptide.

In one aspect, the disclosure provides a composition comprising the variant polypeptide described herein, wherein the composition further comprises an RNA guide or a nucleic acid encoding the RNA guide, wherein the RNA guide comprises a direct repeat sequence and a spacer sequence. In certain embodiments, the direct repeat sequence is at least 90% (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 90% (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) identity to SEQ ID NO: 14 or SEQ ID NO: 15. In some embodiments, the direct repeat sequence is at least 95% (e.g., 95%, 96%, 97%, 98%, 99% or 100%) identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 95% (e.g., 95%, 96%, 97%, 98%, 99% or 100%) identity to SEQ ID NO: 14 or SEQ ID NO: 15. In some embodiments, the direct repeat sequence is any one of SEQ ID NOs: 4-13 or comprises a sequence of SEQ ID NO: 14 or SEQ ID NO: 15.

In some embodiments, the spacer sequence comprises about 15 nucleotides to about 35 nucleotides in length.

In certain embodiments, the spacer sequence binds to a target strand sequence of a target nucleic acid, and wherein a non-target strand sequence of the target nucleic acid sequence is adjacent to a protospacer adjacent motif (PAM) sequence. In some embodiments, the PAM sequence is 5′-TTR-3′, 5′-NTTR-3′, 5′-NTTN-3′, 5′-RTTR-3′, 5′-ATTR-3′, or 5′-RTTG-3′, wherein N is any nucleotide, Y is C or T, and R is A or G. In certain embodiments, the PAM sequence is 5′-TTG-3′, 5′-TTA-3′, 5′-ATTG-3′, 5′-TTTA-3′, or 5′-TTTG-3′.

In certain embodiments, the variant polypeptide further comprises a nuclear localization signal (NLS).

In one aspect, the disclosure provides a composition comprising a nucleic acid that encodes the variant polypeptide and/or the RNA guide described anywhere herein. In some embodiments, the nucleic acid is codon-optimized for expression in a cell. In certain embodiments, the nucleic acid is operably linked to a promoter. In some embodiments, the nucleic acid is in a vector. In certain embodiments, the vector comprises a retroviral vector, a lentiviral vector, a phage vector, an adenoviral vector, an adeno-associated vector, or a herpes simplex vector.

In some embodiments, the variant polypeptide is present in a delivery system comprising a nanoparticle (e.g., a lipid nanoparticle), a liposome, an exosome, a microvesicle, or a gene-gun.

In one aspect, the disclosure provides a cell comprising the variant polypeptide or the composition of any previous aspect or embodiment.

In some embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell or a plant cell. In certain embodiments, the cell is a human cell.

In one aspect, the disclosure provides a composition comprising a variant polypeptide or a complex comprising the variant polypeptide, wherein the variant polypeptide comprises an alteration relative to a parent polypeptide of SEQ ID NO: 3, and wherein the variant polypeptide or the complex exhibits enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability, relative to a parent polypeptide or a complex comprising the parent polypeptide.

In some embodiments, the alteration is a substitution of Table 2.

In certain embodiments, the substitution is a P14R substitution, an E311R substitution, a D32R substitution, an I61R substitution, a G223R substitution, an N109R substitution, and/or a D719R substitution.

In some embodiments, the variant polypeptide comprises a) a P14R substitution, an E311R substitution, and a D32R substitution; b) a P14R substitution, an E311R substitution, and a G223R substitution; c) a P14R substitution, an E311R substitution, a D32R substitution, and an I61R substitution; or d) a D32R substitution, an N109R substitution, an E311R substitution, and a D719R substitution.

In certain embodiments, the variant polypeptide further comprises a K208G substitution, a D302G substitution, a D590G substitution, an E154G substitution, a D567G substitution, an L38G substitution, a D145G substitution, a C13G substitution, a T338G substitution, a P14G substitution, a D55G substitution, a K221G substitution, a K35G substitution, and an E736G substitution.

In some embodiments, the enhanced enzymatic activity is enhanced nuclease activity.

In certain embodiments, the variant polypeptide exhibits enhanced binding activity to an RNA guide, relative to the parent polypeptide.

In some embodiments, the variant polypeptide exhibits enhanced binding specificity to an RNA guide, relative to the parent polypeptide.

In some embodiments, the complex comprising the variant polypeptide is a variant binary complex that further comprises an RNA guide, and the variant binary complex exhibits enhanced binding activity to a target nucleic acid (e.g., on-target binding activity), relative to a parent binary complex.

In still another embodiment, the complex comprising the variant polypeptide is a variant binary complex that further comprises an RNA guide, and the variant binary complex exhibits enhanced binding specificity to a target nucleic acid (e.g., on-target binding specificity), relative to a parent binary complex.

In certain embodiments, the variant binary complex and a target nucleic acid form a variant ternary complex, and the variant ternary complex exhibits increased stability, relative to a parent ternary complex.

In some embodiments, the variant polypeptide further exhibits enhanced binary complex formation, enhanced protein-RNA interactions, and/or decreased dissociation from an RNA guide, relative to the parent polypeptide.

In certain embodiments, the variant binary complex further exhibits decreased dissociation from a target nucleic acid, and/or decreased off-target binding to a non-target nucleic acid, relative to the parent binary complex.

In some embodiments, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur over a range of temperatures, e.g., 20° C. to 65° C. (e.g., 20° C. to 30° C., 30° C. to 40° C., 40° C. to 50° C., 50° C. to 60° C., or 60° C. to 65° C.).

In certain embodiments, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur over a range of incubation times.

In some embodiments, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur in a buffer having a pH in a range of about 7.3 to about 8.6 (e.g., about 7.5 to about 8.0, about 7.8 to 8.3, or about 8.0 to 8.6).

In certain embodiments, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occurs when a T_mvalue of the variant polypeptide, variant binary complex, or variant ternary complex is at least 8° C. greater than the T_mvalue of the parent polypeptide, parent binary complex, or parent ternary complex.

In some embodiments, the variant polypeptide comprises a RuvC domain or a split RuvC domain.

In certain embodiments, the parent polypeptide comprises the sequence of SEQ ID NO: 3.

In some embodiments, the RNA guide comprises a direct repeat sequence and a spacer sequence.

In some embodiments, the direct repeat sequence is at least 90% (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 90% (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) identity to SEQ ID NO: 14 or SEQ ID NO: 15.

In some embodiments, the direct repeat sequence is at least 95% (e.g., 95%, 96%, 97%, 98%, 99% or 100%) identical to any one of SEQ ID NOs: 4-13 or comprises a sequence having at least 95% (e.g., 95%, 96%, 97%, 98%, 99% or 100%) identity to SEQ ID NO: 14 or SEQ ID NO: 15. In some embodiments, the direct repeat sequence is any one of SEQ ID NOs: 4-13 or comprises a sequence of SEQ ID NO: 14 or SEQ ID NO: 15.

In certain embodiments, the spacer sequence comprises between 15 and 35 nucleotides in length.

In certain embodiments, the spacer sequence comprises complementarity to a target strand sequence of a target nucleic acid.

In some embodiments, the target nucleic acid comprises a non-target strand sequence adjacent to a protospacer adjacent motif (PAM) sequence. In certain embodiments, the PAM sequence is 5′-TTR-3′, 5′-NTTR-3′, 5′-NTTN-3′, 5′-RTTR-3′, 5′-ATTR-3′, or 5′-RTTG-3′, wherein N is any nucleotide, Y is C or T, and R is A or G. In some embodiments, the PAM sequence is 5′-TTG-3′, 5′-TTA-3′, 5′-ATTG-3′, 5′-TTTA-3′, or 5′-TTTG-3′.

In certain embodiments, the variant polypeptide further comprises a peptide tag, a fluorescent protein, a base-editing domain, a DNA methylation domain, a histone residue modification domain, a localization factor, a transcription modification factor, a light-gated control factor, a chemically inducible factor, or a chromatin visualization factor.

In one aspect, the disclosure provides a composition comprising a nucleic acid that encodes the variant polypeptide of the previous aspect or embodiments thereof, wherein optionally the nucleic acid is codon-optimized for expression in a cell.

In some embodiments, the cell is a eukaryotic cell.

In some embodiments, the cell is a mammalian cell or a plant cell. In certain embodiments, the cell is a human cell.

In some embodiments, the nucleic acid encoding the variant polypeptide is operably linked to a promoter.

In certain embodiments, the nucleic acid encoding the variant polypeptide is in a vector.

In some embodiments, the vector comprises a retroviral vector, a lentiviral vector, a phage vector, an adenoviral vector, an adeno-associated vector, or a herpes simplex vector.

In certain embodiments, the composition is present in a delivery composition comprising a nanoparticle (e.g., a lipid nanoparticle), a liposome, an exosome, a microvesicle, or a gene-gun.

In one aspect, the disclosure provides a method for editing a gene in a cell, the method comprising contacting the cell with the variant polypeptide or composition of any one of the previous aspects or embodiments.

In one aspect, the disclosure provides a nucleic acid molecule encoding a variant polypeptide of any of the previous aspects of embodiments.

In certain embodiments, the sequence of the nucleic acid molecule is 95% identical to a sequence selected from the group consisting of SEQ ID NOs: 1, 2, or 21-27. In some embodiments, the sequence of the nucleic acid molecule comprises a sequence selected from the group consisting of SEQ ID NOs: 1, 2, or 21-27. In certain embodiments, the sequence of the nucleic acid molecule is 95% identical to a sequence selected from the group consisting of SEQ ID NOs: 22, 23, or 25.

Although this invention disclosed herein is not limited to specific advantages or functionalities, the invention provides a variant polypeptide, and/or a composition comprising a variant polypeptide, wherein the variant polypeptide comprises an alteration relative to the parent polypeptide of SEQ ID NO: 3, and wherein the variant polypeptide or a complex comprising the variant polypeptide exhibits enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability relative to the parent polypeptide or a complex comprising the parent polypeptide.

In some aspects, the enhanced enzymatic activity is enhanced nuclease activity.

In some aspects, the variant polypeptide exhibits enhanced binding activity to an RNA guide relative to the parent polypeptide.

In some aspects, the variant polypeptide exhibits enhanced binding specificity to an RNA guide relative to the parent polypeptide.

In some aspects, the variant polypeptide and an RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced binding activity to a target nucleic acid (e.g., on-target binding activity) relative to a parent binary complex.

In some aspects, the variant polypeptide and an RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced binding specificity to a target nucleic acid (e.g., on-target binding specificity) relative to a parent binary complex.

In some aspects, the variant polypeptide and an RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced stability relative to a parent binary complex.

In some aspects, the variant binary complex and a target nucleic acid form a variant ternary complex, and the variant ternary complex exhibits increased stability relative to a parent ternary complex.

In some aspects, the variant polypeptide further exhibits enhanced binary complex formation, enhanced protein-RNA interactions, and/or decreased dissociation from an RNA guide relative to the parent polypeptide.

In some aspects, the variant binary complex further exhibits decreased dissociation from the target nucleic acid, and/or decreased off-target binding to a non-target nucleic acid relative to the parent binary complex.

In some aspects, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur over a range of temperatures, e.g., 20° C. to 65° C. (e.g., 20° C. to 30° C., 30° C. to 40° C., 40° C. to 50° C., 50° C. to 60° C., or 60° C. to 65° C.).

In some aspects, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur over a range of incubation times.

In some aspects, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur in a buffer having a pH in a range of about 7.3 to about 8.6 (e.g., about 7.5 to about 8.0, about 7.8 to 8.3, or about 8.0 to 8.6).

In some aspects, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occurs when a T_mvalue of the variant polypeptide, variant binary complex, or variant ternary complex is at least 8° C. greater than the T_mvalue of the parent polypeptide, parent binary complex, or parent ternary complex.

In other aspects, the alteration comprises an amino acid sequence alteration relative to the parent polypeptide having the sequence set forth in SEQ ID NO: 3, wherein the alteration comprises one or more (e.g., one, two, three, four, five, or more) substitutions, insertions, deletions, and/or additions as compared to the parent polypeptide having the sequence set forth in SEQ ID NO:3.

In some aspects, the alteration comprises an amino acid sequence alteration relative to the parent polypeptide sequence set forth in SEQ ID NO: 3, wherein the alteration comprises one or more of the amino acid substitutions listed in Table 2.

In some aspects, the alteration comprises an arginine, lysine, glutamine, asparagine, histidine, alanine, or glycine substitution.

In some aspects, the alteration is an amino acid substitution selected from P14R, E311R, D32R, I61R, G223R, N109R, and/or D719R.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position P14 (e.g., a P14R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position P14 (e.g., a P14R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position E311 (e.g., an E311R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position E311 (e.g., an E311R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D32 (e.g., a D32R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D32 (e.g., a D32R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position I61 (e.g., an I61R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position I61 (e.g., an I61R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position G223 (e.g., a G223R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position G223 (e.g., a G223R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position N109 (e.g., an N109R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position N109 (e.g., an N109R substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D719 (e.g., a D719R substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D719 (e.g., a D719R substitution) relative to SEQ ID NO: 3.

In some aspects, the alteration is a combination of amino acid substitutions listed in Table 3.

In some aspects, the combination of amino acid substitutions comprises the substitutions set forth in a) P14R, E311R, D32R; b) P14R, E311R, G223R; c) P14R, E311R, D32R, I61R; or d) D32R, N109R, E311R, D719R.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), and D32 (e.g., a D32R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, D32R variant polypeptide).

In some aspects, the present disclosure provides a polypeptide comprising an amino acid having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), and G223 (e.g., a G223R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, G223R variant polypeptide).

In some aspects, the present disclosure provides a polypeptide comprising an amino acid having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position P14 (e.g., a P14R substitution), E311 (e.g., an E311R), D32 (e.g., a D32R substitution), and I61 (e.g., an I61R substitution) relative to SEQ ID NO: 3 (e.g., a P14R, E311R, D32R and I61R variant polypeptide).

In some aspects, the present disclosure provides a polypeptide comprising an amino acid having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D32R (e.g., a D32R substitution), N109 (e.g., an N109R), E311 (e.g., an E311R substitution), and D719 (e.g., a D719R substitution) relative to SEQ ID NO: 3 (e.g., a D32R, N109R, E311R and D719R variant polypeptide).

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position K208 (e.g., a K208G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position K208 (e.g., a K208G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D302 (e.g., a D302G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D302 (e.g., a D302G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D590 (e.g., a D590G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D590 (e.g., a D590G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position E154 (e.g., an E154G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position E154 (e.g., an E154G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D567 (e.g., a D567G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D567 (e.g., a D567G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position L38 (e.g., an L38G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position L38 (e.g., an L38G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D145 (e.g., a D145G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D145 (e.g., a D145G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position C13 (e.g., a C13G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position C13 (e.g., a C13G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position T338 (e.g., a T338G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position T338 (e.g., a T338G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position P14 (e.g., a P14G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position P14 (e.g., a P14G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position D55 (e.g., a D55G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position D55 (e.g., a D55G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position K221 (e.g., a K221G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position K221 (e.g., a K221G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position K35 (e.g., a K35G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position K35 (e.g., a K35G substitution) relative to SEQ ID NO: 3.

In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having at least 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 3 and comprising a substitution at position E736 (e.g., an E736G substitution) relative to SEQ ID NO: 3. In some aspects, the present disclosure provides a polypeptide comprising an amino acid sequence having one or more sequence alterations (e.g., substitutions, insertions, or deletions, or any combination thereof) at up to 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 amino acid positions of SEQ ID NO: 3, wherein one of the sequence alterations comprises a substitution at position E736 (e.g., an E736G substitution) relative to SEQ ID NO: 3.

In some aspects, the variant polypeptide comprises a RuvC domain or a split RuvC domain.

In some aspects, the variant polypeptide comprises one or more catalytic residues (e.g., aspartic acid or glutamic acid). In some aspects, the one or more catalytic residues comprise D328 and E530. In some aspects, the one or more catalytic residues further comprise D684, D646, or D621.

In some aspects, the composition or complex comprising the variant polypeptide further comprises an RNA guide, and the RNA guide comprises a direct repeat sequence and a spacer sequence.

In some aspects, the RNA guide comprises a direct repeat sequence and a spacer sequence.

In some aspects, the direct repeat sequence comprises a nucleotide sequence with at least 95% sequence identity to any one of SEQ ID NOs: 4-13.

In some aspects, the direct repeat sequence comprises the nucleotide sequence of any one of SEQ ID NOs: 4-13.

In some aspects, the spacer sequence comprises between 15 and 35 nucleotides in length.

In some aspects, the target nucleic acid comprises a sequence complementary to a nucleotide sequence in the spacer sequence.

In some aspects, the target nucleic acid is adjacent to a protospacer adjacent motif (PAM) sequence, wherein the PAM sequence comprises a nucleotide sequence set forth as 5′-NTTR-3′, 5′-NTTN-3′, 5′-RTTR-3′, 5′-ATTR-3′, or 5′-RTTG-3′, wherein N is any nucleotide and R is A or G. In some aspects, the PAM sequence comprises a nucleotide sequence set forth as 5′-GTTA-3′, 5′-TTTG-3′, 5′-CTTG-3′, 5′-GTTG-3′, 5′-TTTA-3′, 5′-CTTA-3′, 5′-ATTG-3′, 5′-ATTA-3′, 5′-ACTG-3′, 5′-CATA-3′, 5′-TTGA-3′, or 5′-TATA-3′.

In some aspects, the target nucleic acid is single-stranded DNA or double-stranded DNA.

In some aspects, the variant polypeptide further comprises a peptide tag, a fluorescent protein, a base-editing domain, a DNA methylation domain, a histone residue modification domain, a localization factor, a transcription modification factor, a light-gated control factor, a chemically inducible factor, or a chromatin visualization factor.

In some aspects, a nucleic acid encoding the variant polypeptide is codon-optimized for expression in a cell

In some aspects, the nucleic acid encoding the variant polypeptide is operably linked to a promoter.

In some aspects, the nucleic acid encoding the variant polypeptide is in a vector.

In some aspects, the vector comprises a retroviral vector, a lentiviral vector, a phage vector, an adenoviral vector, an adeno-associated vector, or a herpes simplex vector.

In some aspects, the composition is present in a delivery composition comprising a nanoparticle, a liposome, an exosome, a microvesicle, or a gene-gun.

The invention further provides a cell comprising the variant polypeptide and/or the composition disclosed herein. In some aspects, the cell is a eukaryotic cell or a prokaryotic cell. In some aspects, the cell is a mammalian cell or a plant cell. In some aspects, the cell is a human cell.

The invention further provides a method of preparing the variant polypeptide and/or the composition disclosed herein.

The invention further provides a method of complexing the variant polypeptide with the RNA guide disclosed herein.

The invention further provides a method of complexing the variant binary complex with the target nucleic acid disclosed herein.

The invention further provides a method of delivering the variant polypeptide and/or the composition disclosed herein.

The invention yet further provides a composition comprising a variant polypeptide, or a complex comprising the variant polypeptide and an RNA guide, wherein the variant polypeptide comprises an alteration relative to the parent polypeptide of SEQ ID NO: 3, and wherein the variant polypeptide or the complex exhibits enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability relative to a parent polypeptide or a complex comprising the parent polypeptide and the RNA guide.

In some aspects, the enhanced enzymatic activity is enhanced nuclease activity.

In some aspects, the variant polypeptide exhibits enhanced binding activity to the RNA guide relative to the parent polypeptide.

In some aspects, the variant polypeptide exhibits enhanced binding specificity to the RNA guide relative to the parent polypeptide.

In some aspects, the variant polypeptide and the RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced binding activity to a target nucleic acid (e.g., on-target binding activity) relative to a parent binary complex.

In some aspects, the variant polypeptide and the RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced binding specificity to a target nucleic acid (e.g., on-target binding specificity) relative to a parent binary complex.

In some aspects, the variant polypeptide and the RNA guide form a variant binary complex, and the variant binary complex exhibits enhanced stability relative to a parent binary complex.

In some aspects, the variant polypeptide further exhibits enhanced binary complex formation, enhanced protein-RNA interactions, and/or decreased dissociation from the RNA guide relative to the parent polypeptide.

In some aspects, the enhanced enzymatic activity, enhanced binding activity, enhanced binding specificity, and/or enhanced stability occur over a range of temperatures, e.g., 20° C. to 65° C.