The field is molecular biology, and more specifically, methods for editing the genome of a plant cell to produce northern leaf blight resistant corn.
The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCII formatted sequence listing with a file named 7156-US-PCT_ST25.txt created on May 3, 2022 and having a size of 241,817 bytes and is filed concurrently with the specification. The sequence listing contained in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety.
Northern leaf blight (NLB), induced by the fungal pathogen Exserohilum turcicum (previously called Helminthosporium turcicum), is a serious foliar wilt disease of maize in many tropical and temperate environments. Symptoms can range from cigar-shaped lesions on the lower leaves to complete destruction of the foliage, thereby reducing the amount of leaf surface area available for photosynthesis. A reduction in photosynthetic capability leads to a lack of carbohydrates needed for grain fill, which impacts grain yield. Mid-altitude regions of the tropics, about 900-1600 m above sea level, have a particularly favorable climate for northern leaf blight, as dew periods are long and temperatures moderate. However, northern leaf blight can also yield losses of 30-50% in temperate environments, such as in the United States, during wet seasons, particularly if the infection is established on the upper leaves of the plant by the silking stage.
The most effective and most preferred method of control for northern leaf blight is the planting of resistant hybrids. Resistance to specific races of the pathogen can be controlled by certain native disease resistance maize genes, such as Ht1, Ht2, Ht3, Htm1, Htn1, HtN, HtP, ht4 and rt (Welz and Geiger 2000. Plant Breeding. 119(1):1-14; Ogliari et al. 2005. Genet Mol Biol 28:435-439; Hurni et al. 2015 PNAS 112(28):8780-5). However, introgressing the resistance genes into other inbreds is an arduous task, which may or may not come with a yield penalty due to linkage drag. There is a need to produce northern leaf blight resistant maize plants more efficiently and in a way that will reduce linkage drag associated with the introgression of multiple resistance loci into elite maize lines via conventional means.
The limitations of conventional breeding for introgressing northern leaf blight resistance into maize lines can be overcome through the editing of genes that confer enhanced resistance to northern leaf blight, such as, for example, Ht1 and NLB18, or by the movement of resistant alleles of Ht1 and NLB18 to another site in the genome such that enhanced resistance to northern leaf blight can be obtained by introgressing a single genomic locus comprising multiple nucleotide sequences, each conferring enhanced resistance to northern leaf blight, into maize plants.
Methods for obtaining a maize plant cell with a modified Ht1 nucleotide sequence are provided herein. The methods include: introducing a double-strand break or site-specific modification at one or more target sites in an endogenous HT1 encoding sequence in a maize plant cell and obtaining a maize plant cell having a modified Ht1 nucleotide sequence. The method may further comprise introducing an Ht1 substitution template in the maize plant cell, wherein said Ht1 substitution template comprises at least one nucleic acid alteration compared to the endogenous HT1 encoding sequence and wherein said Ht1 substitution template is incorporated into the endogenous HT1 encoding sequence. The double-strand break may be induced by a nuclease such as, but not limited to, a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the modified Ht1 nucleotide sequence, and the maize plant may exhibit enhanced resistance to northern leaf blight.
In some aspects, the modified Ht1 nucleotide sequence comprises a deletion in the promoter of the endogenous HT1 encoding sequence. The methods may involve the use of Cas9 endonuclease and one or more guide RNAs. In one embodiment, at least two guide RNAs are used, wherein a first guide RNA comprises a variable targeting domain that is complementary to SEQ ID NO:1 [Ht1-T52], and a second guide RNA comprises a variable targeting domain complementary to SEQ ID NO:2 [Ht1-TS4]. In another embodiment, a first guide RNA comprises a variable targeting domain that is complementary to SEQ ID NO:1 [Ht1-TS2], and a second guide RNA comprises a variable targeting domain complementary to SEQ ID NO:3 [Ht1-ST1-TS1].
In other aspects, an Ht1 subsitution template is used, which comprises an Ht1 nucleotide sequence from PH4GP (Ht1-PH4GP). The Ht1-PH4GP nucleotide sequence may comprise SEQ ID NO:59. In another embodiment, the Ht1-PH4GP nucleotide sequence may comprise a nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:52, wherein the polypeptide confers resistance to northern leaf blight. The Ht1-PH4GP nucleotide sequence may also comprise SEQ ID NO:65. The methods may involve the use of Cas9 endonuclease and one or more guide RNAs. In an embodiment, at least two guide RNAs are used, wherein a first guide RNA comprises a variable targeting domain that is complementary to SEQ ID NO:14 [Ht1-TS6], and a second guide RNA comprises a variable targeting domain complementary to SEQ ID NO:16 [Ht1-TS9]. In another embodiment, a first guide RNA comprises a variable targeting domain complementary to SEQ ID NO:15 [Ht1-TS7], and a second guide RNA comprises a variable targeting domain that is complementary to SEQ ID NO:17 [Ht1-TS10].
Methods for obtaining a maize plant cell with a modified NLB18 nucleotide sequence are provided herein. The methods include: introducing a double-strand break or site-specific modification at one or more target sites in an endogenous NLB18 encoding sequence in a maize plant cell and obtaining a maize plant cell having a modified NLB18 nucleotide sequence. The methods may further comprise introducing an NLB18 substitution template in the maize plant cell, wherein said NLB18 substitution template comprises at least one nucleic acid alteration compared to the endogenous NLB18 encoding sequence and wherein said NLB18 substitution template is incorporated into the endogenous NLB18 encoding sequence. The double-strand break may be induced by a nuclease such as but not limited to a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the modified NLB18 nucleotide sequence, and the maize plant may exhibit enhanced resistance to northern leaf blight.
In some aspects, the modified NLB18 nucleotide sequence comprises a modification in the promoter of the endogenous NLB18 encoding sequence. In some embodiments, the modification in the promoter of an endogenous Ht1 encoding sequence comprises a deletion of a region of repetitive sequences in the Ht1 promoter. In one embodiment, the modification in the promoter of an endogenous Ht1 encoding sequence comprises a deletion of SEQ ID NO: 71 from the Ht1 promoter.
In other aspects, an NLB18 subsitution template is used, which comprises an NLB18 nucleotide sequence from PH26N or PH99N (NLB18-PH26N or NLB18-PH99N). In one embodiment, an NLB18-PH26N nucleotide sequence comprises any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:64, wherein the polypeptide confers resistance to norhtern leaf blight. In some aspects, the NLB18-PH26N nucleotide sequence comprises SEQ ID NO:70. The NLB18-PH99N nucleotide sequence may comprise any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:62, wherein the polypeptide confers resistance to norhtern leaf blight. The methods may involve the use of Cas9 endonuclease and one or more guide RNAs. In one embodiment, at least two guide RNAs are used, wherein a first guide RNA comprises a variable targeting domain that is complementary to SEQ ID NO:30 [NLB18-TS1], and a second guide RNA comprises a variable targeting domain complementary to SEQ ID NO:32 [NLB18-TS4]. In another embodiment, a first guide RNA comprises a variable targeting domain complementary to SEQ ID NO:31 [NLB18-TS8], and a second guide RNA comprises a variable targeting domain complementary to SEQ ID NO:32 [NLB18-TS4].
Methods for obtaining a maize plant cell with an edited genomic locus comprising at least one nucleotide sequence that confers enhanced resistance to northern leaf blight are provided herein. The methods include 1) introducing a double-strand break or site-specific modificaiton at one or more target sites in a genomic locus in a maize plant cell; 2) introducing one or more nucleotide sequences encoding a polypeptide that confers enhanced resistance to northern leaf blight, wherein each nucleotide sequence is flanked by 300-500 contiguous nucleotides of nucleotide sequences 5′ or 3′ of the corresponding target sites; and 3) obtaining a maize plant cell having a genomic locus comprising one or more nucleotide sequences that confer enhanced resistance to northern leaf blight. The double-strand break or site-specific modification may be induced by a nuclease such as but not limited to a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the edited genomic locus comprising the at least one nucleotide sequence that confers enhanced resistance to northern leaf blight, and the maize plant may exhibit enhanced resistance to northern leaf blight.
In some aspects, an edited plant cell comprises the one or more nucleotide sequences include any of the following: Ht1-PH4GP (SEQ ID NO: 51), NLB18-PH26N (SEQ ID NO: 63), and NLB18-PH99N (SEQ ID NO: 61). In other aspects, the genomic locus is CTL1. The Ht1-PH4GP nucleotide sequence may comprise SEQ ID NO:59 or any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:52, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant. In some aspects, the Ht1-PH4GP nucleotide sequence comprises SEQ ID NO:65. The NLB18-PH26N nucleotide sequence may comprise any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:64, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant. In some aspects, the NLB18-PH26N nucleotide sequence comprises SEQ ID NO:70. The NLB18-PH99N nucleotide sequence may comprise any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:62, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant.
In still other aspects, a nucleotide sequence encoding NLB18-PH26N is targeted to TS8 of CTL1; a nucleotide sequence encoding NLB18-PH4GP is targeted to TS10 of CTL1; and/or a nucleotide sequence encoding NLB18-PH26N is targeted to TS45 of CTL1.
In one aspect, a method to edit a plant cell comprises using a Cas9 endonuclease as the DSB-inducing agent, and one or more guide RNAs to target the Cas9 to sites in the CTL1 locus. One guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:36 [CTL1-TS8]; one guide RNA may comprise a variable targeting domain that is complementarty to SEQ ID NO:37 [CTL1-TS10], and one guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:38 [CTL1-TS45].
Maize plant cells produced by the methods presented herein are also provided as are maize plants produced the maize plant cells and seeds produced by the maize plants.
The guide polynucleotides comprising variable targeting domains complementary to target sites in the endogenous Ht1 encoding sequence, the endogenous NLB18 encoding sequence, or the CTL1 genomic locus are also provided herein. The guide polynucleotides may be RNA sequences, DNA sequences, or RNA-DNA combination sequences. For Ht1, the guide polynucleotides may have a variable targeting domain complementarty to SEQ ID NO:1, SEQ ID NO:2, or SEQ ID NO:3. For NLB18, the guide polynucleotides may have a variable targeting domain complementary to SEQ ID NO:30, SEQ ID NO:31, or SEQ ID NO:32. For CTL1, the guide polynucleotides may have a variable targeting domain complementary to SEQ ID NO:36, SEQ ID NO:37, or SEQ ID NO:38.
SEQ ID NO:1 is the nucleotide sequence of the Ht1-TS2 target site.
SEQ ID NO:2 is the nucleotide sequence of the Ht2-TS4 target site.
SEQ ID NO:3 is the nucleotide sequence of the Ht1-ST1-TS1 target site.
SEQ ID NO:4 is the nucleotide sequence of the Cas9 gene.
SEQ ID NO:5 is the amino acid sequence of the SV40 monopartite amino terminal nuclear localization signal.
SEQ ID NO:6 is the nucleotide sequence of the U6 polymerase III promoter.
SEQ ID NO:7 is the nucleotide sequence of the DNA capable of expressing the Ht1-CR2 guide RNA.
SEQ ID NO:8 is the nucleotide sequence of the DNA capable of expressing the Ht1-CR4 guide RNA.
SEQ ID NO:9 is the nucleotide sequence of the DNA capable of expressing the Ht1-ST1-CR1 guide RNA.
SEQ ID NO:10 is the nucleotide sequence of the Ht1f3 forward primer.
SEQ ID NO:11 is the nucleotide sequence of the Ht1r4v2 Reverse primer.
SEQ ID NO:12 is the nucleotide sequence of the secondary PCR reaction forward primer.
SEQ ID NO:13 is the nucleotide sequence of the secondary PCR reaction reverse primer.
SEQ ID NO:14 is the nucleotide sequence of the Ht1-TS6 target site.
SEQ ID NO:15 is the nucleotide sequence of the HT1-TS7 target site.
SEQ ID NO:16 is the nucleotide sequence of the HT1-TS9 target site.
SEQ ID NO:17 is the nucleotide sequence of the HT1-TS10 target site.
SEQ ID NO:18 is the nucleotide sequence of the DNA capable of expressing the HT1-CR6 guide RNA.
SEQ ID NO:19 is the nucleotide sequence of the DNA capable of expressing the HT1-CR9 guide RNA.
SEQ ID NO:20 is the nucleotide sequence of the DNA capable of expressing the HT1-CR7 guide RNA.
SEQ ID NO:21 is the nucleotide sequence of the DNA capable of expressing the HT1-CR10 guide RNA.
SEQ ID NO:22 is the nucleotide sequence of the Ht1HR1f1 forward primer.
SEQ ID NO:23 is the nucleotide sequence of the Ht1HR1r1 reverse primer.
SEQ ID NO:24 is the nucleotide sequence of the Ht1HR2f1 forward primer.
SEQ ID NO:25 is the nucleotide sequence of the Ht1HR2r1 reverse primer.
SEQ ID NO:26 is the nucleotide sequence of the hdr2b_f forward primer.
SEQ ID NO:27 is the nucleotide sequence of the hdr2b_r reverse primer.
SEQ ID NO:28 is the nucleotide sequence of the hdr2b_PV probe.
SEQ ID NO:29 is the nucleotide sequence of the hdr2b_PG probe.
SEQ ID NO:30 is the nucleotide sequence of the NLB18-TS1 target site.
SEQ ID NO:31 is the nucleotide sequence of the NLB18-TS8 target site.
SEQ ID NO:32 is the nucleotide sequence of the NLB18-TS4 target site.
SEQ ID NO:33 is the nucleotide sequence of the DNA capable of expressing the NLB18-CR1 guide RNA.
SEQ ID NO:34 is the nucleotide sequence of the DNA capable of expressing the NLB18-CR8 guide RNA.
SEQ ID NO:35 is the nucleotide sequence of the DNA capable of expressing the NLB18-CR4 guide RNA.
SEQ ID NO:36 is the nucleotide sequence of the CTL1-TS8 target site.
SEQ ID NO:37 is the nucleotide sequence of the CTL1-TS45 target site.
SEQ ID NO:38 is the nucleotide sequence of the CTL1-TS10 target site.
SEQ ID NO:39 is the nucleotide sequence of the 8HR1f1 forward primer.
SEQ ID NO:40 is the nucleotide sequence of the PH26NPr reverse primer.
SEQ ID NO:41 is the nucleotide sequence of the PH26NTf forward primer.
SEQ ID NO:42 is the nucleotide sequence of the 8HR2r1 reverse primer.
SEQ ID NO:43 is the nucleotide sequence of the 10HR1f forward primer.
SEQ ID NO:44 is the nucleotide sequence of the Ht1Pr reverse primer.
SEQ ID NO:45 is the nucleotide sequence of the Ht1Tf forward primer.
SEQ ID NO:46 is the nucleotide sequence of the 10HR2r reverse primer.
SEQ ID NO:47 is the nucleotide sequence of the 45hr1f1 forward primer.
SEQ ID NO:48 is the nucleotide sequence of the PH26NPr reverse primer.
SEQ ID NO:49 is the nucleotide sequence of the PH26NTf forward primer.
SEQ ID NO:50 is the nucleotide sequence of the 45hr2r1 reverse primer.
SEQ ID NO:51 is the nucleotide sequence of the Ht1 cDNA found in inbred line PH4GP.
SEQ ID NO:52 is the amino acid sequence of the polypeptide encoded by SEQ ID NO:51.
SEQ ID NO:53 is the nucleotide sequence of the Ht1 cDNA found in inbred line PH1W2.
SEQ ID NO:54 is the amino acid sequence of the polypeptide encoded by SEQ ID NO:53.
SEQ ID NO:55 is the nucleotide sequence of the Ht1 cDNA found in inbred line B73 and herein referred to as the “B73-high allele”.
SEQ ID NO:56 is the amino acid sequence of the polypeptide encoded by SEQ ID NO:55.
SEQ ID NO:57 is the nucleotide sequence of the Ht1 cDNA found in inbred line B73 and herein referred to as the “B73-low allele”.
SEQ ID NO:58 is the amino acid sequence of the polypeptide encoded by SEQ ID NO:57.
SEQ ID NO:59 is the nucleotide sequence of the Ht1 genomic DNA found in inbred line PH4GP.
SEQ ID NO:60 is the amino acid sequence of a region found in the Ht1 polypeptides of resistant alleles.
SEQ ID NO:61 is the NLB18 cDNA sequence from PH99N.
SEQ ID NO:62 is the amino acid sequence of the protein encoded by SEQ ID NO:61.
SEQ ID NO:63 is the NLB18 cDNA sequence from PH26N.
SEQ ID NO:64 is the amino acid sequence of the protein encoded by SEQ ID NO:63.
SEQ ID NO:65 is the nucleotide sequence of the ZM-HT1-PH4GP including the ZM-HT1-PH4GP promoter, exon 1, intron 1, and terminator.
SEQ ID NO:66 is the NLB18 nucleotide sequence from PH184C, including the 5′ of NLB18-CR8 through the 3′ of NLB18-CR4.
SEQ ID NO:67 is the homology arm sequence flanking the 5′ of NLB18-TS1 in PH184C.
SEQ ID NO:68 is the homology arm sequence flanking the 3′ of NLB18-TS4 in PH184C.
SEQ ID NO:69 is the homology arm sequence flanking the 5′ of NLB18-TS8 in PH184C.
SEQ ID NO:70 is the NLB18 nucleotide sequence from PH26N, including the PH26N NLB18 promoter, exon 1, intron 1, exon 2, intron 2, exon 3, and terminator.
SEQ ID NO:71 is the nucleotide sequence of a region of repetitive sequences in the Ht1 promoter of PH184C.
SEQ ID NO:72 is the nucleotide sequence of an expression cassette including the Zea mays ubiquitin promoter, the 5′ UTR of the ZM-ubiquitin gene, intron 1 of the ZM-ubiquitin gene, the SV40 nuclear localization signal, Cas9 exon 1 (ST1), the potato-LS1 intron, Cas9 exon 2 (ST1), the VirD2 endonuclease nuclear localization signal, and the pinll terminator.
SEQ ID NO:73 is the nucleotide sequence containing the Cas9 used in Example 4; SEQ ID NO:73 contains the cas9 exon 1 (SP), the ST-LS1 intron 2, the Cas9 exon 2 (SP), and the VirD2 nuclear localization signal.
SEQ ID NO:74 is the nucleotide sequence of the DNA capable of expressing the ZM-U6:08CR1 guide RNA.
SEQ ID NO:75 is the nucleotide sequence of the DNA capable of expressing the ZM-U6:45CR1 guide RNA.
SEQ ID NO:76 is the nucleotide sequence of the DNA capable of expressing the ZM-U6:10CR3 guide RNA.
SEQ ID NO:77 is the nucleotide sequence of the 08CR1HR1-NLB18(PH26N) genomic sequence-8CR1HR2 repair template targeted to TS8 of CTL1.
SEQ ID NO:78 is the nucleotide sequence of the 45CR1HR1-NLB18(PH26N) genomic sequence-45CR1HR2 repair template targeted to TS45 of CTL1.
SEQ ID NO:79 is the nucleotide sequence of the 10CR3HR1-HT1 (PH4GP) genomic sequence-10CR3HR2 repair template targeted to TS10 of CTL1.
SEQ ID NO:80 is the amino acid sequence of the Agrobacterium tumefaciens bipartite VirD2 T-DNA border endonuclease carboxyl terminal nuclear localization signal.
SEQ ID NO:81 is the nucleotide sequence of the homology arm flanking the 5′ of HT1-TS6 in PH184C (Example 2).
SEQ ID NO:82 is the nucleotide sequence of the homology arm flanking the 3′ of HT1-TS9 in PH184C (Example 2).
SEQ ID NO:83 is the nucleotide sequence of the homology arm flanking the 5′ of HT1-TS7 in PH184C (Example 2).
SEQ ID NO:84 is the nucleotide sequence of the homology arm flanking the 5′ of HT1-TS10 in PH184C (Example 2).
SEQ ID NO:85 is the nucleotide sequence of the homology arm flanking the 5′ of CTL1-TS8 in PH184C (Example 4).
SEQ ID NO:86 is the nucleotide sequence of the homology arm flanking the 3′ of CTL1-TS8 in PH184C (Example 4).
SEQ ID NO:87 is the nucleotide sequence of the homology arm flanking the 5′ of CTL1-TS45 in PH184C (Example 4).
SEQ ID NO:88 is the nucleotide sequence of the homology arm flanking the 3′ of CTL1-TS45 in PH184C (Example 4).
SEQ ID NO:89 is the nucleotide sequence of the homology arm flanking the 5′ of CTL1-TS10 in PH184C (Example 4).
SEQ ID NO:90 is the nucleotide sequence of the homology arm flanking the 3′ of CTL1-TS10 in PH184C (Example 4).
It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. As used in this specification and the appended claims, terms in the singular and the singular forms “a”, “an” and “the”, for example, include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “plant”, “the plant” or “a plant” also includes a plurality of plants; also, depending on the context, use of the term “plant” can also include genetically similar or identical progeny of that plant; use of the term “a nucleic acid” optionally includes, as a practical matter, many copies of that nucleic acid molecule; similarly, the term “probe” optionally (and typically) encompasses many similar or identical probe molecules. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs unless clearly indicated otherwise.
Compositions and methods are presented herein to edit the maize genome to produce maize plants that have enhanced resistance to northern leaf blight.
The term “allele” refers to one of two or more different nucleotide sequences that occur at a specific locus.
“Exserohilum turcicum”, previously referred to as Helminthosporium turcicum, is the fungal pathogen that induces northern leaf blight infection. The fungal pathogen is also referred to herein as Exserohilum or Et.
“Disease resistance” (such as, for example, northern leaf blight resistance) is a characteristic of a plant, wherein the plant avoids, miminimzes, or reduces the disease symptoms that are the outcome of plant-pathogen interactions, such as maize-Exserohilum turcicum interactions. That is, pathogens are prevented from causing plant diseases and the associated disease symptoms, or alternatively, the disease symptoms caused by the pathogen are minimized or lessened.
A “locus” is a position on a chromosome where a gene or marker is located.
“Resistance” is a relative term, indicating that the infected plant produces better plant health or yield of maize than another, similarly treated, more susceptible plant. That is, the conditions cause a reduced decrease in maize survival, growth, and/or yield in a tolerant maize plant, as compared to a susceptible maize plant. One of skill will appreciate that maize plant resistance to northern leaf blight, or the pathogen causing such, can represent a spectrum of more resistant or less resistant phenotypes, and can vary depending on the severity of the infection. However, by simple observation, one of skill can determine the relative resistance or susceptibility of different plants, plant lines or plant families to northern leaf blight, and furthermore, will also recognize the phenotypic gradations of “resistant”. For example, a 1 to 9 visual rating indicating the level of resistance to northern leaf blight can be used. A higher score indicates a higher resistance. Data should be collected only when sufficient selection pressure exists in the experiment measured. The terms “tolerance” and “resistance” are used interchangeably herein.
The resistance may be “newly conferred” or “enhanced”. “Newly conferred” or “enhanced” resistance refers to an increased level of resistance against a particular pathogen, a wide spectrum of pathogens, or an infection caused by the pathogen(s). An increased level of resistance against a particular fungal pathogen, such as Et, for example, constitutes “enhanced” or improved fungal resistance. The embodiments may enhance or improve fungal plant pathogen resistance.
In some embodiments, gene editing may be facilitated through the induction of a double-stranded break (a “DSB”) in a defined position in the genome near the desired alteration. DSBs can be induced using any DSB-inducing agent available, including, but not limited to, TALENs, meganucleases, zinc finger nucleases, Cas9-gRNA systems (based on bacterial CRISPR-Cas systems), and the like. In some embodiments, the introduction of a DSB can be combined with the introduction of a polynucleotide modification template.
A polynucleotide modification template may be introduced into a cell by any method known in the art, such as, but not limited to, transient introduction methods, transfection, electroporation, microinjection, particle mediated delivery, topical application, whiskers mediated delivery, delivery via cell-penetrating peptides, or mesoporous silica nanoparticle (MSN)-mediated direct delivery.
The polynucleotide modification template may be introduced into a cell as a single stranded polynucleotide molecule, a double stranded polynucleotide molecule, or as part of a circular DNA (vector DNA). The polynucleotide modification template may also be tethered to the guide RNA and/or the Cas endonuclease. Tethered DNAs can allow for co-localizing target and template DNA, useful in genome editing and targeted genome regulation, and can also be useful in targeting post-mitotic cells where function of endogenous homologous recombination HR machinery is expected to be highly diminished (Mali et al. 2013 Nature Methods Vol. 10: 957-963.) The polynucleotide modification template may be present transiently in the cell or it can be introduced via a viral replicon.
A “modified nucleotide” or “edited nucleotide” refers to a nucleotide sequence of interest that comprises at least one alteration when compared to its non-modified nucleotide sequence. Such “alterations” include, for example: (i) replacement of at least one nucleotide, (ii) a deletion of at least one nucleotide, (iii) an insertion of at least one nucleotide, or (iv) any combination of (i)-(iii). An “edited cell” or an “edited plant cell” refers to a cell containing at least one alteration in the genomic sequnce when compared to a control cell or plant cell that does not include such alteration in the genomic sequence.
The term “polynucleotide modification template” or “modification template” as used herein refers to a polynucleotide that comprises at least one nucleotide modification when compared to the target nucleotide sequence to be edited. A nucleotide modification can be at least one nucleotide substitution, addition or deletion. Optionally, the polynucleotide modification template can further comprise homologous nucleotide sequences flanking the at least one nucleotide modification, wherein the flanking homologous nucleotide sequences provide sufficient homology to the desired nucleotide sequence to be edited.
The process for editing a genomic sequence combining DSBs and modification templates generally comprises: providing to a host cell a DSB-inducing agent, or a nucleic acid encoding a DSB-inducing agent, that recognizes a target sequence in the chromosomal sequence, and wherein the DSB-inducing agent is able to induce a DSB in the genomic sequence; and providing at least one polynucleotide modification template comprising at least one nucleotide alteration when compared to the nucleotide sequence to be edited. The endonuclease may be provided to a cell by any method known in the art, for example, but not limited to transient introduction methods, transfection, microinjection, and/or topical application or indirectly via recombination constructs. The endonuclease may be provided as a protein or as a guided polynucleotide complex directly to a cell or indirectly via recombination constructs. The endonuclease may be introduced into a cell transiently or can be incorporated into the genome of the host cell using any method known in the art. In the case of a CRISPR-Cas system, uptake of the endonuclease and/or the guided polynucleotide into the cell can be facilitated with a Cell Penetrating Peptide (CPP) as described in WO2016073433.
As used herein, a “genomic region” refers to a segment of a chromosome in the genome of a cell. In one embodiment, a genomic region includes a segment of a chromosome in the genome of a cell that is present on either side of the target site or, alternatively, also comprises a portion of the target site. The genomic region may comprise at least 5-10, 5-15, 5-20, 5-25, 5-30, 5-35, 5-40, 5-45, 5-50, 5-55, 5-60, 5-65, 5-70, 5-75, 5-80, 5-85, 5-90, 5-95, 5-100, 5-200, 5-300, 5-400, 5-500, 5-600, 5-700, 5-800, 5-900, 5-1000, 5-1100, 5-1200, 5-1300, 5-1400, 5-1500, 5-1600, 5-1700, 5-1800, 5-1900, 5-2000, 5-2100, 5-2200, 5-2300, 5-2400, 5-2500, 5-2600, 5-2700, 5-2800, 5-2900, 5-3000, 5-3100 or more bases such that the genomic region has sufficient homology to undergo homologous recombination with the corresponding region of homology.
TAL effector nucleases (TALEN) are a class of sequence-specific nucleases that can be used to make double-strand breaks at specific target sequences in the genome of a plant or other organism. (See Miller et al. (2011) Nature Biotechnology 29:143-148).
Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Endonucleases include restriction endonucleases, which cleave DNA at specific sites without damaging the bases, and meganucleases, also known as homing endonucleases (HEases), which like restriction endonucleases, bind and cut at a specific recognition site, however the recognition sites for meganucleases are typically longer, about 18 bp or more (patent application PCT/US12/30061, filed on Mar. 22, 2012). Meganucleases have been classified into four families based on conserved sequence motifs, the families are the LAGLIDADG, GIY-YIG, H-N-H, and His-Cys box families. These motifs participate in the coordination of metal ions and hydrolysis of phosphodiester bonds. HEases are notable for their long recognition sites, and for tolerating some sequence polymorphisms in their DNA substrates. The naming convention for meganuclease is similar to the convention for other restriction endonuclease. Meganucleases are also characterized by prefix F-, I-, or PI- for enzymes encoded by free-standing ORFs, introns, and inteins, respectively. One step in the recombination process involves polynucleotide cleavage at or near the recognition site. The cleaving activity can be used to produce a double-strand break. For reviews of site-specific recombinases and their recognition sites, see, Sauer (1994) Curr Op Biotechnol 5:521-7; and Sadowski (1993) FASEB 7:760-7. In some examples the recombinase is from the Integrase or Resolvase families.
Zinc finger nucleases (ZFNs) are engineered double-strand break inducing agents comprised of a zinc finger DNA binding domain and a double-strand-break-inducing agent domain. Recognition site specificity is conferred by the zinc finger domain, which typically comprising two, three, or four zinc fingers, for example having a C2H2 structure, however other zinc finger structures are known and have been engineered. Zinc finger domains are amenable for designing polypeptides which specifically bind a selected polynucleotide recognition sequence. ZFNs include an engineered DNA-binding zinc finger domain linked to a non-specific endonuclease domain, for example nuclease domain from a Type IIs endonuclease such as Fokl. Additional functionalities can be fused to the zinc-finger binding domain, including transcriptional activator domains, transcription repressor domains, and methylases. In some examples, dimerization of nuclease domain is required for cleavage activity. Each zinc finger recognizes three consecutive base pairs in the target DNA. For example, a 3 finger domain recognized a sequence of 9 contiguous nucleotides, with a dimerization requirement of the nuclease, two sets of zinc finger triplets are used to bind an 18 nucleotide recognition sequence.
Genome editing using DSB-inducing agents, such as Cas9-gRNA complexes, has been described, for example in U.S. Patent Application US 2015-0082478 A1, WO2015/026886 A1, WO2016007347, and WO201625131, all of which are incorporated by reference herein.
The term “Cas gene” herein refers to a gene that is generally coupled, associated or close to, or in the vicinity of flanking CRISPR loci in bacterial systems. The terms “Cas gene”, “CRISPR-associated (Cas) gene” are used interchangeably herein. The term “Cas endonuclease” herein refers to a protein, or complex of proteins, encoded by a Cas gene. A Cas endonuclease as disclosed herein, when in complex with a suitable polynucleotide component, is capable of recognizing, binding to, and optionally nicking or cleaving all or part of a specific DNA target sequence. A Cas endonuclease as described herein comprises one or more nuclease domains. Cas endonucleases of the disclosure includes those having a HNH or HNH-like nuclease domain and/or a RuvC or RuvC-like nuclease domain. A Cas endonuclease of the disclosure may include a Cas9 protein, a Cpf1 protein, a C2c1 protein, a C2c2 protein, a C2c3 protein, Cas3, Cas5, Cas7, Cas8, Cas10, or complexes of these.
As used herein, the terms “guide polynucleotide/Cas endonuclease complex”, “guide polynucleotide/Cas endonuclease system”, “guide polynucleotide/Cas complex”, “guide polynucleotide/Cas system”, “guided Cas system” are used interchangeably herein and refer to at least one guide polynucleotide and at least one Cas endonuclease that are capable of forming a complex, wherein said guide polynucleotide/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site. A guide polynucleotide/Cas endonuclease complex herein can comprise Cas protein(s) and suitable polynucleotide component(s) of any of the four known CRISPR systems (Horvath and Barrangou, 2010, Science 327:167-170) such as a type I, II, or III CRISPR system. A Cas endonuclease unwinds the DNA duplex at the target sequence and optionally cleaves at least one DNA strand, as mediated by recognition of the target sequence by a polynucleotide (such as, but not limited to, a crRNA or guide RNA) that is in complex with the Cas protein. Such recognition and cutting of a target sequence by a Cas endonuclease typically occurs if the correct protospacer-adjacent motif (PAM) is located at or adjacent to the 3′ end of the DNA target sequence. Alternatively, a Cas protein herein may lack DNA cleavage or nicking activity, but can still specifically bind to a DNA target sequence when complexed with a suitable RNA component. (See also U.S. Patent Application US 2015-0082478 A1, and US 2015-0059010 A1, both hereby incorporated in its entirety by reference).
A guide polynucleotide/Cas endonuclease complex can cleave one or both strands of a DNA target sequence. A guide polynucleotide/Cas endonuclease complex that can cleave both strands of a DNA target sequence typically comprises a Cas protein that has all of its endonuclease domains in a functional state (e.g., wild type endonuclease domains or variants thereof retaining some or all activity in each endonuclease domain). Thus, a wild type Cas protein (e.g., a Cas9 protein disclosed herein), or a variant thereof retaining some or all activity in each endonuclease domain of the Cas protein, is a suitable example of a Cas endonuclease that can cleave both strands of a DNA target sequence. A Cas9 protein comprising functional RuvC and HNH nuclease domains is an example of a Cas protein that can cleave both strands of a DNA target sequence. A guide polynucleotide/Cas endonuclease complex that can cleave one strand of a DNA target sequence can be characterized herein as having nickase activity (e.g., partial cleaving capability). A Cas nickase typically comprises one functional endonuclease domain that allows the Cas to cleave only one strand (i.e., make a nick) of a DNA target sequence. For example, a Cas9 nickase may comprise (i) a mutant, dysfunctional RuvC domain and (ii) a functional HNH domain (e.g., wild type HNH domain). As another example, a Cas9 nickase may comprise (i) a functional RuvC domain (e.g., wild type RuvC domain) and (ii) a mutant, dysfunctional HNH domain. Non-limiting examples of Cas9 nickases suitable for use herein are disclosed in U.S. Patent Appl. Publ. No. 2014/0189896, which is incorporated herein by reference.
A pair of Cas9 nickases may be used to increase the specificity of DNA targeting. In general, this can be done by providing two Cas9 nickases that, by virtue of being associated with RNA components with different guide sequences, target and nick nearby DNA sequences on opposite strands in the region for desired targeting. Such nearby cleavage of each DNA strand creates a double strand break (i.e., a DSB with single-stranded overhangs), which is then recognized as a substrate for non-homologous-end-joining, NHEJ (prone to imperfect repair leading to mutations) or homologous recombination, HR. Each nick in these embodiments can be at least about 5, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 (or any integer between 5 and 100) bases apart from each other, for example. One or two Cas9 nickase proteins herein can be used in a Cas9 nickase pair. For example, a Cas9 nickase with a mutant RuvC domain, but functioning HNH domain (i.e., Cas9 HNH+/RuvC−), could be used (e.g., Streptococcus pyogenes Cas9 HNH+/RuvC−). Each Cas9 nickase (e.g., Cas9 HNH+/RuvC−) would be directed to specific DNA sites nearby each other (up to 100 base pairs apart) by using suitable RNA components herein with guide RNA sequences targeting each nickase to each specific DNA site.
A Cas protein may be part of a fusion protein comprising one or more heterologous protein domains (e.g., 1, 2, 3, or more domains in addition to the Cas protein). Such a fusion protein may comprise any additional protein sequence, and optionally a linker sequence between any two domains, such as between Cas and a first heterologous domain. Examples of protein domains that may be fused to a Cas protein herein include, without limitation, epitope tags (e.g., histidine [His], V5, FLAG, influenza hemagglutinin [HA], myc, VSV-G, thioredoxin [Trx]), reporters (e.g., glutathione-5-transferase [GST], horseradish peroxidase [HRP], chloramphenicol acetyltransferase [CAT], beta-galactosidase, beta-glucuronidase [GUS], luciferase, green fluorescent protein [GFP], HcRed, DsRed, cyan fluorescent protein [CFP], yellow fluorescent protein [YFP], blue fluorescent protein [BFP]), and domains having one or more of the following activities: methylase activity, demethylase activity, transcription activation activity (e.g., VP16 or VP64), transcription repression activity, transcription release factor activity, histone modification activity, RNA cleavage activity and nucleic acid binding activity. A Cas protein can also be in fusion with a protein that binds DNA molecules or other molecules, such as maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD), GAL4A DNA binding domain, and herpes simplex virus (HSV) VP16. See PCT patent applications PCT/US16/32073, filed May 12, 2016 and PCT/US16/32028 filed May 12, 2016 (both applications incorporated herein by reference) for more examples of Cas proteins.
A guide polynucleotide/Cas endonuclease complex in certain embodiments may bind to a DNA target site sequence, but does not cleave any strand at the target site sequence. Such a complex may comprise a Cas protein in which all of its nuclease domains are mutant, dysfunctional. For example, a Cas9 protein herein that can bind to a DNA target site sequence, but does not cleave any strand at the target site sequence, may comprise both a mutant, dysfunctional RuvC domain and a mutant, dysfunctional HNH domain. A Cas protein herein that binds, but does not cleave, a target DNA sequence can be used to modulate gene expression, for example, in which case the Cas protein could be fused with a transcription factor (or portion thereof) (e.g., a repressor or activator, such as any of those disclosed herein). In other aspects, an inactivated Cas protein may be fused with another protein having endonuclease activity, such as a Fok I endonuclease.
The Cas endonuclease gene herein may encode a Type II Cas9 endonuclease, such as but not limited to, Cas9 genes listed in SEQ ID NOs: 462, 474, 489, 494, 499, 505, and 518 of WO2007/025097, and incorporated herein by reference. In another embodiment, the Cas endonuclease gene is a microbe or optimized Cas9 endonuclease gene. The Cas endonuclease gene can be operably linked to a SV40 nuclear targeting signal upstream of the Cas codon region and a bipartite VirD2 nuclear localization signal (Tinland et al. (1992) Proc. Natl. Acad. Sci. USA 89:7442-6) downstream of the Cas codon region.
Other Cas endonuclease systems have been described in PCT patent applications PCT/US16/32073, and PCT/US16/32028, both applications incorporated herein by reference.
“Cas9” (formerly referred to as Cas5, Csn1, or Csx12) herein refers to a Cas endonuclease of a type II CRISPR system that forms a complex with a crNucleotide and a tracrNucleotide, or with a single guide polynucleotide, for specifically recognizing and cleaving all or part of a DNA target sequence. Cas9 protein comprises a RuvC nuclease domain and an HNH (H-N-H) nuclease domain, each of which can cleave a single DNA strand at a target sequence (the concerted action of both domains leads to DNA double-strand cleavage, whereas activity of one domain leads to a nick). In general, the RuvC domain comprises subdomains I, II and III, where domain I is located near the N-terminus of Cas9 and subdomains II and III are located in the middle of the protein, flanking the HNH domain (Hsu et al, Cell 157:1262-1278). A type II CRISPR system includes a DNA cleavage system utilizing a Cas9 endonuclease in complex with at least one polynucleotide component. For example, a Cas9 can be in complex with a CRISPR RNA (crRNA) and a trans-activating CRISPR RNA (tracrRNA). In another example, a Cas9 can be in complex with a single guide RNA.
A Cas protein herein such as a Cas9 can comprise a heterologous nuclear localization sequence (NLS). A heterologous NLS amino acid sequence herein may be of sufficient strength to drive accumulation of a Cas protein in a detectable amount in the nucleus of a yeast cell herein, for example. An NLS may comprise one (monopartite) or more (e.g., bipartite) short sequences (e.g., 2 to 20 residues) of basic, positively charged residues (e.g., lysine and/or arginine), and can be located anywhere in a Cas amino acid sequence but such that it is exposed on the protein surface. An NLS may be operably linked to the N-terminus or C-terminus of a Cas protein herein, for example. Two or more NLS sequences can be linked to a Cas protein, for example, such as on both the N- and C-termini of a Cas protein. Non-limiting examples of suitable NLS sequences herein include those disclosed in U.S. Pat. No. 7,309,576, which is incorporated herein by reference.
The Cas endonuclease can comprise a modified form of the Cas9 polypeptide. The modified form of the Cas9 polypeptide can include an amino acid change (e.g., deletion, insertion, or substitution) that reduces the naturally-occurring nuclease activity of the Cas9 protein. For example, in some instances, the modified form of the Cas9 protein has less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, or less than 1% of the nuclease activity of the corresponding wild-type Cas9 polypeptide (US patent application US20140068797 A1). In some cases, the modified form of the Cas9 polypeptide has no substantial nuclease activity and is referred to as catalytically “inactivated Cas9” or “deactivated cas9 (dCas9).” Catalytically inactivated Cas9 variants include Cas9 variants that contain mutations in the HNH and RuvC nuclease domains. These catalytically inactivated Cas9 variants are capable of interacting with sgRNA and binding to the target site in vivo but cannot cleave either strand of the target DNA.
A catalytically inactive Cas9 can be fused to a heterologous sequence (US patent application US20140068797 A1). Suitable fusion partners include, but are not limited to, a polypeptide that provides an activity that indirectly increases transcription by acting directly on the target DNA or on a polypeptide (e.g., a histone or other DNA-binding protein) associated with the target DNA. Additional suitable fusion partners include, but are not limited to, a polypeptide that provides for methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, SUMOylating activity, deSUMOylating activity, ribosylation activity, deribosylation activity, myristoylation activity, or demyristoylation activity. Further suitable fusion partners include, but are not limited to, a polypeptide that directly provides for increased transcription of the target nucleic acid (e.g., a transcription activator or a fragment thereof, a protein or fragment thereof that recruits a transcription activator, a small molecule/drug-responsive transcription regulator, etc.). A catalytically inactive Cas9 can also be fused to a FokI nuclease to generate double strand breaks (Guilinger et al. Nature Biotechnology, volume 32, number 6, June 2014).
The terms “functional fragment”, “fragment that is functionally equivalent” and “functionally equivalent fragment” of a Cas endonuclease are used interchangeably herein, and refer to a portion or subsequence of the Cas endonuclease sequence of the present disclosure in which the ability to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break in) the target site is retained. The terms “functional variant”, “Variant that is functionally equivalent” and “functionally equivalent variant” of a Cas endonuclease are used interchangeably herein, and refer to a variant of the Cas endonuclease of the present disclosure in which the ability to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break in) the target site is retained. Fragments and variants can be obtained via methods such as site-directed mutagenesis and synthetic construction.
Any guided endonuclease can be used in the methods disclosed herein. Such endonucleases include, but are not limited to Cas9 and Cpf1 endonucleases. Many endonucleases have been described to date that can recognize specific PAM sequences (see for example—Jinek et al. (2012) Science 337 p 816-821, PCT patent applications PCT/US16/32073, and PCT/US16/32028and Zetsche B et al. 2015. Cell 163, 1013) and cleave the target DNA at a specific positions. It is understood that based on the methods and embodiments described herein utilizing a guided Cas system one can now tailor these methods such that they can utilize any guided endonuclease system.
As used herein, the term “guide polynucleotide”, relates to a polynucleotide sequence that can form a complex with a Cas endonuclease and enables the Cas endonuclease to recognize, bind to, and optionally cleave a DNA target site. The guide polynucleotide can be a single molecule or a double molecule. The guide polynucleotide sequence can be a RNA sequence, a DNA sequence, or a combination thereof (a RNA-DNA combination sequence). Optionally, the guide polynucleotide can comprise at least one nucleotide, phosphodiester bond or linkage modification such as, but not limited, to Locked Nucleic Acid (LNA), 5-methyl dC, 2,6-Diaminopurine, 2′-Fluoro A, 2′-Fluoro U, 2′-O-Methyl RNA, phosphorothioate bond, linkage to a cholesterol molecule, linkage to a polyethylene glycol molecule, linkage to a spacer 18 (hexaethylene glycol chain) molecule, or 5′ to 3′ covalent linkage resulting in circularization. A guide polynucleotide that solely comprises ribonucleic acids is also referred to as a “guide RNA” or “gRNA” (See also U.S. Patent Application US 2015-0082478 A1, and US 2015-0059010 A1, both hereby incorporated in its entirety by reference).
The guide polynucleotide can be a double molecule (also referred to as duplex guide polynucleotide) comprising a crNucleotide sequence and a tracrNucleotide sequence. The crNucleotide includes a first nucleotide sequence domain (referred to as Variable Targeting domain or VT domain) that can hybridize to a nucleotide sequence in a target DNA and a second nucleotide sequence (also referred to as a tracr mate sequence) that is part of a Cas endonuclease recognition (CER) domain. The tracr mate sequence can hybridized to a tracrNucleotide along a region of complementarity and together form the Cas endonuclease recognition domain or CER domain. The CER domain is capable of interacting with a Cas endonuclease polypeptide. The crNucleotide and the tracrNucleotide of the duplex guide polynucleotide can be RNA, DNA, and/or RNA-DNA- combination sequences. In some embodiments, the crNucleotide molecule of the duplex guide polynucleotide is referred to as “crDNA” (when composed of a contiguous stretch of DNA nucleotides) or “crRNA” (when composed of a contiguous stretch of RNA nucleotides), or “crDNA-RNA” (when composed of a combination of DNA and RNA nucleotides). The crNucleotide can comprise a fragment of the cRNA naturally occurring in Bacteria and Archaea. The size of the fragment of the cRNA naturally occurring in Bacteria and Archaea that can be present in a crNucleotide disclosed herein can range from, but is not limited to, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more nucleotides. In some embodiments the tracrNucleotide is referred to as “tracrRNA” (when composed of a contiguous stretch of RNA nucleotides) or “tracrDNA” (when composed of a contiguous stretch of DNA nucleotides) or “tracrDNA-RNA” (when composed of a combination of DNA and RNA nucleotides. In one embodiment, the RNA that guides the RNA/Cas9 endonuclease complex is a duplexed RNA comprising a duplex crRNA-tracrRNA.
The tracrRNA (trans-activating CRISPR RNA) contains, in the 5′-to-3′ direction, (i) a sequence that anneals with the repeat region of CRISPR type II crRNA and (ii) a stem loop-containing portion (Deltcheva et al., Nature 471:602-607). The duplex guide polynucleotide can form a complex with a Cas endonuclease, wherein said guide polynucleotide/Cas endonuclease complex (also referred to as a guide polynucleotide/Cas endonuclease system) can direct the Cas endonuclease to a genomic target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) into the target site. (See also U.S. Patent Application US 2015-0082478 A1, published on Mar. 19, 2015 and US 2015-0059010 A1 both hereby incorporated in its entirety by reference.)
The single guide polynucleotide can form a complex with a Cas endonuclease, wherein said guide polynucleotide/Cas endonuclease complex (also referred to as a guide polynucleotide/Cas endonuclease system) can direct the Cas endonuclease to a genomic target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the target site. (See also U.S. Patent Application US 2015-0082478 A1, and US 2015-0059010 A1, both hereby incorporated in its entirety by reference.)
The term “variable targeting domain” or “VT domain” is used interchangeably herein and includes a nucleotide sequence that can hybridize (is complementary) to one strand (nucleotide sequence) of a double strand DNA target site. The percent complementation between the first nucleotide sequence domain (VT domain) and the target sequence can be at least 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 63%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%. The variable targeting domain can be at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides in length. In some embodiments, the variable targeting domain comprises a contiguous stretch of 12 to 30 nucleotides. The variable targeting domain can be composed of a DNA sequence, a RNA sequence, a modified DNA sequence, a modified RNA sequence, or any combination thereof.
The term “Cas endonuclease recognition domain” or “CER domain” (of a guide polynucleotide) is used interchangeably herein and includes a nucleotide sequence that interacts with a Cas endonuclease polypeptide. A CER domain comprises a tracrNucleotide mate sequence followed by a tracrNucleotide sequence. The CER domain can be composed of a DNA sequence, a RNA sequence, a modified DNA sequence, a modified RNA sequence (see for example US 2015-0059010 A1, incorporated in its entirety by reference herein), or any combination thereof.
The terms “functional fragment”, “fragment that is functionally equivalent” and “functionally equivalent fragment” of a guide RNA, crRNA or tracrRNA are used interchangeably herein, and refer to a portion or subsequence of the guide RNA, crRNA or tracrRNA, respectively, of the present disclosure in which the ability to function as a guide RNA, crRNA or tracrRNA, respectively, is retained.
The terms “functional variant”, “Variant that is functionally equivalent” and “functionally equivalent variant” of a guide RNA, crRNA or tracrRNA (respectively) are used interchangeably herein, and refer to a variant of the guide RNA, crRNA or tracrRNA, respectively, of the present disclosure in which the ability to function as a guide RNA, crRNA or tracrRNA, respectively, is retained.
The terms “single guide RNA” and “sgRNA” are used interchangeably herein and relate to a synthetic fusion of two RNA molecules, a crRNA (CRISPR RNA) comprising a variable targeting domain (linked to a tracr mate sequence that hybridizes to a tracrRNA), fused to a tracrRNA (trans-activating CRISPR RNA). The single guide RNA can comprise a crRNA or crRNA fragment and a tracrRNA or tracrRNA fragment of the type II CRISPR/Cas system that can form a complex with a type II Cas endonuclease, wherein said guide RNA/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site.
The terms “guide RNA/Cas endonuclease complex”, “guide RNA/Cas endonuclease system”, “guide RNA/Cas complex”, “guide RNA/Cas system”, “gRNA/Cas complex”, “gRNA/Cas system”, “RNA-guided endonuclease”, “RGEN” are used interchangeably herein and refer to at least one RNA component and at least one Cas endonuclease that are capable of forming a complex, wherein said guide RNA/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site. A guide RNA/Cas endonuclease complex herein can comprise Cas protein(s) and suitable RNA component(s) of any of the four known CRISPR systems (Horvath and Barrangou, 2010, Science 327:167-170) such as a type I, II, or III CRISPR system. A guide RNA/Cas endonuclease complex can comprise a Type II Cas9 endonuclease and at least one RNA component (e.g., a crRNA and tracrRNA, or a gRNA). (See also U.S. Patent Application US 2015-0082478 A1, and US 2015-0059010 A1, both hereby incorporated in its entirety by reference).
The guide polynucleotide can be introduced into a cell transiently, as single stranded polynucleotide or a double stranded polynucleotide, using any method known in the art such as, but not limited to, particle bombardment, Agrobacterium transformation or topical applications. The guide polynucleotide can also be introduced indirectly into a cell by introducing a recombinant DNA molecule (via methods such as, but not limited to, particle bombardment or Agrobacterium transformation) comprising a heterologous nucleic acid fragment encoding a guide polynucleotide, operably linked to a specific promoter that is capable of transcribing the guide RNA in said cell. The specific promoter can be, but is not limited to, a RNA polymerase III promoter, which allow for transcription of RNA with precisely defined, unmodified, 5′- and 3′-ends (DiCarlo et al., Nucleic Acids Res. 41: 4336-4343; Ma et al., Mol. Ther. Nucleic Acids 3: e161) as described in WO2016025131, incorporated herein in its entirety by reference.
The terms “target site”, “target sequence”, “target site sequence, “target DNA”, “target locus”, “genomic target site”, “genomic target sequence”, “genomic target locus” and “protospacer”, are used interchangeably herein and refer to a polynucleotide sequence including, but not limited to, a nucleotide sequence within a chromosome, an episome, or any other DNA molecule in the genome (including chromosomal, choloroplastic, mitochondrial DNA, plasmid DNA) of a cell, at which a guide polynucleotide/Cas endonuclease complex can recognize, bind to, and optionally nick or cleave. The target site can be an endogenous site in the genome of a cell, or alternatively, the target site can be heterologous to the cell and thereby not be naturally occurring in the genome of the cell, or the target site can be found in a heterologous genomic location compared to where it occurs in nature. As used herein, terms “endogenous target sequence” and “native target sequence” are used interchangeable herein to refer to a target sequence that is endogenous or native to the genome of a cell. Cells include, but are not limited to, human, non-human, animal, bacterial, fungal, insect, yeast, non-conventional yeast, and plant cells as well as plants and seeds produced by the methods described herein. An “artificial target site” or “artificial target sequence” are used interchangeably herein and refer to a target sequence that has been introduced into the genome of a cell. Such an artificial target sequence can be identical in sequence to an endogenous or native target sequence in the genome of a cell but be located in a different position (i.e., a non-endogenous or non-native position) in the genome of a cell.
An “altered target site”, “altered target sequence”, “modified target site”, “modified target sequence” are used interchangeably herein and refer to a target sequence as disclosed herein that comprises at least one alteration when compared to non-altered target sequence. Such “alterations” include, for example: (i) replacement of at least one nucleotide, (ii) a deletion of at least one nucleotide, (iii) an insertion of at least one nucleotide, or (iv) any combination of (i)-(iii).
The length of the target DNA sequence (target site) can vary, and includes, for example, target sites that are at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more nucleotides in length. It is further possible that the target site can be palindromic, that is, the sequence on one strand reads the same in the opposite direction on the complementary strand. The nick/cleavage site can be within the target sequence or the nick/cleavage site could be outside of the target sequence. In another variation, the cleavage could occur at nucleotide positions immediately opposite each other to produce a blunt end cut or, in other Cases, the incisions could be staggered to produce single-stranded overhangs, also called “sticky ends”, which can be either 5′ overhangs, or 3′ overhangs. Active variants of genomic target sites can also be used. Such active variants can comprise at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to the given target site, wherein the active variants retain biological activity and hence are capable of being recognized and cleaved by an Cas endonuclease. Assays to measure the single or double-strand break of a target site by an endonuclease are known in the art and generally measure the overall activity and specificity of the agent on DNA substrates containing recognition sites.
A “protospacer adjacent motif” (PAM) herein refers to a short nucleotide sequence adjacent to a target sequence (protospacer) that is recognized (targeted) by a guide polynucleotide/Cas endonuclease system described herein. The Cas endonuclease may not successfully recognize a target DNA sequence if the target DNA sequence is not followed by a PAM sequence. The sequence and length of a PAM herein can differ depending on the Cas protein or Cas protein complex used. The PAM sequence can be of any length but is typically 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 nucleotides long. The terms “targeting”, “gene targeting” and “DNA targeting” are used interchangeably herein. DNA targeting herein may be the specific introduction of a knock-out, edit, or knock-in at a particular DNA sequence, such as in a chromosome or plasmid of a cell. In general, DNA targeting may be performed herein by cleaving one or both strands at a specific DNA sequence in a cell with an endonuclease associated with a suitable polynucleotide component. Such DNA cleavage, if a double-strand break (DSB), can prompt NHEJ or HDR processes which can lead to modifications at the target site.
A targeting method herein may be performed in such a way that two or more DNA target sites are targeted in the method, for example. Such a method can optionally be characterized as a multiplex method. Two, three, four, five, six, seven, eight, nine, ten, or more target sites may be targeted at the same time in certain embodiments. A multiplex method is typically performed by a targeting method herein in which multiple different RNA components are provided, each designed to guide an guidepolynucleotide/Cas endonuclease complex to a unique DNA target site.
The terms “knock-out”, “gene knock-out” and “genetic knock-out” are used interchangeably herein. A knock-out as used herein represents a DNA sequence of a cell that has been rendered partially or completely inoperative by targeting with a Cas protein; such a DNA sequence prior to knock-out could have encoded an amino acid sequence, or could have had a regulatory function (e.g., promoter), for example. A knock-out may be produced by an indel (insertion or deletion of nucleotide bases in a target DNA sequence through NHEJ), or by specific removal of sequence that reduces or completely destroys the function of sequence at or near the targeting site.
The guide polynucleotide/Cas endonuclease system can be used in combination with a co-delivered polynucleotide modification template to allow for editing (modification) of a genomic nucleotide sequence of interest. (See also U.S. Patent Application US 2015-0082478 A1, and WO2015/026886 A1, both hereby incorporated in its entirety by reference.)
The terms “knock-in”, “gene knock-in, “gene insertion” and “genetic knock-in” are used interchangeably herein. A knock-in represents the replacement or insertion of a DNA sequence at a specific DNA sequence in cell by targeting with a Cas protein (by HR, wherein a suitable donor DNA polynucleotide is also used). Examples of knock-ins include, but are not limited to, a specific insertion of a heterologous amino acid coding sequence in a coding region of a gene, or a specific insertion of a transcriptional regulatory element in a genetic locus.
Various methods and compositions can be employed to obtain a cell or organism having a polynucleotide of interest inserted in a target site for a Cas endonuclease. Such methods can employ homologous recombination to provide integration of the polynucleotide of Interest at the target site. In one method provided, a polynucleotide of interest is provided to the organism cell in a donor DNA construct. As used herein, “donor DNA” is a DNA construct that comprises a polynucleotide of Interest to be inserted into the target site of a Cas endonuclease. The donor DNA construct may further comprise a first and a second region of homology that flank the polynucleotide of Interest. The first and second regions of homology of the donor DNA share homology to a first and a second genomic region, respectively, present in or flanking the target site of the cell or organism genome. By “homology” is meant DNA sequences that are similar. For example, a “region of homology to a genomic region” that is found on the donor DNA is a region of DNA that has a similar sequence to a given “genomic region” in the cell or organism genome. A region of homology can be of any length that is sufficient to promote homologous recombination at the cleaved target site. For example, the region of homology can comprise at least 5-10, 5-15, 5-20, 5-25, 5-30, 5-35, 5-40, 5-45, 5-50, 5-55, 5-60, 5-65, 5-70, 5-75, 5-80, 5-85, 5-90, 5-95, 5-100, 5-200, 5-300, 5-400, 5-500, 5-600, 5-700, 5-800, 5-900, 5-1000, 5-1100, 5-1200, 5-1300, 5-1400, 5-1500, 5-1600, 5-1700, 5-1800, 5-1900, 5-2000, 5-2100, 5-2200, 5-2300, 5-2400, 5-2500, 5-2600, 5-2700, 5-2800, 5-2900, 5-3000, 5-3100 or more bases in length such that the region of homology has sufficient homology to undergo homologous recombination with the corresponding genomic region. “Sufficient homology” indicates that two polynucleotide sequences have sufficient structural similarity to act as substrates for a homologous recombination reaction. The structural similarity includes overall length of each polynucleotide fragment, as well as the sequence similarity of the polynucleotides. Sequence similarity can be described by the percent sequence identity over the whole length of the sequences, and/or by conserved regions comprising localized similarities such as contiguous nucleotides having 100% sequence identity, and percent sequence identity over a portion of the length of the sequences.
“Percent (%) sequence identity” with respect to a reference sequence (subject) is determined as the percentage of amino acid residues or nucleotides in a candidate sequence (query) that are identical with the respective amino acid residues or nucleotides in the reference sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any amino acid conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2. Those skilled in the art can determine appropriate parameters for aligning sequences, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. To determine the percent identity of two amino acid sequences or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (e.g., percent identity of query sequence=number of identical positions between query and subject sequences/total number of positions of query sequence (e.g., overlapping positions)×100).
The amount of homology or sequence identity shared by a target and a donor polynucleotide can vary and includes total lengths and/or regions having unit integral values in the ranges of about 1-20 bp, 20-50 bp, 50-100 bp, 75-150 bp, 100-250 bp, 150-300 bp, 200-400 bp, 250-500 bp, 300-600 bp, 350-750 bp, 400-800 bp, 450-900 bp, 500-1000 bp, 600-1250 bp, 700-1500 bp, 800-1750 bp, 900-2000 bp, 1-2.5 kb, 1.5-3 kb, 2-4 kb, 2.5-5 kb, 3-6 kb, 3.5-7 kb, 4-8 kb, 5-10 kb, or up to and including the total length of the target site. These ranges include every integer within the range, for example, the range of 1-20 bp includes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 and 20 bps. The amount of homology can also described by percent sequence identity over the full aligned length of the two polynucleotides which includes percent sequence identity of about at least 50%, 55%, 60%, 65%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%. Sufficient homology includes any combination of polynucleotide length, global percent sequence identity, and optionally conserved regions of contiguous nucleotides or local percent sequence identity, for example sufficient homology can be described as a region of 75-150 bp having at least 80% sequence identity to a region of the target locus. Sufficient homology can also be described by the predicted ability of two polynucleotides to specifically hybridize under high stringency conditions, see, for example, Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual, (Cold Spring Harbor Laboratory Press, NY); Current Protocols in Molecular Biology, Ausubel et al., Eds (1994) Current Protocols, (Greene Publishing Associates, Inc. and John Wiley & Sons, Inc.); and, Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes, (Elsevier, New York).
The structural similarity between a given genomic region and the corresponding region of homology found on the donor DNA can be any degree of sequence identity that allows for homologous recombination to occur. For example, the amount of homology or sequence identity shared by the “region of homology” of the donor DNA and the “genomic region” of the organism genome can be at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity, such that the sequences undergo homologous recombination
The region of homology on the donor DNA can have homology to any sequence flanking the target site. While in some embodiments the regions of homology share significant sequence homology to the genomic sequence immediately flanking the target site, it is recognized that the regions of homology can be designed to have sufficient homology to regions that may be further 5′ or 3′ to the target site. In still other embodiments, the regions of homology can also have homology with a fragment of the target site along with downstream genomic regions. In one embodiment, the first region of homology further comprises a first fragment of the target site and the second region of homology comprises a second fragment of the target site, wherein the first and second fragments are dissimilar.
As used herein, “homologous recombination” includes the exchange of DNA fragments between two DNA molecules at the sites of homology. The frequency of homologous recombination is influenced by a number of factors. Different organisms vary with respect to the amount of homologous recombination and the relative proportion of homologous to non-homologous recombination. Generally, the length of the region of homology affects the frequency of homologous recombination events: the longer the region of homology, the greater the frequency. The length of the homology region needed to observe homologous recombination is also species-variable. In many cases, at least 5 kb of homology has been utilized, but homologous recombination has been observed with as little as 25-50 bp of homology. See, for example, Singer et al., (1982) Cell 31:25-33; Shen and Huang, (1986) Genetics 112:441-57; Watt et al., (1985) Proc. Natl. Acad. Sci. USA 82:4768-72, Sugawara and Haber, (1992) Mol Cell Biol 12:563-75, Rubnitz and Subramani, (1984) Mol Cell Biol 4:2253-8; Ayares et al., (1986) Proc. Natl. Acad. Sci. USA 83:5199-203; Liskay et al., (1987) Genetics 115:161-7.
Homology-directed repair (HDR) is a mechanism in cells to repair double-stranded and single stranded DNA breaks. Homology-directed repair includes homologous recombination (HR) and single-strand annealing (SSA) (Lieber. 2010 Annu. Rev. Biochem. 79:181-211). The most common form of HDR is called homologous recombination (HR), which has the longest sequence homology requirements between the donor and acceptor DNA. Other forms of HDR include single-stranded annealing (SSA) and breakage-induced replication, and these require shorter sequence homology relative to HR. Homology-directed repair at nicks (single-stranded breaks) can occur via a mechanism distinct from HDR at double-strand breaks (Davis and Maizels. (2014) PNAS (0027-8424), 111 (10), p. E924-E932).
Alteration of the genome of a plant cell, for example, through homologous recombination (HR), is a powerful tool for genetic engineering. Homologous recombination has been demonstrated in plants (Halfter et al., (1992) Mol Gen Genet 231:186-93) and insects (Dray and Gloor, 1997, Genetics 147:689-99). Homologous recombination has also been accomplished in other organisms. For example, at least 150-200 bp of homology was required for homologous recombination in the parasitic protozoan Leishmania (Papadopoulou and Dumas, (1997) Nucleic Acids Res 25:4278-86). In the filamentous fungus Aspergillus nidulans, gene replacement has been accomplished with as little as 50 bp flanking homology (Chaveroche et al., (2000) Nucleic Acids Res 28: e97). Targeted gene replacement has also been demonstrated in the ciliate Tetrahymena thermophila (Gaertig et al., (1994) Nucleic Acids Res 22:5391-8). In mammals, homologous recombination has been most successful in the mouse using pluripotent embryonic stem cell lines (ES) that can be grown in culture, transformed, selected and introduced into a mouse embryo (Watson et al., 1992, Recombinant DNA, 2nd Ed., (Scientific American Books distributed by WH Freeman & Co.). Error-prone DNA repair mechanisms can produce mutations at double-strand break sites. The Non-Homologous-End-Joining (NHEJ) pathways are the most common repair mechanism to bring the broken ends together (Bleuyard et al., (2006) DNA Repair 5:1-12). The structural integrity of chromosomes is typically preserved by the repair, but deletions, insertions, or other rearrangements are possible. The two ends of one double-strand break are the most prevalent substrates of NHEJ (Kirik et al., (2000) EMBO J 19:5562-6), however if two different double-strand breaks occur, the free ends from different breaks can be ligated and result in chromosomal deletions (Siebert and Puchta, (2002) Plant Cell 14:1121-31), or chromosomal translocations between different chromosomes (Pacher et al., (2007) Genetics 175:21-9).
The donor DNA may be introduced by any means known in the art. The donor DNA may be provided by any transformation method known in the art including, for example, Agrobacterium-mediated transformation or biolistic particle bombardment. The donor DNA may be present transiently in the cell or it could be introduced via a viral replicon. In the presence of the Cas endonuclease and the target site, the donor DNA is inserted into the transformed plant's genome. (see guide language)
Further uses for guide RNA/Cas endonuclease systems have been described (See U.S. Patent Application US 2015-0082478 A1, WO2015/026886 A1, US 2015-0059010 A1, U.S. application 62/023,246, and U.S. application 62/036,652, all of which are incorporated by reference herein) and include but are not limited to modifying or replacing nucleotide sequences of interest (such as a regulatory elements), insertion of polynucleotides of interest, gene knock-out, gene-knock in, modification of splicing sites and/or introducing alternate splicing sites, modifications of nucleotide sequences encoding a protein of interest, amino acid and/or protein fusions, and gene silencing by expressing an inverted repeat into a gene of interest.
Mapping of a QTL associated with northern leaf blight resistance on chromosome 2 was described in U.S. Patent Application US2010095395. The Ht1 gene was cloned and identified as a putative CC-NB-LRR (coiled-coil, nucleotide-binding, leucine-rich repeat) gene (U.S. 62/242,691). Ht1 cDNA sequences from PH4GP and from PH1W2 (another source of a resistant allele of Ht1; U.S. Patent Application US2010095395) are represented by SEQ ID NOs:51 and 53, respectively, while the amino acid sequences of the encoded polypeptides are represented by SEQ ID NO:52 and 54 and are 99.6% identical. B73 (which has the susceptible allele) has two splicing variants, and the novel variant expresses at a much higher level (referred to herein as B73-high) than the known variant (referred to herein as B73-low). SEQ ID NO:55 is the cDNA sequence of the B73-high allele, while the amino acid sequence of the encoded polypeptide is represented by SEQ ID NO:56. SEQ ID NO:57 is the cDNA sequence of the B73-low allele, while the amino acid sequence of the encoded polypeptide is represented by SEQ ID NO:58. The genomic sequence of the PH4GP (resistant) allele is provided herein as SEQ ID NO:59. The CC and NB domains are highly similar between the susceptible allele (B73) and resistant alleles (from PH4GP and PH1W2), as shown in U.S. 62/242,691. However, B73 has a deletion in the LRR. The amino acid sequence of this region in the Ht1 resistant alleles is represented by SEQ ID NO:60.
The methods for obtaining a maize plant cell with a modified Ht1 nucleotide sequence include: introducing a double-strand break at one or more target sites in an endogenous HT1 encoding sequence in a maize plant cell and obtaining a maize plant cell having a modified Ht1 nucleotide sequence. In other aspects, the methods include: introducing a double-strand break at one or more target sites in an endogenous Ht1 encoding sequence in a maize plant cell and obtaining a maize plant cell having a modified Ht1 nucleotide sequence. The method may further comprise introducing an NLB18 substitution template in the maize plant cell, wherein said Ht1 substitution template comprises at least one nucleic acid alteration compared to the endogenous Ht1 encoding sequence and wherein said Ht1 substitution template is incorporated into the endogenous Ht1 encoding sequence. The method may further comprise introducing an Ht1 substitution template in the maize plant cell, wherein said Ht1 substitution template comprises at least one nucleic acid alteration compared to the endogenous HT1 encoding sequence and wherein said Ht1 substitution template is incorporated into the endogenous HT1 encoding sequence. The double-strand break may be induced by a nuclease, including, but not limited to, a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the modified Ht1 nucleotide sequence, and the maize plant may exhibit enhanced resistance to northern leaf blight.
An “Ht1 nucleotide sequence” as presented herein can refer to the Ht1 promoter, exons, introns, and/or terminator sequences as a whole or in fragments.
The “endogenous HT1 encoding sequence” refers to the nucleotide sequence that is present in the unmodified maize plant cell and encodes the HT1 polypeptide.
An “Ht1 substitution template” is a polynucleotide modification template containing a favorable version of the Ht1 nucleotide sequence (i.e. one that confers enhanced resistance to northern leaf blight).
The maize plants exhibit enhanced resistance to northern leaf blight when compared to equivalent maize plants lacking the modified Ht1 nucleotide sequence. “Equivalent” means that the maize plants are genetically similar with the exception of the Ht1 sequence.
In some aspects, the modified Ht1 nucleotide sequence comprises a deletion in the promoter of the endogenous HT1 encoding sequence. This may involve the use of Cas9 endonuclease and one or more guide RNAs. If two guide RNAs are used, a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:1 [Ht1-TS2] and a second guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:2 [Ht1-TS4]; or a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:1 [Ht1-TS2] and a second guide RNA may comprise a variable targeting domain that is complementary toSEQ ID NO:3 [Ht1-ST1-TS1].
In other aspects, an Ht1 subsitution template is used, which comprises an Ht1 nucleotide sequence from PH4GP or fragment thereof, or an Ht1 nucleotide sequence that when introduced into the maize plant cell encodes a polypeptide with an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:52. This may involve the use of Cas9 endonuclease and one or more guide RNAs. If two guide RNAs are used, a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:14 [Ht1-TS6] and a second guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:16 [Ht1-TS9]; or a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:15 [Ht1-TS7] and a second guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:17 [Ht1-TS10].
Mapping of a QTL associated with northern leaf blight resistance on chromosome 8 was described in international patent application WO2011163590. Two protein kinase (PK)-like genes with highly conserved kinase catalytic domains were identified in close proximity and were referred to in international patent application WO2011163590 as NLB17 and NLB18. NLB18 was validated as the gene conferring enhanced resistance to northern leaf blight (unpublished). NLB18 cDNA sequences from PH26N and PH99N, the two resistant sources described in WO2011163590, are represented by SEQ ID NOs:61 and 63, respectively, while the amino acid sequences of the encoded polypeptides are represented by SEQ ID NO:62 and 64. SEQ ID NO:62 and SEQ ID NO:64 are 92.4% identical.
Methods for obtaining a maize plant cell with a modified NLB18 nucleotide sequence are provided herein. The methods include: introducing a double-strand break at one or more target sites in an endogenous NLB18 encoding sequence in a maize plant cell and obtaining a maize plant cell having a modified NLB18 nucleotide sequence. The method may further comprise introducing an NLB18 substitution template in the maize plant cell, wherein said NLB18 substitution template comprises at least one nucleic acid alteration compared to the endogenous NLB18 encoding sequence and wherein said NLB18 substitution template is incorporated into the endogenous NLB18 encoding sequence. The double- strand break may be induced by a nuclease such as but not limited to a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the modified NLB18 nucleotide sequence, and the maize plant may exhibit enhanced resistance to northern leaf blight.
An “NLB18 nucleotide sequence” as presented herein can refer to the NLB18 promoter, exons, introns, terminator sequences, and/or any other genomic nucleotide sequence located within the NLB18 genomic locus as a whole or in fragments.
An “endogenous NLB18 encoding sequence” refers to a nucleotide sequence that is present in the unmodified maize plant cell and encodes a NLB18 polypeptide.
An “NLB18 substitution template” is a polynucleotide modification template containing a favorable version of the NLB18 nucleotide sequence (i.e. one that confers enhanced resistance to northern leaf blight).
The maize plants exhibit enhanced resistance to northern leaf blight when compared to equivalent maize plants lacking the modified NLB18 nucleotide sequence. “Equivalent” means that the maize plants are genetically similar with the exception of the NLB18 sequence.
In some aspects, a modified NLB18 nucleotide sequence comprises a modification in the promoter of the endogenous NLB18 encoding sequence.
In other aspects, an NLB18 subsitution template is used, which comprises an NLB18 nucleotide sequence from PH26N or PH99N, or an NLB18 nucleotide sequence that when introduced into the maize plant cell encodes a polypeptide with an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:62 or SEQ ID NO:64. In some aspects, the NLB18 substitution template comprises SEQ ID NO:70. In some embodiments, the use of a NLB substitution template may involve the use of Cas9 endonuclease and one or more guide RNAs. If two guide RNAs are used, a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:30 [NLB18-TS1] and a second guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:32 [NLB18-TS4]; or a first guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:31 [NLB18-TS8] and a second guide RNA may comprise a variable targeting domain that is complementary toSEQ ID NO:32 [NLB18-TS4].
Polynucleotides of interest and/or traits can be stacked together in a complex trait locus as described in US 2013/0263324-A1 and in PCT/US13/22891, both applications hereby incorporated by reference.
Methods for obtaining a maize plant cell with a genomic locus comprising at least one nucleotide sequence that confers enhanced resistance to northern leaf blight are provided herein. The disclosed methods include introducing a double-strand break at one or more target sites in a genomic locus in a maize plant cell; introducing one or more nucleotide sequences that confer enhanced resistance to northern leaf blight, wherein each is flanked by 300-500 bp of nucleotide sequences 5′ or 3′ of the corresponding target sites; and obtaining a maize plant cell having a genomic locus comprising one or more nucleotide sequences that confer enhanced resistance to northern leaf blight. The double-strand break may be induced by a nuclease such as but not limited to a TALEN, a meganuclease, a zinc finger nuclease, or a CRISPR-associated nuclease. The method may further comprise growing a maize plant from the maize plant cell having the genomic locus comprising the at least one nucleotide sequence that confers enhanced resistance to northern leaf blight, and the maize plant may exhibit enhanced resistance to northern leaf blight.
The maize plants exhibit enhanced resistance to northern leaf blight when compared to equivalent maize plants lacking the nucleotide sequences conferring enhanced resistance to northern leaf blight at the genomic locus of interest. “Equivalent” means that the maize plants are genetically similar with the exception of the genomic locus of interest.
In some aspects, the one or more nucleotide sequences that confers enhanced resistance to northern leaf blight include any of the following: Ht1-PH4GP, NLB18-PH26N, and NLB18-PH99N. The Ht1-PH4GP nucleotide sequence may comprise SEQ ID NO:59 or any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:52, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant. In some aspects, the Ht1-PH4GP nucleotide sequence is SEQ ID NO:65. The NLB18-PH26N nucleotide sequence may comprise any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:64, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant. In some aspects, the NLB18-PH26N nucleotide sequence is SEQ ID NO:70. The NLB18-PH99N nucleotide sequence may comprise any nucleotide sequence that encodes a polypeptide having an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to SEQ ID NO:62, wherein said polypeptide confers enhanced resistance to northern leaf blight in a maize plant.
In other aspects, the genomic locus that confers enhanced resistance to northern leaf blight comprises CTL1. In still other aspects, a nucleotide sequence encoding NLB18-PH26N is targeted to TS8 of CTL1; a nucleotide sequence encoding NLB18-PH4GP is targeted to TS10 of CTL1; and/or a nucleotide sequence encoding NLB18-PH26N is targeted to TS45 of CTL1.
The guide polynucleotide/Cas9 endonuclease system as described herein provides for an efficient system to generate double strand breaks and allows for traits to be stacked in a complex trait locus. Thus, in one aspect, Cas9 endonuclease is used as the DSB-inducing agent, and one or more guide RNAs are used to target the Cas9 to sites in the CTL1 locus. One guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:36 [CTL1-TS8]; one guide RNA may comprise a variable targeting domain that is complementarty to SEQ ID NO:37 [CTL1-TS10], and one guide RNA may comprise a variable targeting domain that is complementary to SEQ ID NO:38 [CTL1-TS45].
The maize plants generated by the methods described herein may provide durable and broad spectrum resistance to northern leaf blight and may assist in breeding of northern leaf blight resistant maize plants. For instance, because the nucleotide sequences that confer enhanced resistance to northern leaf blight are in tight linkage with one another (at one locus), this reduces the number of specific loci that require trait introgression through backcrossing and minimizes linkage drag from non-elite resistant donors.
As used herein, “heterologous” in reference to a sequence is a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous polynucleotide is from a species different from the species from which the polynucleotide was derived, or, if from the same/analogous species, one or both are substantially modified from their original form and/or genomic locus, or the promoter is not the native promoter for the operably linked polynucleotide.
“Maize” refers to a plant of the Zea mays L. ssp. mays and is also known as “corn”. The use of “ZM” preceding an object described herein refers to the fact that the object is from Zea mays.
Maize plants, maize plant cells, maize plant parts and seeds, and maize grain having the modified Ht1 or NLB18 sequences disclosed herein are also provided.
As used herein, the term plant includes plant cells, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, plant clumps, and plant cells that are intact in plants or parts of plants such as embryos, pollen, ovules, seeds, leaves, flowers, kernels, ears, cobs, husks, stalks, roots, root tips, anthers, and the like. Grain is intended to mean the mature seed produced by commercial growers for purposes other than growing or reproducing the species.
The guide polynucleotides comprising variable targeting domains complementary to target sites in the endogenous Ht1 encoding sequence, the endogenous NLB18 encoding sequence, or the CTL1 genomic locus are also provided herein. These guide polynucleotides may be RNA sequences, DNA sequences, or RNA-DNA combination sequences. For Ht1, the guide polynucleotides may have a variable targeting domain to complementarity to SEQ ID NO:1, SEQ ID NO:2, or SEQ ID NO:3. For NLB18, the guide polynucleotides may have a variable targeting domain complementary to SEQ ID NO:30, SEQ ID NO:31, or SEQ ID NO:32. For CTL1, the guide polynucleotides may have a variable targeting domain complementary to SEQ ID NO:36, SEQ ID NO:37, or SEQ ID NO:38.
The following examples are offered to illustrate, but not to limit, the appended claims. It is understood that the examples and embodiments described herein are for illustrative purposes only and that persons skilled in the art will recognize various reagents or parameters that can be altered without departing from the spirit of the invention or the scope of the appended claims.
The gRNA/Cas9 site directed nuclease system, described in WO2015026885, WO20158026887, WO2015026883, and WO2015026886, was used to edit the Ht1 gene in maize (WO2017066597, which is incorporated by reference herein). The following pairs of target sites were used for deletion of the repetitive sequence in the Ht1 promoter region of PH184C (represented by SEQ ID NO:71): HT1-TS2 with HT1-TS4 and HT1-TS2 with HT1-ST1-TS1. The location of each target site in the Ht1 genomic sequence and the deletion result schematic drawing are shown in
A Cas9 gene from Streptococcus pyogenes M1 GAS (SF370) (SEQ ID NO:4) was maize codon optimized per standard techniques known in the art, and the potato ST-LS1 intron was introduced in order to eliminate its expression in E.coli and Agrobacterium. To facilitate nuclear localization of the Cas9 protein in maize cells, the Simian virus 40 (SV40) monopartite amino terminal nuclear localization signal (SEQ ID NO:5) was incorporated at the amino terminus of the Cas9 open reading frame. The maize optimized Cas9 gene was operably linked to a maize Ubiquitin promoter using standard molecular biological techniques. In addition to the amino terminal nuclear localization signal SV40, a C-terminal bipartitite nuclear localization signal from Agrobacterium tumefaciens VirD2 endonuclease was fused at the end of exon 2. The resulting sequence is SEQ ID NO:72, which includes the Zea mays ubiquitin promoter, the 5′ UTR of the ZM-ubiquitin gene, intron 1 of the ZM-ubiquitin gene, the SV40 nuclear localization signal, Cas9 exon 1 (ST1), the potato-LS1 intron, Cas9 exon 2 (ST1), the VirD2 endonuclease nuclear localization signal, and the pinII terminator.
To direct the Cas9 nuclease to the designated genomic target sites (in Table 1), a maize U6 polymerase III promoter (SEQ ID NO:6; see WO2015026885, WO20158026887, WO2015026883, and WO2015026886) and its cognate U6 polymerase III termination sequences (TTTTTTTT) were used to direct initiation and termination of gRNA expression. Guide RNA variable targeting domains for HT1 gene editing are identified as HT1-CR2 and HT1-CR4 which correspond to the genomic target sites HT1-TS2, HT1-TS4, and HT1-ST1-CR1 correspond to HT1-ST1-TS, respectively. DNA encoding each of the variable nucleotide targeting domains was cloned into a gRNA expression cassette through BsbI sites using double strand oligos. Each guide RNA expression cassette consists of the U6 polymerase III maize promoter operably linked to one of the DNA versions of the guide RNA (Table 2), and then the cognate U6 polymerase III termination sequence. The DNA version of guide RNA consists of the respective nucleotide variable targeting domain followed by a polynucleotide sequence capable of interacting with the double strand break inducing endonuclease. The guide RNA expression cassette for HT1-ST1-CR1 was constructed into the ST1 Cas9 expression cassette via standard procedures.
Plasmids containing the Cas9 and guide RNA expression cassettes described above were co-bombarded with plasmids containing the transformation selectable marker NPTII and the transformation enhancing developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2)) and Wuschel (20151030-6752 USPSP) into elite maize lines' genomes. Transformation of maize immature embryos can be performed using any method known in the art or the method described below.
In one transformation method, ears are husked and surface sterilized in 30-50% Clorox bleach plus 0.5% Micro detergent for 10 minutes and then rinsed two times with sterile water. The immature embryos are isolated and placed embryo axis side down (scutellum side up), with 25 embryos per plate, on 13224E medium for 2-4 hours and then aligned within the 2.5-cm target zone in preparation for bombardment.
DNA of plasmids is adhered to 0.6 μm (average diameter) gold pellets using a proprietary lipid-polymer mixture of TransIT®-2020 (Cat # MIR 5404, Mirus Bio LLC, Madison, WI 5371). A DNA solution was prepared using 1 μg of plasmid DNA and optionally, other constructs were prepared for co-bombardment using 10 ng (0.5 μl) of each plasmid. To the pre-mixed DNA, 50 μl of prepared gold particles (30 mg/ml) and 1 μl TransIT®-2020 are added and mixed carefully. The final mixture is allowed to incubate under constant vortexing at low speed for 10 minutes. After the precipitation period, the tubes are centrifuged briefly, and liquid is removed. Gold particles are pelleted in a microfuge at 10,000 rpm for 1 min, and aqueous supernatant is removed. 120 μl of 100% EtOH is added, and the particles are resuspended by brief sonication. Then, 10 μl is spotted on to the center of each macrocarrier and allowed to dry about 2 minutes before bombardment, with a total of ten aliquots taken from each tube of prepared particles/DNA.
The sample plates are bombarded with a Biolistic PDA-1000/He (Bio-Rad). Embryos are 6 cm from the macrocarrier, with a gap of ⅛th of an inch between the 200 psi rupture disc and the macrocarrier. All samples receive a single shot.
Following bombardment, the embryos are incubated on the bombardment plate for ˜20 hours then transferred to 13266L (rest/induction medium) for 7-9 days at temperatures ranging from 26-30° C. Embryos are then transferred to the maturation media 289H for ˜21 days. Mature somatic embryos are then transferred to germination media 272G and moved to the light. In about 1 to 2 weeks plantlets containing viable shoots and roots are sampled for analysis and sent to the greenhouse where they are transferred to flats (equivalent to a 2.5″ pot) containing potting soil. After 1-2 weeks, the plants are transferred to Classic 600 pots (1.6 gallon) and grown to maturity.
Bombardment medium (13224E) comprises 4.0 g/l N6 basal salts (SIGMA C-1416), 1.0 ml/l Eriksson's Vitamin Mix (1000X SIGMA-1511), 0.5 mg/l thiamine HCl, 190.0 g/l sucrose, 1.0 mg/l 2,4-D, and 2.88 g/l L-proline (brought to volume with D-I H2O following adjustment to pH 5.8 with KOH); 6.3 g/l Sigma agar (added after bringing to volume with D-I H2O); and 8.5 mg/l silver nitrate (added after sterilizing the medium and cooling to room temperature).
Selection medium (13266L) comprises 1650 mg/l ammonium Nitrate, 277.8 mg/l ammonium Sulfate, 5278 mg/l potassium nitrate, calcium chloride, anhydrous 407.4 mg/l calcium chloride, anhydrous, 234.92 mg/l magnesium sulfate, anhydrous, 410 mg/l potassium phosphate, monobasic, 8 mg/l boric acid, 8.6 mg/l, zinc sulfate·7h2o, 1.28 mg/l potassium iodide, 44.54 mg/l ferrous sulfate·7h2o, 59.46 mg/l na2edta·2h2o, 0.025 mg/l cobalt chloride·6h2o, 0.4 mg/l molybdic acid (sodium salt)·2h2o, 0.025 mg/l cupric sulfate·5h2o, 6 mg/l manganese sulfate monohydrate, 2 mg/l thiamine, 0.6 ml/l b5h minor salts 1000x, 0.4 ml/l eriksson's vitamins 1000x, 6 ml/l s&h vitamin stock 100x, 1.98 g/l l-proline, 3.4 mg/l silver nitrate, 0.3 g/l casein hydrolysate (acid), 20 g/l sucrose, 0.6 g/l glucose, 0.8 mg/l 2,4-d, 1.2 mg/l dicamba, 6 g/l tc agar, 100 mg/l agribio carbenicillin, 25 mg/l cefotaxime, and 150 mg/l geneticin (g418)
Plant regeneration medium (289H) comprises 4.3 g/l MS salts (GIBCO 11117-074), 5.0 ml/l MS vitamins stock solution (0.100 g nicotinic acid, 0.02 g/l thiamine HCL, 0.10 g/l pyridoxine HCL, and 0.40 g/l glycine brought to volume with polished D-I H2O) (Murashige and Skoog (1962) Physiol. Plant. 15:473), 100 mg/l myo-inositol, 0.5 mg/l zeatin, 60 g/l sucrose, and 1.0 ml/l of 0.1 mM abscisic acid (brought to volume with polished D-I H2O after adjusting to pH 5.6); 8.0 g/l Sigma agar (added after bringing to volume with D-I H2O); and 1.0 mg/l indoleacetic acid and 150 mg/l Geneticin (G418) (added after sterilizing the medium and cooling to 60° C.).
Hormone-free medium (272G) comprises 4.3 g/l MS salts (GIBCO 11117-074), 5.0 ml/l MS vitamins stock solution (0.100 g/l nicotinic acid, 0.02 g/l thiamine HCL, 0.10 g/l pyridoxine HCL, and 0.40 g/l glycine brought to volume with polished D-I H2O), 0.1 g/l myo-inositol, and 40.0 g/l sucrose (brought to volume with polished D-I H2O after adjusting pH to 5.6); and 0.5 mg/l IBA and 150 mg/l Geneticin (G418) and 6 g/l bacto-agar (added after bringing to volume with polished D-I H2O), sterilized and cooled to 60° C.
To identify repetitive sequence deletion positive events, genomic DNA was extracted from leaf tissue of T0 plants, and PCR was performed using Phusion master mix (Thermo Scientific F-581) and the primers listed in Table 3. Primer locations are shown in
Next Generation Sequencing (NGS) was used to evaluate the junction sequences in the deletion positive events. The junction was PCR amplified with PHUSION® Flash High Fidelity PCR Master Mix (Termo Scientific, F-531). The same primers can be used for both the CR2/CR4 and CR2/ST1-CR1 deletions. The primers used in the primary PCR reaction are shown in Table 3 and the primers used in the secondary PCR reaction are provided in SEQ ID NO:12 and SEQ ID NO:13. “NNNNNNNN” in the reverse primer is the barcode sequence corresponding to a sample location on a plate.
The Ht1 repetitive sequence deletion T0 plants were transferred to a controlled environment. Pollen from T0 plants was carried to recurrent parent plants to produce seed. T1 plants went through more comprehensive molecular characterization to not only confirm that mutations observed in T0 plant were stably inherited but also to verify that the T1 or later generation plants were free from any foreign DNA elements used during the transformation process. First, qPCR was performed on all helper genes including Cas9, the guide RNAs, the transformation selection marker (NPTII), and the transformation enhancing genes ODP2 and WUS2 to make sure the genes segregated away from the generated mutant alleles. The T1 plants will be sampled using Southern by Sequencing (SbS) analysis to further demonstrate that the plants are free of any foreign DNA.
The gRNA/Cas9 Site directed nuclease system, described in WO2015026885, WO20158026887, WO2015026883, and WO2015026886, was used to edit the Ht1 gene by replacing a native allele with a resistant allele of Ht1 from PH4GP (US2010095395; SEQ ID NO:65). The following pairs of target sites were used for removing the entire Ht1 allele from line PH184C (U.S. Pat. No. 8,445,763), including the predicated promoter, the coding sequence, and 1 kb of 3′ UTR: HT1-TS6 with HT1-TS9 and HT1-TS7 with HT1-TS10. The location of each target site and the schematic drawing of the allele swap are shown in
See Example 1.
To direct Cas9 nuclease to the designated genomic target sites (Table 4), a maize U6 polymerase III promoter (SEQ ID NO:6; see WO2015026885, WO20158026887, WO2015026883, and WO2015026886) and its cognate U6 polymerase III termination sequences (TTTTTTTT) were used to direct initiation and termination of gRNA expression. Guide RNA variable targeting domains for HT1 gene editing are identified as HT1-CR6, HT1-CR7, HT1-CR9, and HT1-CR10, which correspond to the genomic target sites HT1-TS6, HT1-TS7, HT1-TS9, and HT1-TS10, respectively. Oligos containing the DNA encoding each of the variable nucleotide targeting domains were synthesized and cloned into a gRNA expression cassette as described in Example 1. Each guide RNA expression cassette consists of the U6 polymerase III maize promoter operably linked to one of the the DNA versions of the guide RNA (Table 6) followed by the cognate U6 polymerase III termination sequence. The DNA version of the guide RNA consists of the respective nucleotide variable targeting domain followed by a polynucleotide sequence capable of interacting with the double strand break inducing endonuclease.
The substitution/replacement template for CR6/CR9 contains the resistant allele of ZM-HT1-(PH4GP) and the homology sequences flanking the 5′ of HT1-TS6 and 3′ of HT1-TS9; the substitution template for CR7/CR10 contains the same resistant allele of ZM-HT1-(PH4GP) and the homology sequences flanking the 5′ of HT1-TS7 and 3′ of HT1-TS11 in line PH184C. The homology arm sequences (SEQ ID NOs:81-84) were synthesized and then cloned with substitutive ZM-HT1-(PH4GP) genomic sequences via a standard seamlessness Gibson cloning method.
Plasmids containing the Cas9, guide RNA expression cassettes, and substitution template described above were co-bombarded with plasmids containing the transformation selectable marker NPTII and the transformation enhancing developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2)) and Wuschel (20151030-6752 USPSP) into elite maize lines' genomes. Transformation of maize immature embryos can be performed using any method known in the art or the method described in Example 1.
The T0 plant leaf tissue DNA extraction protocol is the same as described in Example 1. To identify swap positive events, PCR was performed using Sigma Extract-N-Amp PCR ready mix. PCR was performed to assay the HR1 Junction using the primer pair of Ht1HR1f1/Ht1HR1r1, while primary PCR with primer pair Ht1HR2 f1 and Ht1HR2r1 was combined with secondary allele differentiation qPCR to screen the HR2 junction due to high homology of the intended edited variants and the unmodified genomic sequence. The primers for primary PCR and the primers and probes for 2nd qPCR are listed in Table 7. The same assay described previously for CR6/CR9 swap is also used for CR7/CR10 allele swap event screening.
The identified allele swap variants will be further molecularly characterized, and qPCR will be used to screen T1(BC0) plants for null segregants, which are expected to be free of the plasmid DNA used during transformation initiation. Southern by sequencing will also be performed to confirm null segregant plants. Table 8 shows a summary of the T0 results obtained from the allele swap experiments. Three T0 plants have been identified as potential allele swap variants among 300 screened T0 plants.
A wall-associated kinase (WAK) gene, NLB18, was identified and validated as a northern leaf blight resistance gene (WO2011163590). The NLB18 gene is clustered with NLB17 on the long arm of chromosome 8. The NLB18 and NLB17 genes are 6.9 kb apart and share a high degree of homology; thus, identifying a unique target site for the NLB18 allele swap is challenging. Multiple sites were identified and guide RNAs were tested. Guides that only cut the NLB18 gene region, and not the NLB17 gene region, were selected for NLB18 allele swap.
The gRNA/Cas9 Site directed nuclease system, described in WO2015026885, WO20158026887, WO2015026883, and WO2015026886, was used to edit the NLB18 gene. The following pairs of target sites were used for removing the entire NLB18 allele from line PH184C, including the potential promoter, the coding sequence, and 3′ UTR: NLB18-TS1 with NLB18-TS4 and NLB18-TS8 with NLB18-TS4. The location of each target site at the NLB18 locus and the schematic drawing of the allele swap are shown in
See Example 1
To direct Cas9 nuclease to the designated genomic target sites (Table 9), a maize U6 polymerase III promoter (SEQ ID NO:6; see WO2015026885, WO20158026887, WO2015026883, and WO2015026886) and its cognate U6 polymerase III termination sequences (TTTTTTTT) were used to direct initiation and termination of gRNA expression. Guide RNA variable targeting domains for the NLB18 gene are identified as NLB18-CR1, NLB18-CR8, and NLB18-CR4, which correspond to the genomic target sites NLB18-TS1, NLB18-TS8, and NLB18-TS4, respectively. Oligos containing the DNA encoding each of the variable nucleotide targeting domains were synthesized and cloned into a gRNA expression cassette as described in above Example 1. Each guide RNA expression cassette consists of the U6 polymerase III maize promoter operably linked to one of the DNA version of the guide RNA (Table 10), and then the cognate U6 polymerase III termination sequence. The DNA version of the guide RNA consists of the respective nucleotide variable targeting domain followed by a polynucleotide sequence capable of interacting with the double strand break inducing endonuclease.
The substitution/replacement templates for NLB18-CR1/CR4 contain the resistant allele of ZM-NLB18 (from PH26N) and the homology sequences flanking the 5′ of NLB18-TS1 (SEQ ID NO:67) and the 3′ of NLB18-TS4 (SEQ ID NO:68) in PH184C; the substitution templates for NLB18-CR1/CR4 contain the resistant allele of ZM-NLB18 (from PH26N) and the homology sequences flanking the 5′ of NLB18-TS8 (SEQ ID NO:69) and 3′ of NLB18-TS4 (SEQ ID NO:68) in PH184C. SEQ ID NO:66 is the NLB18 nucleotide sequence from PH184C, including the 5′ of NLB18-CR8 through the 3′ of NLB18-CR4. The homology arm sequences were synthesized with additional sequence containing restriction sites; after restriction digestion, they were assembled together with the desired resistant allele of NLB18 (from PH26N) into a yeast backbone using standard yeast in vivo assembly protocols. The plasmids from pooled yeast transformants of the assembly reaction were recovered in E. coli, and the plasmids that passed quality control were used as templates for co-bombardment.
Plasmids containing the Cas9, guide RNA expression cassettes, and substitution templates described above were co-bombarded with plasmids containing the transformation selectable marker NPTII and the transformation enhancing developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2)) and Wuschel (20151030-6752 USPSP) into elite maize lines' genomes. Transformation of maize immature embryos can be performed using any method known in the art or using the method described in Example 1.
Screening will be performed similar to the experiments described previously.
A maize genomic window spanning from ZM01:13.7MM to ZM01:16.4MM on chromosome 1 was identified and developed to become Complex Trait Locus (CTL) 1 (WO2016040030). Three sites on CTL1, TS8, TS10, and TS45, were selected for relocating the NLB resistant genes NLB18-PH26N (SEQ ID NO:70), Ht1-PH4GP (SEQ ID NO:65), and NLB18-PH26N (SEQ ID NO:70), respectively. Table 11 shows the genetic map positions for Cas endonuclease target sites (TS8, TS45, TS10), and
The Cas9 gene from Streptococcus pyogenes M1 GAS (SF370) was maize codon optimized per standard techniques known in the art and the potato ST-LS1 intron was introduced in order to eliminate its expression in E.coli and Agrobacterium. To facilitate nuclear localization of the Cas9 protein in maize cells, Simian virus 40 (SV40) monopartite amino terminal nuclear localization signal (SEQ ID NO:5) and Agrobacterium tumefaciens bipartite VirD2 T-DNA border endonuclease carboxyl terminal nuclear localization signal (SEQ ID NO:80) were incorporated at the amino and carboxyl-termini of the Cas9 open reading frame respectively. SEQ ID NO:73 is the nucleotide sequence containing the Cas9 used in Example 4; SEQ ID NO:73 contains the cas9 exon 1 (SP), the ST-LS1 intron 2, the Cas9 exon 2 (SP), and the VirD2 nuclear localization signal. (The SP version of Cas9 differs from the ST version used in the previous examples with respect to codon usage; however, the SP version and the ST version encoded by SEQ ID NO:4 are identical) The maize optimized Cas9 gene was operably linked to a maize ubiquitin promoter by standard molecular biology techniques.
The maize U6 polymerase III promoter (SEQ ID NO:6; see WO2015026885, WO20158026887, WO2015026883, and WO2015026886) was used to express guide RNAs which direct Cas9 nuclease to designated genomic sites. The guide RNA coding sequence was 77 bp long and comprised a 12-30 bp variable targeting domain from a chosen maize genomic target site on the 5′ end maize U6 polymerase III terminator.
In order for the Cas9 endonuclease and the guide RNA to form a protein/RNA complex to mediate site-specific DNA double strand cleavage, the Cas9 endonuclease and guide RNA have to be present in simultaneously. To improve their co-expression and presence, the Cas9 endonuclease and guide RNA expression cassettes were linked into a single DNA construct. A 480-490 bp sequence containing the guide RNA coding sequence, the 12-30 bp variable targeting domain from the chosen maize genomic target site, and part of the U6 promoter were synthesized. The sequence was then cloned to the backbone already have the cas9 cassette and the rest of the gRNA expression cassette through restrict sites of BstBI/HindIII.
The relocating template for CTL1-8CR1 contains the resistant allele of ZM-NLB18 (from PH26N) (SEQ ID NO:70) and the homology sequences flanking the 5′ of CTL1-TS8 (SEQ ID NO:85) and the 3′ of CTL1-TS8 (SEQ ID NO:86) in PH184C. The relocating template for CTL1-45CR1 contains the resistant allele of ZM-NLB18 (from PH26N) (SEQ ID NO:70) and the homology sequences flanking the 5′ of CTL1-TS45 (SEQ ID NO:87) and the 3′ of CTL1-TS45 (SEQ ID NO:88) in PH184C. The relocating template for CTL1-10CR3 contains the resistant allele of ZM-HT1 (from PH4GP) (SEQ ID NO:65) and the homology sequences flanking the 5′ of CTL1-TS10 (SEQ ID NO:89) and the 3′ of CTL1-TS10 (SEQ ID NO:90) in PH184C. The 300-500 bp homology arm sequences were synthesized and then cloned with desired resistant allele sequence via a standard seamlessness Gibson cloning method.
A plasmid comprising the maize codon optimized Cas9 endonuclease expression cassette and guide RNA expression cassettes were co-delivered with a plasmid comprising a DNA template containing NLB18-PH26N (
The guide RNA-DNA constructs targeting various maize genomic sites and the template DNA constructs that were constructed for introduction of the resistant genes into Cas endonuclease target sites through homologous recombination (homology directed repair) are provided in Table 12. These guide RNA, Cas9 DNA constructs, and repair template DNAs were co-delivered into an elite maize genome (e.g. PH184C) by the stable transformation procedure described in Example 1.
See Example 1.
To identify relocation events(insertion positive events), genomic DNA was extracted from leaf tissue of T0 plants, and junction PCR was performed using Sigma Extract-N-Amp PCR ready mix. Primer locations are shown in
T0 plants containing relocation of the NLB18(BC26N) and HT1(ED4GP) were backcrossed with wild type recurring parents to make BC0 (T1) seeds. BC0 seedlings were molecular characterized for junction PCR to confirm the insertion at the CTL1-TS45 with NLB18 (BC26) and insertion at CTL1-TS10 with HT1 (ED4GP) inherited to the next generation. qPCR on the helper genes used during transformation process were also performed to make sure they were segregated away from the relocating plants, null segregants were also confirmed by SbS (Southern by Sequencing). The seeds from self-plants (BC0F2) were planted in field, zygosity analysis were done on the leaf samples, 1:2:1 of homozygous: hemizygous: null insertion ratio observed for both CTL1-TS45 with BC26N and CTL1-TS10 with HT1 (ED4GP) insertion. The plants were also analyzed for RNA expression using qRT-PCR after inoculation with NLB. Comparing to null, both hem izygous and homozygous showed resistance to infection, and both NLB18 and HT1 expression were elevated (
Events were tested in field experiments for efficacy against the northern leaf blight pathogen (Exserohilum turcicum). First, events were challenged with the pathogen for which Ht1 and NLB18 genes are known to provide resistance. All positive events, as determined by qPCR, were resistant to the pathogen. The results are shown in Tables 14 and 15. The number of plants for each event is indicated in column “N”, representing the number of plants tested. The resistance score for NLNB ranges from 1 to 9, with 9 being the most resistant.
This application claims priority to International Patent Application PCT/US2017/55835 filed on Oct. 10, 2017, which claims priority to U.S. Provisional Application No. 62/407,867, filed Oct. 13, 2016, the contents of which are herein incorporated by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2017/055835 | 10/10/2017 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2018/071362 | 4/19/2018 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5844116 | Piper | Dec 1998 | A |
6504084 | Crane, III et al. | Jan 2003 | B1 |
6720487 | Hoffbeck | Apr 2004 | B1 |
6765132 | Brenner et al. | Jul 2004 | B1 |
8062847 | Broglie et al. | Nov 2011 | B2 |
8921646 | Wilson et al. | Dec 2014 | B2 |
9040772 | Li et al. | May 2015 | B2 |
11447793 | Li et al. | Sep 2022 | B2 |
11560568 | Cigan et al. | Jan 2023 | B2 |
11653609 | Hou | May 2023 | B2 |
20080083042 | Butruille et al. | Apr 2008 | A1 |
20100095395 | Wilson et al. | Apr 2010 | A1 |
20110008793 | Butruille et al. | Jan 2011 | A1 |
20150218660 | Li et al. | Aug 2015 | A1 |
20150240253 | McGonigle et al. | Aug 2015 | A1 |
20150315605 | Li et al. | Nov 2015 | A1 |
20150376644 | Li et al. | Dec 2015 | A1 |
20190075749 | Hou et al. | Mar 2019 | A1 |
20190177744 | Li et al. | Jun 2019 | A1 |
20210137040 | Ouzunova et al. | May 2021 | A1 |
20210274739 | Hou et al. | Sep 2021 | A1 |
20220064662 | Abbitt et al. | Mar 2022 | A1 |
20220408678 | Hou et al. | Dec 2022 | A1 |
20230203525 | Li et al. | Jun 2023 | A1 |
20230210080 | Hou et al. | Jul 2023 | A1 |
Number | Date | Country |
---|---|---|
WO-2008021225 | Feb 2008 | WO |
WO-2009091518 | Jul 2009 | WO |
2010045211 | Apr 2010 | WO |
WO-2011163590 | Dec 2011 | WO |
WO-2014036048 | Mar 2014 | WO |
2015026883 | Feb 2015 | WO |
2015026885 | Feb 2015 | WO |
2015026886 | Feb 2015 | WO |
2015026887 | Feb 2015 | WO |
WO-2016040030 | Mar 2016 | WO |
2017066597 | Apr 2017 | WO |
WO-2018071362 | Apr 2018 | WO |
WO-2021257206 | Dec 2021 | WO |
Entry |
---|
Welz et al 2000 (Plant Breeding 119: p. 1-14) (Year: 2000). |
Thatcher et al 2022 Molecular Plant Pathology 00: p. 1-10 (Year: 2022). |
Hurni, Severine, et al.: “The maize disease resistance gene Htn1 against northern corn leaf blight encodes a wall-associated receptor-like kinase”, PNAS Proceedings of the National Academy of Sciences, Jul. 14, 2015 (Jul. 14, 2015), vol. 112, No. 28, pp. 8780-8785. |
Shi, Jinrui, et al.: “ARGOS8 variants generated by CRISPR-Cas9 improve maize grain yield under field drought stress conditions”, Plant Biotechnology Journal, Aug. 17, 2016 (Aug. 17, 2016), vol. 15, No. 2, pp. 207-216. |
International Search Report and Written Opinion, International Application No. PCT/US2017/055835 dated Mar. 13, 2018. |
International Search Report and Written Opinion, International Application No. PCT/US2016/057081 dated Feb. 8, 2017. |
Non-Final Office Action for U.S. Appl. No. 17/319,319, dated Sep. 22, 2022. |
Li, L. J.; et al.: “The physical location of the gene ht1 (Helminthosporium turcium resistance1) in maize (Zea mays L.)”, Hereditas, 1998, vol. 129, pp. 101-106. |
Schnable, P. S., et al.: “The B73 Maize Genome: Complexity, Diversity, and Dynamics”, Science Magazine (2009) vol. 326, No. 5956, pp. 1112-1115. |
UniProt Database Accession No. UPI000220E9DC dated Mar. 19, 2013. |
Yang, et al.: “Quantitative Disease Resistance: Dissection and Adoption in Maize,” Molecular Plant, Mar. 2017, vol. 10, pp. 402-413. |
Extended European Search Report for European Application No. 23158586.0, dated Oct. 23, 2023, 8 Pages. |
Asea G., et al., “Validation of Consensus Quantitative Trait Loci Associated with Resistance to Multiple Foliar Pathogens of Maize,” Phytopathology, 2009, vol. 99, No. 5, pp. 540-547. |
Balint-Kurti P.J., et al., “Use of a Maize Advanced Intercross Line for Mapping of QTL for Northern Leaf Blight Resistance and Multiple Disease Resistance,” Crop Science, Mar.-Apr. 2010, vol. 50, pp. 458-466. |
Bentolila S., et al., “Identification of an RFLP Marker Tightly Linked to the Ht1 Gene in Maize,” Theoretical and Applied Genetics, 1991, vol. 82, pp. 393-398. |
Chung C-L., et al., “Characterization and Fine-Mapping of a Resistance Locus for Northern Leaf Blight in Maize Bin 8.06,” Theoretical and Applied Genetics, International Journal of Plant Breeding Research, Springer, Berlin, DE, Mar. 9, 2010, vol. 121, No. 2, pp. 205-227, ISSN 1432-2242, XP019836046. |
David Z., et al., “Linkage of a Second Gene for NCLB Resistance to Molecular Markers in Maize,” Maize Genetics Cooperation Newsletter, vol. 66, pp. 69-70. |
Extended European Search Report for European Application No. 18167377.3, dated May 29, 2018, 8 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2011/041822, dated Jan. 10, 2013, 7 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2016/057081, dated Apr. 26, 2018, 7 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2017/055835, dated Apr. 25, 2019, 12 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2021/031741, dated Dec. 29, 2022, 8 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2021/044479, dated Feb. 23, 2023, 9 Pages. |
International Search Report and Written Opinion for International Application No. PCT/US2011/041822, dated Oct. 6, 2011, 10 Pages. |
International Search Report and Written Opinion for International Application No. PCT/US2021/031741, dated Aug. 5, 2021, 13 Pages. |
International Search Report and Written Opinion for International Application No. PCT/US2021/044479, dated Feb. 18, 2022, 13 Pages. |
Kevin S.D., et al., “Mapping the HtN Resistance Gene to the Long Arm of Chromosome 8,” Maize Genetics Cooperation Newsletter, 1993, vol. 67. |
Kevin S.D., et al., “The Use of Molecular Markers to Study Setosphaeria Turcica Resistance in Maize,” Phytopathology, 1993, vol. 82, No. 12, pp. 1326-1330. |
Lehti-Shiu M.D., et al., “Diversity, Classification and Function of the Plant Protein Kinase Superfamily,” Philosophical Transactions of the Royal Society B, 2012, vol. 367, pp. 2619-2639. |
Manju G., et al., “Identification of RFLP Markers for the Ht1 Gene by Comparison of Inbreds and their HT1-Inversions,” Maize Genetics Cooperation Newsletter, 1989. |
Paterson A.H., et al., “The Sorghum Bicolor Genome and the Diversification of Grasses,” Nature, Jan. 29, 2009, vol. 457, No. 29, DOI: 10.1038/nature07723, pp. 551-556, XP009145526. |
Romeis T., “Protein Kinases in the Plant Defence Response,” Current Opinion in Plant Biology, 2001, vol. 4, pp. 407-414. |
Schnable, “A0A096THR4,” Nov. 26, 2014, [Retrieved on Jan. 16, 2017] XP055335643, Retrieved from URL: http://www.uniprot.org/uniprot/A0A096THR4.txt?version=1. |
Soderlund C., et al., “Sequencing, Mapping, and Analysis of 27,455 Maize Full-Length cDNAs,” PLOS Genetics, Nov. 2009, vol. 5, No. 11, e1000740, 13 Pages. |
UNIPROT: “SubName: Full=Putative Disease Resistance RPP13-Like Protein 3 {ECO:0000313|EMBL:PWZ41471.1},” UniProt, Apr. 22, 2020, Database Accession No. A0A3L6G3T9, 2 Pages, Retrieved from URL: EBI, XP002803769. |
UNIPROT: “SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EES01295.1},” UniProt, Database Accession No. C5XGV9, Sep. 1, 2009, XP002781139, Retrieved from URL: EBI. |
UNIPROT: “Subname: Full=Uncharacterized protein,” Database UniProt [Online], Accession No. ROJM84_SETT2, Database Accession No. ROJM84, XP55887658, Retrieved from URL: https://www.uniprot.org/uniprot/ROJM84.txt. |
Webb C.A., et al., “Genetic and Molecular Characterization of the Maize rp3 Rust Resistance Locus,” Genetics, Sep. 2002, vol. 162, pp. 381-394. |
Wilson R.K., et al., “Zea mays Chromosome 8 Clone CH201-117L11, ZMMBBc0117L01, *** Sequencing in Progress *** , 14 Unordered Pieces,” Nucleotide, GenBank Accession No. AC197148.2, Jun. 27, 2008, pp. 1-44. |
Wisser R.J., et al., “Selection Mapping of Loci for Quantitative Disease Resistance in a Diverse Maize Population Genetics,” Sep. 2008, vol. 180, pp. 583-599. |
Wisser R.J., et al., “The Genetic Architecture of Disease Resistance in Maize: A Synthesis of Published Studies,” Phytopathology, 2006, vol. 96, No. 2, pp. 120-129, DOI:10.1094/PHYTO-96-0120, XP009100368. |
Yang E., et al., “Organisms with Candidate Sequences in the Localization Region of Maize Leaf Spot Resistance Gene Ht1,” Hereditas. |
Zheng P., et al., “A Phenylalanine in DGAT is a Key Determinant of Oil Content and Composition in Maize,” Nature Genetics, Mar. 2008, vol. 40, No. 3, pp. 367-372. |
Zuo W., et al., “A Maize Wall-Associated Kinase Confers Quantitative Resistance to Head Smut,” Nature Genetics, Feb. 2015, vol. 47, No. 2, pp. 151-158, 9 Pages. |
Number | Date | Country | |
---|---|---|---|
20220275392 A1 | Sep 2022 | US |
Number | Date | Country | |
---|---|---|---|
62407867 | Oct 2016 | US |