The contents of the electronic sequence listing (794542002300seglist.xml; Size: 207,448 bytes; and Date of Creation: Mar. 8, 2024) are herein incorporated by reference in its entirety.
The contents of the large table CERK6 (CERK6Z4_refine_60.txt; Size: 1,432,541 bytes; and Date of Creation: Mar. 1, 2023) and the large table LYK3 (LYK3Z4_refine_67.txt; Size: 727,916 bytes; and Date of Creation: Mar. 1, 2023) are herein incorporated by reference in their entireties.
The present disclosure relates to genetically modified plant LysM receptors and methods of producing the same. In particular, the present disclosure relates to modified plant LysM receptors including a modified juxtamembrane (JM) zone 4, and optionally further including a modified JM zone 2, a modified JM zone 3, a modified extracellular domain, and/or a modified kinase C-terminus region or a modified kinase N-terminus region. The modified LysM receptors of the present disclosure are either able to able to initiate NFR1-mediated root nodule symbiosis signaling or able to initiate ROS signaling. In addition, the present disclosure relates to genetically modified plants or parts thereof including the genetically modified plant LysM receptors and methods of producing the same. The present disclosure further relates to expression vectors, isolated DNA molecules, or recombinant nucleic acids encoding the genetically modified plant LysM receptors.
Receptor kinases (RKs) are located on the cell surface of eukaryotic cells, and act as signal sensors and transducers to regulate crucial processes. Plant RKs are known as receptor-like kinases (RLKs, which have intracellular domains) or receptor-like proteins (RLPs, which lack intracellular domains). Most RKs and RLKs are characterized by similar protein domains: an extracellular domain (EC), a single pass transmembrane domain (TM), a cytoplasmic flexible region known as juxtamembrane domain (JM), and a kinase domain (K) followed by a C-terminal tail.
All plants use lysin motif (LysM) RLKs called chitin elicitor kinases (CERKs) to recognize chitin oligomers (CO6-8; chitooligosaccharides) produced by pathogenic fungi and to activate an immune response. Legume plants have evolved highly similar LysM RLKs called nod factor receptors (NFRs) with increased sensitivity for decorated chitin oligomers (LCOs; lipochitooligosaccharides) produced by nitrogen-fixing soil bacteria. These receptors provide legumes with an ecological advantage as they are able to initiate symbiosis signaling leading to nodule organogenesis, bacterial infection, and symbiotic nitrogen fixation in nutrient-poor soils. Both CERKs and NFRs contain three tandem LysM domains in their extracellular region, a single-pass transmembrane (TM) domain, a juxtamembrane (JM) domain, and an intracellular kinase domain. The kinase domains of CERKs and NFRs phosphorylate specific substrates and initiate distinct signaling pathways: immunity for the CERKs or symbiosis for the NFRs.
The endosymbiotic relationship between plants and nitrogen-fixing bacteria is known as root nodule symbiosis (RNS), and there are different types of RNS. The most common type of RNS is rhizobia-legume symbiosis, which occurs between legume plants (of the Fabaceae or Leguminosae family), and nitrogen-fixing bacteria known as rhizobia (Beringer et al. (1979) Proceedings of the Royal Society of London, 204(1155):219-33). Plants that engage in RNS are found only within the orders Fabales, Fagales, Cucurbitales, and Rosales, which together constitute the FaFaCuRo or nitrogen-fixing clade (NFC) (Kistner and Parniske (2002) Trends in Plant Science, 7(11):511-18; Soltis et al. (1995) PNAS, 92(7):2647-51). However, only 10 out of 28 families of the NFC contain plant species able to establish RNS (nodulating species) (Doyle (2011) MPMI, 24(11):1289-95).
Rhizobia-legume RNS requires the plant LysM receptor kinase pair NFR1 and NFR5 for the perception of Nod factors (Radutoiu et al. (2003) Nature 425, 585-592) and the subsequent initiation of the symbiosis pathway. It has been shown that receptor signaling requires the catalytic activity of the NFR1 protein kinase, while the mechanistic role for the catalytically inactive NFR5 pseudokinase is less well understood. LjNFR1 and MtLYK3, both of which are NFRs with kinase domains, are thought to be key receptors for symbiotic signaling in the nodulating model legume species Lotus japonicus and Medicago truncatula, respectively (Arrighi et al. (2006) Plant Phys., 142(1):265-79; Madsen et al. (2011) Plant J., 65(3):404-17.; Mbengue et al. (2010) Plant Cell, 22(10):3474). In Lotus japonicus, NFR1 is indispensable for signaling, since its absence results in the lack of either physiological change (e.g., root hair deformation in response to NF) or molecular changes (e.g., no calcium signaling or expression of key nodulation genes, such as the transcription factor NIN) (Madsen et al. (2003) Nature, 425(6958):637-40.; Miwa et al. (2006) MPMI, 9(8):914-23; Radutoiu et al. (2003) Nature, 425(6958):585-92). Despite the importance of NFRs with kinase domains, the structural domains of the receptors that are responsible for determining the downstream symbiosis pathway have remained unidentified.
There exists a need to understand the specific domains of the NFR1 kinase involved in the initiation of the symbiosis pathway. More broadly, there is a need to identify the structural domains of LysM receptors that determine whether the downstream symbiosis pathway is initiated or the downstream immunity pathway. The identification of these domains will allow the engineering of existing LysM receptor kinases involved in immunity signaling (CERKs) into LysM receptor kinases involved in symbiosis signaling (NFRs) and vice versa. Further identifying these domains will allow targeted modification of existing LysM receptor kinases into kinases able to initiate specific downstream signaling pathways. Identification of these domains is a key in LysM receptor kinase engineering, and represents an important step for engineering NFR1-mediated root nodule symbiosis signaling in non-legume species.
The present disclosure identifies a zone in the NFR1 juxtamembrane (JM) domain as being critical for NFR1-mediated root nodule symbiosis signaling, namely JM zone 4. Additional JM domains identified in the present disclosure, JM zone 2 and JM zone 3, are thought to be essential for receptor dimerization. In addition, the present disclosure establishes that the C-terminus of the NFR1 kinase domain is required for symbiosis signaling (i.e., NFR1-mediated root nodule symbiosis signaling), and that the N-terminus of the CERK6 kinase domain is required for immunity signaling (i.e., ROS signaling). In combination with previously identified ectodomain regions of LysM receptors, which determine receptor specificity, the transmembrane/juxtamembrane and intracellular residues and regions of the present disclosure provide essential components needed for engineering NFR1-mediated root nodule symbiosis signaling in other plant LysM receptor proteins and in non-legume species. Further, the present disclosure provides transmembrane/juxtamembrane and intracellular residues and regions required for engineering immunity signaling in plant LysM receptors.
An aspect of the disclosure includes a modified plant LysM receptor polypeptide including a first JM zone 4, wherein the first JM zone 4 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor. In a further embodiment of this aspect, the first JM zone 4, the second JM zone 4, or both correspond to amino acids 305 to 327 when aligned to SEQ ID NO: 1 or correspond to amino acids 303 to 325 when aligned to SEQ ID NO: 8. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4, or the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In yet another embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the second JM zone 4 is able to initiate NFR1-mediated root nodule symbiosis signaling, ROS signaling, or different signaling than the first JM zone 4. In another embodiment of this aspect, the second JM zone 4 is able to initiate NFR1-mediated root nodule symbiosis signaling. In an additional embodiment of this aspect, the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60; and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In a further embodiment of this aspect, the second JM zone 4 is able to initiate ROS signaling. In still another embodiment of this aspect, the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8.
A further aspect of the disclosure includes a modified plant non-NFR1 LysM receptor polypeptide engineered for NFR1-mediated root nodule symbiosis signaling including a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling. In an additional embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In yet another embodiment of this aspect, the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide further includes a first kinase C-terminus region, wherein the first kinase C-terminus region has been modified as compared to the amino acid sequence of the corresponding unmodified plant LysM receptor polypeptide by substitution of one or more amino acids in the first kinase C-terminus region with the corresponding amino acids from an NFR1 LysM receptor polypeptide.
An additional aspect of the disclosure includes a modified plant LysM receptor polypeptide with enhanced ROS signaling including a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling. In a further embodiment of this aspect, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4, or the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In an additional embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein substitution optionally includes deletion of an amino acid not found in the second JM zone 2 and insertion of an amino acid found in the second JM zone 2 but not in the first JM zone 2. In an additional embodiment of this aspect, the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the first JM zone 2 and/or the second JM zone 2 includes SEQ ID NO: 11, SEQ ID NO: 58, or SEQ ID NO: 4.
In an additional embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein substitution optionally includes deletion of an amino acid not found in the second JM zone 3 and insertion of an amino acid found in the second JM zone 2 but not in the first JM zone 3. In an additional embodiment of this aspect, the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the first JM zone 3 and/or the second JM zone 3 includes SEQ ID NO: 12, SEQ ID NO: 59, or SEQ ID NO: 5.
Yet another aspect of the disclosure includes a modified plant non-NFR1 LysM receptor polypeptide engineered for NFR1-mediated root nodule symbiosis signaling including the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein the plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In a further embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, 5523, 5525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In a further embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Still another aspect of the disclosure includes a modified plant non-CERK6 LysM receptor polypeptide engineered for immune signaling including a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In an additional embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In yet another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant non-CERK6 LysM receptor polypeptide is able to initiate ROS signaling.
In a further embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling, the modified plant LysM receptor polypeptide further includes the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In an additional embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, 5523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In still another embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling, the modified plant LysM receptor polypeptide further includes a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In a further embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In still another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified LysM receptor polypeptide is able to initiate ROS signaling.
In still another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant non-NFR1 LysM receptor polypeptide, the modified plant non-NFR1 LysM receptor polypeptide further includes the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In a further embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, 5523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In yet another embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In still another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide with enhanced ROS signaling, the modified plant LysM receptor polypeptide further includes a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In an additional embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In yet another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In still another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified LysM receptor polypeptide is able to initiate ROS signaling.
In an additional embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first extracellular domain, wherein the first extracellular domain is modified as compared to the amino acid sequence of the corresponding unmodified plant LysM receptor polypeptide. In a further embodiment of this aspect, the first extracellular domain is modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, and wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate ROS signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant non-NFR1 LysM receptor polypeptide. In a further embodiment of this aspect, the modified plant non-NFR1 LysM receptor polypeptide includes a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide with enhanced ROS signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling, and wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, 5416, 5417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, 5336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, 5400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate ROS signaling.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has a genetically modified plant or part thereof, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof. In yet another embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the plant is selected from the group of cassava, yam, sweet potato, corn, cowpea, rice, barley, wheat, Trema spp., apple, pear, plum, apricot, peach, almond, walnut, strawberry, raspberry, blackberry, red currant, black currant, melon, cucumber, pumpkin, squash, grape, bean, soybean, pea, chickpea, pigeon pea, lentil, Bambara groundnut, lupin, pulses, Medicago spp., Lotus spp., forage legumes, indigo, legume trees, or hemp.
Further aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling or that has a modified plant non-NFR1 LysM receptor polypeptide, including introducing a genetic alteration to the plant including a first nucleic acid sequence encoding the modified plant LysM receptor polypeptide or the modified plant non-NFR1 LysM receptor polypeptide. In an additional embodiment of this aspect, the nucleic acid sequence is operably linked to a promoter, wherein the promoter is a root specific promoter, an inducible promoter, a constitutive promoter, or a combination thereof. In a further embodiment of this aspect, the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In yet another embodiment of this aspect, the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter. In yet another embodiment of this aspect, the nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence is operably linked to an endogenous promoter, and wherein the endogenous promoter is a root specific promoter.
Additional aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling or that has a modified plant non-NFR1 LysM receptor polypeptide, including genetically modifying the plant or part thereof by transforming the plant or part thereof with one or more gene editing components that target an endogenous nuclear genome sequence encoding an endogenous plant LysM receptor polypeptide or plant non-NFR1 LysM receptor polypeptide to genetically modify a first JM zone 4 by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and optionally further including: (i) genetically modifying a first JM zone 2 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide; (ii) genetically modifying a first JM zone 3 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide; (iii) genetically modifying a first kinase C-terminus region by substitution of one or more amino acids in the first kinase C-terminus region with the corresponding amino acids from a second kinase C-terminus region from a NFR1 LysM receptor polypeptide; and/or (iv) genetically modifying a first extracellular domain by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In a further embodiment of this aspect, the one or more gene editing components include a ribonucleoprotein complex that targets the nuclear genome sequence; a vector including a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector including a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (OND), wherein the OND targets the nuclear genome sequence; or a vector CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence.
Further aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling or a modified plant LysM receptor polypeptide with enhanced ROS signaling, including introducing a genetic alteration to the plant including a first nucleic acid sequence encoding the modified plant LysM receptor polypeptide or the modified plant LysM receptor polypeptide with enhanced ROS signaling. In an additional embodiment of this aspect, the nucleic acid sequence is operably linked to a promoter, wherein the promoter is a root specific promoter, an inducible promoter, a constitutive promoter, or a combination thereof. In a further embodiment of this aspect, the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In yet another embodiment of this aspect, the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter. In yet another embodiment of this aspect, the nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence is operably linked to an endogenous promoter, and wherein the endogenous promoter is a root specific promoter.
Additional aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling or a modified plant LysM receptor polypeptide with enhanced ROS signaling, including genetically modifying the plant or part thereof by transforming the plant or part thereof with one or more gene editing components that target an endogenous nuclear genome sequence encoding an endogenous plant LysM receptor polypeptide to genetically modify a first JM zone 4 by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and optionally to further including: (i) genetically modifying a first JM zone 2 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide; (ii) genetically modifying a first JM zone 3 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide; (iii) genetically modifying a first kinase N-terminus region by substitution of one or more amino acids in the first kinase N-terminus region with the corresponding amino acids from a second kinase N-terminus region from a NFR1 LysM receptor polypeptide; and/or (iv) genetically modifying a first extracellular domain by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In a further embodiment of this aspect, the one or more gene editing components include a ribonucleoprotein complex that targets the nuclear genome sequence; a vector including a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector including a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (OND), wherein the OND targets the nuclear genome sequence; or a vector CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence.
A further aspect of the disclosure includes an expression vector, isolated DNA molecule, or recombinant nucleic acid including a modified plant LysM receptor polypeptide including a modified JM zone 4 domain, a modified JM zone 3 domain, a modified JM zone 2 domain, a modified kinase C-terminus region, and/or a modified extracellular domain operably linked to at least one expression control sequence. In an additional embodiment of this aspect, (i) the modified JM zone 4 was modified by substitution of one or more amino acids in a first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (ii) the modified JM zone 2 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (iii) the modified JM zone 3 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iv) the modified kinase C-terminus region was modified by substitution of one or more amino acids in a first kinase C-terminus region with the corresponding amino acids from a second kinase C-terminus region from a NFR1 LysM receptor polypeptide, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (v) the modified extracellular domain was modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain.
An additional aspect of the disclosure includes an expression vector, isolated DNA molecule, or recombinant nucleic acid including a modified plant LysM receptor polypeptide including a modified JM zone 4 domain, a modified JM zone 3 domain, a modified JM zone 2 domain, a modified kinase N-terminus region, and/or a modified extracellular domain operably linked to at least one expression control sequence. In a further embodiment of this aspect, (i) the modified JM zone 4 was modified by substitution of one or more amino acids in a first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling, and wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (ii) the modified JM zone 2 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (iii) the modified JM zone 3 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iv) the modified kinase N-terminus region was modified by substitution of one or more amino acids in a first kinase N-terminus region with the corresponding amino acids from a second kinase N-terminus region from a CERK6 LysM receptor polypeptide, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (v) the modified extracellular domain was modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has an expression vector, isolated DNA molecular, or recombinant nucleic acid, the at least one expression control sequence includes a promoter selected from the group of a root specific promoter, a constitutive promoter, or a combination thereof. In an additional embodiment of this aspect, the promoter is a root specific promoter, and wherein the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In a further embodiment of this aspect, the promoter is constitutive promoter, and wherein the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter.
Some aspects of the disclosure include a bacterial cell or an Agrobacterium cell including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments.
Further aspects of the disclosure include a genetically modified plant, plant part, plant cell, or seed including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments.
Additional aspects of the disclosure include a composition or kit including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments, the bacterial cell or Agrobacterium cell of any one of the preceding embodiments, or the genetically modified plant, plant part, plant cell, or seed of any one of the preceding embodiments.
Yet further aspects of the disclosure include methods of initiating NFR1-mediated root nodule symbiosis signaling including: introducing a genetic alteration via the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments that has a modified kinase C-terminus region.
Still further aspects of the disclosure include methods of initiating ROS signaling including: introducing a genetic alteration via the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments that has a modified kinase N-terminus region.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has a method of initiating NFR1-mediated root nodule symbiosis signaling or ROS signaling, the plant is a plant cell.
An additional aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 4 corresponding to amino acids 303 to 325 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 4 corresponding to amino acids 305-327 of SEQ ID NO: 1, and optionally further aligning the JM zone 4 of the candidate receptor to SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (b) modifying the first JM zone 4 by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in a second JM zone 4; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 4 has been substituted with corresponding amino acid residues from the second JM zone 4.
A further aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 2 corresponding to amino acids 256 to 281 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 2 corresponding to amino acids 256-280 of SEQ ID NO: 1; (b) modifying the first JM zone 2 by substituting inserting, deleting, or substituting one or more amino acid residues in the first JM zone 2 with corresponding amino acid residues that are different in a second JM zone 2; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 2 has been substituted with corresponding amino acid residues from the second JM zone 2.
Yet another aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 3 corresponding to amino acids 282 to 302 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 3 corresponding to amino acids 281-304 of SEQ ID NO: 1; (b) modifying the first JM zone 3 by substituting inserting, deleting, or substituting one or more amino acid residues in the first JM zone 3 with corresponding amino acid residues that are different in a second JM zone 3; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 3 has been substituted with corresponding amino acid residues from the second JM zone 3.
Still another aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first kinase C-terminus region corresponding to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first kinase C-terminus region corresponding to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 of SEQ ID NO: 1; (b) modifying the first kinase C-terminus region by substituting one or more amino acid residues in the first kinase C-terminus region with corresponding amino acid residues that are different in a second kinase C-terminus region; and (c) generating the modified plant LysM receptor polypeptide wherein the first kinase C-terminus region has been substituted with corresponding amino acid residues from the second kinase C-terminus region.
An additional aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first kinase N-terminus region corresponding to amino acids D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first kinase N-terminus region corresponding to amino acids A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 of SEQ ID NO: 8; (b) modifying the first kinase N-terminus region by substituting one or more amino acid residues in the first kinase N-terminus region with corresponding amino acid residues that are different in a second kinase N-terminus region; and (c) generating the modified plant LysM receptor polypeptide wherein the first kinase N-terminus region has been substituted with corresponding amino acid residues from the second kinase N-terminus region.
Further embodiments of the preceding aspects, which may be combined with any of the preceding embodiments that has methods of generating a modified plant LysM receptor polypeptide, include the modified plant LysM receptor polypeptide produced by any of the methods of any one of the preceding embodiments, or a combination thereof.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The following description sets forth exemplary methods, parameters, and the like. It should be recognized, however, that such description is not intended as a limitation on the scope of the present disclosure but is instead provided as a description of exemplary embodiments.
An aspect of the disclosure includes a modified plant LysM receptor polypeptide including a first JM zone 4, wherein the first JM zone 4 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor. As used herein, the terms zone 4 and JM-B are interchangeable. In a further embodiment of this aspect, the first JM zone 4, the second JM zone 4, or both correspond to amino acids 305 to 327 when aligned to SEQ ID NO: 1 or correspond to amino acids 303 to 325 when aligned to SEQ ID NO: 8. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4, or the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In yet another embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the second JM zone 4 is able to initiate NFR1-mediated root nodule symbiosis signaling, ROS signaling, or different signaling than the first JM zone 4. In another embodiment of this aspect, the second JM zone 4 is able to initiate NFR1-mediated root nodule symbiosis signaling. In an additional embodiment of this aspect, the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60; and second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In a further embodiment of this aspect, the second JM zone 4 is able to initiate ROS signaling. In still another embodiment of this aspect, the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8.
A further aspect of the disclosure includes a modified plant non-NFR1 LysM receptor polypeptide engineered for NFR1-mediated root nodule symbiosis signaling including a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling. In an additional embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In yet another embodiment of this aspect, the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide further includes a first kinase C-terminus region, wherein the first kinase C-terminus region has been modified as compared to the amino acid sequence of the corresponding unmodified plant LysM receptor polypeptide by substitution of one or more amino acids in the first kinase C-terminus region with the corresponding amino acids from an NFR1 LysM receptor polypeptide.
An additional aspect of the disclosure includes a modified plant LysM receptor polypeptide with enhanced ROS signaling including a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling. In a further embodiment of this aspect, the first JM zone 4 is modified by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4, or the first JM zone 4 is modified by substituting three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in the second JM zone 4. In an additional embodiment of this aspect, substitution includes deletion of an amino acid not found in the second JM zone 4 and insertion of an amino acid found in the second JM zone 4 but not in the first JM zone 4. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the substituted amino acid residues are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein substitution optionally includes deletion of an amino acid not found in the second JM zone 2 and insertion of an amino acid found in the second JM zone 2 but not in the first JM zone 2. In an additional embodiment of this aspect, the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the first JM zone 2 and/or the second JM zone 2 includes SEQ ID NO: 11, SEQ ID NO: 58, or SEQ ID NO: 4.
In an additional embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein substitution optionally includes deletion of an amino acid not found in the second JM zone 3 and insertion of an amino acid found in the second JM zone 3 but not in the first JM zone 3. In an additional embodiment of this aspect, the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the first JM zone 3 and/or the second JM zone 3 includes SEQ ID NO: 12, SEQ ID NO: 59, or SEQ ID NO: 5.
Yet another aspect of the disclosure includes a modified plant non-NFR1 LysM receptor polypeptide engineered for NFR1-mediated root nodule symbiosis signaling including the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase C-terminus region and insertion of an amino acid found in the second kinase C-terminus region but not in the first kinase C-terminus region. In a further embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In a further embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Still another aspect of the disclosure includes a modified plant non-CERK6 LysM receptor polypeptide engineered for immune signaling including a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase N-terminus region and insertion of an amino acid found in the second kinase N-terminus region but not in the first kinase N-terminus region. In an additional embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In yet another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant non-CERK6 LysM receptor polypeptide is able to initiate ROS signaling.
In a further embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling, the modified plant LysM receptor polypeptide further includes the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase C-terminus region and insertion of an amino acid found in the second kinase C-terminus region but not in the first kinase C-terminus region. In an additional embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, 5544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In still another embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling, the modified plant LysM receptor polypeptide further includes a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase N-terminus region and insertion of an amino acid found in the second kinase N-terminus region but not in the first kinase N-terminus region. In a further embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In still another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified LysM receptor polypeptide is able to initiate ROS signaling.
In still another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant non-NFR1 LysM receptor polypeptide, the modified plant non-NFR1 LysM receptor polypeptide further includes the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase C-terminus region and insertion of an amino acid found in the second kinase C-terminus region but not in the first kinase C-terminus region. In a further embodiment of this aspect, the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, I497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase C-terminus region includes amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8. In yet another embodiment of this aspect, the first kinase C-terminus region is modified by substituting one or more amino acids of T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, or G621 of SEQ ID NO: 8 with one or more amino acids of 1469, N473, T475, Q477, 1497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, or V622 of SEQ ID NO: 1. In still another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified plant LysM receptor polypeptide with enhanced ROS signaling, the modified plant LysM receptor polypeptide further includes a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor. In a further embodiment of this aspect, which may be combined in any of the preceding embodiments, substitution includes deletion of an amino acid not found in the second kinase N-terminus region and insertion of an amino acid found in the second kinase N-terminus region but not in the first kinase N-terminus region. In an additional embodiment of this aspect, the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8. In a further embodiment of this aspect, which may be combined with any one of the preceding embodiments, the first kinase N-terminus region includes D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1. In yet another embodiment of this aspect, the first kinase N-terminus region is modified by substituting one or more amino acids of D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, or L456 of SEQ ID NO: 1 with one or more amino acids of A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, or Y454 of SEQ ID NO: 8. In still another embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified LysM receptor polypeptide is able to initiate ROS signaling.
In an additional embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide further includes a first extracellular domain, wherein the first extracellular domain is modified as compared to the amino acid sequence of the corresponding unmodified plant LysM receptor polypeptide. In a further embodiment of this aspect, the first extracellular domain is modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, and wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. The extracellular domain is critical for receptor specificity. For example, the extracellular domain of NFR1 is critical for recognition of rhizobial LCOs, while the extracellular domain of CERK6 is critical for recognition of pathogen COs. The modified LysM receptors of any of the preceding embodiments can be modified in their ectodomain as disclosed in U.S. patent application Ser. No. 17/324,354 (U.S. Pat. App. Pub. No. US-2021-03663200-A1), which is incorporated by reference for its disclosure of modifications to LysM receptor domains for engineering of recognition of LCOs. The modified LysM receptors of any of the preceding embodiments can also be modified in their ectodomain as disclosed in and U.S. patent application Ser. No. 17/267,240 (U.S. Pat. App. Pub. NO. US-2021-0233608-A1), which is incorporated by reference for its disclosure of modifications to LysM receptor domains for engineering of recognition of LCOs.
An additional aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 4 corresponding to amino acids 303 to 325 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 4 corresponding to amino acids 305-327 of SEQ ID NO: 1, and optionally further aligning the JM zone 4 of the candidate receptor to SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (b) modifying the first JM zone 4 by substituting at least two or at least three amino acid residues in the first JM zone 4 with corresponding amino acid residues that are different in a second JM zone 4; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 4 has been substituted with corresponding amino acid residues from the second JM zone 4.
A further aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 2 corresponding to amino acids 256 to 281 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 2 corresponding to amino acids 256-280 of SEQ ID NO: 1; (b) modifying the first JM zone 2 by substituting inserting, deleting, or substituting one or more amino acid residues in the first JM zone 2 with corresponding amino acid residues that are different in a second JM zone 2; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 2 has been substituted with corresponding amino acid residues from the second JM zone 2.
Yet another aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first JM zone 3 corresponding to amino acids 282 to 302 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first JM zone 3 corresponding to amino acids 281-304 of SEQ ID NO: 1; (b) modifying the first JM zone 3 by substituting inserting, deleting, or substituting one or more amino acid residues in the first JM zone 3 with corresponding amino acid residues that are different in a second JM zone 3; and (c) generating the modified plant LysM receptor polypeptide wherein the first JM zone 3 has been substituted with corresponding amino acid residues from the second JM zone 3.
Still another aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first kinase C-terminus region corresponding to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 of SEQ ID NO: 8, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first kinase C-terminus region corresponding to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 of SEQ ID NO: 1; (b) modifying the first kinase C-terminus region by substituting one or more amino acid residues in the first kinase C-terminus region with corresponding amino acid residues that are different in a second kinase C-terminus region; and (c) generating the modified plant LysM receptor polypeptide wherein the first kinase C-terminus region has been substituted with corresponding amino acid residues from the second kinase C-terminus region.
An additional aspect of the disclosure includes methods of generating a modified plant LysM receptor polypeptide, including: (a) aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 1 to identify a first kinase N-terminus region corresponding to amino acids D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 of SEQ ID NO: 1, or aligning an amino acid sequence of a candidate receptor to SEQ ID NO: 8 to identify a first kinase N-terminus region corresponding to amino acids A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 of SEQ ID NO: 8; (b) modifying the first kinase N-terminus region by substituting one or more amino acid residues in the first kinase N-terminus region with corresponding amino acid residues that are different in a second kinase N-terminus region; and (c) generating the modified plant LysM receptor polypeptide wherein the first kinase N-terminus region has been substituted with corresponding amino acid residues from the second kinase N-terminus region.
Further embodiments of the preceding aspects, which may be combined with any of the preceding embodiments that has methods of generating a modified plant LysM receptor polypeptide, include the modified plant LysM receptor polypeptide produced by any of the methods of any one of the preceding embodiments, or a combination thereof.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate ROS signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant non-NFR1 LysM receptor polypeptide. In a further embodiment of this aspect, the modified plant non-NFR1 LysM receptor polypeptide includes a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) the first kinase C-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase C-terminus region with the corresponding amino acids from a C-terminus region of a NFR1 LysM receptor polypeptide or wherein plant non-NFR1 LysM receptor polypeptide lacks a first C-terminus region and the plant non-NFR1 LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second C-terminus region into the corresponding site when aligning the plant non-NFR1 LysM receptor and the plant NFR1 LysM receptor, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, I501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant non-NFR1 LysM receptor polypeptide is able to initiate NFR1-mediated root nodule symbiosis signaling.
Some aspects of the disclosure include a genetically modified plant or part thereof including the modified plant LysM receptor polypeptide of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide with enhanced ROS signaling. In a further embodiment of this aspect, the modified plant LysM receptor polypeptide includes a first JM zone 4, wherein the first JM zone 4 was modified by substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling, and wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; and optionally further includes: (i) a first JM zone 2, wherein the first JM zone 2 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 2 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 2 with the corresponding amino acids from the second JM zone 2 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (ii) a first JM zone 3, wherein the first JM zone 3 has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 3 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids from the second JM zone 3 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iii) a first kinase N-terminus region, wherein the first kinase N-terminus region has been modified by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the first kinase N-terminus region with the corresponding amino acids from a second N-terminus region of a CERK6 LysM receptor polypeptide or wherein the non-CERK6 plant LysM receptor polypeptide lacks the first N-terminus region and the non-CERK6 plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, ten percent or more of the amino acids, twenty percent or more of the amino acids, thirty percent or more of the or more amino acids, forty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, fifty percent or more of the or more amino acids, sixty percent or more of the or more amino acids, seventy percent or more of the or more amino acids, eighty percent or more of the amino acids, ninety percent or more of the amino acids, or all amino acids of the second N-terminus region into the corresponding site when aligning the non-CERK6 plant LysM receptor and the CERK6 LysM receptor, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, 5416, 5417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, 5336, E345, R346, A347, M354, K358, A362, R369, 5382, 1383, F393, E395, 5400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (iv) a first extracellular domain modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In an additional embodiment of this aspect, which may be combined with any one of the preceding embodiments, the modified plant LysM receptor polypeptide is able to initiate ROS signaling.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has a genetically modified plant or part thereof, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof. In yet another embodiment of the preceding aspects, which may be combined with any of the preceding embodiments, the plant is selected from the group of cassava (e.g., manioc, yucca, Manihot esculenta), yam (e.g., Dioscorea rotundata, Dioscorea alata, Dioscorea trifida, Dioscorea sp.), sweet potato (e.g., Ipomoea batatas), taro (e.g., Colocasia esculenta), oca (e.g., Oxalis tuberosa), corn (e.g., maize, Zea mays), rice (e.g., indica rice, japonica rice, aromatic rice, glutinous rice, Oryza sativa, Oryza glaberrima), wild rice (e.g., Zizania spp., Porteresia spp.), barley (e.g., Hordeum vulgare), sorghum (e.g., Sorghum bicolor), millet (e.g., finger millet, fonio millet, foxtail millet, pearl millet, barnyard millets, Eleusine coracana, Panicum sumatrense, Panicum milaceum, Setaria italica, Pennisetum glaucum, Digitaria spp., Echinocloa spp.), teff (e.g., Eragrostis tej), oat (e.g., Avena sativa), triticale (e.g., X Triticosecale Wittmack, Triticosecale schlanstedtense Wittm., Triticosecale neoblaringhemii A. Camus, Triticosecale neoblaringhemii A. Camus), rye (e.g., Secale cereale, Secale cereanum), wheat (e.g., common wheat, spelt, durum, einkorn, emmer, kamut, Triticum aestivum, Triticum spelta, Triticum durum, Triticum urartu, Triticum monococcum, Triticum turanicum, Triticum spp.), Trema spp. (e.g., Trema cannabina, Trema cubense, Trema discolor, Trema domingensis, Trema integerrima, Trema lamarckiana, Trema micrantha, Trema orientalis, Trema philippinensis, Trema strigilosa, Trema tomentosa, Trema levigata), apple (e.g., Malus domestica, Malus pumila, Pyrus malus), pear (e.g., Pyrus communis, Pyrus×bretschneideri, Pyrus pyrifolia, Pyrus sinkiangensis, Pyrus pashia, Pyrus spp.), plum (e.g., Mirabelle, greengage, damson, Prunus domestica, Prunus salicina, Prunus mume), apricot (e.g., Prunus armeniaca, Prunus brigantine, Prunus mandshurica), peach (e.g., Prunus persica), almond (e.g., Prunus dulcis, Prunus amygdalus), walnut (e.g., Persian walnut, English walnut, black walnut, Juglans regia, Juglans nigra, Juglans cinerea, Juglans californica), strawberry (e.g., Fragaria×ananassa, Fragaria chiloensis, Fragaria virginiana, Fragaria vesca), raspberry (e.g., European red raspberry, black raspberry, Rubus idaeus L., Rubus occidentalis, Rubus strigosus), blackberry (e.g., evergreen blackberry, Himalayan blackberry, Rubus fruticosus, Rubus ursinus, Rubus laciniatus, Rubus argutus, Rubus armeniacus, Rubus plicatus, Rubus ulmifolius, Rubus allegheniensis, Rubus subgenus Eubatus sect. Moriferi & Ursini), red currant (e.g., white currant, Ribes rubrum), black currant (e.g., cassis, Ribes nigrum), gooseberry (e.g., Ribes uva-crispa, Ribes grossulari, Ribes hirtellum), melon (e.g., watermelon, winter melon, casabas, cantaloupe, honeydew, muskmelon, Citrullus lanatus, Benincasa hispida, Cucumis melo, Cucumis melo cantalupensis, Cucumis melo inodorus, Cucumis melo reticulatus), cucumber (e.g., slicing cucumbers, pickling cucumbers, English cucumber, Cucumis sativus), pumpkin (e.g., Cucurbita pepo, Cucurbita maxima), squash (e.g., gourd, Cucurbita argyrosperma, Cucurbita ficifolia, Cucurbita maxima, Cucurbita moschata), grape (e.g., Vitis vinifera, Vitis amurensis, Vitis labrusca, Vitis mustangensis, Vitis riparia, Vitis rotundifolia), bean (e.g., Phaseolus vulgaris, Phaseolus lunatus, Vigna angularis, Vigna radiate, Vigna mungo, Phaseolus coccineus, Vigna umbellate, Vigna acontifolia, Phaseolus acutifolius, Vicia faba, Vicia faba equine, Phaseolus spp., Vigna spp.), soybean (e.g., soy, soya bean, Glycine max, Glycine soja), pea (e.g., Pisum spp., Pisum sativum var. sativum, Pisum sativum var. arvense), pea (e.g., Pisum spp., Pisum sativum var. sativum, Pisum sativum var. arvense), chickpea (e.g., garbanzo, Bengal gram, Cicer arietinum), cowpea (e.g., Vigna unguiculata), pigeon pea (e.g., Arhar/Toor, cajan pea, Congo bean, gandules, Caganus cajan), lentil (e.g., Lens culinaris), Bambara groundnut (e.g., earth pea, Vigna subterranea), lupin (e.g., Lupinus spp.), pulses (e.g., minor pulses, Lablab purpureaus, Canavalia ensiformis, Canavalia gladiate, Psophocarpus tetragonolobus, Mucuna pruriens var. utilis, Pachyrhizus erosus), Medicago spp. (e.g., Medicago sativa, Medicago truncatula, Medicago arborea), Lotus spp. (e.g., Lotus japonicus), forage legumes (e.g., Leucaena spp., Albizia spp., Cyamopsis spp., Sesbania spp., Stylosanthes spp., Trifolium spp., Vicia spp.), indigo (e.g., Indigofera spp., Indigofera tinctoria, Indigofera suffruticosa, Indigofera articulata, Indigofera oblongifolia, Indigofera aspalthoides, Indigofera suffruticosa, Indigofera arrecta), legume trees (e.g., locust trees, Gleditsia spp., Robinia spp., Kentucky coffeetree, Gymnocladus dioicus, Acacia spp., Laburnum spp., Wisteria spp.), or hemp (e.g., cannabis, Cannabis sativa).
Further aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling or that has a modified plant non-NFR1 LysM receptor polypeptide, including introducing a genetic alteration to the plant including a first nucleic acid sequence encoding the modified plant LysM receptor polypeptide or the modified plant non-NFR1 LysM receptor polypeptide. In an additional embodiment of this aspect, the nucleic acid sequence is operably linked to a promoter, wherein the promoter is a root specific promoter, an inducible promoter, a constitutive promoter, or a combination thereof. In a further embodiment of this aspect, the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In yet another embodiment of this aspect, the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter. In yet another embodiment of this aspect, the nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence is operably linked to an endogenous promoter, and wherein the endogenous promoter is a root specific promoter.
Additional aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to initiate NFR1-mediated root nodule symbiosis signaling or that has a modified plant non-NFR1 LysM receptor polypeptide, including genetically modifying the plant or part thereof by transforming the plant or part thereof with one or more gene editing components that target an endogenous nuclear genome sequence encoding an endogenous plant LysM receptor polypeptide or plant non-NFR1 LysM receptor polypeptide to genetically modify a first JM zone 4 by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and optionally further including: (i) genetically modifying a first JM zone 2 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide; (ii) genetically modifying a first JM zone 3 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide; (iii) genetically modifying a first kinase C-terminus region by substitution of one or more amino acids in the first kinase C-terminus region with the corresponding amino acids from a second kinase C-terminus region from a NFR1 LysM receptor polypeptide; and/or (iv) genetically modifying a first extracellular domain by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In a further embodiment of this aspect, the one or more gene editing components include a ribonucleoprotein complex that targets the nuclear genome sequence; a vector including a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector including a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (OND), wherein the OND targets the nuclear genome sequence; or a vector CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence.
Further aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling or a modified plant LysM receptor polypeptide with enhanced ROS signaling, including introducing a genetic alteration to the plant including a first nucleic acid sequence encoding the modified plant LysM receptor polypeptide or the modified plant LysM receptor polypeptide with enhanced ROS signaling. In an additional embodiment of this aspect, the nucleic acid sequence is operably linked to a promoter, wherein the promoter is a root specific promoter, an inducible promoter, a constitutive promoter, or a combination thereof. In a further embodiment of this aspect, the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In yet another embodiment of this aspect, the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter. In yet another embodiment of this aspect, the nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence is operably linked to an endogenous promoter, and wherein the endogenous promoter is a root specific promoter.
Additional aspects of the present disclosure relate to methods of producing the genetically modified plant or part thereof of any one of the preceding embodiments that has a modified plant LysM receptor polypeptide including a first JM zone 4 modified by insertion, deletion, or substitution of one or more amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 able to ROS signaling or a modified plant LysM receptor polypeptide with enhanced ROS signaling, including genetically modifying the plant or part thereof by transforming the plant or part thereof with one or more gene editing components that target an endogenous nuclear genome sequence encoding an endogenous plant LysM receptor polypeptide to genetically modify a first JM zone 4 by insertion, deletion, or substitution of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from a second JM zone 4 of a second plant LysM receptor polypeptide or wherein the plant LysM receptor polypeptide lacks a first JM zone 4 and the plant LysM receptor polypeptide has been modified by insertion of one or more amino acids, two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, seven or more amino acids, nine or more amino acids, eleven or more amino acids, fifteen or more amino acids, twenty or more amino acids, or all amino acids in the first JM zone 4 with the corresponding amino acids from the second JM zone 4 into the corresponding site when aligning the plant LysM receptor and the second plant LysM receptor, and optionally to further including: (i) genetically modifying a first JM zone 2 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide; (ii) genetically modifying a first JM zone 3 by insertion, deletion, or substitution of one or more amino acids in the first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide; (iii) genetically modifying a first kinase N-terminus region by substitution of one or more amino acids in the first kinase N-terminus region with the corresponding amino acids from a second kinase N-terminus region from a NFR1 LysM receptor polypeptide; and/or (iv) genetically modifying a first extracellular domain by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain. In a further embodiment of this aspect, the one or more gene editing components include a ribonucleoprotein complex that targets the nuclear genome sequence; a vector including a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector including a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (OND), wherein the OND targets the nuclear genome sequence; or a vector CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence.A control as described herein can be a control sample or a reference sample from a wild-type, an azygous, or a null-segregant plant, species, or sample or from populations thereof. A reference value can be used in place of a control or reference sample, which was previously obtained from a wild-type, azygous, or null-segregant plant, species, or sample or from populations thereof or a group of a wild-type, azygous, or null-segregant plant, species, or sample. A control sample or a reference sample can also be a sample with a known amount of a detectable composition or a spiked sample.
A further aspect of the disclosure includes an expression vector, isolated DNA molecule, or recombinant nucleic acid including a modified plant LysM receptor polypeptide including a modified JM zone 4 domain, a modified JM zone 3 domain, a modified JM zone 2 domain, a modified kinase C-terminus region, and/or a modified extracellular domain operably linked to at least one expression control sequence. In an additional embodiment of this aspect, (i) the modified JM zone 4 was modified by substitution of one or more amino acids in a first JM zone 4 with the corresponding amino acids from a second JM zone 4 from an NFR1 LysM receptor polypeptide with NFR1-mediated root nodule symbiosis signaling, wherein the first JM zone 4 includes SEQ ID NO: 13, is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or conservative substitutions thereof, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, or includes SEQ ID NO: 60, and/or wherein the second JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (ii) the modified JM zone 2 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (iii) the modified JM zone 3 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iv) the modified kinase C-terminus region was modified by substitution of one or more amino acids in a first kinase C-terminus region with the corresponding amino acids from a second kinase C-terminus region from a NFR1 LysM receptor polypeptide, wherein the first kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, and/or wherein the second kinase C-terminus region corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1; and/or (v) the modified extracellular domain was modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain.
An additional aspect of the disclosure includes an expression vector, isolated DNA molecule, or recombinant nucleic acid including a modified plant LysM receptor polypeptide including a modified JM zone 4 domain, a modified JM zone 3 domain, a modified JM zone 2 domain, a modified kinase N-terminus region, and/or a modified extracellular domain operably linked to at least one expression control sequence. In a further embodiment of this aspect, (i) the modified JM zone 4 was modified by substitution of one or more amino acids in a first JM zone 4 with the corresponding amino acids from a second JM zone 4 from a LysM receptor polypeptide with ROS signaling, and wherein the first JM zone 4 includes SEQ ID NO: 6, or is selected from the group of SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, and SEQ ID NO: 52, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions, and/or wherein the second JM zone 4 includes SEQ ID NO: 13, or is selected from the group of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, and SEQ ID NO: 40, or optionally any of the preceding amino acid sequences with one, two, or three amino acid substitutions, insertions and/or deletions; (ii) the modified JM zone 2 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 2 with the corresponding amino acids from a second JM zone 2 of a second plant LysM receptor polypeptide, and wherein the first JM zone 2, the second JM zone 2, or both correspond to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or correspond to amino acids 256 to 281 when aligned to SEQ ID NO: 8; (iii) the modified JM zone 3 was modified by insertion, deletion, or substitution of one or more amino acids in a first JM zone 3 with the corresponding amino acids from a second JM zone 3 of a second plant LysM receptor polypeptide, and wherein the first JM zone 3, the second JM zone 3, or both correspond to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or correspond to amino acids 282 to 302 when aligned to SEQ ID NO: 8; (iv) the modified kinase N-terminus region was modified by substitution of one or more amino acids in a first kinase N-terminus region with the corresponding amino acids from a second kinase N-terminus region from a CERK6 LysM receptor polypeptide, wherein the first kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, and/or wherein the second kinase N-terminus region corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8; and/or (v) the modified extracellular domain was modified by substituting one or more amino acids of the first extracellular domain with one or more amino acids of a second extracellular domain, wherein the second extracellular domain has a different affinity, selectivity, and/or specificity for oligosaccharides than the first extracellular domain.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has an expression vector, isolated DNA molecular, or recombinant nucleic acid, the at least one expression control sequence includes a promoter selected from the group of a root specific promoter, a constitutive promoter, or a combination thereof. In an additional embodiment of this aspect, the promoter is a root specific promoter, and wherein the promoter is selected from the group of a NFR1 promoter, a NFR5 promoter, a LYK3 promoter, a CERK6 promoter, a NFP promoter, a Lotus japonicus NFR5 promoter (SEQ ID NO: 109), a Lotus japonicus NFR1 promoter (SEQ ID NO: 151), a Lotus japonicus CERK6 promoter (SEQ ID NO: 111), a Medicago truncatula NFP promoter (SEQ ID NO: 110), a Medicago truncatula LYK3 promoter (SEQ ID NO: 112), a maize metallothioneine promoter, a chitinase promoter, a maize ZRP2 promoter, a tomato LeExt1 promoter, a glutamine synthetase soybean root promoter, a RCC3 promoter, a rice antiquitin promoter, a LRR receptor kinase promoter, or an Arabidopsis pCO2 promoter. In a further embodiment of this aspect, the promoter is constitutive promoter, and wherein the promoter is selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a maize ubiquitin promoter, a polyubiquitin promoter, a vein mosaic cassava virus promoter, or an Arabidopsis UBQ10 promoter.
Some aspects of the disclosure include a bacterial cell or an Agrobacterium cell including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments.
Further aspects of the disclosure include a genetically modified plant, plant part, plant cell, or seed including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments.
Additional aspects of the disclosure include a composition or kit including the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments, the bacterial cell or Agrobacterium cell of any one of the preceding embodiments, or the genetically modified plant, plant part, plant cell, or seed of any one of the preceding embodiments.
Yet further aspects of the disclosure include methods of initiating NFR1-mediated root nodule symbiosis signaling including: introducing a genetic alteration via the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments that has a modified kinase C-terminus region.
Still further aspects of the disclosure include methods of initiating ROS signaling including: introducing a genetic alteration via the expression vector, isolated DNA molecule, or recombinant nucleic acid of any one of the preceding embodiments that has a modified kinase N-terminus region.
In a further embodiment of the preceding aspects, which may be combined with any of the preceding embodiments that has a method of initiating NFR1-mediated root nodule symbiosis signaling or ROS signaling, the plant is a plant cell.
LysM receptors may be defined as proteins that contain three tandem LysM domains in their extracellular region, namely LysM1, LysM2, and LysM3, which are present in this order on the protein sequence and separated by CxC motifs. The LysM1 domain is located toward the N-terminal end of the protein sequence, and is preceded by an N-terminal signal peptide. Moving into the cell, the LysM domains are followed by a single-pass transmembrane (TM) domain, a juxtamembrane (JM) domain, and an intracellular kinase or pseudokinase domain. Most plant LysM receptors, including NFR1, contain an intracellular kinase domain. Examples of LysM-RLK receptors can be NFP, NFR5, LYK10, LYR3, LYK4, LYK3, NFR1, lykX (LysM-RLK), CERK1, and other receptors known to one of skill in the art.
As shown in
The intracellular kinase domain (KD) includes multiple zones as well.
Lohmann et al., 2010 presents additional characteristics of NFR1-type receptors (Lohmann GV, Shimoda Y, Nielsen MW, Jergensen FG, Grossmann C, Sandal N, Serensen K, Thirup S, Madsen LH, Tabata S, Sato S, Stougaardi J, Radutoiu S. Evolution and regulation of the Lotus japonicus LysM receptor gene family. Mol Plant Microbe Interact. 2010; 23(4):510-21). In short, NFR1-type receptors are translated from multi-exon genes. Further, NFR1-type receptors contain all the subdomains typical for an intracellular kinase. NFR1-type receptors may also be referred to as LYK3-type receptors, as Lotus japonicus NFR1 and Medicago truncatula LYK3 both belong to this receptor type. An alignment of NFR1-type LysM receptors is shown in
The present disclosure provides the crystal structures of Lotus japonicus CERK6/303-599 D460N and Medicago truncatula LYK3/302-597 D459N with AMP-PNP (phosphoaminophosphonic acid-adenylate ester) in large table form, which further illustrate the embodiments. These structures are hereby incorporated by reference, and are described in Table I: CERK6 (for Lotus japonicus CERK6/303-599 D460N) and Table II: LYK3 (for Medicago truncatula LYK3/302-597 D459N with AMP-PNP), respectively. Further provided in the Examples of the present disclosure is exploration of the crystal structures containing these residues, along with descriptions of these residues' interactions with the kinase N-lobe.
Plant breeding begins with the analysis of the current germplasm, the definition of problems and weaknesses of the current germplasm, the establishment of program goals, and the definition of specific breeding objectives. The next step is the selection of germplasm that possess the traits to meet the program goals. The selected germplasm is crossed in order to recombine the desired traits and through selection, varieties or parent lines are developed. The goal is to combine in a single variety or hybrid an improved combination of desirable traits from the parental germplasm. These important traits may include higher yield, field performance, improved fruit and agronomic quality, resistance to biological stresses, such as diseases and pests, and tolerance to environmental stresses, such as drought and heat.
Each breeding program should include a periodic, objective evaluation of the efficiency of the breeding procedure. Evaluation criteria vary depending on the goal and objectives, but should include gain from selection per year based on comparisons to an appropriate standard, overall value of the advanced breeding lines, and number of successful cultivars produced per unit of input (e.g., per year, per dollar expended, etc.). Promising advanced breeding lines are thoroughly tested and compared to appropriate standards in environments representative of the commercial target area(s) for three years at least. The best lines are candidates for new commercial cultivars; those still deficient in a few traits are used as parents to produce new populations for further selection. These processes, which lead to the final step of marketing and distribution, usually take five to ten years from the time the first cross or selection is made.
The choice of breeding or selection methods depends on the mode of plant reproduction, the heritability of the trait(s) being improved, and the type of cultivar used commercially (e.g., F1 hybrid cultivar, inbred cultivar, etc.). For highly heritable traits, a choice of superior individual plants evaluated at a single location will be effective, whereas for traits with low heritability, selection should be based on mean values obtained from replicated evaluations of families of related plants. The complexity of inheritance also influences the choice of the breeding method. Backcross breeding is used to transfer one or a few genes for a highly heritable trait into a desirable cultivar (e.g., for breeding disease-resistant cultivars), while recurrent selection techniques are used for quantitatively inherited traits controlled by numerous genes, various recurrent selection techniques are used. Commonly used selection methods include pedigree selection, modified pedigree selection, mass selection, and recurrent selection.
Pedigree selection is generally used for the improvement of self-pollinating crops or inbred lines of cross-pollinating crops. Two parents which possess favorable, complementary traits are crossed to produce an F1. An F2 population is produced by selfing one or several F1s or by intercrossing two F1s (sib mating). Selection of the best individuals is usually begun in the F2 population; then, beginning in the F3, the best individuals in the best families are selected. Replicated testing of families, or hybrid combinations involving individuals of these families, often follows in the F4 generation to improve the effectiveness of selection for traits with low heritability. At an advanced stage of inbreeding (i.e., F6 and F7), the best lines or mixtures of phenotypically similar lines are tested for potential release as new cultivars.
Mass and recurrent selections can be used to improve populations of either self- or cross-pollinating crops. A genetically variable population of heterozygous individuals is either identified or created by intercrossing several different parents. The best plants are selected based on individual superiority, outstanding progeny, or excellent combining ability. The selected plants are intercrossed to produce a new population in which further cycles of selection are continued.
Backcross breeding (i.e., recurrent selection) may be used to transfer genes for a simply inherited, highly heritable trait into a desirable homozygous cultivar or line that is the recurrent parent. The source of the trait to be transferred is called the donor parent. The resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent. After the initial cross, individuals possessing the phenotype of the donor parent are selected and repeatedly crossed (backcrossed) to the recurrent parent. The resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent.
The single-seed descent procedure in the strict sense refers to planting a segregating population, harvesting a sample of one seed per plant, and using the one-seed sample to plant the next generation. When the population has been advanced from the F2 to the desired level of inbreeding, the plants from which lines are derived will each trace to different F2 individuals. The number of plants in a population declines each generation due to failure of some seeds to germinate or some plants to produce at least one seed. As a result, not all of the F2 plants originally sampled in the population will be represented by a progeny when generation advance is completed.
In addition to phenotypic observations, the genotype of a plant can also be examined. There are many laboratory-based techniques available for the analysis, comparison and characterization of plant genotype; among these are Isozyme Electrophoresis, Restriction Fragment Length Polymorphisms (RFLPs), Randomly Amplified Polymorphic DNAs (RAPDs), Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), Sequence Characterized Amplified Regions (SCARs), Amplified Fragment Length Polymorphisms (AFLPs), Simple Sequence Repeats (SSRs, which are also referred to as Microsatellites), Fluorescently Tagged Inter-simple Sequence Repeats (ISSRs), Single Nucleotide Polymorphisms (SNPs), Genotyping by Sequencing (GbS), and Next-generation Sequencing (NGS).
Molecular markers, or “markers”, can also be used during the breeding process for the selection of qualitative traits. For example, markers closely linked to alleles or markers containing sequences within the actual alleles of interest can be used to select plants that contain the alleles of interest. The use of markers in the selection process is often called genetic marker enhanced selection or marker-assisted selection. Methods of performing marker analysis are generally known to those of skill in the art.
Mutation breeding may also be used to introduce new traits into plant varieties. Mutations that occur spontaneously or are artificially induced can be useful sources of variability for a plant breeder. The goal of artificial mutagenesis is to increase the rate of mutation for a desired characteristic. Mutation rates can be increased by many different means including temperature, long-term seed storage, tissue culture conditions, radiation (such as X-rays, Gamma rays, neutrons, Beta radiation, or ultraviolet radiation), chemical mutagens (such as base analogs like 5-bromo-uracil), antibiotics, alkylating agents (such as sulfur mustards, nitrogen mustards, epoxides, ethyleneamines, sulfates, sulfonates, sulfones, or lactones), azide, hydroxylamine, nitrous acid or acridines. Once a desired trait is observed through mutagenesis the trait may then be incorporated into existing germplasm by traditional breeding techniques. Details of mutation breeding can be found in Principles of Cultivar Development: Theory and Technique, Walter Fehr (1991), Agronomy Books, 1 (https://lib.dr.iastate.edu/agron_books/1).
The production of double haploids can also be used for the development of homozygous lines in a breeding program. Double haploids are produced by the doubling of a set of chromosomes from a heterozygous plant to produce a completely homozygous individual. For example, see Wan, et al., Theor. Appl. Genet., 77:889-892, 1989.
Additional non-limiting examples of breeding methods that may be used include, without limitation, those found in Principles of Plant Breeding, John Wiley and Son, pp. 115-161 (1960); Principles of Cultivar Development: Theory and Technique, Walter Fehr (1991), Agronomy Books, 1 (https://lib.dr.iastate.edu/agron_books/1), which are herewith incorporated by reference.
One aspect of the present disclosure provides transgenic plant cells, plant parts, or plants including a modified juxtamembrane (JM) zone 4, and optionally further including a modified JM zone 2, a modified JM zone 3, a modified extracellular domain, and/or a modified kinase C-terminus region or a modified kinase N-terminus region. The modified LysM receptors of the present disclosure are either able to able to initiate NFR1-mediated root nodule symbiosis signaling or able to initiate ROS signaling. In addition, the present disclosure provides isolated DNA molecules of vectors and gene editing components used to produce transgenic plants of the present disclosure.
The JM zone 4 corresponds to amino acids 305 to 327 when aligned to SEQ ID NO: 1 or corresponds to amino acids 303 to 325 when aligned to SEQ ID NO: 8. In an additional embodiment of this aspect, the first JM zone 4 is selected from the group of SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 60, SEQ ID NO: 6, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, or SEQ ID NO: 52. In another embodiment of this aspect, the first JM zone 4 includes substituted amino acid residues that are selected from amino acid residues corresponding to M306, A308, and K320 of SEQ ID NO: 1 or from amino acid residues corresponding to T304, D306, and T318 of SEQ ID NO: 8. In a further embodiment of this aspect, the residues of JM zone 4 may be modified (e.g., by insertion, deletion, alteration, etc.) in such a way that the modified residues do not correspond to the residues of a different JM zone 4 from a different LysM receptor.
The JM zone 2 corresponds to amino acids 256 to 280 when aligned to SEQ ID NO: 1 or corresponds to amino acids 256 to 281 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the JM zone 2 includes SEQ ID NO: 11, SEQ ID NO: 58, or SEQ ID NO: 4. In a further embodiment of this aspect, the residues of JM zone 2 may be modified (e.g., by insertion, deletion, alteration, etc.) in such a way that the modified residues do not correspond to the residues of a different JM zone 2 from a different LysM receptor.
The JM zone 3 corresponds to amino acids 281 to 304 when aligned to SEQ ID NO: 1 or corresponds to amino acids 282 to 302 when aligned to SEQ ID NO: 8. In yet another embodiment of this aspect, the JM zone 3 includes SEQ ID NO: 12, SEQ ID NO: 59, or SEQ ID NO: 5. In a further embodiment of this aspect, the residues of JM zone 3 may be modified (e.g., by insertion, deletion, alteration, etc.) in such a way that the modified residues do not correspond to the residues of a different JM zone 3 from a different LysM receptor.
The modified extracellular domain (ectodomain) can be modified as disclosed in U.S. patent application Ser. No. 17/324,354 (U.S. Pat. App. Pub. No. US-2021-03663200-A1) or in U.S. patent application Ser. No. 17/267,240 (U.S. Pat. App. Pub. NO. US-2021-0233608-A1).
The kinase C-terminus region corresponds to amino acids T467, S471, S473, P475, G477, V496, V500, Y510, D517, 1519, V520, S523, S525, 1526, T527, D528, G538, V539, S541, Q542, P543, T546, E547, D548, Q555, D559, V563, R567, M569, A573, K574, Q578, Q582, 1590, T600, D602, W604, V606, G607, F609, N612, N614, V616, M619, and G621 when aligned to SEQ ID NO: 8, or corresponds to amino acids 1469, N473, T475, Q477, 1497, 1501, F511, N518, V520, L521, G524, L526, V527, A528, E529, E539, A540, N542, K543, S544, C547, D548, A549, P556, E560, E564, L568, 1570, G574, R575, R579, L583, L591, L601, E603, C605, D607, E608, S610, S613, T615, 1617, L620, and V622 when aligned to SEQ ID NO: 1.
The kinase N-terminus region corresponds to D328, A338, K347, K348, T349, V356, T360, C364, H371, C384, V385, H395, D397, G402, Y404, H406, K410, E411, S416, S417, A425, V445, and L456 when aligned to SEQ ID NO: 1, or corresponds to A326, S336, E345, R346, A347, M354, K358, A362, R369, S382, 1383, F393, E395, S400, H402, R404, R408, D409, A414, T415, S423, 1443, and Y454 when aligned to SEQ ID NO: 8.
Transformation and generation of genetically altered monocotyledonous and dicotyledonous plant cells is well known in the art. See, e.g., Weising, et al., Ann. Rev. Genet. 22:421-477 (1988); U.S. Pat. No. 5,679,558; Agrobacterium Protocols, ed: Gartland, Humana Press Inc. (1995); Wang, et al. Acta Hort. 461:401-408 (1998), and Broothaerts, et al. Nature 433:629-633 (2005). The choice of method varies with the type of plant to be transformed, the particular application, and/or the desired result. The appropriate transformation technique is readily chosen by the skilled practitioner.
Any methodology known in the art to delete, insert or otherwise modify the cellular DNA (e.g., genomic DNA and organelle DNA) can be used in practicing the compositions, methods, and processes disclosed herein. As an example, the CRISPR/Cas-9 system and related systems (e.g., TALEN, ZFN, ODN, etc.) may be used to insert a heterologous gene to a targeted site in the genomic DNA or substantially edit an endogenous gene to express the heterologous gene or to modify the promoter to increase or otherwise alter expression of an endogenous gene through, for example, removal of repressor binding sites or introduction of enhancer binding sites. For example, a disarmed Ti plasmid, containing a genetic construct for deletion or insertion of a target gene, in Agrobacterium tumefaciens can be used to transform a plant cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using procedures described in the art, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and published European Patent application (“EP”) 0242246. Ti-plasmid vectors each contain the gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and U.S. Pat. No. 4,684,611), plant RNA virus-mediated transformation (as described, for example in EP 0 067 553 and U.S. Pat. No. 4,407,956), liposome-mediated transformation (as described, for example in U.S. Pat. No. 4,536,475), and other methods such as the methods for transforming certain lines of corn (e.g., U.S. Pat. No. 6,140,553; Fromm et al., Bio/Technology (1990) 8, 833-839); Gordon-Kamm et al., The Plant Cell, (1990) 2, 603-618), rice (Shimamoto et al., Nature, (1989) 338, 274-276; Datta et al., Bio/Technology, (1990) 8, 736-740), and the method for transforming monocots generally (PCT publication WO 92/09696). For cotton transformation, the method described in PCT patent publication WO 00/71733 can be used. For soybean transformation, reference is made to methods known in the art, e.g., Hinchee et al. (Bio/Technology, (1988) 6, 915) and Christou et al. (Trends Biotech, (1990) 8, 145) or the method of WO 00/42207.
Genetically altered plants of the present disclosure can be used in a conventional plant breeding scheme to produce more genetically altered plants with the same characteristics, or to introduce the genetic alteration(s) in other varieties of the same or related plant species. Seeds, which are obtained from the altered plants, preferably contain the genetic alteration(s) as a stable insert in chromosomal DNA or as modifications to an endogenous gene or promoter. Plants including the genetic alteration(s) in accordance with this disclosure include plants including, or derived from, root stocks of plants including the genetic alteration(s) of this disclosure, e.g., fruit trees or ornamental plants. Hence, any non-transgenic grafted plant parts inserted on a transformed plant or plant part are included in this disclosure.
Genetic alterations of the disclosure, including in an expression vector or expression cassette, which result in the expression of an introduced gene or altered expression of an endogenous gene will typically utilize a plant-expressible promoter. A ‘plant-expressible promoter’ as used herein refers to a promoter that ensures expression of the genetic alteration(s) of this disclosure in a plant cell. Examples of constitutive promoters that are often used in plant cells are the cauliflower mosaic (CaMV) 35S promoter (Kay et al. Science, 236, 4805, 1987), the minimal CaMV 35S promoter (Benfey & Chua, Science, (1990) 250, 959-966), various other derivatives of the CaMV 35S promoter, the figwort mosaic virus (FMV) promoter (Richins, et al., Nucleic Acids Res. (1987) 15:8451-8466)the maize ubiquitin promoter (Christensen & Quail, Transgenic Res, 5, 213-8, 1996), the polyubiquitin promoter (Ljubql, Maekawa et al. Mol Plant Microbe Interact. 21, 375-82, 2008), the vein mosaic cassava virus promoter (International Application WO 97/48819), and the Arabidopsis UBQ10 promoter, Norris et al. Plant Mol. Biol. 21, 895-906, 1993).
Additional examples of promoters directing constitutive expression in plants are known in the art and include: the strong constitutive 35S promoters (the “35S promoters”) of the cauliflower mosaic virus (CaMV), e.g., of isolates CM 1841 (Gardner et al., Nucleic Acids Res, (1981) 9, 2871-2887), CabbB S (Franck et al., Cell (1980) 21, 285-294) and CabbB JI (Hull and Howell, Virology, (1987) 86, 482-493); promoters from the ubiquitin family (e.g., the maize ubiquitin promoter of Christensen et al., Plant Mol Biol, (1992) 18, 675-689), the gos2 promoter (de Pater et al., The Plant J (1992) 2, 834-844), the emu promoter (Last et al., Theor Appl Genet, (1990) 81, 581-588), actin promoters such as the promoter described by An et al. (The Plant J, (1996) 10, 107), the rice actin promoter described by Zhang et al. (The Plant Cell, (1991) 3, 1155-1165); promoters of the figwort mosaic virus (FMV) (Richins, et al., Nucleic Acids Res. (1987) 15:8451-8466), promoters of the Cassava vein mosaic virus (WO 97/48819; Verdaguer et al., Plant Mol Biol, (1998) 37, 1055-1067), the pPLEX series of promoters from Subterranean Clover Stunt Virus (WO 96/06932, particularly the S4 or S7 promoter), an alcohol dehydrogenase promoter, e.g., pAdhlS (GenBank accession numbers X04049, X00581), and the TR1′ promoter and the TR2′ promoter (the “TR1′ promoter” and “TR2′ promoter”, respectively) which drive the expression of the 1′ and 2′ genes, respectively, of the T DNA (Velten et al., EMBO J, (1984) 3, 2723-2730).
Alternatively, a plant-expressible promoter can be a tissue-specific promoter, i.e., a promoter directing a higher level of expression in some cells or tissues of the plant, e.g., in root epidermal cells or root cortex cells. In preferred embodiments, LysM receptor promoters will be used. Non-limiting examples include NFR1 promoters, NFR5 promoters, LYK3 promoters, NFP promoters, the Lotus japonicus NFR5 promoter (SEQ ID NO: 109), the Lotus japonicus NFR1 promoter (SEQ ID NO: 151), the Medicago truncatula NFP promoter (SEQ ID NO: 110), the Lotus japonicus CERK6 promoter (SEQ ID NO: 111), and the Medicago truncatula LYK3 promoter (SEQ ID NO: 112). In additional preferred embodiments, root specific promoters will be used. Non-limiting examples include the promoter of the maize metallothioneine (De Framond et al, FEBS 290, 103.-106, 1991 Application EP 452269), the chitinase promoter (Samac et al. Plant Physiol 93, 907-914, 1990), the glutamine synthetase soybean root promoter (Hirel et al. Plant Mol. Biol. 20, 207-218, 1992), the RCC3 promoter (PCT Application WO 2009/016104), the rice antiquitin promoter (PCT Application WO 2007/076115), the LRR receptor kinase promoter (PCT application WO 02/46439), the maize ZRP2 promoter (U.S. Pat. No. 5,633,363), the tomato LeExt1 promoter (Bucher et al. Plant Physiol. 128, 911-923, 2002), and the Arabidopsis pCO2 promoter (Heidstra et al, Genes Dev. 18, 1964-1969, 2004). These plant promoters can be combined with enhancer elements, they can be combined with minimal promoter elements, or can comprise repeated elements to ensure the expression profile desired.
Examples of constitutive promoters that are often used in plant cells are the cauliflower mosaic (CaMV) 35S promoter (Kay et al. Science, 236, 4805, 1987), and various derivatives of the promoter, virus promoter vein mosaic cassava (International Application WO 97/48819), the maize ubiquitin promoter (Christensen & Quail, Transgenic Res, 5, 213-8, 1996), polyubiquitin (Ljubql, Maekawa et al. Mol Plant Microbe Interact. 21, 375-82, 2008) and Arabidopsis UBQ10 (Norris et al. Plant Mol. Biol. 21, 895-906, 1993).
In some embodiments, further genetic alterations to increase expression in plant cells can be utilized. For example, an intron at the 5′ end or 3′ end of an introduced gene, or in the coding sequence of the introduced gene, e.g., the hsp70 intron. Other such genetic elements can include, but are not limited to, promoter enhancer elements, duplicated or triplicated promoter regions, 5′ leader sequences different from another transgene or different from an endogenous (plant host) gene leader sequence, 3′ trailer sequences different from another transgene used in the same plant or different from an endogenous (plant host) trailer sequence.
An introduced gene of the present disclosure can be inserted in host cell DNA so that the inserted gene part is upstream (i.e., 5′) of suitable 3′ end transcription regulation signals (i.e., transcript formation and polyadenylation signals). This is preferably accomplished by inserting the gene in the plant cell genome (nuclear or chloroplast). Preferred polyadenylation and transcript formation signals include those of the nopaline synthase gene (Depicker et al., J. Molec Appl Gen, (1982) 1, 561-573), the octopine synthase gene (Gielen et al., EMBO J, (1984) 3:835-845), the SCSV or the Malic enzyme terminators (Schunmann et al., Plant Funct Biol, (2003) 30:453-460), and the T DNA gene 7 (Velten and Schell, Nucleic Acids Res, (1985) 13, 6981-6998), which act as 3′ untranslated DNA sequences in transformed plant cells. In some embodiments, one or more of the introduced genes are stably integrated into the nuclear genome. Stable integration is present when the nucleic acid sequence remains integrated into the nuclear genome and continues to be expressed (i.e., detectable mRNA transcript or protein is produced) throughout subsequent plant generations. Stable integration into the nuclear genome can be accomplished by any known method in the art (e.g., microparticle bombardment, Agrobacterium-mediated transformation, CRISPR/Cas9, electroporation of protoplasts, microinjection, etc.).
The term recombinant or modified nucleic acids refers to polynucleotides which are made by the combination of two otherwise separated segments of sequence accomplished by the artificial manipulation of isolated segments of polynucleotides by genetic engineering techniques or by chemical synthesis. In so doing one may join together polynucleotide segments of desired functions to generate a desired combination of functions.
As used herein, the term “overexpression” refers to increased expression (e.g., of mRNA, polypeptides, etc.) relative to expression in a wild type organism (e.g., plant) as a result of genetic modification and can refer to expression of heterologous genes at a sufficient level to achieve the desired result such as increased yield. In some embodiments, the increase in expression is a slight increase of about 10% more than expression in wild type. In some embodiments, the increase in expression is an increase of 50% or more (e.g., 60%, 70%, 80%, 100%, etc.) relative to expression in wild type. In some embodiments, an endogenous gene is upregulated. In some embodiments, an exogenous gene is upregulated by virtue of being expressed. Upregulation of a gene in plants can be achieved through any known method in the art, including but not limited to, the use of constitutive promoters with inducible response elements added, inducible promoters, high expression promoters (e.g., PsaD promoter) with inducible response elements added, enhancers, transcriptional and/or translational regulatory sequences, codon optimization, modified transcription factors, and/or mutant or modified genes that control expression of the gene to be upregulated in response to a stimulus such as cytokinin signaling.
Where a recombinant nucleic acid is intended for expression, cloning, or replication of a particular sequence, DNA constructs prepared for introduction into a host cell will typically include a replication system (e.g., vector) recognized by the host, including the intended DNA fragment encoding a desired polypeptide, and can also include transcription and translational initiation regulatory sequences operably linked to the polypeptide-encoding segment. Additionally, such constructs can include cellular localization signals (e.g., plasma membrane localization signals). In preferred embodiments, such DNA constructs are introduced into a host cell's genomic DNA, chloroplast DNA or mitochondrial DNA.
In some embodiments, a non-integrated expression system can be used to induce expression of one or more introduced genes. Expression systems (expression vectors) can include, for example, an origin of replication or autonomously replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary processing information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Signal peptides can also be included where appropriate from secreted polypeptides of the same or related species, which allow the protein to cross and/or lodge in cell membranes, cell wall, or be secreted from the cell.
Selectable markers useful in practicing the methodologies disclosed herein can be positive selectable markers. Typically, positive selection refers to the case in which a genetically altered cell can survive in the presence of a toxic substance only if the recombinant polynucleotide of interest is present within the cell. Negative selectable markers and screenable markers are also well known in the art and are contemplated by the present disclosure. One of skill in the art will recognize that any relevant markers available can be utilized in practicing the compositions, methods, and processes disclosed herein.
Screening and molecular analysis of recombinant strains of the present disclosure can be performed utilizing nucleic acid hybridization techniques. Hybridization procedures are useful for identifying polynucleotides, such as those modified using the techniques described herein, with sufficient homology to the subject regulatory sequences to be useful as taught herein. The particular hybridization techniques are not essential to this disclosure. As improvements are made in hybridization techniques, they can be readily applied by one of skill in the art. Hybridization probes can be labeled with any appropriate label known to those of skill in the art. Hybridization conditions and washing conditions, for example temperature and salt concentration, can be altered to change the stringency of the detection threshold. See, e.g., Sambrook et al. (1989) vide infra or Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, NY, N.Y., for further guidance on hybridization conditions.
Additionally, screening and molecular analysis of genetically altered strains, as well as creation of desired isolated nucleic acids can be performed using Polymerase Chain Reaction (PCR). PCR is a repetitive, enzymatic, primed synthesis of a nucleic acid sequence. This procedure is well known and commonly used by those skilled in this art (see Mullis, U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki et al. (1985) Science 230:1350-1354). PCR is based on the enzymatic amplification of a DNA fragment of interest that is flanked by two oligonucleotide primers that hybridize to opposite strands of the target sequence. The primers are oriented with the 3′ ends pointing towards each other. Repeated cycles of heat denaturation of the template, annealing of the primers to their complementary sequences, and extension of the annealed primers with a DNA polymerase result in the amplification of the segment defined by the 5′ ends of the PCR primers. Because the extension product of each primer can serve as a template for the other primer, each cycle essentially doubles the amount of DNA template produced in the previous cycle. This results in the exponential accumulation of the specific target fragment, up to several million-fold in a few hours. By using a thermostable DNA polymerase such as the Taq polymerase, which is isolated from the thermophilic bacterium Thermus aquaticus, the amplification process can be completely automated. Other enzymes which can be used are known to those skilled in the art.
Nucleic acids and proteins of the present disclosure can also encompass homologues of the specifically disclosed sequences. Homology (e.g., sequence identity) can be 50%-100%. In some instances, such homology is greater than 80%, greater than 85%, greater than 90%, or greater than 95%. The degree of homology or identity needed for any intended use of the sequence(s) is readily identified by one of skill in the art. As used herein percent sequence identity of two nucleic acids is determined using an algorithm known in the art, such as that disclosed by Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the BLASTN, BLASTP, and BLASTX, programs of Altschul et al. (1990) J. Mol. Biol. 215:402-410. BLAST nucleotide searches are performed with the BLASTN program, score=100, wordlength=12, to obtain nucleotide sequences with the desired percent sequence identity. To obtain gapped alignments for comparison purposes, Gapped BLAST is used as described in Altschul et al. (1997) Nucl. Acids. Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (BLASTN and BLASTX) are used. See www.ncbi.nih.gov. One of skill in the art can readily determine in a sequence of interest where a position corresponding to amino acid or nucleic acid in a reference sequence occurs by aligning the sequence of interest with the reference sequence using the suitable BLAST program with the default settings (e.g., for BLASTP: Gap opening penalty: 11, Gap extension penalty: 1, Expectation value: 10, Word size: 3, Max scores: 25, Max alignments: 15, and Matrix: blosum62; and for BLASTN: Gap opening penalty: 5, Gap extension penalty:2, Nucleic match: 1, Nucleic mismatch−3, Expectation value: 10, Word size: 11, Max scores: 25, and Max alignments: 15).
Preferred host cells are plant cells. Recombinant host cells, in the present context, are those which have been genetically modified to contain an isolated nucleic molecule, contain one or more deleted or otherwise non-functional genes normally present and functional in the host cell, or contain one or more genes to produce at least one recombinant protein. The nucleic acid(s) encoding the protein(s) of the present disclosure can be introduced by any means known to the art which is appropriate for the particular type of cell, including without limitation, transformation, lipofection, electroporation or any other methodology known by those skilled in the art.
“Isolated”, “isolated DNA molecule” or an equivalent term or phrase is intended to mean that the DNA molecule or other moiety is one that is present alone or in combination with other compositions, but altered from or not within its natural environment. For example, nucleic acid elements such as a coding sequence, intron sequence, untranslated leader sequence, promoter sequence, transcriptional termination sequence, and the like, that are naturally found within the DNA of the genome of an organism are not considered to be “isolated” so long as the element is within the genome of the organism and at the location within the genome in which it is naturally found. However, each of these elements, and subparts of these elements, would be “isolated” from its natural setting within the scope of this disclosure so long as the element is not within the genome of the organism in which it is naturally found, the element is altered from its natural form, or the element is not at the location within the genome in which it is naturally found. Similarly, a nucleotide sequence encoding a protein or any naturally occurring variant of that protein would be an isolated nucleotide sequence so long as the nucleotide sequence was not within the DNA of the organism from which the sequence encoding the protein is naturally found in its natural location or if that nucleotide sequence was altered from its natural form. A synthetic nucleotide sequence encoding the amino acid sequence of the naturally occurring protein would be considered to be isolated for the purposes of this disclosure. For the purposes of this disclosure, any transgenic nucleotide sequence, i.e., the nucleotide sequence of the DNA inserted into the genome of the cells of a plant, alga, fungus, or bacterium, or present in an extrachromosomal vector, would be considered to be an isolated nucleotide sequence whether it is present within the plasmid or similar structure used to transform the cells, within the genome of the plant or bacterium, or present in detectable amounts in tissues, progeny, biological samples or commodity products derived from the plant or bacterium.
Having generally described the compositions, methods, and processes of this disclosure, the same will be better understood by reference to certain specific examples, which are included herein to further illustrate the disclosure and are not intended to limit the scope of the invention as defined by the claims.
The present disclosure is described in further detail in the following examples which are not in any way intended to limit the scope of the disclosure as claimed. The attached figures are meant to be considered as integral parts of the specification and description of the disclosure. The following examples are offered to illustrate, but not to limit the claimed disclosure. Examples 1-7 describe the identification of zone 4 of the NFR1 juxtamembrane domain (NFR1 zone 4=SEQ ID NO: 6) as a determinant of root nodule symbiosis and target for engineering in non-legume receptors. Example 8 describes the crystal structures of CERK6 and LYK3 proteins. Example 9 describes biochemical analyses of native, mutant, and chimeric NFR1-CERK6 proteins. Examples 10-12 describe the identification of the N terminal region of CERK6 as being critical for immunity. Example 13 describes investigating the role of zone 4 in activating symbiotic signaling. Example 14 describes experiments investigating the role of the CERK6 kinase domains in root hair infection thread formation. Example 15 describes experiments investigating ROS production after treatment with Nod factors in lines expressing NFR1 variants containing CERK6 kinase. Example 16 describes experiments investigating whether NFR1 variants containing CERK6 kinase form inhibitory dimers with the native CERK6. Example 17 describes experiments investigating the NFR1 zone 4 biochemical properties that are required for symbiotic signaling. Example 18 describes experiments investigating the effects on nodulation of mutations in the NFR1 activation loop (AL). Example 19 describes experiments investigating the role of CERK6 juxtamembrane in immune signaling. Example 20 describes experiments investigating the structure of the CERK6 activation loop in the absence of the ligand.
Example 1 describes that zone 4 of the NFR1 protein from Lotus japonicus (LjNFR1) juxtamembrane domain (JM; NFR1 zone 4=SEQ ID NO: 6) is essential for nodule organogenesis.
A level 2 pIV10 expression vector (Márquez, A. J., and J. Stougaard, eds. 2005. Lotus Japonicus Handbook. Dordrecht: Springer) containing an expression cassette of nuclear localized tripleYFP driven by the constitutive promoter pUbi (pUbi::tYFP-NLS) serving as transformation marker was used for Hairy Root transformation. The chimeric constructs or chimeric variants were positioned downstream of pNFR1. Golden gate cloning (Weber, Ernst, Carola Engler, Ramona Gruetzner, Stefan Werner, and Sylvestre Marillonnet. 2011. “A Modular Cloning System for Standardized Assembly of Multigene Constructs” edited by J. Peccoud. PLoS ONE 6(2):e16765. doi: 0.1371/journal.pone.0016765.) was used to create the different receptor versions. The modules utilized to assemble the receptor chimeras were the following: LjCERK6 and LjNFR1 (genomic sequences containing introns): Ectodomain (EC), Transmembrane/Juxtamembrane domain (TM/JM), TM/JM zones 1-4 (TM/JM zone 1 is part of the TM (NFR1 zone 1=SEQ ID NO: 3; CERK6 zone 1=SEQ ID NO: 10), while zones 2-4 are part of the JM (NFR1 zone 2=SEQ ID NO: 4; NFR1 zone 3=SEQ ID NO: 5; NFR1 zone 4=SEQ ID NO: 6; CERK6 zone 2=SEQ ID NO: 11; CERK6 zone 3=SEQ ID NO: 12; CERK6 zone 4=SEQ ID NO: 13), and Kinase domain.
E. coli TOP10 (ThermoFisher Scientific) were used for molecular cloning and grown in LB medium at 37° C. Agrobacterium rhizogenes strain AR1193 was used for hairy root transformations. Agrobacterium strains were grown in LB medium at 28° C. M. loti strain R7A dsRED, used for nodulation assay, was grown in YMB at 28 degrees C.
The Lotus japonicus ecotype Gifu wild type and mutant line nfr1-1 pNin-gus were used for nodulation assays. Plants were grown at 21° C. (16/8-hour light/dark conditions).
For scarification, Lotus seeds were either immersed in sulfuric acid for 15 minutes or treated with sandpaper. Seeds were then surface sterilized for 3 minutes with 3% sodium hypochlorite, washed 3 times in ddH2O and dispersed on wet filter paper for germination. Three-day-old seedlings were transferred to square agar plates solidified with 0.8% Gelrite (Duchefa Biochemi) supplemented with 12 Gamborg's B5 nutrient solution (Duchefa Biochemi). A. rhizogenes AR1193 strains carrying the construct of interest were grown for three days on LB Agar containing Ampicillin, Rifampicin, and Spectinomycin (final concentration of each antibiotic was 100 μg/ml). For each construct the cells grown on one plate were resuspended in 4 mL YMB (5 g/L mannitol, 0.5 g/L yeast extract, 0.5 g/L K2HPO4, 0.2 g/L MgSO4·7H2O, 0.1 g NaCl, pH=6.8). The bacterial suspension was then used to transform 6-day-old seedlings using a 1 mL syringe with a needle (Sterican® Ø0.40×20 mm), punching the hypocotyl and placing a droplet on the root emerging from the wound. Square plates containing the transformed seedlings were sealed and left in the dark for one day and then moved to 21° C. under 16/8-hour light/dark conditions. After three weeks, primary roots were removed and seedlings with transformed roots were transferred to Magenta™ vessels (Sigma-Aldrich) filled with lightweight expanded clay aggregate (Leca®, 2-4 mm; Saint-Gobain Weber A/S) supplemented with 80 mL nitrogen-free 1/4×B&D nutrient solution.
If the roots were inoculated, nodules were counted five weeks after the transfer to Magenta™ vessels (Sigma-Aldrich) and inoculation with Rhizobia (equivalent to nine weeks after hairy root transformations). All nodulation assays in roots transformed using hairy root transformation occurred in Magenta™ vessels (Sigma-Aldrich). Pictures were acquired with a Leica M165FC Fluorescent Stereo Microscope equipped with the Leica DFC310 FX digital color camera.
A Kruskal-Wallis test (Kruskal et al. (1952) J Am Stat Assoc 47, 583-621) followed by Dunn's post-hoc test (Dunn (1964) Technometrics 6, 241-252) was used for statistical analysis of nodule counts.
Plant hairy roots were incubated in the GUS buffer consisting of: NaPO4 (pH 7.0) 100 mM, EDTA (pH 8.0) 10 mM, K Ferricyanide 1 mM, K Ferrocyanide 1 mM, Triton X-100 0.1%, X-Gluc (5-bromo-4-chloro-3-indolyl-beta-D-glucuronic acid, cyclohexylammonium salt) 0.5 mg/ml at 37° C. overnight in the dark. The roots were washed with 70% EtOH and kept at 4° C. for several days after inspection.
To identify whether NFR1 and CERK6 are expressed in the same root cells, data on single cell root transcriptomics (provided in: Frank et al. (2023) Single-cell analysis identifies genes facilitating rhizobium infection in Lotus japonicus. Nat Commun 14, 7171) were analyzed.
Previous studies investigating which regions of the NFR1 and CERK6 receptors contributed to immune or symbiotic signaling identified that the region containing transmembrane domain (TM) and juxtamembrane domain (JM) was crucial for root nodulation (see
These chimeric constructs each contained the ectodomain of NFR1 and the kinase of CERK6 coupled to different NFR1-CERK6 chimeric versions of TM/JM (see
In order to determine if the chimeric receptors had the capacity to activate symbiotic signaling and root nodulation, Lotus japonicus nfr1-1 mutant plants expressing a pNIN:GUS symbiotic marker were used as background for hairy root transformation, and nodule formation was assessed at 35 dpi with M. loti R7A dsRed (
This full-dissection study, in which the TM/JM region was dissected in 4 zones and the role of each part was studied, identified that only constructs containing the LjNFR1 zone 4 were able to activate pNIN:GUS expression and induce nodule formation (see
Detailed microscopic investigation revealed that M. loti infection of nodules and NIN promoter expression had been induced (pNin:GUS expression) in all roots expressing the complementing constructs (
It was hypothesized that these results could be explained if i) zone 4 of CERK6 has an inhibitory role for root nodule symbiosis (RNS), or ii) CERK6 zone 4 and kinase, together, represent a signaling module that is not favorable for symbiosis signaling.
To investigate whether the CERK6 JM zone 4 alone had an inhibitory role in root nodule symbiosis, a NFR1 receptor variant chimera was constructed as above, but in which zone 4 was replaced by the corresponding region of CERK6 and tested for the ability to initiate nodule organogenesis in nfr1-1 pNIN:GUS mutant line. All tested plants formed functional and infected nodules similar to the control full-length NFR1 construct (
The following example describes experiments demonstrating that chimeric receptors containing LjCERK6 zone 4 and kinase localized at the plasma membrane of tobacco leaf cells, indicating that the combination of CERK6 zone 4 and kinase impaired the receptor's functionality but not its expression and/or localization.
A pICH binary vector backbone (Weber et al. 2011) was used for N. benthamiana localization assays, containing a plasma membrane localized AtPIP2A-mCherry driven by pUbi (pUbi::PMmCherry) serving as a localization control. The chimeric constructs were cloned downstream of the β35s and were fused with a module encoding eYFP and terminated by T35s. The modules utilized to assembly the receptor chimeras were the following: LjCERK6 and LjNFR1 (genomic sequences containing introns): Ectodomain (EC), Transmembrane/Juxtamembrane domain (TM/JM), TM/JM zones 1-4 (TM/JM zone 1 is part of the TM (NFR1 zone 1=SEQ ID NO: 3; CERK6 zone 1=SEQ ID NO: 10), while zones 2-4 are part of the JM (NFR1 zone 2=SEQ ID NO: 4; NFR1 zone 3=SEQ ID NO: 5; NFR1 zone 4=SEQ ID NO: 6; CERK6 zone 2=SEQ ID NO: 11; CERK6 zone 3=SEQ ID NO: 12; CERK6 zone 4=SEQ ID NO: 13), and Kinase domain.
E. coli TOP10 (ThermoFisher Scientific) cells were used for molecular cloning and grown in LB medium at 37° C. Agrobacterium tumefaciens line AGL1 was used for tobacco infiltration and was grown in histidine-enriched LB or/and LA at 28 degrees C.
Plant materials and Nicotiana benthamiana infiltration [0180] 3 week-old Nicotiana benthamiana plants were used for infiltration. A. tumefaciens AGL1 cells carrying the respective construct was cultured in LB medium overnight at 28 degrees C. at 180 rpm. The inoculi were centrifuged and the pellet resuspended in MQ sterile water to an OD600=0.6-0.8. The bacteria suspension was used to infiltrate 3-4 week-old N. benthamiana plants with a 1 mL syringe
At 2 days post infiltration, tobacco leaf discs were depicted utilizing Zeiss LSM780 confocal microscope with the following excitation/emission parameters: a) YFP: 514/520-560 nm, b) mCherry 561/570-600 nm and c) bright field. The photos were edited using Fiji (ImageJ).
The lack of complementation of the nfr1 mutant by chimeric receptors containing the CERK6 zone 4 and CERK6 kinase (see Example 1,
In order to investigate if nonfunctional receptor chimeras properly folded and localized at the plasma membrane, a heterologous overexpression system was used to co-express mCherry-tagged AtPIP2A, which localizes to plasma membranes, with YFP-tagged chimeric receptors in tobacco (Nicotiana benthamiana) leaves.
NFR1 and CERK6 receptor proteins are normally found in low abundance in L. japonicus roots, and tagged receptors expressed from native NFR1 and CERK6 promoters are not detectable by fluorescence-based microscopy. Because overexpression of LjCERK6 kinase has been shown to induce cell death (Bozsoki et al. (2020) Science 369(6504), 663-670), a kinase-dead mutant of CERK6 kinase (K35IN) was used in the chimeras in this assay in place of the wild-type CERK6 kinase domain described in Example 1. Co-expression and subsequent co-localization microscopy of YFP-tagged chimeras with mCherry-tagged AtPIP2A experiments were performed to determine if the YFP-tagged chimeras co-localized with mCherry-tagged AtPIP2A at the plasma membrane.
Five YFP-tagged receptors were expressed separately in the co-expression system described above: The full-length native LjNFR1 tagged with YFP (top row in
The following example describes in-depth phylogenomic and functional analyses demonstrating that residues in LjNFR1 JM zone 4 were critical for symbiotic signaling. In order to dissect if the entire zone 4 of NFR1 is important, or if individual amino acids are sufficient or critical for root nodule symbiosis, alignments of NFR1 and CERK6 homologs were analyzed, and chimeric constructs containing amino acid swaps were constructed and analyzed.
A level 2 pIV10 expression vector 9 (Marquez, A. J., and J. Stougaard, eds. 2005. Lotus Japonicus Handbook. Dordrecht: Springer) was used for Hairy Root transformation, containing an expression cassette of nuclear localized tripleYFP driven by the constitutive promoter pUbi (pUbi::tYFP-NLS) serving as transformation marker. The chimeric constructs or chimeric variants were positioned downstream of pNFR1. Golden gate cloning took place utilizing T4 ligase and BpiI or BsaI (ThermoFisherScientific) depending on the level of the cloning modules (Weber et al. 2011). The modules utilized to assembly the receptor chimeras were the following: LjCERK6 and LjNFR1 (genomic sequences containing introns): Ectodomain (EC), Transmembrane/Juxtamembrane domain (TM/JM), TM/JM zones 1-4 (TM/JM zone 1 is part of the TM (NFR1 zone 1=SEQ ID NO: 3; CERK6 zone 1=SEQ ID NO: 10), while zones 2-4 are part of the JM (NFR1 zone 2=SEQ ID NO: 4; NFR1 zone 3=SEQ ID NO: 5; NFR1 zone 4=SEQ ID NO: 6; CERK6 zone 2=SEQ ID NO: 11; CERK6 zone 3=SEQ ID NO: 12; CERK6 zone 4=SEQ ID NO: 13)), Kinase domain and mutagenized zone 4.
Plant hairy roots were incubated in the GUS buffer consisting of: NaPO4 (pH 7.0) 100 mM, EDTA (pH 8.0) 10 mM, K Ferricyanide 1 mM, K Ferrocyanide 1 mM, Triton X-100 0.1%, X-Gluc (5-bromo-4-chloro-3-indolyl-beta-D-glucuronic acid, cyclohexylammonium salt) 0.5 mg/ml at 37 degree C. overnight in the dark. The roots were washed with 70% EtOH and kept at 4 degrees C. for several days after inspection.
Alignment and phylogenetic trees were created with CLC Main Workbench, Qiagen. Phylogenetic trees: Tree construction method: Neighbor joining, Protein distance measure: Jukes-Cantor Bootstrap analysis: 100 replicates.
Bacteria and culture conditions, plant materials, N. benthamiana infiltration, and other methods were conducted as described in Example 1 unless stated otherwise.
Alignments of NFR1 and CERK6 homologs in 17 nodulating and 2 non-nodulating species were analyzed to identify conserved residues (
These differences in amino acids have consequences on the polarity and charge of this particular region (
Additional studies further dissected the role of individual amino acids of NFR1 zone 4 in symbiosis. A construct containing the EC and TM/JM of NFR1 coupled to the kinase of CERK6 (
Nodule formation measurements indicated that nfr1-1 pNin:GUS plants expressing the chimeric constructs carrying the M311V, Q316D, K320T, or N323D amino acid swaps were able to complement the nfr1-1 mutant similarly to the NFR1 full-length control (as indicated by the lack of a significant reduction in the number of total nodules in these amino acid swap constructs compared to the NFR1 full-length control), whereas constructs carrying the M306T or A308D amino acid swaps were unable to complement the nfr1-1 mutant (as indicated by a significant reduction in the number of total nodules compared to the NFR1 full-length control) (
A further investigation following GUS staining of the roots for visualization of NIN promoter expression revealed that roots expressing the A308D or K320T amino acid swaps induced a larger number of nodule primordia when compared to the rest of the analyzed constructs, as statistical analysis showed significant difference in the number of nodule primordia (see lowercase letter c in
The following example describes that residue motifs in LJNFR1 JM zone 4 are critical for symbiotic signaling. Example 3 describes that individual residues in NFR1 JM zone 4 have an impact on organogenesis and nodule infection (see, e.g.,
Cloning of LjNFR1 zone 4 chimeras, bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays, and other methods were conducted as described in Example 1 unless stated otherwise.
The construct with the EC and TM/JM of NFR1 coupled with the kinase of CERK6 (Chimera A) was used as a basis for the construction of the receptor variants. For each variant, three-residue combinations were changed from the residues found in NFR1 to those found in CERK6 (see
Chimera A containing the substitutions Q316D, K320T and N323D in the αB-helix of zone 4 had very low efficiency in formation of nodules compared with the full-length NFR1, with only 4 out of 38 characterized plants that contained the substitutions Q316D, K320T and N323D found able to produce nodules (at a maximum of about 5 nodules per plant) (
Next, the role of individual amino acids of NFR1 zone 4 in symbiosis was dissected. Chimera A (containing the EC and TM/JM from NFR1 and the K domain from CERK6) was used this time to individually swap the six residues from NFR1 to CERK6 (see schematic representation in
The following example describes gain-of-function experiments indicating that particular residues in LJNFR1 JM zone 4 are sufficient to induce symbiotic signaling. Example 3 showed that substitution of the JM M306 or A308 residues from the residues found in NFR1 to those found in CERK6 impairs organogenesis and root hair infection (see
Cloning of LjNFR1 zone 4 chimeras, bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays and other methods were conducted as described in Example 1 unless stated otherwise.
Receptor variants were designed as shown in
None of the individual substitutions tested (T340M, D360A, V309M, T318K, and D321N) was sufficient to restore root nodule symbiosis in the nfr1-1 mutant background (
The following example describes experiments demonstrating that particular residue motifs in LJNFR1 JM zone 4 were sufficient to induce functional nodule organogenesis. In order to determine the minimal JM zone 4 motif that would allow NFR1 to engage both organogenesis and root infection, the functionality of additional receptor variants was tested.
Cloning of LjNFR1 zone 4 chimeras, bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays and other methods were conducted as described in Example 3 unless stated otherwise.
To determine the minimal zone 4 motif that allows NFR1 to engage both in root hair infection and nodule organogenesis, the functionality of receptor variants in which combinations of CERK6 zone 4 residues were substituted by the corresponding NFR1 were tested. The chimera containing the EX and TM/JM zones 1-3 from NFR1 and JM zone 4 and the K domain from CERK6 (hereinafter the “CC”) chimera was used as a basis to construct the receptor variants (see schematic representation in
Chimera carrying both the T304M and D306A substitutions had the ability to induce nodule organogenesis and nodule infection in 9 out of 23 tested plants, whereas chimera carrying the T304M, D306A, and T318K substitutions showed an improved complementation capacity, with 15 out of 23 tested plants nodulating (
The following example describes experiments demonstrating that a nodulation signature enables intracellular RLK4 to induce root nodule symbiosis signaling in Lotus japonicus roots. Materials and Methods Cloning of LjNFR1 zone 4 chimeras
A level 2 pIV10 expression vector (Mirquez, A. J., and J. Stougaard, eds. 2005. Lotus Japonicus Handbook. Dordrecht: Springer) was used for Hairy Root transformation, containing an expression cassette of nuclear localized tripleYFP driven by the constitutive promoter pUbi (pUbi::tYFP-NLS) serving as transformation marker. The chimeric constructs or chimeric variants were positioned downstream of pNFR1. Golden gate cloning (Weber et al. (2011) PLoS ONE 6(2):e16765) was used for creating the different receptor versions. The modules utilized to assembly the receptor chimeras were the following: HvRLK4 and LjNFR1 (coding sequence (CDS) for RLK4, and genomic sequences containing introns for NFR1): Ectodomain (EC), Transmembrane/Juxtamembrane domain (TM/JM), TM/JM zones 1-4 (TM/JM zone 1 is part of the TM (NFR1 zone 1=SEQ ID NO: 3; HvRLK4 zone 1=SEQ ID NO: 57), while zones 2-4 are part of the JM (NFR1 zone 2=SEQ ID NO: 4; NFR1 zone 3=SEQ ID NO: 5; NFR1 zone 4=SEQ ID NO: 6; HvRLK4 zone 2=SEQ ID NO: 58; HvRLK4 zone 3=SEQ ID NO: 59; HvRLK4 zone 4=SEQ ID NO: 60)), and Kinase domain.
Alignments were created with CLC Main Workbench, Qiagen. P Protein distance measure: Jukes-Cantor.
Lotus japonicus Gifu and cerk6 mutant plants were transformed with Agrobacterium rhizogenes containing an empty vector (“IYFP_EV”) or chimeric CERK6-NFR1 or RLK4-NFR1 receptor constructs. Plants with hairy roots were grown in magenta containers containing Leca substrate and B&D medium with KNO3 3 mM for 21 days. Only the transformed hairy roots were kept (detected by fluorescence under UV light since the transgenic roots have tripleYFP incorporated into their genome) and cut into 1-2 cm pieces. Equal amounts of the cut roots were collected into white 96-well flat-bottomed polystyrene plates (Greiner Bio-One) with the goal of having one hairy root in one well. They were kept for overnight incubation in sterile MQ water. The next day, for the ROS measurement, the water in every well was replaced by reaction Master Mix consisting of: 0.5 mM L-023 (Wako Chemicals), 5 μg/mL horseradish peroxidase (Sigma), and 0.1 mg/ml chitin. Luminescence in the 30-40 min assays was recorded with a Varioskan Flash Multimode Reader (Thermo Scientific) as Relative Luminescence Units (RLU). In one repetition, at least three wells were measured for each treatment for every genotype.
Bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays and other methods were conducted as described in Example 1 unless stated otherwise.
HvRLK4, an RLK LysM receptor polypeptide from barley (Hordeum vulgare) was chosen as an engineering target because its kinase domain shares 70% and 78% identity with LJNFR1 and LjCERK6, respectively (
In the variants tested in this Example, domains from NFR1 and RLK4 were combined into chimeric receptors. Receptor variants were designed as shown in
As with the chimeric receptors containing the kinase domain of CERK6 discussed in the Examples above, chimeric receptors containing either the RLK4 KD or TM/JM in combination with the remaining regions of NFR1 were able to restore nodulation in the nfr1-1 background (
Alignment of the NFR1, CERK6 and RLK4 protein sequences (see
The alignment in
In order to test the functional impact of combining different regions from the JM of RLK4 with zone 4 of NFR1, four additional NFR1-RLK4 chimeric receptor constructs were created (
Together, these results indicated that zone 4 of NFR1 maintained its symbiotic determinacy when embedded in the intracellular region of RLK4, but that its full properties were dependent and enhanced by the remaining regions of the NFR1 TM/JM.
The following example describes the crystal structures of the kinase domain of CERK6 and LYK3 proteins. Structures for LjCERK6/303-599 D460N and MtLYK3/302-597 D459N with AMP-PNP (phosphoaminophosphonic acid-adenylate ester) have been solved. AMP-PNP is an ATP analogue.
Lotus japonicus CERK6 D460N kinase (residues 303-599) and Medicago truncatula LYK3 D459N (residues 302-597) were codon optimized for expression in E. coli (Genscript) and N-terminally fused with a histidine tag, an arginine tag, a SUMO tag and a 3C protease cleavage site.
For expression, Rosetta 2 (Sigma-Aldrich) competent cells were transformed with the constructs and grown to OD600=0.6 in LB medium supplemented with the relevant antibiotics at 370 C with shaking. Then the culture was chilled on ice for 30 minutes before expression was induced by adding 0.2 mM IPTG and incubating at 190 C with shaking for 20 hours. The cells were harvested at 4400 g for 15 minutes, resuspended in LB medium and pelleted again at 3050 g for 10 minutes. The pellet was resuspended in lysis buffer (50 mM Tris-HCL pH 8.0, 500 mM NaCl, 10% glycerol, 10 mM imidazole, 5 mM BME) and lysed by sonication. Cleared supernatant was subjected to nickel immobilized metal affinity chromatography (Ni-IMAC) on a 1 ml Protino Ni-NTA column (Macherey-Nagel) equilibrated in lysis buffer. The sample was loaded onto the column using a peristaltic pump, then the column was washed with 20 column volumes 20 mM imidazole before it was eluted in 12 ml 200 mM imidazole. 3C protease and,-phosphatase was added to the eluate in an approximately 1:50 molar ratio and it was dialyzed against 1 L of dialysis buffer (50 mM Tris-HCL pH 8.0, 250 mM NaCl, 5% glycerol, 5 mM BME, 1 mM MnC12) overnight at 4° C. The dialysis product was subjected to another Ni-IMAC to remove the protease and the digested tags before the protein was purified in a final gel filtration on a Superdex75 increase 10/300 GL column or a Superdex200 increase 10/300 GL column (GE Healthcare) equilibrated in gel filtration buffer (25 mM Tris-HCL pH 8.0, 150 mM NaCl, 5 mM BME). All purification steps were analyzed by SDS-PAGE and elution fractions pooled accordingly.
Crystals from LjCERK6/303-599 D460N and MtLYK3/302-597 with AMP-PNP were obtained in a sitting drop vapor diffusion setup. The following crystallization conditions were used for each protein: LjCERK6/303-599 D460N: 0.1 M imidazole pH 8.0, 5% PEG 8000, 10 mg/ml protein; MtLYK3/302-597 D459N with AMP-PNP: 0.2 M sodium formate, 0.1 M bis-tris propane pH 8.5, 20% w/v PEG 3350, 3 mM AMP-PNP.
Crystals were fished and cryo protected as follows: LjCERK6/303-599 D460N: 25% ethylene glycol in mother liquor; MtLYK3/302-597 D459N with AMP-PNP: 20% glycerol in mother liquor.
The synchrotron data for MtLYK3/302-597 D459N with AMP-PNP were collected at beamline P13 operated by EMBL Hamburg at the PETRA III storage ring (DESY, Hamburg, Germany). The data for LjCERK6/303-599 D460N were collected at beamline BioMAX at the MAX IV storage ring (Lund, Sweden).
For both structures, the phase problem was solved by molecular replacement using Phaser (McCoy et al. (2007) J Appl Crystallogr 40, 658-674). Refinement was performed in PHENIX (Liebschner et al. (2019) Acta Cryst. D75, 861-877) and manual model building was carried out in Coot (Emsley et al. (2004) Acta Crystallogr. D60, 2126-2132). Figures of the crystal structures were prepared using PyMol Molecular Graphics system ver. 2.3.2 (Schr6dinger, LCC).
Preliminary structures of CERK6 from Lotus japonicus (LjCERK6) and LYK3 from Medicago truncatula (MtLYK3) with AMP-PNP are shown in
Most of the JM zone 4 was a well-defined alpha helix that was part of the kinase core fold. This region was annotated as aB in
The following example describes biochemical analyses of recombinant chimeric NFR1-CERK6 proteins.
The chimeras as well as apo NFR1 kinase and apo CERK6 kinase were tested on nano DSF to assess their stability and binding capacity. They were furthermore tested for kinase activity in a radioactive kinase assay and run on native PAGE.
Thermal stability of purified proteins was assayed by nano DSF using a Tycho instrument (NanoTemper Technologies) in Tycho NT.6 Capillaries (NanoTemper Technologies). The ratio between the intrinsic fluorescence intensities at 330 nm and 350 nm was measured over a temperature gradient (35-90° C.) yielding a fluorescence transition curve from which an inflection temperature could be derived. Protein was assayed at a 10 μM concentration, while nucleotide and magnesium was added at a 1 mM concentration. Samples were incubated 10 minutes on ice before measurements.
4 μg of each protein were incubated with 200 nCi [γ32-P]ATP (PerkinElmer) in 5 mM Tris-HCl pH 8.0, 30 mM NaCL, 5 mM MgCl2 and 20 mM cold ATP at room temperature for 1 hour. The samples were then separated on a 15% SDS-PAGE gel which was stained with InstantBlue (Expedion) before it was was left to expose overnight on a TranScreen HE (BioMax). The phosphor plates were scanned with a Typhoon FLA 9500 (Cytiva).
Polyacrylamide gel electrophoresis (native PAGE) was performed using self-cast 12% gels and ran on a Mini-PROTEAN tetra cell system (Bio-rad) at 4° C. 4 μg protein was loaded in each lane and the gel was run at 130 V for 3-4 hours.
In vitro biochemical experiments were completed to assess general stability, binding capacity and activity of the kinase domains of the chimeras purified from E. coli.
Nano-DSF experiments were performed on the intracellular part of the following chimeric receptors to test their general stability and their ability to bind nucleotide (
The stability and nucleotide binding capabilities of Chimera A, Chimera C, and Chimera D was seen both from the raw nano-DSF data (
From size exclusion chromatography and SDS-PAGE gels, the purified protein was concluded to be of high purity. To assess if the protein was homogenous, it was run on a native PAGE gel (
The intracellular domains of the chimeras were tested for kinase activity together with the intracellular domains of CERK6, NFR1, and NFR1 D462N (negative control). Each protein was tested for autophosphorylation and for transphosphorylation of MBP (
The following example describes phylogenetic analyses revealing that certain motifs in LysM RLK homologs are conserved while others are variable.
NFR1 and CERK6 of Lotus japonicus share high homology, with the full length proteins being 65% identical, and the kinase domains being 80% identical. They also belong to the same subfamily of LysM receptors with active kinase (NFR1 clade; shown in pink in
For the analysis, a collection of DNA and protein data was used. Sequences were retracted with blasting from the NCBI and an internal database. As queries, the ectodomain, transmembrane and juxtamembrane domains of LjNFR1 genomic or protein sequence were used. Homologues from Fabales order (i.e. homologues from the species Abrus precatorius, Arachis hypogaea, Cajanus cajan, Casaus castanospermum, Cercis spp., Cicer arietinum, Glycine max, Lupinus angustifolius, Medicago truncatula, Nissolia schottii, Pisum sativum, Chamaechrista spp., Phaseolus vulgaris, Mimosa pudica, and Trifolium spp.) were used for the study. In addition, the following protein sequences from non-Fabales species were added to the analysis: HvRLK4, MesCERKa, MesCERKb, MesLYKI, PanLYK3.1, and PanLYK3.2.
Upon gene annotations (exons/introns recognition and annotation) and translation of the coding sequences into protein, protein alignments and phylogenetic trees were created utilizing the CLC Main Workbench (Qiagen). Filtering of the different accessions was performed by taking into consideration the following criteria: a) retracted genes had to contain 11-13 exons, b) retracted or in silico translated proteins had to contain three LysM domains separated by CXC motif, c) a transmembrane domain, and d) an active kinase domain (containing DFG motif and activation loop). Alignment and phylogenetic trees were created with CLC Main Workbench, Qiagen. Phylogenetic trees: Tree construction method: Neighbor joining, Protein distance measure: Jukes-Cantor Bootstrap analysis: 100 replicates.
The model in
After applying the filtering steps described in the phylogenetic analyses methods above, the sequences were categorized into five groups based on their homology with L. japonicus NFR1, CERK6, LYS7, NFRe and LYS2 or the presence or absence of protein motifs that characterize the previous receptors (such as the lack of the “YAQ” motif in NFRe and Lys2) (
Next, the candidate residues or regions containing possible determinants of specificity for NFR1 and CERK6 were identified from the protein alignments. A phylogenetic tree containing the NFR1 and CERK6 homologs in Fabales and closest relatives from non-Fabales species was constructed (
Separate protein alignments were made for NFR1 (Fabales) homologs, CERK6 (Fabales) homologs, and RLKs (non-Fabales) homologs, respectively, and the corresponding 70% consensus sequences were identified. The 70% consensus represented the protein sequence for which, for each position, a given amino acid was conserved in more than 70% of the aligned accessions, whereas “X” was noted for the residues that varied in more than 70% of the accessions (
Alignment of the three 70% consensus protein sequences (
The following example describes experiments investigating the role of the NFR1 and CERK6 kinase domains in symbiotic signaling.
A level 2 pIV10 expression vector (Márquez, A. J., and J. Stougaard, eds. 2005. Lotus Japonicus Handbook. Dordrecht: Springer) was used for Hairy Root transformation, containing an expression cassette of nuclear localized tripleYFP driven by the constitutive promoter pUbi (pUbi::tYFP-NLS) serving as transformation marker. The chimeric constructs or chimeric variants were positioned downstream of pNFR1. Golden gate cloning (Weber et al. 2011) was used to create different receptor versions. The modules utilized to assembly the receptor chimeras were the following: LjCERK6 and LjNFR1 (genomic sequences containing introns): Ectodomain (EC), Transmembrane/Juxtamembrane domain (TM/JM), TM/JM zones 1-4 (TM/JM zone 1 is part of the TM (NFR1 zone 1=SEQ ID NO: 3; CERK6 zone 1=SEQ ID NO: 10), while zones 2-4 are part of the JM (NFR1 zone 2=SEQ ID NO: 4; NFR1 zone 3=SEQ ID NO: 5; NFR1 zone 4=SEQ ID NO: 6; CERK6 zone 2=SEQ ID NO: 11; CERK6 zone 3=SEQ ID NO: 12; CERK6 zone 4=SEQ ID NO: 13)), Kinase domain and mutagenized kinase domain.
Crystallography was performed for receptors corresponding to constructs designated 27 and 29, according to methodology in Example 8.
Modeling for homology between crystallographic models was performed according to methodology in Example 10.
Bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays and other methods were conducted as described in Example 1 unless stated otherwise.
A series of additional chimeric NFR1 receptor versions were created, in which each contained both zone 4 and the kinase domain from CERK6, but in which the sequences of the kinase surfaces (zones A through G, as identified in
A series of further variants were created based on the schemes shown in
Example 1 described experiments showing that chimeric NFR1 receptors containing the both zone 4 and the kinase domain from CERK6 were not able to complement symbiosis in the nfr1-1 mutant, but that chimeric NFR1 receptors containing zone 4 of CERK6 and kinase of NFR1 were symbiotically proficient receptors (see, e.g.,
NFR1 Kinase Zones, Individually or in Combination with Zone D, are not Sufficient to Induce Signaling
None of the tested receptor versions with individual kinase domain zones exchanged from CERK6 to NFR1 were able to induce nodule organogenesis at 35 days post infection after inoculation with M. loti R7a (see data sets labeled “Zone A”, “Zone B”, “Zone C”, “Zone D”, “Zone E”, “Zone F”, and “Zone G” in
Surprisingly, not even the chimeras containing the Activation Loop (zone A or zone D+A) from NFR1 were able to induce nodulation (
Nodulation experiments on the additional variants shown in
The following example describes experiments investigating the role of the NFR1, HvRLK4, and CERK6 kinase domains in immune signaling.
Lotus japonicus Gifu and cerk6 mutant plants were transformed with Agrobacterium rhizogenes containing the respective empty vector or chimeric CERK6-NFR1 receptor constructs. Plants with hairy roots were grown in magenta containers containing Leca substrate and B&D medium with KNO3 3 mM for 21 days. Only the transformed hairy roots were kept (detected by fluorescence under UV light since the transgenic roots have tripleYFP incorporated into their genome) and cut into 1-2 cm pieces. Equal amounts of the cut roots were collected into white 96-well flat-bottomed polystyrene plates (Greiner Bio-One) trying to put one hairy root into one well. They were kept for overnight incubation in sterile MQ water. The next day, for the ROS measurement, the water in every well was replaced by reaction Master Mix consisting of: 0.5 mM L-023 (Wako Chemicals), 5 μg/mL horseradish peroxidase (Sigma), and 0.1 mg/ml chitin. Luminescence in the 30-40 min assays was recorded with a Varioskan Flash Multimode Reader (Thermo Scientific). In one repetition, at least three wells were measured for each treatment for every genotype.
Cloning of CERK6 receptor variants containing the NFR1 kinase zones or the HvRLK4 ectodomain and membrane domains, bacteria and culture conditions, plant materials, Hairy Root transformation, nodulation assays and other methods were conducted as described in Example 11 unless stated otherwise.
Previous studies (e.g., Bozsoki et al., (2017) PNAS E8118-E8127; Bozsoki et al. (2020) Science 369, 663-670) showed that the NFR1 receptor, unlike CERK6, was not able to induce an immune response. This was explained by the fact that the NFR1 ectodomain does not recognize chitin, and its kinase lost the capacity to activate an immune signaling pathway. The CERK6 kinase domain, on the other hand, had both the capacity to activate symbiotic signaling (such as when combined with remaining NFR1 regions, as in, for example,
A series of CERK6 receptor variants containing the NFR1 kinase zones (see annotation in
Individual NFR1 Kinase Zones Function in Immune Signaling with Different Efficiencies
Experiments were undertaken to determine if specific regions of CERK6 that vary compared to NFR1 were critical in immune signaling. A series of ten CERK6 receptor variants containing NFR1 kinase zones (see
The chimeric receptors containing zone A (Activation Loop or AL) or C (C-terminal tail) of NFR1 were still able to induce reactive oxygen species (ROS), as indicated by an increase in Relative Luminescence Units (RLU) reported within 30 minutes upon 0.1 mg/ml chitin application (
Next, the impact of individual zones D, E, F, and G from NFR1 were each tested (
Combinations of NFR1 Kinase Zones Function in Immune Signaling with Different Efficiencies
The four chimeras containing NFR1 zones in different combinations were tested for their ability to induce ROS production (
Collectively, these results showed that signaling from the CERK6 kinase domain was highly sensitive to changes, given that exchanging individual or combined surfaces with those from NFR1 (C, D, E, F, G, D+B, D+C;
Individual CERK6 Kinase Zones, or CERK6 Kinase Zones in Combination with CERK6 Kinase Zones D or G, are not Sufficient to Induce Immune Signaling when Coupled with the Remaining Zones from the NFR1 Kinase Domain.
Additional experiments were undertaken to explore if specific regions of CERK6 that vary from NFR1 were sufficient to induce immune signaling. Following the same logic as above, a chimeric CERK6 receptor containing the NFR1 kinase domain, which was not able to activate immune signaling due to the presence of NFR1 kinase (
These chimeric kinase receptors were expressed in cerk6 mutants, and transformed roots were exposed to chitin to monitor receptor functionality. Production of ROS in the tested and control roots was monitored after chitin treatment for up to 30 minutes. Roots of wild-type plants and those of cerk6 mutants expressing the full-length CERK6 (
Thus, a series of experiments was designed to explore the impact of exchanging larger surfaces in the kinase domain to those of CERK6. The variant named C326-467N469-623 had the EC and TM/JM of CERK6 and a chimeric K domain with the N-terminus of the K domain from CERK6 and the C-terminus of the K domain from NFR1. The receptor variant named N328-469C467-622 had the EC and TM/JM of CERK6 and a chimeric K domain with the N-terminus of the K domain from NFR1 and the C-terminus of the K domain from CERK6. The border between N- and C- termini is highlighted in
Interestingly, the intracellular variant of C326-467N469-623 but not N328-469C467-622 coupled to CERK6 ectodomain (receptors 31 and 32) was also permissive for chitin immune signalling (
In comparing performance between RLK4 and CERK6 receptor variants with luminescence-based assays, it was found that a CERK6 chimeric construct containing a HvRLK4 kinase domain was able to induce a response to chitin, whereas HvRLK4 normally does not (
Following up on the testing of constructs with chimeric receptor zones described above, the JM-B, or zone 4, region of the entire CERK6 receptor was replaced with the corresponding region of NFR1 and compared its signalling capacity to CERK6 (
The following example describes experiments investigating the role of the JM-B motif, or the zone 4 motif, on stability of the kinase.
Cloning, plant cultivation, hairy root transformation, Agrobacterium growth conditions, crystal production, and crystal analysis were as described in the above Examples.
The intracellular domains of Lotus japonicus CERK6 (residues 303-599) and Lotus japonicus NFR1 (residues 263-623) (
The structures revealed a classic kinase fold, and were striking in their similarity (RMSD=0.724 Å) despite the fact that LYK3 were crystallized with a nucleotide and CERK6 was not. When examining a superimposition of the two structures, it was found that the largest difference between them was located at the N-lobe where LYK3 is slightly rotated compared to CERK6. It is though that this rotation might be induced by the nucleotide binding.
The JM-B has previously been considered as a part of the juxtamembrane, which is usually depicted as a flexible region connecting the catalytic kinase domain to the transmembrane domain and the ectodomain. However, these CERK6 and LYK3 structures revealed JM-B to be clearly defined in the density as a loop structure followed by a helix positioned above the anti-parallel beta sheet (b2-b5) of the kinase N-lobe. Importantly, residues in the JM-B that were conserved (across both classes of chitin and Nod factor receptors) interacted with residues in the N-lobe (
Three of JM-B residues that differ between CERK6 and NFR1 are located in the N-terminal loop, while the remaining three were on the helix, sidechains being surface exposed. In contrast to the rest of the JM-B residues, none of these made contact to the N-lobe of the kinase. This enabled receptor engineering without risk of compromising the structural integrity of the chimeras.
The following example describes experiments investigating the role of the CERK6 kinase domains in root hair formation.
Cloning, plant cultivation, hairy root transformation, and Agrobacterium growth conditions were as described in the above Examples. Nodulation assays were conducted as described in Example 11 unless stated otherwise.
Infection thread numbers were quantified at 35 days post inoculation with M. loti MAFF using a Zeiss Axioplan microscope with 546/575-640 excitation/emission (nm) parameters for detecting M. loti bacteria.
Nod factor receptors control both the developmental processes that take place during root nodule symbiosis, root nodule organogenesis, and root hair infection via infection thread formation and elongation. These two programs are tightly coordinated and the absence or impairment of one has an impact on the other (Miri et al. (2019) New Phytologist, 222(3):1523-37; Yano et al. (2008), PNAS 105(51):20540-45). Expression of NFR1 in the epidermis is critical for initiation of the root hair infection program. The Lotus nfr1 mutant is unable to initiate root hair deformation, bacteria attachment to the root, and the subsequent formation of micro-colonies in response to rhizobial infection (Radutoiu et al. (2003) Nature 425(6958):585-92).
Moreover, a constitutively active receptor complex formed by the nanobody-linked NFR1-NFR5 complex has the capacity to activate root nodule organogenesis but has a major negative impact on root infection (Rubsam et al. (2023) Science 379(6629):272-77). Together, these indicate that signaling from NFR1 and NFR5 needs to be tightly regulated.
In this study, it was identified that a chimera that contained the kinase domain from CERK6 and the EC and TM/JM from NFR1 had a reduced efficiency for root nodule organogenesis. Further, a chimera that contained the kinase domain and JM zone 4 from CERK6 and the EC and TM/JM zones 1-3 from NFR1 was nonfunctional compared to full-length NFR1 (
A significant reduction in IT formation was observed for plants expressing the construct containing the kinase domain from CERK6 and the EC and TM/JM from NFR1, while no ITs were observed when the CERK6 kinase was coupled with the CERK6 zone 4 and the EC and TM/JM zones 1-3 from NFR1 (
Potential interaction of the JM-B and C-terminal region with inhibitor(s) for symbiosis and/or inhibition of interaction with a downstream partner was investigated by testing whether a GFP nanobody-tagged receptor 6, which was symbiotically inactive (
The following example describes experiments investigating ROS production after treatment with Nod factors in lines expressing NFR1 variants containing CERK6 kinase.
Materials and methods were as described in the above Examples. For ROS measurement, 0.1 mg/ml chitin or 10−8M. loti R7A Nod factor were used.
The CERK6 kinase is essential for immune signaling resulting in robust ROS production in response to chitin in Lotus roots (Bozsoki et al. (2020) Science, 369(6504):663-70). In the present study, NFR1 chimeric receptors that contain the CERK6 kinase have been used to rescue the symbiotic phenotype in nfr1 roots, and they were identified to have a reduced functionality (
The tested constructs included: one chimeric construct with the EC and TM/JM from NFR1 and the K domain from CERK6; one chimeric construct with the EC from NFR1 and the TM/JM and K domains from CERK6; one chimeric construct with the EC and TM/JM zones 1-3 from NFR1 and JM zone 4 and the K domain from CERK6; one chimeric construct with the EC and JM zone 4 from NFR1 and TM/JM zones 1-3 and the K domain from CERK6; and one chimeric construct with the EC and K domains from NFR1 and the TM/JM from CERK6, all expressed under the NFR1 promotor. An additional negative control containing the EC and TM/JM domains from NFR1 and the CERK6 kinase domain with the kinase-dead (K351N) mutation was also included in this assay.
It was found that none of the tested roots, irrespective of the expressed construct, was able to induce ROS within 30 minutes after 10−8M M. loti R7A NF application. By contrast, all roots responded normally to chitin, since native CERK6 was present in the nfr1 mutant (
The following example describes experiments investigating whether NFR1 variants containing CERK6 kinase form inhibitory dimers with the native CERK6.
Materials and methods were as described in the above Examples unless noted otherwise.
To test the hypothesis that NFR1 or CERK6 form homo- or heterodimers with each other, an in vitro co-immunoprecipitation assay is created for receptors expressed in N. benthamiana leaves, using a binary vector for transient transformation, in which chimeric receptors or receptor variants are tagged with eGFP on the C-terminal and expressed with a 35s promotor and 35s terminator, or tagged with mCherry on the C-terminal and expressed with a Ubiquitin (Ubi) promoter and NOS terminator. Fusion of the receptor variants with variable fluorescent tags, in this case GFP or mCherry, provides the opportunity to detect co-immunoprecipitated receptor molecules by performing Western blot analyses with the use of different antibodies.
RLK receptors form homo- and heteromeric complexes at the plasma membrane in order to activate their kinase. Often the formation of such complexes is forced by interactions between intracellular domains (Maeda et al. (2018) Biophys. J., 114(4):893-903; Petutschnig et al. (2010) JBC, 285(37):28902-11; Rubsam et al. (2023) Science, 379(6629):272-77). It has not been yet proven that NFR1 or CERK6 forms homodimers for signaling transduction in planta but there is evidence that they interact with their respective homomers to promote phosphorylation (Madsen et al. (2011) The Plant J., 65(3):404-17).
All the receptor variants were expressed and localized in the plasma membrane (
It was then hypothesized that NFR1 and CERK6 can form heterodimers through their intracellular domains, which might inhibit the function of NFR1 in symbiosis. To investigate whether NFR1-CERK6 chimeras inhibit the function of native NFR1 expressed in the Gifu wild type and thus lead to impaired nodulation, the chimeras were expressed in Gifu and nodulation events were assessed 35 dpi with M. loti R7A dsRed. All transformed Gifu roots induced nodulation with the same efficiency (
The following example describes experiments investigating the NFR1 zone 4 biochemical properties that are required for symbiotic signaling.
Materials and methods were as described in the above Examples unless noted otherwise.
The present study underlined the importance of NFR1 M306 and A308 in root nodule symbiosis signaling. In the presence of CERK6 kinase, substitutions of the NFR1 methionine or alanine with the corresponding threonine or aspartic acid from CERK6 impaired nodulation, probably due to a shift from a non-polar to a polar state. Methionine residues can act as “ROS sensors” by having the property of being oxidized to methionine sulfoxide (MetO), and therefore, they can regulate kinase function in the cytoplasm (Aledo (2019) Protein Science 28(10):1785-96.). In the reduced form, methionine can establish interactions with aromatic residues (phenylalanine, tyrosine, and tryptophan) that are present in close proximity, both inter- and intramolecularly, and can regulate their phosphorylation (Lewis et al. (2016) Nature Chemical Biology 12(10):860-66; Reid, Lindley, and Thornton (1985) FEBS Letters 190(2):209-13; Valley et al. (2012) JBC 287(42):34979-91; Veredas, Cant6n, and Aledo (2017) Scientific Reports 7(1):40403). Alanine is a small, non-polar residue that plays a significant role in the stabilization of protein conformations, giving a hydrophobic character to the corresponding protein region (Rohl, Fiori, and Baldwin (1999) PNAS 96(7):3682-87).
The role of M306 and A308 during symbiosis was examined by introducing mutagenized forms of Chimera A in the nfr1 mutant and testing for genetic complementation (see schematic representation in
All four substitutions of the M306 resulted in a diminished symbiotic capacity when compared to Chimera A. The number of nodulating plants, as well as the number of nodules per plant was reduced (
The role of A308 was investigated through mutation to aspartic acid (A308D), glutamic acid (A308E), or leucine (A308L) substitutions. Aspartic acid and glutamic acid, with the first being present in CERK6, are negatively charged residues; these substitutions were designed to investigate if the hydrophobic character of A308 is essential for signaling. The substitution to leucine, a larger and hydrophobic residue, was designed to investigate if the small size of A308 is crucial for its function.
Results from the expression of receptor variants carrying the three A308 substitutions in nfr1-1 pNIN:GUS roots revealed a lower nodulation tendency than that of Chimera A (
Aspartic acid and glutamic acid are two very similar negatively charged residues, with the only obvious difference being their size, since aspartic acid has one fewer methyl group than glutamic acid. The D-K salt bridge appears to be stronger than the E-K in vitro, as the smaller size of aspartic acid's side chain stabilizes the interaction (Smith and Scholtz (1998) Biochemistry 37(1):33-40). If the stable interaction between zone 4 and the kinase of CERK6 is inhibitory for nodulation, this would explain why the A308E substitution did not have a dramatic impact on nodule formation.
The following example describes experiments investigating the effects on nodulation of mutations in the NFR1 activation loop (AL).
Materials and methods were as described in the above Examples unless noted otherwise.
Codon mutagenesis on the Activation loop (AL) was performed utilizing the Q5® Site-Directed Mutagenesis Kit with primer pairs designed with the NEBaseChanger online tool following the manufacturer's manual. The mutated clones were transformed into chemical competent E. coli TOP10.
In the MtLYK3 structure, part of the AL appeared to form an α-helix that was parallel to the regulatory αC-helix (
It was hypothesized that the helical conformation of the NFR1 AL was inhibitory for nodulation and that release of this conformation might lead to a constitutive active kinase. To test this hypothesis, hydrophobic interface between two parallel helixes were disrupted by substituting leucine468, which was located in the AL, with a positive-charged arginine. The mutated NFR1 receptor was then expressed in nfr1 roots, and its efficiency in nodule formation with or without inoculation with M. loti R7A was evaluated. No spontaneous nodules were formed in the uninoculated transformed root, indicating that the L468R mutation did not create a constitutively active kinase form that engages in signaling transduction in absence of NF (
The following example describes experiments investigating the role of CERK6 juxtamembrane in immune signaling.
Materials and methods were as described in the above Examples unless noted otherwise.
A previously published study (Bozsoki et al. (2020) Science 369(6504):663-70) utilizing chimera-based approaches to investigate the role of NFR1 and CERK6 intracellular domains identified that the CERK6 core kinase was essential for immune signaling, since its substitution with the corresponding NFR1 kinase failed to initiate the immune pathway that leads to ROS production. In the same study, it was reported that when the transmembrane and juxtamembrane domains of CERK6 are exchanged with the corresponding regions in NFR1, this leads to a low intensity of ROS response (see
These results could be explained by two hypotheses: either the symbiotic NFR1 kinase lost immune-triggering traits during evolution, or it evolved an inhibitory mechanism by a cooperation between the kinase and the TM/JM domain to assure that no robust ROS responses will be induced during NF signaling. To further investigate the role of intracellular CERK6 and NFR1 regions in immunity, additional experiments were performed using chimera-swap approaches.
CERK6 is ubiquitously expressed in all tissues regardless infection from pathogens, and induces ROS production within minutes upon chitin perception (Bozsoki et al. (2017) PNAS 114(38):E8118-27; Lohmann et al. (2010) MPMI 23(4):510-21). This suggests that the receptor is present in the plasma membrane in a resting state that rapidly can be reverted to an active state to induce a robust defense mechanism in response to pathogen invasion. Results from the present study suggest that the existence of an allosteric mechanism between CERK6 zone 4 and the kinase (K) (see
To test if substitution of CERK6 JM regions with the corresponding NFR1 lead to a constitutive active CERK6 or increases the ROS-response intensity upon perception of chitin, a series of swap-domain chimeras between CERK6 and NFR1 TM/JM zones coupled with the EC and K of CERK6 were constructed and expressed in the cerk6 mutant roots. Efficiency of the different constructs was evaluated in a Luminescence-based assay for ROS detection within 30 minutes upon chitin octamer (C08) application in different concentrations. None of the tested constructs or the Gifu wild type and CERK6 complementation were able to induce ROS production with the lower concentrations of C08, (C08-10 and C08-12), (
The following example describes experiments investigating the structure of the CERK6 activation loop in the absence of the ligand.
Materials and methods were as described in the above Examples unless noted otherwise.
The CERK6 crystal structure revealed that, like in the case of MtLYK3, part of the AL forms an α-helix that was parallel to the regulatory αC-helix (
This phenotype has been observed before when the CERK6 receptor was overexpressed in cerk6 roots (unpublished data). Overexpression of CERK6 results in cell death, likely due to overactivation of the immune response (Bozsoki et al. (2017) PNAS 114(38):E8118-27). These data suggest that the helical conformation of the CERK6 AL was inhibitory for immune signaling by keeping the kinase in an “off” state in the absence of elicitor, and disruption of this conformation leads to a constitutively active kinase. CERK6 and LYK3 protomers were also captured in the crystals in an inactive state where the αC helix and the inhibitory helix of the activation loops kept each other in an “off” state by the salt bridge established between the glutamate E362 and lysine K464 (
This application claims the benefit of priority to U.S. Provisional 63/490,219, filed Mar. 14, 2023, which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63490219 | Mar 2023 | US |