The content of the following submission on ASCII text file is incorporated herein by reference in its entirety: a computer readable form (CRF) of the Sequence Listing (file name: 794542000440SEQLIST.txt, date recorded: Aug. 12, 2019, size: 298 KB).
The present disclosure relates to genetically altered LysM receptors. In particular, the present disclosure relates to a hydrophobic patch into the LysM2 domain which can increase affinity and/or selectivity for LCOs and by replacement of regions in the LysM1 domain with the corresponding regions of the LysM1 domain from a donor LysM receptor that can alter the affinity and/or selectivity for the oligosaccharide, particularly for LCOs, and can alter the specificity between LCOs when using regions from a high affinity and specificity LCO LysM receptor such as a legume NFR1 receptor. The present disclosure also relates to genetically altering LysM receptors in plants to include a hydrophobic patch or alter the hydrophobic patch and to genetically altering LysM receptors in plants by replacement of regions in the LysM2 domain.
Plants are exposed to a wide variety of microbes in their environment, both benign and pathogenic. To protect against the pathogenic microbes, plants have the ability to recognize specific molecular signals of the microbes through an array of receptors and, depending upon the pattern of the signals, can initiate an appropriate immune response. The molecular signals are derived from secreted materials, cell-wall components, and even cytosolic proteins of the microbes. Chitooligosaccharides (COs) are an important fungal molecular signal that plants recognize through the chitin receptors CEBiP, CERK1, LYK5, and CERK6 (previously called LYSE) found on the plasma membrane. These receptors are in the LysM class of receptors and recognize the size and the acetylation of COs from fungi. Lipo-chitooligosaccharides (LCOs) are another important molecular signal that can be found on both bacteria and fungi that are recognized by other LysM receptors.
In addition to benign and pathogenic microbes, some microbes can be beneficial to plants through association or symbiosis. Plants that enter into symbiotic relationships with certain nitrogen fixing bacteria and fungi need to be able to recognize the specific bacterial or fungal species to initiate the symbiosis while still being able to activate their immune systems to respond to other bacteria and fungi. One important mechanism that allows plants to recognize these specific bacteria or fungi is through specialized LysM receptors that have high affinity, high selectivity, and/or high specificity for the form of LCOs produced by the specific bacteria or fungi while LCOs from other bacteria and fungi are not recognized by these specialized LysM receptors.
Experimental and computational approaches have been used to identify a number of these specialized LysM receptors (also referred to as high affinity and specificity LCO receptors). As these receptors are required for recognizing symbiotic bacterial and fungal species, and for initiating symbiosis, these receptors represent an important component of any plant engineering strategy. Using these receptors, however, will not be particularly straightforward; transferring a specialized LysM receptor into a plant that does not currently have one may require codon optimization, the identification of suitable promoters, the use of targeting signals, and further engineering approaches needed to adapt exogenous sequences for optimal expression. Further, the number of these receptors that have been identified is currently limited.
Moreover, species that already have specialized LysM receptors, e.g., legumes, cannot be easily engineered with new specialized LysM receptors. Currently, legumes are limited to the specific bacterial or fungal species with which they form symbiotic associations. While legumes may have the benefit of existing symbiotic associations, their agricultural potential is limited. For example, legumes cannot currently be easily engineered to have different specificity for different symbiotic microbial species, which would allow legumes to better form associations with the bacterial or fungal species in different soils. Moreover, legumes cannot be easily engineered to have improved specialized LysM receptors. Further, legumes cannot currently be engineered to have synergistic symbiotic requirements with other crops grown in rotation with them. Editing approaches are needed for both the modification of endogenous LysM receptors into specialized LysM receptors able to perceive symbiotic bacterial and fungal species, and the modification of specialized LysM receptors into specialized LysM receptors with different specific recognition of symbiotic bacterial and fungal species.
In order to meet these needs, the present disclosure provides complementary means of modifying LysM receptors by introduction of a hydrophobic patch into the LysM2 domain which can increase affinity and/or selectivity for LCOs, and by replacement of regions in the LysM1 domain with the corresponding regions of the LysM1 domain from a donor LysM receptor that can alter the affinity and/or selectivity for the oligosaccharide, particularly for LCOs, and can alter the specificity between LCOs when using regions from a high affinity and specificity LCO LysM receptor such as a legume NFR1 receptor.
Certain aspects of the present disclosure relate to a modified plant LysM receptor comprising a LysM2 domain modified to comprise a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the modified LysM2 domain binds a lipo-chitooligosaccharide (LCO). In some embodiments, the modified LysM2 domain binds the LCO with higher affinity than the unmodified LysM2 domain. In some embodiments, the modified LysM2 domain binds the LCO with higher selectivity for the LCO than the unmodified LysM2 domain. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the LCO. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the lipid of the LCO. In some embodiments, the LCO is produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof.
In some embodiments of any of the above embodiments, the LysM receptor is selected from the group consisting of a LysM chitooligosaccharide (CO) receptor, a LysM LCO receptor, and a LysM peptidoglycan (PGN) receptor. In some embodiments, the hydrophobic patch is adjacent to a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is adjacent to a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the LysM receptor is not an exopolysaccharide (EPS) receptor.
In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having at least 70% sequence identity, at least 75% sequence identity, at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6). In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having amino acid sequence SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6).
In some embodiments of any of the above embodiments, the hydrophobic patch was generated by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, or combinations thereof. In some embodiments, the at least one amino acid was identified by an amino acid sequence alignment with a LysM2 domain from a LysM high affinity LCO receptor that naturally has a hydrophobic patch that interacts with LCO. In some embodiments, the at least one amino acid corresponds to an amino acid that is red highlighted in red in
In some aspects, the present disclosure relates to a modified plant LysM receptor comprising a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. In some embodiments, the first LysM1 domain is modified by substituting a first part of the first LysM1 domain with a third part of a second LysM1 domain and/or by substituting a second part of the first LysM1 domain with a fourth part of the second LysM1 domain. In some embodiments, the first LysM1 domain and the second LysM1 domain have different affinities, selectivities, and/or specificities for oligosaccharides and the modification of the first LysM1 domain alters the affinity, selectivity, and/or specificity to be more like the second LysM1 domain. In some embodiments, the first part and the third part correspond to SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI, SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and wherein the second part and the fourth part correspond to SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; SEQ ID NO:29 [Lotus NFR1 region IV 73-81], or LNDINIQSF. In some embodiments, the first LysM1 domain is selected from the group of SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/26-95], SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/25-95], or NFR1 DLALASYYILPGVFILQNITTFMQSEIVSSNDAITSYNKDKILNDINIQSFQRLNIPFP; and the second LysM1 domain is CERK6: ALAQASYYLLNGSNLTYISEIMQSSLLTKPEDIVSYNQDTIASKDSVQAGQRINVPFP. In some embodiments, the first part is selected from SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI; the second part is selected from SEQ ID NO:28 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; the third part is selected from SEQ ID NO:31 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and the fourth part is selected from SEQ ID NO:29 [Lotus NFR1 region IV 73-81] or LNDINIQSF. In some embodiments, the entire first LysM1 domain was replaced with the entire second LysM1 domain. In some embodiments, the modified LysM1 domain binds a lipo-chitooligosaccharide (LCO) produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified LysM1 domain binds an LCO with higher affinity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with higher selectivity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with altered specificity as compared to an unmodified LysM1 domain. In some embodiments, structural modelling was used to define the LysM1 domain and was used to identify the first part, the second part, the third part, and/or the fourth part for substitution. In some embodiments, the receptor of the above embodiments further contains a LysM2 domain modified to contain a hydrophobic patch as in any one of the previous embodiments relating to modifying the LysM2 domain.
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, comprising a nucleic acid sequence encoding the modified plant LysM receptor of any one of the preceding embodiments. In some embodiments, the modified plant LysM receptor has higher affinity, higher selectivity, and/or altered specificity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high affinity, high selectivity, and/or altered specificity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn, rice, barley, wheat, Trema spp., apple, pear, plum, apricot, peach, almond, walnut, strawberry, raspberry, blackberry, red currant, black currant, melon, cucumber, pumpkin, squash, grape, bean, pea, chickpea, cowpea, pigeon pea, lentil, Bambara groundnut, lupin, pulses, Medicago spp., Lotus spp., forage legumes, indigo, legume trees, or hemp. In some embodiments, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof.
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, comprising a first nucleic acid sequence encoding a modified plant LysM receptor where the LysM1 domain has been modified as in any of the preceding embodiments relating to modification to the LysM1 domain and a second nucleic acid sequence encoding a modified plant LysM receptor where the LysM2 domain has been modified to include a hydrophobic patch as in any of the preceding embodiments relating to modifications to the LysM2 domain. In some embodiments, the modified plant LysM receptor has higher affinity, higher selectivity, and/or altered specificity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high affinity, high selectivity, and/or altered specificity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the first nucleic acid or second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO: 24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn, rice, barley, wheat, Trema spp., apple, pear, plum, apricot, peach, almond, walnut, strawberry, raspberry, blackberry, red currant, black currant, melon, cucumber, pumpkin, squash, grape, bean, pea, chickpea, cowpea, pigeon pea, lentil, Bambara groundnut, lupin, pulses, Medicago spp., Lotus spp., forage legumes, indigo, legume trees, or hemp. In some embodiments, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof. In some embodiments, the plant part is a fruit, a kernel, or a grain.
In some aspects, the present disclosure relates to a pollen grain or an ovule of a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a protoplast from a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a tissue culture produced from protoplasts or cells from a genetically altered plant of any of the above embodiments relating to plants, wherein the cells or protoplasts are produced from a plant part selected from the group consisting of leaf, anther, pistil, stem, petiole, root, root primordia, root tip, fruit, seed, flower, cotyledon, hypocotyl, embryo, and meristematic cell.
In some aspects, the present disclosure relates to a method of producing the genetically altered plant of any one of the above embodiments relating to plants, comprising introducing a genetic alteration to the plant comprising the nucleic acid sequence. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO: 24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter (KAY et al. Science, 236, 4805, 1987), a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to an endogenous promoter. In some embodiments, the endogenous promoter is a root specific promoter.
Additional aspects of the present disclosure relate to a modified plant LysM receptor including a LysM2 domain modified to include a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the modified LysM2 domain binds a lipo-chitooligosaccharide (LCO). In some embodiments, the modified LysM2 domain binds the LCO with higher affinity than the unmodified LysM2 domain. In some embodiments, the modified LysM2 domain binds the LCO with higher selectivity for the LCO than the unmodified LysM2 domain. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the LCO. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the lipid of the LCO. In some embodiments, the LCO is produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof.
In some embodiments of any of the above embodiments, the LysM receptor is selected from the group of a LysM chitooligosaccharide (CO) receptor, a LysM LCO receptor, or a LysM peptidoglycan (PGN) receptor. In some embodiments, the hydrophobic patch is adjacent to a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is adjacent to a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the LysM receptor is not an exopolysaccharide (EPS) receptor.
In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having at least 70% sequence identity, at least 75% sequence identity, at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6). In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having amino acid sequence SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6).
In some embodiments of any of the above embodiments, the hydrophobic patch was generated by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, or combinations thereof. In some embodiments of any of the above embodiments, the hydrophobic patch was generated by modifying an existing hydrophobic patch in the unmodified LysM receptor. In some embodiments, the unmodified LysM receptor was modified by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, substituting at least one hydrophobic amino acid residue with another hydrophobic amino acid residue, or combinations thereof. In some embodiments, the at least one amino acid was identified by an amino acid sequence alignment with a LysM2 domain from a LysM high affinity LCO receptor that naturally has a hydrophobic patch that interacts with LCO. In some embodiments, the at least one amino acid corresponds to an amino acid that is in bold underline in
In some aspects, the present disclosure relates to a modified plant LysM receptor including a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. In some embodiments, the first LysM1 domain is modified by substituting a first part of the first LysM1 domain with a third part of a second LysM1 domain and/or by substituting a second part of the first LysM1 domain with a fourth part of the second LysM1 domain. In some embodiments, the first LysM1 domain and the second LysM1 domain have different affinities, selectivities, and/or specificities for oligosaccharides and the modification of the first LysM1 domain alters the affinity, selectivity, and/or specificity to be more like the second LysM1 domain. In some embodiments, the first part and the third part correspond to SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI, SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and wherein the second part and the fourth part correspond to SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; SEQ ID NO:29 [Lotus NFR1 region IV 73-81], or LNDINIQSF. In some embodiments, the first LysM1 domain is selected from the group of SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/26-95], SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/25-95], or NFR1 DLALASYYILPGVFILQNITTFMQSEIVSSNDAITSYNKDKILNDINIQSFQRLNIPFP; and the second LysM1 domain is CERK6: ALAQASYYLLNGSNLTYISEIMQSSLLTKPEDIVSYNQDTIASKDSVQAGQRINVPFP. In some embodiments, the first part is selected from SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI; the second part is selected from SEQ ID NO:28 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; the third part is selected from SEQ ID NO:31 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and the fourth part is selected from SEQ ID NO:29 [Lotus NFR1 region IV 73-81] or LNDINIQSF. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the first LysM1 domain is further modified by substituting a fifth part of the first LysM1 domain with a sixth part of a second LysM1 domain. In some embodiments, the first LysM1 domain is SEQ ID NO:115 [LysM1 domain Lotus NFR1; LjNFR1/32-89] or SEQ ID NO:106 [LysM1 domain Lotus NFR1; LjNFR1/31-89] and the second LysM1 domain is SEQ ID NO:114 [LysM1 domain Medicago LYK3; MtLYK3/31-89] or SEQ ID NO:105 [LysM1 domain Medicago LYK3; MtLYK3/30-89]. In some embodiments, wherein the fifth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/56-92], and wherein the sixth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62]. In some embodiments, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82]. In some embodiments, the first LysM1 domain is SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/31-89] and the second LysM1 domain is SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/32-89]. In some embodiments, the fifth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62], and the sixth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/59-62]. In some embodiments of any of the above embodiments including the first LysM1 domain being SEQ ID NO:33 and the second LysM1 domain being SEQ ID NO:32, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82].
In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the entire first LysM1 domain was replaced with the entire second LysM1 domain. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, either or both (i) 80% or fewer, 70% or fewer, 60% or fewer, 50% or fewer, 40% or fewer, 30% or fewer, or 20% or fewer of amino acid residues in the first LysM1 domain were substituted or deleted with the corresponding amino acid residues of the second LysM1 domain, and (ii) the entire LysM1 domain in the unmodified plant LysM receptor was not substituted with another entire LysM2 domain to generate the modified plant LysM receptor. In some embodiments, the modified LysM1 domain binds a lipo-chitooligosaccharide (LCO) produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof. In some embodiments, the modified LysM1 domain binds an LCO with higher affinity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with higher selectivity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with altered specificity as compared to an unmodified LysM1 domain. In some embodiments, structural modelling was used to define the LysM1 domain and was used to identify the first part, the second part, the third part, and/or the fourth part for substitution. In some embodiments, the unmodified plant LysM receptor was selected using the method of any one of the aspects of the present disclosure relating to such selection including any and all embodiments thereof and the second LysM2 domain is from the donor plant LysM receptor. In some embodiments, the receptor of the above embodiments further contains a LysM2 domain modified to contain a hydrophobic patch as in any one of the previous embodiments relating to modifying the LysM2 domain.
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, including a nucleic acid sequence encoding the modified plant LysM receptor of any one of the preceding embodiments. In some embodiments, the modified plant LysM receptor has higher affinity, higher selectivity, and/or altered specificity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high affinity, high selectivity, and/or altered specificity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn, rice, barley, wheat, Trema spp., apple, pear, plum, apricot, peach, almond, walnut, strawberry, raspberry, blackberry, red currant, black currant, melon, cucumber, pumpkin, squash, grape, bean, soybean, pea, chickpea, cowpea, pigeon pea, lentil, Bambara groundnut, lupin, pulses, Medicago spp., Lotus spp., forage legumes, indigo, legume trees, or hemp. In some embodiments, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof.
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, including a first nucleic acid sequence encoding a modified plant LysM receptor where the LysM1 domain has been modified as in any of the preceding embodiments relating to modification to the LysM1 domain and a second nucleic acid sequence encoding a modified plant LysM receptor where the LysM2 domain has been modified to include a hydrophobic patch as in any of the preceding embodiments relating to modifications to the LysM2 domain. In some embodiments, the modified plant LysM receptor has higher affinity, higher selectivity, and/or altered specificity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high affinity, high selectivity, and/or altered specificity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the first nucleic acid or second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO: 24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn, rice, barley, wheat, Trema spp., apple, pear, plum, apricot, peach, almond, walnut, strawberry, raspberry, blackberry, red currant, black currant, melon, cucumber, pumpkin, squash, grape, bean, soybean, pea, chickpea, cowpea, pigeon pea, lentil, Bambara groundnut, lupin, pulses, Medicago spp., Lotus spp., forage legumes, indigo, legume trees, or hemp. In some embodiments, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof. In some embodiments, the plant part is a fruit, a kernel, or a grain.
In some aspects, the present disclosure relates to a pollen grain or an ovule of a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a protoplast from a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a tissue culture produced from protoplasts or cells from a genetically altered plant of any of the above embodiments relating to plants, wherein the cells or protoplasts are produced from a plant part selected from the group of leaf, anther, pistil, stem, petiole, root, root primordia, root tip, fruit, seed, flower, cotyledon, hypocotyl, embryo, or meristematic cell.
In some aspects, the present disclosure relates to a method of producing the genetically altered plant of any one of the above embodiments relating to plants, including introducing a genetic alteration to the plant having the nucleic acid sequence. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO: 24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter (KAY et al. Science, 236, 4805, 1987), a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to an endogenous promoter. In some embodiments, the endogenous promoter is a root specific promoter.
In further aspects, the present disclosure relates to methods for selection of a target plant LysM receptor for modifying the target plant LysM receptor to have a desired receptor characteristic, wherein the method includes the steps of: a) providing a structural model, a molecular model, a surface characteristics model, and/or an electrostatic potential model of a donor plant LysM receptor having the desired receptor characteristic and two or more potential target plant LysM receptors; b) comparing each of the two or more potential target plant LysM receptors with the structural model, the molecular model, the surface characteristics model, and/or the electrostatic potential model of the donor plant LysM receptor, and/or comparing each of the two or more potential target plant LysM receptors with the donor plant LysM receptor using structural overlay; and c) selecting the potential target plant LysM receptor with a suitable match for the donor plant LysM receptor to be the target plant LysM receptor. In some embodiments, the criteria for determining that the potential target plant LysM receptor is a suitable match for the donor plant LysM receptor in step (c) are selected from the group of goodness of fit to template structure; similarity; phylogenetic relation; surface potential; coverage to template structure; GMQE, QMEAN, and Local Quality estimates from SWISS-Model; or any combination thereof. In some embodiments, the structural model of a donor plant LysM receptor is a protein crystal structure, a molecular model, a cryo-EM structure, and a NMR structure. In some embodiments, the donor plant LysM receptor model is of an entire ectodomain and the two or more potential target plant LysM receptor models are of entire ectodomains. In some embodiments, the donor plant LysM receptor model is of a LysM1 domain, a LysM2 domain, a LysM3 domain, or any combination thereof, and the two or more potential target plant LysM receptor models are of LysM1 domains, LysM2 domains, LysM3 domains, or any combination thereof.
In some embodiments, the donor plant LysM receptor is Medicago NFP, Medicago LYK3, Lotus NFR1, Lotus NFR5, Lotus LYS11, or Arabidopsis CERK1. In some embodiments, the two or more target plant LysM receptors are additionally compared to Lotus CERK6. In some embodiments, the two or more potential target plant LysM receptor polypeptides are all from the same plant species or plant variety. In some embodiments, the desired receptor characteristic is affinity, selectivity, and/or specificity for an oligosaccharide or class of oligosaccharides. In some embodiments, the desired receptor characteristic is binding kinetics for an oligosaccharide or class of oligosaccharides, wherein the binding kinetics include off-rate and on-rate. In some embodiments, the class of oligosaccharides is selected from the group of LCOs, COs, beta-glucans, cyclic-beta-glucans, exopolysaccharides, or optionally LPS. In some embodiments, the class of oligosaccharides is LCOs or COs. In some embodiments, the class of oligosaccharides is LCOs, optionally produced by a produced by a nitrogen-fixing bacteria optionally selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof; or optionally produced by a mycorrhizal fungi optionally selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, pr any combination thereof. In some embodiments, the LCOs are M. loti LCO, S. meliloti LCO-IV, or S. meliloti LCO-V.
In some embodiments, the method further includes step d) identifying one or more amino acid residues for modification in the target LysM receptor by comparing amino acid residues of a first oligosaccharide binding feature in the donor plant LysM receptor with the corresponding amino acid residues in the target plant LysM receptor, and optionally identifying one or more amino acid residues for modification in the target LysM receptor by comparing amino acid residues of a second oligosaccharide binding feature in the donor plant LysM receptor with the corresponding amino acid residues in the target plant LysM receptor. In some embodiments, the method further includes step e) generating a modified plant LysM receptor wherein the one or more amino acid residues in the first oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM; generating a modified plant LysM receptor wherein the one or more amino acid residues in the second oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM; or generating a modified plant LysM receptor wherein the one or more amino acid residues in the first oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM and the one or more amino acid residues in the second oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM. In some embodiments, the first oligosaccharide binding feature is a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the second oligosaccharide binding feature is a part of the LysM1 domain of the donor plant LysM receptor.
In additional aspects, the present disclosure relates to a a modified plant LysM receptor produced using any one of the preceding methods, wherein the modified plant LysM receptor includes a LysM2 domain modified to include a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the modified LysM2 domain binds a lipo-chitooligosaccharide (LCO). In some embodiments, the modified LysM2 domain binds the LCO with higher affinity than the unmodified LysM2 domain. In some embodiments, the modified LysM2 domain binds the LCO with higher selectivity for the LCO than the unmodified LysM2 domain. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the LCO. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the lipid of the LCO. In some embodiments, the LCO is produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof.
In some embodiments of any of the above embodiments, the LysM receptor is selected from the group of a LysM chitooligosaccharide (CO) receptor, a LysM LCO receptor, or a LysM peptidoglycan (PGN) receptor. In some embodiments, the hydrophobic patch is adjacent to a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is adjacent to a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the hydrophobic patch is within 30 Å, 20 Å, 10 Å, 7.5 Å, 5 Å, 4 Å, 3 Å, 2 Å, 1.5 Å, or 1 Å of a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the LysM receptor is not an exopolysaccharide (EPS) receptor.
In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having at least 70% sequence identity, at least 75% sequence identity, at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6). In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having amino acid sequence SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6).
In some embodiments of any of the above embodiments, the hydrophobic patch was generated by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, or combinations thereof. In some embodiments of any of the above embodiments, the hydrophobic patch was generated by modifying an existing hydrophobic patch in the unmodified LysM receptor. In some embodiments, the unmodified LysM receptor was modified by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, substituting at least one hydrophobic amino acid residue with another hydrophobic amino acid residue, or combinations thereof. In some embodiments, the at least one amino acid was identified by an amino acid sequence alignment with a LysM2 domain from a LysM high affinity LCO receptor that naturally has a hydrophobic patch that interacts with LCO. In some embodiments, the at least one amino acid corresponds to an amino acid that is in bold underline in
In further aspects, the present disclosure relates to a a modified plant LysM receptor produced using any one of the preceding methods, wherein the modified plant LysM receptor includes a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. In some embodiments, the first LysM1 domain is modified by substituting a first part of the first LysM1 domain with a third part of a second LysM1 domain and/or by substituting a second part of the first LysM1 domain with a fourth part of the second LysM1 domain. In some embodiments, the first LysM1 domain and the second LysM1 domain have different affinities, selectivities, and/or specificities for oligosaccharides and the modification of the first LysM1 domain alters the affinity, selectivity, and/or specificity to be more like the second LysM1 domain. In some embodiments, the first part and the third part correspond to SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI, SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and wherein the second part and the fourth part correspond to SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; SEQ ID NO:29 [Lotus NFR1 region IV 73-81], or LNDINIQSF. In some embodiments, the first LysM1 domain is selected from the group of SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/26-95], SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/25-95], or NFR1 DLALASYYILPGVFILQNITTFMQSEIVSSNDAITSYNKDKILNDINIQSFQRLNIPFP; and the second LysM1 domain is CERK6: ALAQASYYLLNGSNLTYISEIMQSSLLTKPEDIVSYNQDTIASKDSVQAGQRINVPFP. In some embodiments, the first part is selected from SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI; the second part is selected from SEQ ID NO:28 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; the third part is selected from SEQ ID NO:31 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and the fourth part is selected from SEQ ID NO:29 [Lotus NFR1 region IV 73-81] or LNDINIQSF. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the first LysM1 domain is further modified by substituting a fifth part of the first LysM1 domain with a sixth part of a second LysM1 domain. In some embodiments, the first LysM1 domain is SEQ ID NO:115 [LysM1 domain Lotus NFR1; LjNFR1/32-89] or SEQ ID NO:106 [LysM1 domain Lotus NFR1; LjNFR1/31-89] and the second LysM1 domain is SEQ ID NO:114 [LysM1 domain Medicago LYK3; MtLYK3/31-89] or SEQ ID NO:105 [LysM1 domain Medicago LYK3; MtLYK3/30-89]. In some embodiments, wherein the fifth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/59-62], and wherein the sixth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62]. In some embodiments, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82]. In some embodiments, the first LysM1 domain is SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/31-89] and the second LysM1 domain is SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/32-89]. In some embodiments, the fifth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62], and the sixth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/59-62]. In some embodiments of any of the above embodiments including the first LysM1 domain being SEQ ID NO:33 and the second LysM1 domain being SEQ ID NO:32, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82].
In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the entire first LysM1 domain was replaced with the entire second LysM1 domain. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, either or both (i) 80% or fewer, 70% or fewer, 60% or fewer, 50% or fewer, 40% or fewer, 30% or fewer, or 20% or fewer of amino acid residues in the first LysM1 domain were substituted or deleted with the corresponding amino acid residues of the second LysM1 domain, and (ii) the entire LysM1 domain in the unmodified plant LysM receptor was not substituted with another entire LysM2 domain to generate the modified plant LysM receptor. In some embodiments, the modified LysM1 domain binds a lipo-chitooligosaccharide (LCO) produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof In some embodiments, the modified LysM1 domain binds an LCO with higher affinity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with higher selectivity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with altered specificity as compared to an unmodified LysM1 domain. In some embodiments, structural modelling was used to define the LysM1 domain and was used to identify the first part, the second part, the third part, and/or the fourth part for substitution. In some embodiments, the receptor of the above embodiments further contains a LysM2 domain modified to contain a hydrophobic patch as in any one of the previous embodiments relating to modifying the LysM2 domain. In some embodiments that may be combined with any of the preceding embodiments, the present disclosure related to a genetically altered plant or part thereof including the modified plant LysM receptor of any of the above embodiments.
The following description sets forth exemplary methods, parameters, and the like. It should be recognized, however, that such description is not intended as a limitation on the scope of the present disclosure but is instead provided as a description of exemplary embodiments.
Certain aspects of the present disclosure relate to a modified plant LysM receptor comprising a LysM2 domain modified to comprise a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the modified LysM2 domain binds a lipo-chitooligosaccharide (LCO). In some embodiments, the modified LysM2 domain binds the LCO with higher affinity than the unmodified LysM2 domain. In some embodiments, the modified LysM2 domain binds the LCO with higher selectivity for the LCO than the unmodified LysM2 domain. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the LCO. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the lipid of the LCO. In some embodiments, the LCO is produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof.
In some embodiments of any of the above embodiments, the LysM receptor is selected from the group consisting of a LysM chitooligosaccharide (CO) receptor, a LysM LCO receptor, and a LysM peptidoglycan (PGN) receptor. In some embodiments, the hydrophobic patch is adjacent to a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is within 30 Å, 29 Å, 28 Å, 27 Å, 26 Å, 25 Å, 24 Å, 23 Å, 22 Å, 21 Å, 20 Å, 19 Å, 18 Å, 17 Å, 16 Å, 15 Å, 14 Å, 13 Å, 12 Å, 11 Å, 10 Å, 9.5 Å, 9 Å, 8.5 Å, 8 Å, 7.5 Å, 7 Å, 6.5 Å, 6 Å, 5.5 Å, 5 Å, 4.5 Å, 4 Å, 3.5 Å, 3 Å, 2.5 Å, 2 Å, 1.5 Å, or 1 Å of a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is adjacent to a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the hydrophobic patch is within 30 Å, 29 Å, 28 Å, 27 Å, 26 Å, 25 Å, 24 Å, 23 Å, 22 Å, 21 Å, 20 Å, 19 Å, 18 Å, 17 Å, 16 Å, 15 Å, 14 Å, 13 Å, 12 Å, 11 Å, 10 Å, 9.5 Å, 9 Å, 8.5 Å, 8 Å, 7.5 Å, 7 Å, 6.5 Å, 6 Å, 5.5 Å, 5 Å, 4.5 Å, 4 Å, 3.5 Å, 3 Å, 2.5 Å, 2 Å, 1.5 Å, or 1 Å of a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the LysM receptor is not an exopolysaccharide (EPS) receptor.
In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide with at least 70% sequence identity, at least 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% sequence identity to SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6). In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having amino acid sequence SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LJCERK6).
In some embodiments of any of the above embodiments, the hydrophobic patch was generated by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, or combinations thereof. In some embodiments, the at least one amino acid was identified by an amino acid sequence alignment with a LysM2 domain from a LysM high affinity LCO receptor that naturally has a hydrophobic patch that interacts with LCO. In some embodiments, the at least one amino acid corresponds to an amino acid that is in bold in
In some aspects, the present disclosure relates to a modified plant LysM receptor comprising a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. In some embodiments, the first LysM1 domain is modified by substituting a first part of the first LysM1 domain with a third part of a second LysM1 domain and/or by substituting a second part of the first LysM1 domain with a fourth part of the second LysM1 domain. In some embodiments, the first LysM1 domain and the second LysM1 domain have different affinities and/or selectivities for oligosaccharides and the modification of the first LysM1 domain alters the affinity, selectivity, and/or specificity to be more like the second LysM1 domain. In some embodiments, the first part and the third part correspond to SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI, SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and wherein the second part and the fourth part correspond to SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; SEQ ID NO:29 [Lotus NFR1 region IV 73-81], or LNDINIQSF. In some embodiments, the first LysM1 domain is selected from the group of SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/26-95], SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/25-95], or NFR1 DLALASYYILPGVFILQNITTFMQSEIVSSNDAITSYNKDKILNDINIQSFQRLNIPFP; and the second LysM1 domain is CERK6: ALAQASYYLLNGSNLTYISEIMQSSLLTKPEDIVSYNQDTIASKDSVQAGQRINVPFP. In some embodiments, the first part is selected from SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI; the second part is selected from SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; the third part is selected from SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and the fourth part is selected from SEQ ID NO:29 [Lotus NFR1 region IV 73-81] or LNDINIQSF. In some embodiments, the entire first LysM1 domain was replaced with the entire second LysM1 domain. In some embodiments, the modified LysM1 domain binds a lipo-chitooligosaccharide (LCO) produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified LysM1 domain binds an LCO with higher affinity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with higher selectivity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with altered specificity as compared to an unmodified LysM1 domain. In some embodiments, structural modelling was used to define the LysM1 domain and was used to identify the first part, the second part, the third part, and/or the fourth part for substitution. In some embodiments, the receptor of the above embodiments further contains a LysM2 domain modified to contain a hydrophobic patch as in any one of the previous embodiments relating to modifications to the LysM2 domain.
Additional aspects of the present disclosure relate to a modified plant LysM receptor including a LysM2 domain modified to include a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the modified LysM2 domain binds a lipo-chitooligosaccharide (LCO). In some embodiments, the modified LysM2 domain binds the LCO with higher affinity than the unmodified LysM2 domain. In some embodiments, the modified LysM2 domain binds the LCO with higher selectivity for the LCO than the unmodified LysM2 domain. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the LCO. In some embodiments, the higher affinity or higher selectivity is due to the hydrophobic patch interacting with the lipid of the LCO. In some embodiments, the LCO is produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof.
In some embodiments of any of the above embodiments, the LysM receptor is selected from the group of a LysM chitooligosaccharide (CO) receptor, a LysM LCO receptor, or a LysM peptidoglycan (PGN) receptor. In some embodiments, the hydrophobic patch is adjacent to a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is within 30 Å, 29 Å, 28 Å, 27 Å, 26 Å, 25 Å, 24 Å, 23 Å, 22 Å, 21 Å, 20 Å, 19 Å, 18 Å, 17 Å, 16 Å, 15 Å, 14 Å, 13 Å, 12 Å, 11 Å, 10 Å, 9.5 Å, 9 Å, 8.5 Å, 8 Å, 7.5 Å, 7 Å, 6.5 Å, 6 Å, 5.5 Å, 5 Å, 4.5 Å, 4 Å, 3.5 Å, 3 Å, 2.5 Å, 2 Å, 1.5 Å, or 1 Å of a chitin binding motif if the LysM receptor is the LysM CO receptor or the LysM LCO receptor. In some embodiments, the hydrophobic patch is adjacent to a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the hydrophobic patch is within 30 Å, 29 Å, 28 Å, 27 Å, 26 Å, 25 Å, 24 Å, 23 Å, 22 Å, 21 Å, 20 Å, 19 Å, 18 Å, 17 Å, 16 Å, 15 Å, 14 Å, 13 Å, 12 Å, 11 Å, 10 Å, 9.5 Å, 9 Å, 8.5 Å, 8 Å, 7.5 Å, 7 Å, 6.5 Å, 6 Å, 5.5 Å, 5 Å, 4.5 Å, 4 Å, 3.5 Å, 3 Å, 2.5 Å, 2 Å, 1.5 Å, or 1Λ of a glycan binding motif if the LysM receptor is the LysM PGN receptor. In some embodiments, the LysM receptor is not an exopolysaccharide (EPS) receptor.
In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having at least 70% sequence identity, at least 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% sequence identity to SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6). In some embodiments of any of the above embodiments, the LysM receptor is a polypeptide having amino acid sequence SEQ ID NO:34 (i.e., Lotus CERK6; BAI79273.1_LjCERK6).
In some embodiments of any of the above embodiments, the hydrophobic patch was generated by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, or combinations thereof. In some embodiments of any of the above embodiments, the hydrophobic patch was generated by modifying an existing hydrophobic patch in the unmodified LysM receptor. In some embodiments, the unmodified LysM receptor was modified by deleting at least one non-hydrophobic amino acid residue, substituting at least one amino acid residue with a more hydrophobic amino acid, substituting at least one hydrophobic amino acid residue with another hydrophobic amino acid residue, or combinations thereof. In some embodiments, the at least one amino acid was identified by an amino acid sequence alignment with a LysM2 domain from a LysM high affinity LCO receptor that naturally has a hydrophobic patch that interacts with LCO. In some embodiments, the at least one amino acid corresponds to an amino acid that is in bold underline in
In some aspects, the present disclosure relates to a modified plant LysM receptor including a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. In some embodiments, the first LysM1 domain is modified by substituting a first part of the first LysM1 domain with a third part of a second LysM1 domain and/or by substituting a second part of the first LysM1 domain with a fourth part of the second LysM1 domain. In some embodiments, the first LysM1 domain and the second LysM1 domain have different affinities, selectivities, and/or specificities for oligosaccharides and the modification of the first LysM1 domain alters the affinity, selectivity, and/or specificity to be more like the second LysM1 domain. In some embodiments, the first part and the third part correspond to SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI, SEQ ID NO:28 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and wherein the second part and the fourth part correspond to SEQ ID NO:31 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; SEQ ID NO:29 [Lotus NFR1 region IV 73-81], or LNDINIQSF. In some embodiments, the first LysM1 domain is selected from the group of SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/26-95], SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/25-95], or NFR1 DLALASYYILPGVFILQNITTFMQSEIVSSNDAITSYNKDKILNDINIQSFQRLNIPFP; and the second LysM1 domain is CERK6: ALAQASYYLLNGSNLTYISEIMQSSLLTKPEDIVSYNQDTIASKDSVQAGQRINVPFP. In some embodiments, the first part is selected from SEQ ID NO:30 [Lotus CERK6 region II 43-53] or NGSNLTYISEI; the second part is selected from SEQ ID NO:28 [Lotus CERK6 region IV 74-82] or ASKDSVQAG; the third part is selected from SEQ ID NO:31 [Lotus NFR1 region II 41-52] or PGVFILQNITTF; and the fourth part is selected from SEQ ID NO:29 [Lotus NFR1 region IV 73-81] or LNDINIQSF. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the first LysM1 domain is further modified by substituting a fifth part of the first LysM1 domain with a sixth part of a second LysM1 domain. In some embodiments, the first LysM1 domain is SEQ ID NO:115 [LysM1 domain Lotus NFR1; LjNFR1/32-89] or SEQ ID NO:106 [LysM1 domain Lotus NFR1; LjNFR1/31-89] and the second LysM1 domain is SEQ ID NO:114 [LysM1 domain Medicago LYK3; MtLYK3/31-89] or SEQ ID NO:105 [LysM1 domain Medicago LYK3; MtLYK3/30-89]. In some embodiments, wherein the fifth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/56-92], and wherein the sixth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62]. In some embodiments, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82]. In some embodiments, the first LysM1 domain is SEQ ID NO:33 [LysM1 domain Medicago LYK3; MtLYK3/31-89] and the second LysM1 domain is SEQ ID NO:32 [LysM1 domain Lotus NFR1; LjNFR1/32-89]. In some embodiments, the fifth part is SEQ ID NO:46 [Medicago LYK3 region III 57-62; MtLYK3/57-62], and the sixth part is SEQ ID NO:53 [Lotus NFR1 region III 59-62; LjNFR1/59-62]. In some embodiments of any of the above embodiments including the first LysM1 domain being SEQ ID NO:33 and the second LysM1 domain being SEQ ID NO:32, the first LysM1 domain is modified by substituting a seventh part of the first LysM1 domain, wherein the seventh part spans the first part of the first LysM1 domain, the second part of the first LysM1 domain, and the fifth part of the first LysM1 domain, with an eighth part of the second LysM1 domain, wherein the eighth part spans the third part of the second LysM1 domain, the fourth part of the second LysM1 domain, and the sixth part of the second LysM1 domain. In some embodiments, the seventh part of the first LysM1 domain is SEQ ID NO:51 [Lotus NFR1 regions II-IV 41-82; LjNFR1/41-82], and the eighth part of the second LysM1 domain is SEQ ID NO:113 [Medicago LYK3 regions II-IV 40-82; MtLYK3/40-82] or SEQ ID NO:104 [Medicago LYK3 regions II-IV 41-82; MtLYK3/41-82].
In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, the entire first LysM1 domain was replaced with the entire second LysM1 domain. In some embodiments of any of the above embodiments including the first LysM1 domain being modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain, either or both (i) 80% or fewer, 79% or fewer, 78% or fewer, 77% or fewer, 76% or fewer, 75% or fewer, 74% or fewer, 73% or fewer, 72% or fewer, 71% or fewer, 70% or fewer, 69% or fewer, 68% or fewer, 67% or fewer, 66% or fewer, 65% or fewer, 64% or fewer, 63% or fewer, 62% or fewer, 61% or fewer, 60% or fewer, 59% or fewer, 58% or fewer, 57% or fewer, 56% or fewer, 55% or fewer, 54% or fewer, 53% or fewer, 52% or fewer, 51% or fewer, 50% or fewer, 49% or fewer, 48% or fewer, 47% or fewer, 46% or fewer, 45% or fewer, 44% or fewer, 43% or fewer, 42% or fewer, 41% or fewer, 40% or fewer, 39% or fewer, 38% or fewer, 37% or fewer, 36% or fewer, 35% or fewer, 34% or fewer, 33% or fewer, 32% or fewer, 31% or fewer, 30% or fewer, 29% or fewer, 28% or fewer, 27% or fewer, 26% or fewer, 25% or fewer, 24% or fewer, 23% or fewer, 22% or fewer, 21% or fewer, or 20% or fewer of amino acid residues in the first LysM1 domain were substituted or deleted with the corresponding amino acid residues of the second LysM1 domain, and (ii) the entire LysM1 domain in the unmodified plant LysM receptor was not substituted with another entire LysM2 domain to generate the modified plant LysM receptor. In some embodiments, the modified LysM1 domain binds a lipo-chitooligosaccharide (LCO) produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCO is produced by nitrogen-fixing bacteria selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof, or by mycorrhizal fungi selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof. In some embodiments, the modified LysM1 domain binds an LCO with higher affinity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with higher selectivity than an unmodified LysM1 domain. In some embodiments, the modified LysM1 domain binds LCOs with altered specificity as compared to an unmodified LysM1 domain. In some embodiments, structural modelling was used to define the LysM1 domain and was used to identify the first part, the second part, the third part, and/or the fourth part for substitution. In some embodiments, the unmodified plant LysM receptor was selected using the method of any one of the aspects of the present disclosure relating to such selection including any and all embodiments thereof and the second LysM2 domain is from the donor plant LysM receptor. In some embodiments, the receptor of the above embodiments further contains a LysM2 domain modified to contain a hydrophobic patch as in any one of the previous embodiments relating to modifying the LysM2 domain.
LysM receptors are a well known and well understood type of receptor. LysM receptors have three characteristic domains located in the ectodomain of the protein: LysM1, LysM2, and LysM3, which are present in this order on the protein sequence. The LysM1 domain is located toward the N-terminal end of the protein sequence, and is preceded by an N-terminal signal peptide as well as a C(x)xxxC motif. The LysM1 domain is separated from the LysM2 domain by a CxC motif, and the LysM2 domain is separated from the LysM3 domain by a CxC motif as well. The three LysM domains, as well as the C(x)xxxC and CxC motif are clearly shown in
As used in the present disclosure, the term “affinity” refers to affinity for LCOs generally. The LysM receptors of the present disclosure may contain a hydrophobic patch in their LysM2 domain. Without wanting to be limited to theory, it is believed that LysM receptors with the hydrophobic patch have higher affinity for LCOs as compared to LysM receptors without the hydrophobic patch, but LysM receptors with domain-swapped LysM1 domains would also provide higher affinity for LCOs and other agonists. Affinity can be measured using the methods described in the Examples below, and using other methods known in the art that measure binding kinetics, association, dissociation, and KD.
As used in the present disclosure, the term “selectivity” refers to the differentiation between different polysaccharide ligands, specifically between lipo-chitooligosaccharides (LCOs) as a class and other polysaccharide ligands, preferably chitooligosaccharides (COs). Without wanting to be limited to theory, it is believed that this hydrophobic patch confers selective recognition of LCOs over COs, and that therefore LysM receptors with the hydrophobic patch have increased selectivity as compared to LysM receptors without the hydrophobic patch. In addition, the LysM receptors with domain-swapped LysM1 domains should also have higher or altered selectivity depend upon the choice of the donor receptor.
As used in the present disclosure, the term “specificity” refers to the differentiation between different lipo-chitooligosaccharides (LCOs) produced by different nitrogen-fixing bacterial species and/or mycorrhizal fungi. The LysM receptors of the present disclosure may contain a LysM1 domain where regions (e.g., partial, entire) in the LysM1 domain have been replaced with the corresponding regions of the LysM1 domain from a donor LysM receptor. Without wanting to be limited to theory, it is believed that if the donor LysM receptor is a high affinity and specificity LCO LysM receptor such as a legume NFR1 receptor, this replacement can alter the specificity of the LysM receptor, but LysM receptors with a hydrophobic patch in the LysM2 domain may also provide specificity for specific LCOs. The LysM1 domain is clearly shown in
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, comprising a nucleic acid sequence encoding a modified plant LysM receptor of any one of the embodiments described in the section “Modified plant LysM receptors”. In some embodiments, the modified plant LysM receptor has higher selectivity and/or affinity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high selectivity and/or affinity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the root cell is a root cortex cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn (e.g., maize, Zea mays), rice (e.g., Oryza sativa, Oryza glaberrima, Zizania spp.), barley (e.g., Hordeum vulgare), wheat (e.g., common wheat, spelt, durum, Triticum aestivum, Triticum spelta, Triticum durum, Triticum spp.), Trema spp. (e.g., Trema cannabina, Trema cubense, Trema discolor, Trema domingensis, Trema integerrima, Trema lamarckiana, Trema micrantha, Trema orientalis, Trema philippinensis, Trema strigilosa, Trema tomentosa), apple (e.g., Malus pumila), pear (e.g., Pyrus communis, Pyrus×bretschneideri, Pyrus pyrifolia, Pyrus sinkiangensis, Pyrus pashia, Pyrus spp.), plum (e.g., prune, damson, bullaces, Prunus domestica, Prunus salicina), apricot (e.g., Prunus armeniaca, Prunus brigantina, Prunus mandshurica, Prunus mume, Prunus sibirica), peach (e.g., nectarine, Prunus persica), almond (e.g., Prunus dulcis, Prunus amygdalus), walnut (e.g., Persian walnut, English walnut, black walnut, Juglans regia, Juglans nigra, Juglans cinerea, Juglans californica), strawberry (e.g., Fragaria×ananassa, Fragaria chiloensis, Fragaria virginiana, Fragaria vesca), raspberry (e.g., European red raspberry, black raspberry, Rubus idaeus, Rubus occidentalis, Rubus strigosus), blackberry (e.g., evergreen blackberry, Himalayan blackberry, Rubus fruticosus, Rubus ursinus, Rubus laciniatus, Rubus argutus, Rubus armeniacus, Rubus plicatus, Rubus ulmifolius, Rubus allegheniensis), red currant (e.g., Ribes rubrum, Ribes spicatum, Ribesbes alpinum, Ribes schlechtendalii, Ribes multiflorum, Ribes petraeum, Ribes triste), black currant (e.g., Ribes nigrum), melon (e.g., watermelon, winter melon, casabas, cantaloupe, honeydew, muskmelon, Citrullus lanatus, Benincasa hispida, Cucumis melo cantalupensis, Cucumis melo inodorus, Cucumis melo reticulatus), cucumber (e.g., slicing cucumbers, pickling cucumbers, English cucumber, Cucumis sativus), pumpkin (e.g., Cucurbita pepo, Cucurbita maxima), squash (e.g., gourd, Cucurbita argyrosperma, Cucurbita ficifolia, Cucurbita maxima, Cucurbita moschata), grape (e.g., Vitis vinifera, Vitis amurensis, Vitis labrusca, Vitis mustangensis, Vitis riparia, Vitis rotundifolia), bean (e.g., Phaseolus vulgaris, Phaseolus lunatus, Vigna angularis, Vigna radiate, Vigna mungo, Phaseolus coccineus, Vigna umbellate, Vigna acontifolia, Phaseolus acutifolius, Vicia faba, Vicia faba equine, Phaseolus spp., Vigna spp.), soybean (e.g., soy, soya bean, Glycine max, Glycine soja), pea (e.g., Pisum spp., Pisum sativum var. sativum, Pisum sativum var. arvense), chickpea (e.g., garbanzo, Bengal gram, Cicer arietinum), cowpea (e.g., black-eyed pea, blackeye bean, Vigna unguiculata), pigeon pea (e.g., Arhar/Toor, cajan pea, Congo bean, gandules, Caganus cajan), lentil (e.g., Lens culinaris), Bambara groundnut (e.g., earth pea, Vigna subterranea), lupin (e.g., Lupinus spp.), pulses (e.g., minor pulses, Lablab purpureaus, Canavalia ensiformis, Canavalia gladiate, Psophocarpus tetragonolobus, Mucuna pruriens var. utilis, Pachyrhizus erosus), Medicago spp. (e.g., Medicago sativa, Medicago truncatula, Medicago arborea), Lotus spp. (e.g., Lotus japonicus), forage legumes (e.g., Leucaena spp., Albizia spp., Cyamopsis spp., Sesbania spp., Stylosanthes spp., Trifolium spp., Vicia spp.), indigo (e.g., Indigofera spp., Indigofera tinctoria, Indigofera suffruticosa, Indigofera articulata, Indigofera oblongifolia, Indigofera aspalthoides, Indigofera suffruticosa, Indigofera arrecta), legume trees (e.g., locust trees, Gleditsia spp., Robinia spp., Kentucky coffeetree, Gymnocladus dioicus, Acacia spp., Laburnum spp., Wisteria spp.), or hemp (e.g., cannabis, Cannabis sativa).
In some aspects, the present disclosure relates to a genetically altered plant or part thereof, comprising a first nucleic acid sequence encoding a modified plant LysM receptor where the LysM1 domain has been modified as in any of the preceding embodiments relating to modification of the LysM1 domain and a second nucleic acid sequence encoding a modified plant LysM receptor where the LysM2 domain has been modified to include a hydrophobic patch as in any of the preceding embodiments relating to modification of the LysM2 domain. In some embodiments, the modified plant LysM receptor has higher selectivity and/or affinity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high selectivity and/or affinity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the root cell is a root cortex cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the first nucleic acid or second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn (e.g., maize, Zea mays), rice (e.g., Oryza sativa, Oryza glaberrima, Zizania spp.), barley (e.g., Hordeum vulgare), wheat (e.g., common wheat, spelt, durum, Triticum aestivum, Triticum spelta, Triticum durum, Triticum spp.), Trema spp. (e.g., Trema cannabina, Trema cubense, Trema discolor, Trema domingensis, Trema integerrima, Trema lamarckiana, Trema micrantha, Trema orientalis, Trema philippinensis, Trema strigilosa, Trema tomentosa), apple (e.g., Malus pumila), pear (e.g., Pyrus communis, Pyrus×bretschneideri, Pyrus pyrifolia, Pyrus sinkiangensis, Pyrus pashia, Pyrus spp.), plum (e.g., prune, damson, bullaces, Prunus domestica, Prunus salicina), apricot (e.g., Prunus armeniaca, Prunus brigantina, Prunus mandshurica, Prunus mume, Prunus sibirica), peach (e.g., nectarine, Prunus persica), almond (e.g., Prunus dulcis, Prunus amygdalus), walnut (e.g., Persian walnut, English walnut, black walnut, Juglans regia, Juglans nigra, Juglans cinerea, Juglans californica), strawberry (e.g., Fragaria×ananassa, Fragaria chiloensis, Fragaria virginiana, Fragaria vesca), raspberry (e.g., European red raspberry, black raspberry, Rubus idaeus, Rubus occidentalis, Rubus strigosus), blackberry (e.g., evergreen blackberry, Himalayan blackberry, Rubus fruticosus, Rubus ursinus, Rubus laciniatus, Rubus argutus, Rubus armeniacus, Rubus plicatus, Rubus ulmifolius, Rubus allegheniensis), red currant (e.g., Ribes rubrum, Ribes spicatum, Ribesbes alpinum, Ribes schlechtendalii, Ribes multiflorum, Ribes petraeum, Ribes triste), black currant (e.g., Ribes nigrum), melon (e.g., watermelon, winter melon, casabas, cantaloupe, honeydew, muskmelon, Citrullus lanatus, Benincasa hispida, Cucumis melo cantalupensis, Cucumis melo inodorus, Cucumis melo reticulatus), cucumber (e.g., slicing cucumbers, pickling cucumbers, English cucumber, Cucumis sativus), pumpkin (e.g., Cucurbita pepo, Cucurbita maxima), squash (e.g., gourd, Cucurbita argyrosperma, Cucurbita ficifolia, Cucurbita maxima, Cucurbita moschata), grape (e.g., Vitis vinifera, Vitis amurensis, Vitis labrusca, Vitis mustangensis, Vitis riparia, Vitis rotundifolia), bean (e.g., Phaseolus vulgaris, Phaseolus lunatus, Vigna angularis, Vigna radiate, Vigna mungo, Phaseolus coccineus, Vigna umbellata, Vigna acontifolia, Phaseolus acutifolius, Vicia faba, Vicia faba equine, Phaseolus spp., Vigna spp.), soybean (e.g., soy, soya bean, Glycine max, Glycine soja), pea (e.g., Pisum spp., Pisum sativum var. sativum, Pisum sativum var. arvense), chickpea (e.g., garbanzo, Bengal gram, Cicer arietinum), cowpea (e.g., black-eyed pea, blackeye bean, Vigna unguiculata), pigeon pea (e.g., Arhar/Toor, cajan pea, Congo bean, gandules, Caganus cajan), lentil (e.g., Lens culinaris), Bambara groundnut (e.g., earth pea, Vigna subterranea), lupin (e.g., Lupinus spp.), pulses (e.g., minor pulses, Lablab purpureaus, Canavalia ensiformis, Canavalia gladiate, Psophocarpus tetragonolobus, Mucuna pruriens var. utilis, Pachyrhizus erosus), Medicago spp. (e.g., Medicago sativa, Medicago truncatula, Medicago arborea), Lotus spp. (e.g., Lotus japonicus), forage legumes (e.g., Leucaena spp., Albizia spp., Cyamopsis spp., Sesbania spp., Stylosanthes spp., Trifolium spp., Vicia spp.), indigo (e.g., Indigofera spp., Indigofera tinctoria, Indigofera suffruticosa, Indigofera articulata, Indigofera oblongifolia, Indigofera aspalthoides, Indigofera suffruticosa, Indigofera arrecta), legume trees (e.g., locust trees, Gleditsia spp., Robinia spp., Kentucky coffeetree, Gymnocladus dioicus, Acacia spp., Laburnum spp., Wisteria spp.), or hemp (e.g., cannabis, Cannabis sativa). In some embodiments, the plant part is a leaf, a stem, a root, a root primordia, a flower, a seed, a fruit, a kernel, a grain, a cell, or a portion thereof. In some embodiments, the plant part is a fruit, a kernel, or a grain.
In some aspects, the present disclosure relates to a pollen grain or an ovule of a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a protoplast from a genetically altered plant of any of the above embodiments relating to plants.
In some aspects, the present disclosure relates to a tissue culture produced from protoplasts or cells from a genetically altered plant of any of the above embodiments relating to plants, wherein the cells or protoplasts are produced from a plant part selected from the group consisting of leaf, anther, pistil, stem, petiole, root, root primordia, root tip, fruit, seed, flower, cotyledon, hypocotyl, embryo, and meristematic cell.
Certain aspects of the present disclosure relate to a method of producing the genetically altered plant of any one of the above embodiments relating to plants as described in the section “Genetically altered plants and seeds”, comprising introducing a genetic alteration to the plant comprising the nucleic acid sequence. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter (KAY et al. Science, 236, 4805, 1987), a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is inserted into the genome of the plant so that the nucleic acid sequence, the first nucleic acid sequence, and/or the second nucleic acid sequence is operably linked to an endogenous promoter. In some embodiments, the endogenous promoter is a root specific promoter.
In some aspects, the present disclosure relates to a method of producing a genetically altered plant able to recognize LCOs, comprising the steps of: introducing a genetic alteration to the plant comprising the provision of an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized, thereby enabling the plant to recognize LCOs.
In some aspects, the present disclosure relates to a method of producing a genetically altered plant able to recognize LCOs, comprising the steps of: introducing a genetic alteration to the plant comprising the provision of an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized with high affinity, high selectivity, and/or high specificity, thereby enabling the plant to recognize LCOs with high affinity, high selectivity, and/or high specificity.
In some aspects, the present disclosure relates to a method of producing a genetically altered plant able to recognize LCOs produced by a specific nitrogen-fixing bacterial species and/or a specific mycorrhizal fungal species, comprising the steps of: introducing a genetic alteration to the plant comprising the provision of an ability for LCOs produced by the specific nitrogen-fixing bacterial species and/or the specific mycorrhizal fungal species to be recognized with altered specificity, thereby enabling the plant to recognize LCOs with altered specificity. In some embodiments, the genetic alteration allows the genetically altered plant to recognize a different specific nitrogen-fixing bacterial species and/or specific mycorrhizal fungal species as compared to a plant without the genetic alteration. In some embodiments, the genetically altered plant is able to be grown in different agricultural conditions (e.g., different soils containing different symbiotic microbial species, etc.). In some embodiments, the genetic alteration allows the genetically altered plant to be grown in different agricultural conditions containing specific bacterial strains producing LCOs detected with high specificity, sensitivity, and/or selectivity by the genetically altered plant. In some embodiments, the bacterial strains are added as a seed coating or as a soil inoculum. In some embodiments, the genetically altered plant is able to be grown with different crop species (e.g., different crop rotations, etc.).
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs, comprising the steps of: providing a seed with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized, wherein the seed produces a plant with the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi; cultivating the plant under conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil.
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs with high affinity, high selectivity, and/or high specificity, comprising the steps of: providing a seed with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized with high affinity, high selectivity, and/or high specificity, wherein the seed produces a plant with the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high affinity, high selectivity, and/or high specificity; cultivating the plant under conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high affinity, high selectivity, and/or high specificity results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil.
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs with altered specificity, comprising the steps of: providing a seed with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized with altered specificity, wherein the seed produces a plant with the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high specificity; cultivating the plant under conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with altered specificity results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil. In some embodiments, the genetic alteration allows the genetically altered plant to recognize a different specific nitrogen-fixing bacterial species and/or specific mycorrhizal fungal species as compared to a plant without the genetic alteration. In some embodiments, the genetically altered plant is able to be grown in different agricultural conditions (e.g., different soils containing different symbiotic microbial species, etc.). In some embodiments, the genetically altered plant is able to be grown with different crop species (e.g., different crop rotations, etc.).
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs, comprising the steps of: providing a tissue culture or protoplast with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized; regenerating the tissue culture or protoplast into a plantlet; growing the plantlet into a plant, wherein the plant has the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi; transplanting the plant into conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil.
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs with high affinity, high selectivity, and/or high specificity, comprising the steps of: providing a tissue culture or protoplast with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized with high affinity, high selectivity, and/or high specificity, regenerating the tissue culture or protoplast into a plantlet; growing the plantlet into a plant, wherein the plant has the ability to recognize LCOs produced by produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high affinity, high selectivity, and/or high specificity; transplanting the plant into conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high affinity, high selectivity, and/or high specificity results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil.
In some aspects, the present disclosure relates to a method of cultivating a plant with the ability to recognize LCOs with altered specificity, comprising the steps of: providing a tissue culture or protoplast with one or more genetic alterations that provide an ability for LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi to be recognized with altered specificity, regenerating the tissue culture or protoplast into a plantlet; growing the plantlet into a plant, wherein the plant has the ability to recognize LCOs produced by produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with high specificity; transplanting the plant into conditions where the ability to recognize LCOs produced by nitrogen-fixing bacteria and/or mycorrhizal fungi with altered specificity results in increased growth, yield, and/or biomass, as compared to a plant grown under the same conditions that lacks the one or more genetic alterations. In some embodiments, the plant is cultivated in nutrient-poor soil. In some embodiments, the genetic alteration allows the genetically altered plant to recognize a different specific nitrogen-fixing bacterial species and/or specific mycorrhizal fungal species as compared to a plant without the genetic alteration. In some embodiments, the genetically altered plant is able to be grown in different agricultural conditions (e.g., different soils containing different symbiotic microbial species, etc.). In some embodiments, the genetically altered plant is able to be grown with different crop species (e.g., different crop rotations, etc.).
In some embodiments of any of the above methods, the ability to recognize LCOs is conferred by a nucleic acid sequence encoding a modified plant LysM receptor of any one of the embodiments described in the section “Modified plant LysM receptors”. In some embodiments, the modified plant LysM receptor has higher selectivity and/or affinity for LCOs than the unmodified plant LysM receptor and the expression of the modified plant LysM receptor allows the plant or part thereof to recognize LCOs with high selectivity and/or affinity. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria or by mycorrhizal fungi. In some embodiments, the LCOs are produced by nitrogen-fixing bacteria selected from the group consisting of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., and any combination thereof, or by or by mycorrhizal fungi selected from the group consisting of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, and any combination thereof. In some embodiments, the modified polypeptide is localized to a plant cell plasma membrane. In some embodiments, the plant cell is a root cell. In some embodiments, the root cell is a root epidermal cell. In some embodiments, the root cell is a root cortex cell. In some embodiments, the modified polypeptide is expressed in a developing plant root system. In some embodiments, the nucleic acid sequence is operably linked to a promoter. In some embodiments, the promoter is a root specific promoter. In some embodiments, the promoter is selected from the group of a NFR1/LYK3/CERK6 or NFR5/NFP promoter, the Lotus NFR5 promoter (SEQ ID NO:24) and the Lotus NFR1 promoters (SEQ ID NO:25) the maize allothioneine promoter, the chitinase promoter, the maize ZRP2 promoter, the tomato LeExt1 promoter, the glutamine synthetase soybean root promoter, the RCC3 promoter, the rice antiquitine promoter, the LRR receptor kinase promoter, or the Arabidopsis pCO2 promoter. In some embodiments, the promoter is a constitutive promoter optionally selected from the group of the CaMV35S promoter, a derivative of the CaMV35S promoter, the maize ubiquitin promoter, the trefoil promoter, a vein mosaic cassava virus promoter, or the Arabidopsis UBQ10 promoter. In some embodiments, the plant is selected from the group of corn (e.g., maize, Zea mays), rice (e.g., Oryza sativa, Oryza glaberrima, Zizania spp.), barley (e.g., Hordeum vulgare), wheat (e.g., common wheat, spelt, durum, Triticum aestivum, Triticum spelta, Triticum durum, Triticum spp.), Trema spp. (e.g., Trema cannabina, Trema cubense, Trema discolor, Trema domingensis, Trema integerrima, Trema lamarckiana, Trema micrantha, Trema orientalis, Trema philippinensis, Trema strigilosa, Trema tomentosa), apple (e.g., Malus pumila), pear (e.g., Pyrus communis, Pyrus xbretschneideri, Pyrus pyrifolia, Pyrus sinkiangensis, Pyrus pashia, Pyrus spp.), plum (e.g., prune, damson, bullaces, Prunus domestica, Prunus salicina), apricot (e.g., Prunus armeniaca, Prunus brigantina, Prunus mandshurica, Prunus mume, Prunus sibirica), peach (e.g., nectarine, Prunus persica), almond (e.g., Prunus dulcis, Prunus amygdalus), walnut (e.g., Persian walnut, English walnut, black walnut, Juglans regia, Juglans nigra, Juglans cinerea, Juglans californica), strawberry (e.g., Fragaria×ananassa, Fragaria chiloensis, Fragaria virginiana, Fragaria vesca), raspberry (e.g., European red raspberry, black raspberry, Rubus idaeus, Rubus occidentalis, Rubus strigosus), blackberry (e.g., evergreen blackberry, Himalayan blackberry, Rubus fruticosus, Rubus ursinus, Rubus laciniatus, Rubus argutus, Rubus armeniacus, Rubus plicatus, Rubus ulmifolius, Rubus allegheniensis), red currant (e.g., Ribes rubrum, Ribes spicatum, Ribesbes alpinum, Ribes schlechtendalii, Ribes multiflorum, Ribes petraeum, Ribes triste), black currant (e.g., Ribes nigrum), melon (e.g., watermelon, winter melon, casabas, cantaloupe, honeydew, muskmelon, Citrullus lanatus, Benincasa hispida, Cucumis melo cantalupensis, Cucumis melo inodorus, Cucumis melo reticulatus), cucumber (e.g., slicing cucumbers, pickling cucumbers, English cucumber, Cucumis sativus), pumpkin (e.g., Cucurbita pepo, Cucurbita maxima), squash (e.g., gourd, Cucurbita argyrosperma, Cucurbita ficifolia, Cucurbita maxima, Cucurbita moschata), grape (e.g., Vitis vinifera, Vitis amurensis, Vitis labrusca, Vitis mustangensis, Vitis riparia, Vitis rotundifolia), bean (e.g., Phaseolus vulgaris, Phaseolus lunatus, Vigna angularis, Vigna radiate, Vigna mungo, Phaseolus coccineus, Vigna umbellate, Vigna acontifolia, Phaseolus acutifolius, Vicia faba, Vicia faba equine, Phaseolus spp., Vigna spp.), soybean (e.g., soy, soya bean, Glycine max, Glycine soja), pea (e.g., Pisum spp., Pisum sativum var. sativum, Pisum sativum var. arvense), chickpea (e.g., garbanzo, Bengal gram, Cicer arietinum), cowpea (e.g., black-eyed pea, blackeye bean, Vigna unguiculata), pigeon pea (e.g., Arhar/Toor, cajan pea, Congo bean, gandules, Caganus cajan), lentil (e.g., Lens culinaris), Bambara groundnut (e.g., earth pea, Vigna subterranea), lupin (e.g., Lupinus spp.), pulses (e.g., minor pulses, Lablab purpureaus, Canavalia ensiformis, Canavalia gladiate, Psophocarpus tetragonolobus, Mucuna pruriens var. utilis, Pachyrhizus erosus), Medicago spp. (e.g., Medicago sativa, Medicago truncatula, Medicago arborea), Lotus spp. (e.g., Lotus japonicus), forage legumes (e.g., Leucaena spp., Albizia spp., Cyamopsis spp., Sesbania spp., Stylosanthes spp., Trifolium spp., Vicia spp.), indigo (e.g., Indigofera spp., Indigofera tinctoria, Indigofera suffruticosa, Indigofera articulata, Indigofera oblongifolia, Indigofera aspalthoides, Indigofera suffruticosa, Indigofera arrecta), legume trees (e.g., locust trees, Gleditsia spp., Robinia spp., Kentucky coffeetree, Gymnocladus dioicus, Acacia spp., Laburnum spp., Wisteria spp.), or hemp (e.g., cannabis, Cannabis sativa).
One embodiment of the present invention provides a genetically altered plant or plant cell comprising one or more modified endogenous plant genes. For example, the present disclosure provides plants with genetically altered LysM receptors modified to include a hydrophobic patch or alter the hydrophobic patch in the LysM2 domain and plants with genetically altered LysM receptors modified by replacing regions in the LysM1 domain with corresponding donor LysM1 domain regions. Plants with these modified receptors can have increased affinity, selectivity, and/or specificity for LCOs.
Certain aspects of the present disclosure relate to methods for selection of a target plant LysM receptor for modifying the target plant LysM receptor to have a desired receptor characteristic, wherein the method includes the steps of: a) providing a structural model, a molecular model, a surface characteristics model, and/or an electrostatic potential model of a donor plant LysM receptor having the desired receptor characteristic and two or more potential target plant LysM receptors; b) comparing each of the two or more potential target plant LysM receptors with the structural model, the molecular model, the surface characteristics model, and/or the electrostatic potential model of the donor plant LysM receptor, and/or comparing each of the two or more potential target plant LysM receptors with the donor plant LysM receptor using structural overlay; and c) selecting the potential target plant LysM receptor with a suitable match for the donor plant LysM receptor to be the target plant LysM receptor. In some embodiments, the criteria for determining that the potential target plant LysM receptor is a suitable match for the donor plant LysM receptor in step (c) are selected from the group of goodness of fit to template structure; similarity; phylogenetic relation; surface potential; coverage to template structure; GMQE, QMEAN, and Local Quality estimates from SWISS-Model; or any combination thereof. In some embodiments, the structural model of a donor plant LysM receptor is a protein crystal structure, a molecular model, a cryo-EM structure, and a NMR structure. In some embodiments, the donor plant LysM receptor model is of an entire ectodomain and the two or more potential target plant LysM receptor models are of entire ectodomains. In some embodiments, the donor plant LysM receptor model is of a LysM1 domain, a LysM2 domain, a LysM3 domain, or any combination thereof, and the two or more potential target plant LysM receptor models are of LysM1 domains, LysM2 domains, LysM3 domains, or any combination thereof.
In some embodiments, the donor plant LysM receptor is Medicago NFP, Medicago LYK3, Lotus NFR1, Lotus NFR5, Lotus LYS11, or Arabidopsis CERK1. In some embodiments, the two or more target plant LysM receptors are additionally compared to Lotus CERK6. In some embodiments, the two or more potential target plant LysM receptor polypeptides are all from the same plant species or plant variety. In some embodiments, the desired receptor characteristic is affinity, selectivity, and/or specificity for an oligosaccharide or class of oligosaccharides. In some embodiments, the desired receptor characteristic is binding kinetics for an oligosaccharide or class of oligosaccharides, wherein the binding kinetics include off-rate and on-rate. In some embodiments, the class of oligosaccharides is selected from the group of LCOs, COs, beta-glucans, cyclic-beta-glucans, exopolysaccharides, or optionally LPS. In some embodiments, the class of oligosaccharides is LCOs or COs. In some embodiments, the class of oligosaccharides is LCOs, optionally produced by a produced by a nitrogen-fixing bacteria optionally selected from the group of Mesorhizobium loti, Mesorhizobium huakuii, Mesorhizobium mediterraneum, Mesorhizobium ciceri, Mesorhizobium spp., Rhizobium mongolense, Rhizobium tropici, Rhizobium etli phaseoli, Rhizobium giardinii, Rhizobium leguminosarum optionally R. leguminosarum trifolii, R. leguminosarum viciae, and R. leguminosarum phaseoli, Burkholderiales optionally symbionts of Mimosa, Sinorhizobium meliloti, Sinorhizobium medicae, Sinorhizobium fredii, Sinorhizobium NGR234, Azorhizobium caulinodans, Bradyrhizobium japonicum, Bradyrhizobium elkanii, Bradyrhizobium liaonginense, Frankia spp., or any combination thereof; or optionally produced by a mycorrhizal fungi optionally selected from the group of Acaulosporaceae spp., Diversisporaceae spp., Gigasporaceae spp., Pacisporaceae spp., Funneliformis spp., Glomus spp., Rhizophagus spp., Sclerocystis spp., Septoglomus spp., Claroideoglomus spp., Ambispora spp., Archaeospora spp., Geosiphon pyriformis, Paraglomus spp., other species in the division Glomeromycota, or any combination thereof. In some embodiments, the LCOs are M. loti LCO, S. meliloti LCO-IV, or S. meliloti LCO-V.
In some embodiments, the method further includes step d) identifying one or more amino acid residues for modification in the target LysM receptor by comparing amino acid residues of a first oligosaccharide binding feature in the donor plant LysM receptor with the corresponding amino acid residues in the target plant LysM receptor, and optionally identifying one or more amino acid residues for modification in the target LysM receptor by comparing amino acid residues of a second oligosaccharide binding feature in the donor plant LysM receptor with the corresponding amino acid residues in the target plant LysM receptor. In some embodiments, the method further includes step e) generating a modified plant LysM receptor wherein the one or more amino acid residues in the first oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM; generating a modified plant LysM receptor wherein the one or more amino acid residues in the second oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM; or generating a modified plant LysM receptor wherein the one or more amino acid residues in the first oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM and the one or more amino acid residues in the second oligosaccharide binding feature of the target plant LysM receptor have been substituted with corresponding amino acid residues from the donor LysM. In some embodiments, the first oligosaccharide binding feature is a hydrophobic patch on the surface of the LysM2 domain. In some embodiments, the second oligosaccharide binding feature is a part of the LysM1 domain of the donor plant LysM receptor. In some embodiments, the modified LysM receptor is an endogenous LysM receptor modified using one or more gene editing components. In some embodiments, the one or more gene editing components target a nuclear genome sequence operably linked to a nucleic acid encoding an endogenous LysM receptor (e.g., a soybean LysM receptor). A further embodiment of this aspect includes the one or more gene editing components being selected from the group of a ribonucleoprotein complex that targets the nuclear genome sequence; a vector comprising a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector comprising a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (ODN), wherein the ODN targets the nuclear genome sequence; or a vector comprising a CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence.
Additional aspects of the present disclosure relate to a modified plant LysM receptor produced using any one of the above methods, wherein the modified plant LysM receptor includes a LysM2 domain modified to comprise a hydrophobic patch on the surface of the LysM2 domain. Further aspects of the present disclosure relate to a modified plant LysM receptor produced using any one of the above methods, wherein the modified plant LysM receptor includes a first LysM1 domain modified to replace at least part of the first LysM1 domain with at least part of a second LysM1 domain. The modified plant LysM receptors of the present disclosure may be used to produce the genetically altered plant of any one of the above embodiments relating to plants as described in the section “Genetically altered plants and seeds”
Transformation and generation of genetically altered monocotyledonous and dicotyledonous plant cells is well known in the art. See, e.g., Weising, et al., Ann. Rev. Genet. 22:421-477 (1988); U.S. Pat. No. 5,679,558; Agrobacterium Protocols, ed: Gartland, Humana Press Inc. (1995); and Wang, et al. Acta Hort. 461:401-408 (1998). The choice of method varies with the type of plant to be transformed, the particular application and/or the desired result. The appropriate transformation technique is readily chosen by the skilled practitioner.
Any methodology known in the art to delete, insert or otherwise modify the cellular DNA (e.g., genomic DNA and organelle DNA) can be used in practicing the inventions disclosed herein. For example, a disarmed Ti plasmid, containing a genetic construct for deletion or insertion of a target gene, in Agrobacterium tumefaciens can be used to transform a plant cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using procedures described in the art, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and published European Patent application (“EP”) 0242246. Ti-plasmid vectors each contain the gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and U.S. Pat. No. 4,684,611), plant RNA virus-mediated transformation (as described, for example in EP 0 067 553 and U.S. Pat. No. 4,407,956), liposome-mediated transformation (as described, for example in U.S. Pat. No. 4,536,475), and other methods such as the methods for transforming certain lines of corn (e.g., U.S. Pat. No. 6,140,553; Fromm et al., Bio/Technology (1990) 8, 833 839); Gordon-Kamm et al., The Plant Cell, (1990) 2, 603 618) and rice (Shimamoto et al., Nature, (1989) 338, 274 276; Datta et al., Bio/Technology, (1990) 8, 736 740) and the method for transforming monocots generally (PCT publication WO 92/09696). For cotton transformation, the method described in PCT patent publication WO 00/71733 can be used. For soybean transformation, reference is made to methods known in the art, e.g., Hinchee et al. (Bio/Technology, (1988) 6, 915) and Christou et al. (Trends Biotech, (1990) 8, 145) or the method of WO 00/42207.
Genetically altered plants of the present invention can be used in a conventional plant breeding scheme to produce more genetically altered plants with the same characteristics, or to introduce the genetic alteration(s) in other varieties of the same or related plant species. Seeds, which are obtained from the altered plants, preferably contain the genetic alteration(s) as a stable insert in chromosomal or organelle DNA or as modifications to an endogenous gene or promoter. Plants comprising the genetic alteration(s) in accordance with the invention include plants comprising, or derived from, root stocks of plants comprising the genetic alteration(s) of the invention, e.g., fruit trees or ornamental plants. Hence, any non-transgenic grafted plant parts inserted on a transformed plant or plant part are included in the invention.
Introduced genetic elements, whether in an expression vector or expression cassette, which result in the expression of an introduced gene will typically utilize a plant-expressible promoter. A ‘plant-expressible promoter’ as used herein refers to a promoter that ensures expression of the genetic alteration(s) of the invention in a plant cell. Examples of promoters directing constitutive expression in plants are known in the art and include: the strong constitutive 35S promoters (the “35S promoters”) of the cauliflower mosaic virus (CaMV), e.g., of isolates CM 1841 (Gardner et al., Nucleic Acids Res, (1981) 9, 2871 2887), CabbB S (Franck et al., Cell (1980) 21, 285 294) and CabbB J I (Hull and Howell, Virology, (1987) 86, 482 493); promoters from the ubiquitin family (e.g., the maize ubiquitin promoter of Christensen et al., Plant Mol Biol, (1992) 18, 675-689), the gos2 promoter (de Pater et al., The Plant J (1992) 2, 834-844), the emu promoter (Last et al., Theor Appl Genet, (1990) 81, 581-588), actin promoters such as the promoter described by An et al. (The Plant J, (1996) 10, 107), the rice actin promoter described by Zhang et al. (The Plant Cell, (1991) 3, 1155-1165); promoters of the Cassava vein mosaic virus (WO 97/48819, Verdaguer et al. (Plant Mol Biol, (1998) 37, 1055-1067), the pPLEX series of promoters from Subterranean Clover Stunt Virus (WO 96/06932, particularly the S4 or S7 promoter), an alcohol dehydrogenase promoter, e.g., pAdh1S (GenBank accession numbers X04049, X00581), and the TR1′ promoter and the TR2′ promoter (the “TR1′ promoter” and “TR2′ promoter”, respectively) which drive the expression of the 1′ and 2′ genes, respectively, of the T DNA (Velten et al., EMBO J, (1984) 3, 2723 2730).
Alternatively, a plant-expressible promoter can be a tissue-specific promoter, i.e., a promoter directing a higher level of expression in some cells or tissues of the plant, e.g., in root epidermal cells or root cortex cells. Examples of constitutive promoters that are often used in plant cells are the cauliflower mosaic (CaMV) 35S promoter (KAY et al. Science, 236, 4805, 1987), and various derivatives of the promoter, virus promoter vein mosaic cassava (International Application WO 97/48819), the maize ubiquitin promoter (CHRISTENSEN & QUAIL, Transgenic Res, 5, 213-8, 1996), trefoil (Ljubql, MAEKAWA et al. Mol Plant Microbe Interact. 21, 375-82, 2008) and Arabidopsis (UBQ10, Norris et al. Plant Mol. Biol. 21, 895-906, 1993).
In preferred embodiments, root specific promoters will be used. Non-limiting examples include the promoter of the maize allothioneine (DE FRAMOND et al, FEBS 290, 103-106, 1991 Application EP 452269), the chitinase promoter (SAMAC et al. Plant Physiol 93, 907-914, 1990), the glutamine synthetase soybean root promoter (HIREL et al. Plant Mol. Biol. 20, 207-218, 1992), the RCC3 promoter (PCT Application WO 2009/016104), the rice antiquitine promoter (PCT Application WO 2007/076115), the LRR receptor kinase promoter (PCT application WO 02/46439), the maize ZRP2 promoter (U.S. Pat. No. 5,633,363), the tomato LeExt1 promoter (Bucher et al. Plant Physiol. 128, 911-923, 2002), and the Arabidopsis pCO2 promoter (HEIDSTRA et al, Genes Dev. 18, 1964-1969, 2004). These plant promoters can be combined with enhancer elements, they can be combined with minimal promoter elements, or can comprise repeated elements to ensure the expression profile desired.
In some embodiments, genetic elements to increase expression in plant cells can be utilized. For example, an intron at the 5′ end or 3′ end of an introduced gene, or in the coding sequence of the introduced gene, e.g., the hsp70 intron. Other such genetic elements can include, but are not limited to, promoter enhancer elements, duplicated or triplicated promoter regions, 5′ leader sequences different from another transgene or different from an endogenous (plant host) gene leader sequence, 3′ trailer sequences different from another transgene used in the same plant or different from an endogenous (plant host) trailer sequence.
An introduced gene of the present invention can be inserted in host cell DNA so that the inserted gene part is upstream (i.e., 5′) of suitable 3′ end transcription regulation signals (e.g., transcript formation and polyadenylation signals). This is preferably accomplished by inserting the gene in the plant cell genome (nuclear or chloroplast). Preferred polyadenylation and transcript formation signals include those of the nopaline synthase gene (Depicker et al., J. Molec Appl Gen, (1982) 1, 561-573), the octopine synthase gene (Gielen et al., EMBO J, (1984) 3:835 845), the SCSV or the Malic enzyme terminators (Schunmann et al., Plant Funct Biol, (2003) 30:453-460), and the T DNA gene 7 (Velten and Schell, Nucleic Acids Res, (1985) 13, 6981 6998), which act as 3′ untranslated DNA sequences in transformed plant cells. In some embodiments, one or more of the introduced genes are stably integrated into the nuclear genome. Stable integration is present when the nucleic acid sequence remains integrated into the nuclear genome and continues to be expressed (e.g., detectable mRNA transcript or protein is produced) throughout subsequent plant generations. Stable integration into and/or editing of the nuclear genome can be accomplished by any known method in the art (e.g., microparticle bombardment, Agrobacterium-mediated transformation, CRISPR/Cas9, electroporation of protoplasts, microinjection, etc.).
The term recombinant or modified nucleic acids refers to polynucleotides which are made by the combination of two otherwise separated segments of sequence accomplished by the artificial manipulation of isolated segments of polynucleotides by genetic engineering techniques or by chemical synthesis. In so doing one may join together polynucleotide segments of desired functions to generate a desired combination of functions.
As used herein, the terms “overexpression” and “upregulation” refer to increased expression (e.g., of mRNA, polypeptides, etc.) relative to expression in a wild type organism (e.g., plant) as a result of genetic modification. In some embodiments, the increase in expression is a slight increase of about 10% more than expression in wild type. In some embodiments, the increase in expression is an increase of 50% or more (e.g., 60%, 70%, 80%, 100%, etc.) relative to expression in wild type. In some embodiments, an endogenous gene is overexpressed. In some embodiments, an exogenous gene is overexpressed by virtue of being expressed. Overexpression of a gene in plants can be achieved through any known method in the art, including but not limited to, the use of constitutive promoters, inducible promoters, high expression promoters (e.g., PsaD promoter), enhancers, transcriptional and/or translational regulatory sequences, codon optimization, modified transcription factors, and/or mutant or modified genes that control expression of the gene to be overexpressed.
Where a recombinant nucleic acid is intended for expression, cloning, or replication of a particular sequence, DNA constructs prepared for introduction into a host cell will typically comprise a replication system (e.g. vector) recognized by the host, including the intended DNA fragment encoding a desired polypeptide, and can also include transcription and translational initiation regulatory sequences operably linked to the polypeptide-encoding segment. Additionally, such constructs can include cellular localization signals (e.g., plasma membrane localization signals). In preferred embodiments, such DNA constructs are introduced into a host cell's genomic DNA, chloroplast DNA or mitochondrial DNA.
In some embodiments, a non-integrated expression system can be used to induce expression of one or more introduced genes. Expression systems (expression vectors) can include, for example, an origin of replication or autonomously replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary processing information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Signal peptides can also be included where appropriate from secreted polypeptides of the same or related species, which allow the protein to cross and/or lodge in cell membranes, cell wall, or be secreted from the cell.
Selectable markers useful in practicing the methodologies of the invention disclosed herein can be positive selectable markers. Typically, positive selection refers to the case in which a genetically altered cell can survive in the presence of a toxic substance only if the recombinant polynucleotide of interest is present within the cell. Negative selectable markers and screenable markers are also well known in the art and are contemplated by the present invention. One of skill in the art will recognize that any relevant markers available can be utilized in practicing the inventions disclosed herein.
Screening and molecular analysis of recombinant strains of the present invention can be performed utilizing nucleic acid hybridization techniques. Hybridization procedures are useful for identifying polynucleotides, such as those modified using the techniques described herein, with sufficient homology to the subject regulatory sequences to be useful as taught herein. The particular hybridization techniques are not essential to the subject invention. As improvements are made in hybridization techniques, they can be readily applied by one of skill in the art. Hybridization probes can be labeled with any appropriate label known to those of skill in the art. Hybridization conditions and washing conditions, for example temperature and salt concentration, can be altered to change the stringency of the detection threshold. See, e.g., Sambrook et al. (1989) vide infra or Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, NY, N.Y., for further guidance on hybridization conditions.
Additionally, screening and molecular analysis of genetically altered strains, as well as creation of desired isolated nucleic acids can be performed using Polymerase Chain Reaction (PCR). PCR is a repetitive, enzymatic, primed synthesis of a nucleic acid sequence. This procedure is well known and commonly used by those skilled in this art (see Mullis, U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki et al. (1985) Science 230:1350-1354). PCR is based on the enzymatic amplification of a DNA fragment of interest that is flanked by two oligonucleotide primers that hybridize to opposite strands of the target sequence. The primers are oriented with the 3′ ends pointing towards each other. Repeated cycles of heat denaturation of the template, annealing of the primers to their complementary sequences, and extension of the annealed primers with a DNA polymerase result in the amplification of the segment defined by the 5′ ends of the PCR primers. Because the extension product of each primer can serve as a template for the other primer, each cycle essentially doubles the amount of DNA template produced in the previous cycle. This results in the exponential accumulation of the specific target fragment, up to several million-fold in a few hours. By using a thermostable DNA polymerase such as the Taq polymerase, which is isolated from the thermophilic bacterium Thermus aquaticus, the amplification process can be completely automated. Other enzymes which can be used are known to those skilled in the art.
Nucleic acids and proteins of the present invention can also encompass homologues of the specifically disclosed sequences. Homology (e.g., sequence identity) can be 50%-100%. In some instances, such homology is greater than 80%, greater than 85%, greater than 90%, or greater than 95%. The degree of homology or identity needed for any intended use of the sequence(s) is readily identified by one of skill in the art. As used herein percent sequence identity of two nucleic acids is determined using an algorithm known in the art, such as that disclosed by Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. (1990) J. Mol. Biol. 215:402-410. BLAST nucleotide searches are performed with the NBLAST program, score=100, wordlength=12, to obtain nucleotide sequences with the desired percent sequence identity. To obtain gapped alignments for comparison purposes, Gapped BLAST is used as described in Altschul et al. (1997) Nucl. Acids. Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (NBLAST and XBLAST) are used. See www.ncbi.nih.gov.
Preferred host cells are plant cells. Recombinant host cells, in the present context, are those which have been genetically modified to contain an isolated nucleic molecule, contain one or more deleted or otherwise non-functional genes normally present and functional in the host cell, or contain one or more genes to produce at least one recombinant protein. The nucleic acid(s) encoding the protein(s) of the present invention can be introduced by any means known to the art which is appropriate for the particular type of cell, including without limitation, transformation, lipofection, electroporation or any other methodology known by those skilled in the art.
Having generally described this invention, the same will be better understood by reference to certain specific examples, which are included herein to further illustrate the invention and are not intended to limit the scope of the invention as defined by the claims.
The present disclosure is described in further detail in the following examples which are not in any way intended to limit the scope of the disclosure as claimed. The attached figures are meant to be considered as integral parts of the specification and description of the disclosure. The following examples are offered to illustrate, but not to limit the claimed disclosure.
The following example describes the structural characterization of the Medicago NFP protein ectodomain.
Expression and purification of Medicago NFP ectodomain: The Medicago truncatula NFP ectodomain (residues 28-246) was codon-optimized for insect cell expression (Genscript, Piscataway, USA) and cloned into the pOET4 baculovirus transfer vector (Oxford Expression Technologies). The native NFP signal peptide (residues 1-27, predicted by SignalP 4.1) was replaced with the AcMNPV gp67 signal peptide to facilitate secretion and a hexa-histidine tag was added to the C-terminus. Recombinant baculoviruses were produced in Sf9 cells (Spodoptera frugiperda) using the FlashBac Gold kit (Oxford Expression technologies) according to the manufacturer's instructions with Lipofectin (ThermoFisher Scientific) as a transfection reagent. Protein expression was performed as follows. Suspension-cultured Sf9 cells were maintained with shaking at 299 K in serum-free MAX-XP (BD-Biosciences, discontinued) or HyClone SFX (GE Healthcare) medium supplemented with 1% Pen-Strep (10000 U/ml, Life technologies) and 1% CD lipid concentrate (Gibco). Protein expression was induced by adding recombinant passage 3 virus once the Sf9 cells reached a cell density of 1.0*10{circumflex over ( )}6 cells/ml. After 5-7 days of expression, medium supernatant containing NFP ectodomains was harvested by centrifugation. This was followed by an overnight dialysis step against 50 mM Tris-HCl pH 8, 200 mM NaCl at 277 K. The NFP ectodomain was enriched by two subsequent steps of Ni-IMAC purification (HisTrap excel/HisTrap HP, both GE Healthcare). For crystallography, N-glycans were removed using the endoglycosidase PNGase F (1:15 (w/w), room temperature, overnight). As a final purification step, NFP ectodomains was purified by SEC on a Superdex 200 10/300 or HiLoad Superdex 200 16/600 (both GE Healthcare) in phosphate buffered saline at pH 7.2 supplemented to a total of 500 mM NaCl (for binding assays) or 50 mM Tris-HCl, 200 mM NaCl (for crystallography). NFP ectodomain elutes as a single, homogeneous peak corresponding to a monomer.
Crystallization and structure determination: Crystals of deglycosylated NFP ectodomain were obtained using a vapour diffusion setup at 3-5 mg/ml in 0.2 M Na-acetate, 0.1 M Na-cacodylate pH 6.5, and 30% (w/v) PEG-8000. Crystals were cryoprotected in their crystallization condition by supplementing with 5% (w/v) PEG-400 before being snap-frozen in liquid nitrogen. Complete diffraction data to 2.85 Å resolution was obtained at the MaxLab 1911-3 beamline. The phase problem was solved by molecular replacement using Phaser from the PHENIX suite with a homology model based on the AtCERK1 ectodomain structure (PDB coordinates 4EBZ) as a search model. Model building and refinement was done using COOT and the PHENIX suite, respectively. The output pdb filled structural model was generated and its electrostatic surface potential was calculated using the PDB2PQR and APBS webservers (PMID: 21425296). The results were visualized in PyMol using APBS tools 2.1 (DeLano, W. L. (2002). PyMOL. DeLano Scientific, San Carlos, Calif., 700.).
The structure of Medicago NFP was determined by molecular replacement using a homology model based on the inner low B-factor scaffold of AtCERK1. The complete structure of NFP (residues 33-233) was built this way, including four N-glycosylations that were clearly resolved in the 2.8 Å electron density map. NFP forms a compact structure where three classical βααβ LysM domains are tightly interconnected and stabilized by 3 conserved disulfide bridges (C3-C104, C47-C166 and C102-C164) (
The following example describes the use of a structurally-guided approach to identify important residues in NFP for LCO perception. After identifying important residues, NFP point mutations were created, and tested using ligand-binding assays.
Structurally-guided residue identification: The NFP ectodomain was structurally aligned to ligand-bound CERK1. Then, the electrostatic surface potential was mapped to the previously-developed structure of the NFP ectodomain. The predicted ligand-binding location and electrostatic surface potential are depicted in
Creation of NFP point mutations: The NFP leucine residues L147 and L154 were replaced with aspartate residues. Aspartate is similar in size to leucine, but negatively charged where leucine is hydrophobic. Point mutants of NFP were engineered using site-directed mutagenesis. In particular, a double-mutated NFP was engineered where the leucine residues L147 and L154 were replaced with aspartate residues to create the mutant NFP L147D L154D. Point mutated versions of NFP were expressed and purified as described in Example 1.
NFP mutant binding assays: The binding assay using NFP wild type (WT) protein was replicated seven times, while the binding assay using the NFP mutant NFP L147D L154D was replicated four times. A summary of results is shown in
Biolayer interferometry (BLI): Binding of NFP WT and NFP L147D/L154D mutant to S. meliloti LCO-IV was measured on an Octet RED 96 system (Pall ForteBio). S. meliloti LCO consists of a tetrameric/pentameric N-acetylglucosamine backbone that is 0-sulfated on the reducing terminal residue, 0-acetylated on the non-reducing terminal residue, and mono-N-acylated by unsaturated C16 acyl groups. Biotinylated ligand conjugates were immobilized on streptavidin biosensors (kinetic quality, Pall ForteBio) at a concentration of 125-250 nM for 5 minutes. The binding assays were replicated 7 times for the NFP WT, and 4 times for the NFP L147D/L154D mutant. Data analysis was performed in GraphPad Prism 6 software (GraphPad Software, Inc.). Equilibrium dissociation constants derived from the steady-state were determined by applying a non-linear regression (one site, specific binding) to the response at equilibrium plotted against the protein concentration. Kinetic parameters were determined by non-linear regression (association followed by dissociation) on the subtracted data. Results are shown in
To test the contribution of these two residues to LCO binding, both residues were replaced with similarly sized but negatively charged aspartate residues to produce NFP L147D L154D. Interestingly, the double mutated NFP L147D L154D ectodomain bound S. meliloti LCO-IV with approximately two times lower affinity; Kd of 48.0±1.0 μM (
Biochemical analysis of LCO binding to the hydrophobic patch mutant reveals that purified L147D/L154D NFP-ECD bound S. meliloti LCO-IV with 13-fold lower affinity (Kd of 166.7±4.2 μM) compared to WT NFP-ECD (
The binding kinetics of AtCERK1 binding to chitin fragments were measured as a comparison. As shown in
Together, the data provided evidence that the hydrophobic patch in NFP (shown in
To confirm the biochemical observations described in the previous examples, next a complementation test was performed in Medicago nfp mutants using hairy root transformation.
Complementation assay: Construct assembly, plant growth conditions, hairy root transformations, nodulation and ROS assays were generally conducted as described in Bozsoki et al. (2017) (Bozsoki Z, Cheng J, Feng F, Gysel K, Vinther M, Andersen K R, Oldroyd G, Blaise M, Radutoiu S, Stougaard J (2017) Receptor-mediated chitin perception in legume roots is functionally separable from Nod factor perception. Proc Natl Acad Sci 114: E8118-E8127). A general schematic of the construct is provided in
These complementation experiments were repeated using S. medicae inoculation, which has been reported to nodulate Medicago with higher efficiency. The results shown in
The following example describes functional characterization of the Lotus LCO receptor NFR1 and the Lotus CO receptor CERK6. This was done using domain swaps, and by measuring nodulation and defense (reactive oxygen species, ROS) responses to assess complementation.
Complementation assay: Construct assembly, plant growth conditions, hairy root transformations, nodulation and ROS assays were generally conducted as described in Bozsoki et al. (2017) (Bozsoki Z, Cheng J, Feng F, Gysel K, Vinther M, Andersen K R, Oldroyd G, Blaise M, Radutoiu S, Stougaard J (2017) Receptor-mediated chitin perception in legume roots is functionally separable from Nod factor perception. Proc Natl Acad Sci 114: E8118-E8127). A general schematic of the construct is provided in
Additional experiments, also depicted in
The following example describes the structural characterization of the Lotus CERK6 protein ectodomain.
Modelling: The target LysM receptor amino acid sequence (Lotus CERK6) was aligned with a known receptor sequence (Medicago NFP). Then, the LysM1-3 domains of the target sequence were used as an input in SWISS-MODEL (Biasini 2014). The structural coordinate file (.pdb) of the Medicago NFP crystal structure as template file in SWISS-MODEL (Biasini 2014), and the modelling program was run using the command ‘Build Model’. The electrostatic surface potential of the output target (.pdb) model generated with SWISS-MODEL was calculated using PDB2PQR & APBS webservers (PMID: 21425296) and visualized in PyMol using APBS tools 2.1 (DeLano, W. L. 2002). The 3D structure of the Lotus CERK6 ectodomain is depicted in
The 3D structure of CERK6 shows that region II and region IV are located adjacent to each other as shown in
The following example describes functional characterization of the Lotus LCO receptor NFR1 and the Medicago LCO receptor LYK3. This was done using domain swaps, and by measuring nodulation to assess complementation.
Complementation assay: Construct assembly, plant growth conditions, hairy root transformations, nodulation and ROS assays were generally conducted as described in Bozsoki et al. (2017) (Bozsoki Z, Cheng J, Feng F, Gysel K, Vinther M, Andersen K R, Oldroyd G, Blaise M, Radutoiu S, Stougaard J (2017) Receptor-mediated chitin perception in legume roots is functionally separable from Nod factor perception. Proc Natl Acad Sci 114: E8118-E8127). A general schematic of the construct is provided in
The tested chimeric receptors in Medicago are depicted as block diagrams in
The results from
Overall, these results indicated that the LysM1 domain was essential for recognizing those LCOs produced by the cognate N-fixing bacterial strain of a legume species. When the chimeric receptors were expressed in M. truncatula, the regions II, III, and IV of the LysM1 domain were identified as particularly important for this recognition. Replacing regions II and IV were sufficient to obtain a loss of recognition. Replacing regions II, III, and IV were required to obtain gain of recognition for S. meliloti LCO and optimal functionality of the receptor.
The following example describes engineering of the Lotus receptor LYS11 (LjLYS11) to specifically perceive LCOs. This was done using domain swaps, by measuring ligand binding, and by measuring nodulation to assess complementation.
LjLYS11 ectodomain production and purification: The LjLYS11 ectodomain (residues 26-234; SEQ ID NO:60) was codon-optimized for insect cell expression (Genscript, Piscataway, USA) and cloned into the pOET4 baculovirus transfer vector (Oxford Expression Technologies). The native LjLYS11 signal peptide was replaced with the gp64 signal peptide (SEQ ID NO:59) to facilitate secretion and a hexa-histidine (6×His; SEQ ID NO:61) tag was added to the C-terminus (LAYS11-ecto (26-234), N-term gp64, C-term 6His=SEQ ID NO:56). Recombinant baculoviruses were produced in Sf9 cells (Spodoptera frugiperda) using the FlashBac Gold kit (Oxford Expression technologies) according to the manufacturer's instructions with Lipofectin (ThermoFisher Scientific) as a transfection reagent. Protein expression was performed as follows. Suspension-cultured Sf9 cells were maintained with shaking at 299 K in serum-free MAX-XP (BD-Biosciences, discontinued) or HyClone SFX (GE Healthcare) medium supplemented with 1% Pen-Strep (10000 U/ml, Life technologies) and 1% CD lipid concentrate (Gibco). Protein expression was induced by adding recombinant passage 3 virus once the Sf9 cells reached a cell density of 1.0*10{circumflex over ( )}6 cells/ml. After 5-7 days of expression, medium supernatant containing LjLYS11 ectodomains was harvested by centrifugation. This was followed by an overnight dialysis step against 50 mM Tris-HCl pH 8, 200 mM NaCl at 277 K. The LjLYS11 ectodomain was enriched by two subsequent steps of Ni-IMAC purification (HisTrap excel/HisTrap HP, both GE Healthcare). For crystallography experiments, N-glycans were removed using the endoglycosidase PNGase F (1:15 (w/w), room temperature, overnight). As a final purification step, LjLYS11 ectodomain was purified by SEC on a Superdex 200 10/300 or HiLoad Superdex 200 16/600 (both GE Healthcare) in phosphate buffered saline at pH 7.2 supplemented to a total of 500 mM NaCl (for binding assays) or 50 mM Tris-HCl, 200 mM NaCl (for crystallography).
Biolayer interferometry (BLI): Binding of LjLYS11 ectodomain and domain-swapped versions of LjLYS11 ectodomain to ligands was measured on an Octet RED 96 system (Pall ForteBio). The ligands used were CO5 chitin oligomer (corresponding to the backbone of S. meliloti LCO-V), M. loti LCO, and S. meliloti LCO. S. meliloti LCO consists of a tetrameric/pentameric N-acetylglucosamine backbone that is O-sulfated on the reducing terminal residue, 0-acetylated on the non-reducing terminal residue, and mono-N-acylated by unsaturated C16 acyl groups. M. loti LCO is a pentameric N-acetylglucosamine with a cis-vaccenic acid and a carbamoyl group at the non-reducing terminal residue together with a 2,4-O-acetylfucose at the reducing terminal residue. Biotinylated ligand conjugates were immobilized on streptavidin biosensors (kinetic quality, Pall ForteBio) at a concentration of 125-250 nM for 5 minutes. Data analysis was performed in GraphPad Prism 6 software (GraphPad Software, Inc.). Equilibrium dissociation constants derived from the steady-state were determined by applying a non-linear regression (one site, specific binding) to the response at equilibrium plotted against the protein concentration. Kinetic parameters were determined by non-linear regression (association followed by dissociation) on the subtracted data. The tested chimeric receptors are depicted as block diagrams in
Complementation assay: The complementation assay was done as in Example 6. The tested chimeric receptors are depicted as block diagrams in
Based on modelling and crystal structure determination of LjLYS11 ectodomain (
Next, it was tested whether stringent and specific LCO recognition could be engineered. For these tests, LjLYS11 ectodomains were engineered to contain parts of LjNFR5 receptors. Either the entire LysM2 or key residues from the LysM2 hydrophobic patch from LjLYS11 were replaced with the corresponding regions QLGDSYD (SEQ ID NO:63) and GV (SEQ ID NO:64) from LjNFR5, and ligand binding of these chimeric ectodomains was measured. As shown in
Then, chimeric receptors were tested in planta. For these tests, the same chimeric LjLYS11 ectodomains were used (the entire LysM2, or key residues from LysM2 from LjLYS11 were replaced with the corresponding regions from LjNFR5) or the entire LjLYS11 ectodomain (LysM1, LysM2, and LysM3) was used, and these were attached to the transmembrane domain (wavy shape in schematic of
Interestingly, the chimeric LjLYS11/LjNFR5 ectodomains had different LCO binding kinetics with slow on/off rates that resembled the binding kinetics of M. truncatula NFP. As shown in
Taken together, the results seen with chimeric LjLYS11/LjNFR5 ectodomains show that LysM2 engineering can create receptors with higher stringency toward LCO as well as higher specificity toward LCO.
The following example describes homology modelling in barley (H. vulgare) to identify target LysM receptors for use in engineering.
Modelling: Homology modelling was performed with SWISS-MODEL (Biasini, M. et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 42, W252-W258 (2014)). For barley RLK10 (HvRLK10), the crystal structure of Medicago NFP served as the template model onto which the amino acid sequence of the target receptor was mapped. For barley RLK4 (HvRLK4), the crystal structure of Medicago LYK3 served as the template model onto which the amino acid sequence of the target receptor was mapped. The output pdb filled structural model was generated and its electrostatic surface potential was calculated using the PDB2PQR and APBS webservers (PMID: 21425296). The results were visualized in PyMol using APBS tools 2.1 (DeLano, W. L. (2002). PyMOL. DeLano Scientific, San Carlos, Calif., 700).
Expression and purification of ectodomain: The HvRLK10 ectodomain (residues 25-231; SEQ ID NO:66) was codon-optimized for insect cell expression (Genscript, Piscataway, USA) and cloned into the pOET4 baculovirus transfer vector (Oxford Expression Technologies). The native HvRLK10 signal peptide was replaced with the gp64 signal peptide (SEQ ID NO:59) to facilitate secretion and a hexa-histidine (6×HIS; SEQ ID NO:61) tag was added to the C-terminus (HvRLK10-ecto (25-231), N-term gp64, C-term 6His=SEQ ID NO:65). The HvRLK4 ectodomain (residues 27-228; SEQ ID NO:68) was codon-optimized for insect cell expression (Genscript, Piscataway, USA) and cloned into the pOET4 baculovirus transfer vector (Oxford Expression Technologies). The native HvRLK4 signal peptide was replaced with the gp64 signal peptide (SEQ ID NO:59) to facilitate secretion and a hexa-histidine (6×HIS; SEQ ID NO:61) tag was added to the C-terminus (RLK4-ecto (27-228), N-term gp64, C-term 6His=SEQ ID NO:67). Recombinant baculoviruses were produced in Sf9 cells (Spodoptera frugiperda) using the FlashBac Gold kit (Oxford Expression technologies) according to the manufacturer's instructions with Lipofectin (ThermoFisher Scientific) as a transfection reagent.
Protein expression was performed as follows. Suspension-cultured Sf9 cells were maintained with shaking at 299 K in serum-free MAX-XP (BD-Biosciences, discontinued) or HyClone SFX (GE Healthcare) medium supplemented with 1% Pen-Strep (10000 U/ml, Life technologies) and 1% CD lipid concentrate (Gibco). Protein expression was induced by adding recombinant passage 3 virus once the Sf9 cells reached a cell density of 1.0*10{circumflex over ( )}6 cells/ml. After 5-7 days of expression, medium supernatant containing HvRLK10 ectodomain or HvRLK4 ectodomain was harvested by centrifugation. This was followed by an overnight dialysis step against 50 mM Tris-HCl pH 8, 200 mM NaCl at 277 K. The HvRLK10 ectodomain or HvRLK4 ectodomain was enriched by two subsequent steps of Ni-IMAC purification (HisTrap excel/HisTrap HP, both GE Healthcare).
Biolayer interferometry (BLI): Binding of HvRLK10 ectodomain or HvRLK4 ectodomain to ligands was measured on an Octet RED 96 system (Pall ForteBio). The ligands used were CO5 chitin oligomer (corresponding to the backbone of S. meliloti LCO-V), M. loti LCO, and S. meliloti LCO. S. meliloti LCO consists of a tetrameric/pentameric N-acetylglucosamine backbone that is O-sulfated on the reducing terminal residue, 0-acetylated on the non-reducing terminal residue, and mono-N-acylated by unsaturated C16 acyl groups. M. loti LCO is a pentameric N-acetylglucosamine with a cis-vaccenic acid and a carbamoyl group at the non-reducing terminal residue together with a 2,4-O-acetylfucose at the reducing terminal residue. Biotinylated ligand conjugates were immobilized on streptavidin biosensors (kinetic quality, Pall ForteBio) at a concentration of 125-250 nM for 5 minutes. Data analysis was performed in GraphPad Prism 6 software (GraphPad Software, Inc.). Equilibrium dissociation constants derived from the steady-state were determined by applying a non-linear regression (one site, specific binding) to the response at equilibrium plotted against the protein concentration. Kinetic parameters were determined by non-linear regression (association followed by dissociation) on the subtracted data.
Homology modelling of all ten barley LysM receptor-like kinases (RLKs) was done using the Medicago NFP structure as a template. Of the barley LysM RLKs, HvRLK10 was the receptor that was closest to Medicago NFP and modelled the best using this approach.
To experimentally validate this prediction, the HvRLK10 ectodomain was expressed and purified for use in binding experiments (ectodomain schematic shown at top of
In addition, homology modelling of all ten barley LysM RLKs was done using the Medicago LYK3 structure as a template. HvRLK4 was the receptor that was closest to Medicago LYK3 and modelled the best using this approach.
To experimentally validate this prediction, the HvRLK4 ectodomain was expressed and purified for use in binding experiments (ectodomain schematic shown at top of
Both HvRLK10 and HvRLK4 were initially identified by homology modelling, and then confirmed to be LCO receptors by biochemical characterization. HvRLK10 and HvRLK4 therefore represent promising target receptors for engineering in barley particularly for engineering receptors that recognize LCOs in a manner similar to the donor receptors used to select them, Medicago NFP and Medicago LYK3, respectively.
Overall, these results show that the homology modelling approach can be used to identify LCO receptors of both the NFP/NFR5 type and the NFR1/LYK3 type specifically and that this method may be used to identify good target LysM receptor to modify to alter a desired receptor characteristic to be that of the donor LysM receptor used to select the target LysM receptor.
One of skill in the art would have no difficulty applying the teachings of this disclosure to genetically alter LysM receptors to include a hydrophobic patch or alter an existing hydrophobic patch. Exemplary steps would be:
1. Align the target LysM receptor amino acid sequence with one or more known LCO receptor sequences (See, e.g.,
Applying this step to the HvLysM-RLK2/37-247 sequence produced the following amino acid sequence:
2. Use the LysM1-3 domain amino acid sequence as the input sequence to be modeled in an appropriate molecular modeling program such as SWISS-MODEL (Biasini 2014). SWISS-MODEL can be readily accessed at swissmodel.expasy.org under interactive#structure.
3. Input the structural template to the molecular modelling program, for example from a structural coordinate file (e.g., a pdb format file).
The HvLysM-RLK2/37-247 LysM1-3 domain amino acid sequence was entered into SWISS-MODEL as was the Medicago NFP receptor ectodomain crystal structure .pdb file (the atomic coordinates are reproduced at the end of the specification). The SWISS-MODEL program was run by the command ‘Build Model’. The Medicago NFP receptor ectodomain crystal structure was chosen as it has a known hydrophobic patch. One of skill in the art can readily select others based upon the teachings in this specification.
4. Optionally create an electrostatic surface potential of the target model and structurally align with a structure with chitin (or glycan) bound to the LysM2 domain to align the ligand binding grooves.
An electrostatic surface potential of the output target (.pdb) model generated with SWISS-MODEL was calculated using PDB2PQR & APBS webservers (PMID: 21425296) and visualized in PyMol using APBS tools 2.1 (DeLano, W. L. 2002). The AtCERK1 ectodomain structure (PDB coordinates 4EBZ) which has the chitin bound in the structure was aligned to the target model in PyMol. One of skill in the art would readily understand the position of the chitin binding domain as the LysM chitin binding motif is defined structurally in Liu et al. Science 2012 for AtCERK1. This aligned the chitin (C04) ligand in the LysM2 ligand binding groove of the target model.
5. Select the residues from the alignment in the target model that align with the known hydrophobic patch.
From the sequence alignment (1), structural alignment of the target model with the crystal structure of Medicago NFP and the electrostatic surface potential information (5) the hydrophobic patch was identified (with the placed chitin from AtCERK1 as reference for locating the CO binding groove as shown in (
This application claims the benefit of U.S. Provisional Application No. 62/718,282, filed Aug. 13, 2018, which is hereby incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2019/071705 | 8/13/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62718282 | Aug 2018 | US |