The present disclosure generally relates to methods of conferring resistance to nematodes in soybeans.
BACKGROUND OF THE DISCLOSURE
Soybean cyst nematode (SCN, Heterodera glycines Ichinohe) is the most devastating pest among plant-parasitic nematode species in the United States and worldwide. Annual soybean yield losses caused by this pest in the United States alone were estimated at $1.5 billion [Wrather & Koenning]. The deployment of SCN resistance soybean varieties is the most efficient management manner to control the nematodes damage in soybean production areas. In past decades, many efforts have been made to evaluate the USDA Soybean Germplasm Collection for new sources of resistance to SCN. Over 100 plant introductions (PIs), including common accessions PI 88788, ‘Peking’ (PI 548402), and PI 437654 were identified as resistant to different SCN HG Types [Concibido et al; Arelli et al., 2000; Arelli et al., 1997]. Among these, PI 437654 and PI 567516C were highly resistant to multiple SCN races [Vuong et al.; Wu et al.; Arelli et al., 2009; Brucker et al.].
To date, only two major sources of resistance have been commonly employed in soybean breeding programs, which are derived from soybean lines PI 88788 and ‘Peking’ [Concibido et al.]. PI 88788 has eight copies at the Rhg1 locus and is the primary source used in commercial breeding programs to battle SCN damage. More than 90% of SCN resistant cultivars are derived from this single source. A survey conducted in 2005 [Niblack et al.] showed that 83% of the soybean fields in Illinois were infested with SCN and 70% of these have adapted to PI 88788, resulting in a reduction of the effectiveness when using SCN resistant cultivars as a crop management tool [Niblack et al.]. It is now urgent for soybean growers to have alternative sources of SCN resistance to overcome the selection pressure and the SCN population shifts.
Recent advances in high-throughput genotyping and next-generation sequencing technologies provide researchers with new opportunities to analyze genome structure at a large and a fine scale [Wang et al.; Schmutz et al., 2014]. Re-sequencing of diverse genetic populations is a powerful approach for trait discovery and has been conducted in a variety of organisms, including humans [Telenti et al], animals [Choi et al.; Zhou et al., 2016; Rubin et al.], and several species thereof [Afolitos et al.; Varshney et al., 2017; Lam et al., 2011; Lam et al., 2010; Xu et al.]. Whole genome re-sequencing (WGRS) facilitates the identification of functional variations and provides a comprehensive catalog of genome wide polymorphism in closely related accessions. It also overcomes the limitation of missing data compared to other genotyping technologies [Jackson et al.]. Importantly, the data from WGRS provides a high resolution of the variation within populations, thus enabling marker-assisted breeding, gene mapping, and the identification of phenotype-genotype relationships. In humans, WGRS of diverse human populations aided the development of HapMap and facilitated the identification of common genetic variations [Gibbs et al.]. In crops such as rice [Huang et al.; Yano et al.], tomato [Aflitos et al.], soybean [Lam et al., 2010], chickpea [Varshney et al., 2013], pigeonpea [Varshney et al., 2017] and maize [Gore et al.], the detailed analysis of re-sequencing data provided a catalog of genetic variants, such as single nucleotide polymorphisms (SNPs) and copy number variation (CNV), across the genome. Furthermore, this information has been used to identify genomic regions that are expected to play an important role during domestication and selection. Importantly, CNVs are an important component of genetic variation because they influence gene expression, phenotypic variation and adaptation by disturbing genes and altering gene dosage [Sebat et al.; Shlien & Malkin; Redon et al.]. In humans, CNVs are associated with cancer risk factors, neurological functions, regulation of cell growth and metabolism [Sebat et al.].
In soybean, a large number of wild accessions, landraces, and varieties have recently been re-sequenced to provide useful information about the genome structure and enable the discovery of new genes [Lam et al., 2010; Zhou et al., 2015; Qi et al.; Schmutz et al., 2010; Li et al.; Valliyodan et al.]. Moreover, the development of soybean high-density markers from large sequencing data sets provides a powerful tool for whole genome prediction and selection applications [Patil et al., 2016]. In the case of SCN resistance, remarkable progress has been made since the cloning of the resistance genes that reside in the two major loci, Rhg 1 and Rhg4 [Liu et al., 2012; Cook et al., 2012; Liu et al., 2017; Lakhssassi et al.]. However, the mechanism of SCN broad-based resistance and the interaction of these two loci in the soybean accessions are still unclear and warrant further investigation.
One embodiment of the present disclosure is a transgenic soybean plant resistant to soybean cyst nematode (SCN) comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-. The transgenic soybean plant may have increased SCN resistance compared to a control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a transgenic soybean plant resistant to soybean cyst nematode (SCN) comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H. The transgenic soybean plant may have increased SCN resistance compared to a control soybean plant lacking the second polynucleotide. The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number.
Another embodiment of the present disclosure is a plant of an agronomically elite soybean variety comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-. The plant may have increased soybean cyst nematode (SCN) resistance compared to a control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a plant of an agronomically elite soybean variety comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H. The plant may have increased soybean cyst nematode (SCN) resistance compared to a control soybean plant lacking the second polynucleotide. The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number.
Another embodiment of the present disclosure is a method of increasing soybean cyst nematode (SCN) resistance of a soybean plant comprising transforming the soybean plant with a first DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-. The transformed soybean plant may have increased SCN resistance compared to a control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a method of increasing soybean cyst nematode (SCN) resistance of a soybean plant comprising transforming the soybean plant with a first DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H. The transformed soybean plant may have increased SCN resistance compared to a control soybean plant lacking the second polynucleotide. The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number.
Another embodiment of the present disclosure is a DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in a soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-.
Another embodiment of the present disclosure is a DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in soybean operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity. The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H. The DNA construct may be constructed such that a soybean plant transformed with the DNA construct may have increased expression, an altered expression pattern, or an increased copy number of the second polynucleotide compared to a control soybean plant that has not been transformed with the DNA construct.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure. The present disclosure may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein. However, those of skill in the art will understand that the drawings, described below, are for illustrative purposes only. The drawings are not intended to limit the scope of the present teachings in any way.
The patent or patent application files contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
FIG. 20A1, FIG. 20A2, and
A sequence listing is being submitted herewith by electronic submission and is hereby incorporated by reference.
SEQ ID NO:1 is a nucleotide sequence for Essex Glyma.08g108900 (Serine Hydroxymetyhltransferase) DNA promoter.
SEQ ID NO:2 is a nucleotide sequence for Essex Glyma.08g108900 (Serine Hydroxymethyltransferase) protein.
SEQ ID NO:3 is a nucleotide sequence for Williams 82 Glyma.18g022500 (alpha Soluble NSF attachment protein) DNA promoter.
SEQ ID NO:4 is a nucleotide sequence for Essex Glyma.18g022500 (alpha Soluble NSF attachment protein) protein.
One embodiment of the present disclosure is a transgenic soybean plant resistant to soybean cyst nematode (SCN) comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The transgenic soybean plant may also comprise a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in the soybean plant operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSF attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L288I, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The transgenic soybean plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the first polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The transgenic soybean plant may have increased SCN resistance compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least 2 SCN races as compared to the control soybean plant lacking the first polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a transgenic soybean plant resistant to soybean cyst nematode (SCN) comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity ay comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The transgenic soybean plant may also comprise a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in soybean operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSF attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L2881, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The transgenic soybean plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the second polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The transgenic soybean plant may have increased SCN resistance compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least 2 SCN races as compared to the control soybean plant lacking the second polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the second polynucleotide.
A further embodiment of the disclosed technology is a plant part of any of the transgenic soybean plants described above.
Another embodiment of the present disclosure is a plant of an agronomically elite soybean variety comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The plant may also comprise a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in the soybean plant operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSF attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L288I, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the first polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The plant may have increased soybean cyst nematode (SCN) resistance compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least 2 SCN races as compared to the control soybean plant lacking the first polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a plant of an agronomically elite soybean variety, comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 at least 11, at least 12, at least 13, at least 14, or at least 15.
The plant may also comprise a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in soybean operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSH attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L2881, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the second polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The plant may have increased soybean cyst nematode (SCN) resistance compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least 2 SCN races as compared to the control soybean plant lacking the second polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the second polynucleotide.
A further embodiment of the disclosed technology is a plant part of any of the plants described above.
Another embodiment of the present disclosure is a method of increasing soybean cyst nematode (SCN) resistance of a soybean plant comprising transforming the soybean plant with a first DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The method may comprise further transforming the soybean plant with a second DNA construct comprising a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in the soybean plant operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSF attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L2881, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The soybean plant may be simultaneously transformed with the first DNA construct and the second DNA construct. The soybean plant may be transformed separately with the first DNA construct and the second DNA construct. The soybean plant may be transformed first with the first DNA construct then transformed with the second DNA construct. The soybean plant may be transformed first with the second DNA construct then transformed with the first DNA construct.
The transformed soybean plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the first polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The transformed soybean plant may have increased SCN resistance compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the first polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least 2 SCN races as compared to the control soybean plant lacking the first polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the first polynucleotide.
Another embodiment of the present disclosure is a method of increasing soybean cyst nematode (SCN) resistance of a soybean plant comprising transforming the soybean plant with a first DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in the soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The second polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The second polynucleotide may have a copy number of at least 2. Alternatively, the second polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The method may comprise further transforming the soybean plant with a second DNA construct comprising a third polynucleotide encoding an alpha soluble NSF attachment protein promoter that functions in the soybean plant operably linked to a fourth polynucleotide encoding a polypeptide having alpha soluble NSF attachment protein activity.
The third polynucleotide may comprise SEQ ID NO: 3, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The third polynucleotide may comprise one or more mutations of SEQ ID NO: 3 selected from the group consisting of: C1161A, C1082A, C1044A, C1025T, A1016C, T997A, C970A, C970-, G829T, G825T, A815C, A363T, T336C, G334A, T328C, T327A, C267G, T157G, T83A, C57T, and T36A.
The polypeptide having alpha soluble NSF attachment protein activity may comprise SEQ ID NO: 4, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having alpha soluble NSF attachment protein activity may comprise one or more mutations of SEQ ID NO: 4 selected from the group consisting of: A111D, Q203K, D208E, I238V, E285Q, D286Y, D286H, D287E, +287A, +287V, L288I, and +288T.
The fourth polynucleotide may have increased expression, an altered expression pattern, or an increased copy number. The fourth polynucleotide may have a copy number of at least 2. Alternatively, the fourth polynucleotide may have a copy number of at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15.
The soybean plant may be simultaneously transformed with the first DNA construct and the second DNA construct. The soybean plant may be transformed separately with the first DNA construct and the second DNA construct. The soybean plant may be transformed first with the first DNA construct then transformed with the second DNA construct. The soybean plant may be transformed first with the second DNA construct then transformed with the first DNA construct.
The transformed soybean plant may have a grain yield of at least about 90%, at least about 94%, at least about 98%, at least about 100%, at least about 105%, or at least about 110% as compared to a control soybean plant lacking the second polynucleotide. For example, the grain yield can be from about 90% to about 110%, from about 94% to about 110%, from about 100% to about 110%, or from about 105% to about 110% as compared to a control soybean plant lacking the first polynucleotide.
The transformed soybean plant may have increased SCN resistance compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, or at least about 1000% decrease in susceptibility to SCN as compared to the control soybean plant lacking the second polynucleotide.
The increased SCN resistance may comprise a decrease in susceptibility to at least two SCN races as compared to the control soybean plant lacking the second polynucleotide. Alternatively, the increased SCN resistance may comprise a decrease in susceptibility to at least 3 SCN races, at least 4 SCN races, at least 5 SCN races, at least 6 SCN races, at least 7 SCN races, at least 8 SCN races, at least 9 SCN races, or at least 10 SCN races as compared to the control soybean plant lacking the second polynucleotide.
Another embodiment of the present disclosure is a DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in a soybean plant operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The first polynucleotide may comprise SEQ ID NO: 1, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The first polynucleotide may comprise one or more mutations of SEQ ID NO: 1 selected from the group consisting of: A3959T, G3726C, A3444T, C3147T, A3130C, T3037C, G2999C, C2998T, T2979C, C2846T, G2475T, A2420G, C2416T, +2323T, T2051A, G2050C, A1606G, T1523-, G1164A, T1156A, A403C, C380T, A338T, T329A, T313C, T225G, T225-, A133G, A133-, G28T, and G28-.
Another embodiment of the present disclosure is a DNA construct comprising a first polynucleotide encoding a serine hydroxymethyltransferase promoter that functions in soybean operably linked to a second polynucleotide encoding a polypeptide having serine hydroxymethyltransferase activity.
The polypeptide having serine hydroxymethyltransferase activity may comprise SEQ ID NO: 2, or a sequence at least 95% identical thereto, or a full-length complement thereof, or a functional fragment thereof. The polypeptide having serine hydroxymethyltransferase activity may comprise one or more mutations of SEQ ID NO: 2 selected from the group consisting of: 1107F, P200R, P200-, N459Y, and N459H.
The DNA construct may be constructed such that a soybean plant transformed with the DNA construct may have increased expression, an altered expression pattern, or an increased copy number of the second polynucleotide compared to a control soybean plant that has not been transformed with the DNA construct.
The amino acid sequences and nucleic acid sequences described herein may contain various mutations. Mutations may include insertions, substitutions, and deletions. Insertions are written as follows: (+)(amino acid/nucleic acid sequence position number)(inserted amino acid/nucleic acid base). For example, +287A would mean an insertion of an alanine residue after position 287 in the corresponding amino acid sequence. Substitutions are written as follows: (amino acid/nucleic acid base to be replaced)(amino acid/nucleic acid sequence position number)(substituted amino acid/nucleic acid base). For example, C1082A would mean a substitution of an adenine base instead of a cytosine base at position 1082 in the corresponding nucleic acid sequence. Deletions are written as follows: (amino acid/nucleic acid base to be deleted)(amino acid/nucleic acid sequence position number)(-). For example, C970- would mean a deletion of the cytosine base normally located at position 970 in the corresponding nucleic acid sequence.
The amino acid sequences and nucleic acid sequences described herein may contain mutations at various sequence positions. Sequence positions may be written a variety a ways for convenience. More specifically, sequence positions may be written from either the beginning of the sequence as a positive position number, or from the end of the sequence as a negative number. Sequence positions may be converted easily between a positive notation and a negative notation by comparing to the sequence length and either adding or subtracting the sequence length. For example, a promoter containing 10 nucleic acid bases with a mutation from cytosine to adenine at the second position from the start of the sequence may be written as C2A. Alternatively, this mutation may be written as C(−9)A, −9C/A, or in a similar fashion denoting the negative position number.
The following definitions and methods are provided to better define the present disclosure and to guide those of ordinary skill in the art in the practice of the present disclosure. Unless otherwise noted, terms are to be understood according to conventional usage by those of ordinary skill in the relevant art.
The term “agronomically elite” refers to a genotype that has a culmination of many distinguishable traits such as emergence, vigor, vegetative vigor, disease resistance, seed set, standability, and threshability, which allows a producer to harvest a product of commercial significance.
An “allele” refers to one of two or more alternative forms of a genomic sequence at a given locus on a chromosome.
The term “chimeric” is understood to refer to the product of the fusion of portions of two or more different polynucleotide molecules. “Chimeric promoter” is understood to refer to a promoter produced through the manipulation of known promoters or other polynucleotide molecules. Such chimeric promoters can combine enhancer domains that can confer or modulate gene expression from one or more promoters or regulatory elements, for example, by fusing a heterologous enhancer domain from a first promoter to a second promoter with its own partial or complete regulatory elements. Thus, the design, construction, and use of chimeric promoters according to the methods disclosed herein for modulating the expression of operably linked polynucleotide sequences are encompassed by the present disclosure.
Novel chimeric promoters can be designed or engineered by a number of methods. For example, a chimeric promoter may be produced by fusing an enhancer domain from a first promoter to a second promoter. The resultant chimeric promoter may have novel expression properties relative to the first or second promoters. Novel chimeric promoters can be constructed such that the enhancer domain from a first promoter is fused at the 5′ end, at the 3′ end, or at any position internal to the second promoter.
A “construct” is generally understood as any recombinant nucleic acid molecule such as a plasmid, cosmid, virus, autonomously replicating nucleic acid molecule, phage, or linear or circular single-stranded or double-stranded DNA or RNA nucleic acid molecule, derived from any source, capable of genomic integration or autonomous replication, comprising a nucleic acid molecule where one or more nucleic acid molecule has been operably linked.
A construct of the present disclosure can contain a promoter operably linked to a transcribable nucleic acid molecule operably linked to a 3′ transcription termination nucleic acid molecule. In addition, constructs can include but are not limited to additional regulatory nucleic acid molecules from, e.g., the 3′-untranslated region (3′ UTR). Constructs can include but are not limited to the 5′ untranslated regions (5′ UTR) of an mRNA nucleic acid molecule, which can play an important role in translation initiation and can also be a genetic component in an expression construct. These additional upstream and downstream regulatory nucleic acid molecules may be derived from a source that is native or heterologous with respect to the other elements present on the promoter construct.
“Expression vector”, “vector”, “expression construct”, “vector construct”, “plasmid”, or “recombinant DNA construct” is generally understood to refer to a nucleic acid that has been generated via human intervention, including by recombinant means or direct chemical synthesis, with a series of specified nucleic acid elements that permit transcription or translation of a particular nucleic acid in, for example, a host cell. The expression vector can be part of a plasmid, virus, or nucleic acid fragment. Typically, the expression vector can include a nucleic acid to be transcribed operably linked to a promoter.
The term “genotype” means the specific allelic makeup of a plant.
The terms “heterologous DNA sequence”, “exogenous DNA segment” or “heterologous nucleic acid,” as used herein, each refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling. The terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides. A “homologous” DNA sequence is a DNA sequence that is naturally associated with a host cell into which it is introduced.
“Highly stringent hybridization conditions” are defined as hybridization at 65° C. in a 6× SSC buffer (i.e., 0.9 M sodium chloride and 0.09 M sodium citrate). Given these conditions, a determination can be made as to whether a given set of sequences will hybridize by calculating the melting temperature (Tm) of a DNA duplex between the two sequences. If a particular duplex has a melting temperature lower than 65° C. in the salt conditions of a 6× SSC, then the two sequences will not hybridize. On the other hand, if the melting temperature is above 65° C. in the same salt conditions, then the sequences will hybridize. In general, the melting temperature for any hybridized DNA:DNA sequence can be determined using the following formula: Tm=81.5° C.+16.6(logio[Na+])+0.41(fraction G/C content)−0.63(% formamide)−(600/1). Furthermore, the Tm of a DNA:DNA hybrid is decreased by 1-1.5° C. for every 1% decrease in nucleotide identity (see Sambrook and Russel, 2006).
The term “introgressed,” when used in reference to a genetic locus, refers to a genetic locus that has been introduced into a new genetic background. Introgression of a genetic locus can thus be achieved through plant breeding methods and/or by molecular genetic methods. Such molecular genetic methods include, but are not limited to, various plant transformation techniques and/or methods that provide for homologous recombination, non-homologous recombination, site-specific recombination, and/or genomic modifications that provide for locus substitution or locus conversion.
The term “linked,” when used in the context of nucleic acid markers and/or genomic regions, means that the markers and/or genomic regions are located on the same linkage group or chromosome.
A “marker” means a detectable characteristic that can be used to discriminate between organisms. Examples of such characteristics include, but are not limited to, genetic markers, biochemical markers, metabolites, morphological characteristics, and agronomic characteristics.
A “marker gene” refers to any transcribable nucleic acid molecule whose expression can be screened for or scored in some way.
Certain genetic markers useful in the present disclosure include “dominant” or “codominant” markers. “Codominant” markers reveal the presence of two or more alleles (two per diploid individual). “Dominant” markers reveal the presence of only a single allele. The presence of the dominant marker phenotype (e.g., a band of DNA) is an indication that one allele is present in either the homozygous or heterozygous condition. The absence of the dominant marker phenotype (e.g., absence of a DNA band) is merely evidence that “some other” undefined allele is present. In the case of populations where individuals are predominantly homozygous and loci are predominantly dimorphic, dominant and codominant markers can be equally valuable. As populations become more heterozygous and multiallelic, codominant markers often become more informative of the genotype than dominant markers.
“Operably-linked” or “functionally linked” refers preferably to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a regulatory DNA sequence is said to be “operably linked to” or “associated with” a DNA sequence that codes for an RNA or a polypeptide if the two sequences are situated such that the regulatory DNA sequence affects expression of the coding DNA sequence (i.e., that the coding sequence or functional RNA is under the transcriptional control of the promoter). Coding sequences can be operably-linked to regulatory sequences in sense or antisense orientation. The two nucleic acid molecules may be part of a single contiguous nucleic acid molecule and may be adjacent. For example, a promoter is operably linked to a gene of interest if the promoter regulates or mediates transcription of the gene of interest in a cell.
The term “phenotype” means the detectable characteristics of a cell or organism that can be influenced by gene expression.
The term “plant” can include plant cells, plant protoplasts, plant cells of tissue culture from which a plant can be regenerated, plant calli, plant clumps and plant cells that are intact in plants or parts of plants such as pollen, flowers, seeds, leaves, stems, and the like. Each of these terms can apply to a soybean “plant”. Plant parts (e.g., soybean parts) include, but are not limited to, pollen, an ovule and a cell.
The term “population” means a genetically heterogeneous collection of plants that share a common parental derivation.
A “promoter” is generally understood as a nucleic acid control sequence that directs transcription of a nucleic acid. An inducible promoter is generally understood as a promoter that mediates transcription of an operably linked gene in response to a particular stimulus. A promoter can include necessary nucleic acid sequences near the transcription start site, such as, in the case of a polymerase II type promoter, a TATA element. A promoter can optionally include distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription.
A “quantitative trait locus (QTL)” is a chromosomal location that encodes for alleles that affect the expressivity of a phenotype.
A “transcribable nucleic acid molecule” as used herein refers to any nucleic acid molecule capable of being transcribed into a RNA molecule. Methods are known for introducing constructs into a cell in such a manner that the transcribable nucleic acid molecule is transcribed into a functional mRNA molecule that is translated and therefore expressed as a protein product. Constructs may also be constructed to be capable of expressing antisense RNA molecules, in order to inhibit translation of a specific RNA molecule of interest. For the practice of the present disclosure, conventional compositions and methods for preparing and using constructs and host cells are well known to one skilled in the art (Sambrook and Russel, 2006; Ausubel et al.; Sambrook and Russel, 2001; Elhai and Wolk).
The “transcription start site” or “initiation site” is the position surrounding a nucleotide that is part of the transcribed sequence, which is also defined as position+1. With respect to this site all other sequences of the gene and its controlling regions can be numbered. Downstream sequences (i.e., further protein encoding sequences in the 3′ direction) can be denominated positive, while upstream sequences (mostly of the controlling regions in the 5′ direction) can be denominated as negative.
The term “transformation” refers to the transfer of a nucleic acid fragment into the genome of a host cell, resulting in genetically stable inheritance. Host cells containing the transformed nucleic acid fragments are referred to as “transgenic” cells, and organisms comprising transgenic cells are referred to as “transgenic organisms”.
“Transformed,” “transgenic,” and “recombinant” refer to a host cell or organism such as a plant into which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome as generally known in the art. Known methods of polymerase chain reaction (PCR) include, but are not limited to, methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific primers, vector-specific primers, partially mismatched primers, and the like. The term “untransformed” refers to normal cells that have not been through the transformation process.
The terms “variety” and “cultivar” mean a group of similar plants that by their genetic pedigrees and performance can be identified from other varieties within the same species.
“Wild-type” refers to a virus or organism found in nature without any known mutation.
In some embodiments, numbers expressing quantities of ingredients, properties such as molecular weight, reaction conditions, and so forth, used to describe and claim certain embodiments of the present disclosure are to be understood as being modified in some instances by the term “about.” In some embodiments, the term “about” is used to indicate that a value includes the standard deviation of the mean for the device or method being employed to determine the value. In some embodiments, the numerical parameters set forth in the written description and attached claims are approximations that can vary depending upon the desired properties sought to be obtained by a particular embodiment. In some embodiments, the numerical parameters should be construed in light of the number of reported significant digits and by applying ordinary rounding techniques. Notwithstanding that the numerical ranges and parameters setting forth the broad scope of some embodiments of the present disclosure are approximations, the numerical values set forth in the specific examples are reported as precisely as practicable. The numerical values presented in some embodiments of the present disclosure may contain certain errors necessarily resulting from the standard deviation found in their respective testing measurements. The recitation of ranges of values herein is merely intended to serve as a shorthand method of referring individually to each separate value falling within the range. Unless otherwise indicated herein, each individual value is incorporated into the specification as if it were individually recited herein.
Nucleotide and/or amino acid sequence identity percent (%) is understood as the percentage of nucleotide or amino acid residues that are identical with nucleotide or amino acid residues in a candidate sequence in comparison to a reference sequence when the two sequences are aligned. To determine percent identity, sequences are aligned and, if necessary, gaps are introduced to achieve the maximum percent sequence identity. Sequence alignment procedures to determine percent identity are well known to those of skill in the art. Often publicly available computer software such as BLAST, BLAST2, ALIGN2 or Megalign (available from DNASTAR) software is used to align sequences. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full-length of the sequences being compared. When sequences are aligned, the percent sequence identity of a given sequence A to, with, or against a given sequence B (which can alternatively be phrased as a given sequence A that has or comprises a certain percent sequence identity to, with, or against a given sequence B) can be calculated as: percent sequence identity=X/Y100, where X is the number of residues scored as identical matches by the sequence alignment program's or algorithm's alignment of A and B and Y is the total number of residues in B. If the length of sequence A is not equal to the length of sequence B, the percent sequence identity of A to B will not equal the percent sequence identity of B to A.
In some embodiments, the terms “a,” “an,” “the,” and similar references used in the context of describing a particular embodiment (especially in the context of certain claims) can be construed to cover both the singular and the plural, unless specifically noted otherwise. When used in conjunction with the word “comprising” or other open language in the claims, the words “a” and “an” denote “one or more,” unless specifically noted.
In some embodiments, the term “or” as used herein, including the claims, is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive.
The terms “comprise,” “have” and “include” are open-ended linking verbs. Any forms or tenses of one or more of these verbs, such as “comprises,” “comprising,” “has,” “having,” “includes” and “including,” are also open-ended. For example, any method that “comprises,” “has” or “includes” one or more steps is not limited to possessing only those one or more steps and can also cover other unlisted steps. Similarly, any composition or device that “comprises,” “has” or “includes” one or more features is not limited to possessing only those one or more features and can cover other unlisted features.
All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g. “such as”) provided with respect to certain embodiments herein is intended merely to better illuminate the present disclosure and does not pose a limitation on the scope of the present disclosure otherwise claimed. No language in the specification should be construed as indicating any non-claimed element essential to the practice of the present disclosure.
Groupings of alternative elements or embodiments of the present disclosure disclosed herein are not to be construed as limitations. Each group member can be referred to and claimed individually or in any combination with other members of the group or other elements found herein. One or more members of a group can be included in, or deleted from, a group for reasons of convenience or patentability. When any such inclusion or deletion occurs, the specification is herein deemed to contain the group as modified thus fulfilling the written description of all Markush groups used in the appended claims.
All publications, patents, patent applications, and other references cited in this application are incorporated herein by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application or other reference was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Citation of a reference herein shall not be construed as an admission that such is prior art to the present disclosure.
Having described the present disclosure in detail, it will be apparent that all of the compositions and methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this disclosure have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and methods and in the steps or in the sequence of steps of the methods described herein without departing from the concept, spirit and scope of the disclosure. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the disclosure as defined by the appended claims. Furthermore, it should be appreciated that all examples in the present disclosure are provided as non-limiting examples.
The following non-limiting examples are provided to further illustrate the present disclosure. It should be appreciated by those of skill in the art that the techniques disclosed in the examples that follow represent approaches the inventors have found function well in the practice of the present disclosure, and this can be considered to constitute examples of modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments that are disclosed and still obtain a like or similar result without departing from the spirit and scope of the present disclosure.
As described further below, WGRS data from a diverse panel of 106 soybean accessions was utilized, including wild accessions, exotic germplasm, breeding lines, and varieties, to investigate the two major SCN resistance loci using genome data mining approaches. These efforts provide new insight into the interconnectedness of haplotype compatibility, copy number variation (CNV), promoter variation and gene expression with broad-based SCN resistance.
One hundred and six (106) soybean accessions and ‘Forrest’ indicator lines in the present study were evaluated for resistance to different HG Types of SCN. Homogenous nematode populations of races PA1 (HG Type 2.5.7), PA2 (HG Type 1.2.5.7), PA3 (HG Type 0), PA5 (HG Type 2.5.7), and PA14 (HG Type 1.3.5.6.7) have been maintained at the University of Missouri for more than 30 generations. The SCN bioassays were performed in a greenhouse at the University of Missouri following a well-established method [Arelli et al., 1997]. Briefly, soybean seeds were germinated in paper pouches for 3-4 days and were then transplanted into PVC tubes (100 cm3) (one plant per tube). The tubes were filled with steam pasteurized sandy soil and packed into plastic containers prior to transplanting. Each container held 25 tubes and was suspended over water baths maintained at 27±1° C. Five plants of each indicator line were arranged in a randomized complete block design. Two days after transplanting, each plant was inoculated with 2000±25 SCN eggs. Thirty days post inoculation, nematode cysts were washed from the roots of each plant and counted using a fluorescence-based imaging system [Brown et al.]. The female index (FI %) was estimated to evaluate the response of each plant to each race of SCN using the following formula: FI (%)=(average number of female cyst nematodes on a given individual/ average number of female nematodes on the susceptible check)×100. The FI values for all 106 lines are shown in
The 106 soybean germplasm lines sequenced at approximately 17× genome coverage were utilized for mapping and detection of allelic variants [Valliyodan et al.]. The paired—end resequencing reads were mapped to the soybean reference genome, Williams 82 version 2 (W82.a2.v1.1) with BWA as described previously [Zhou et al., 2015; Valliyodan et al.]. SNP and Indels detection was performed using Genome Analysis Toolkit (GATK, V3.4.0) [McKenna et al.] and SAMTools. For Indel calling, insertions and deletions shorter than or equal to 6 bp were taken into consideration. CNV were detected according to depth distribution of each line [Zhou et al., 2015]. Regions were regarded as CNVs if their minimum length was greater than 2 kb and their mean depth was less than half of the sequence depth or more than double of the sequence depth. The initial and final minimum probability to merge the adjacent breakpoint were set to 0.5 and 0.8, respectively. Additionally, CNV of indicator lines was visualized using GenomeBrowse. Haplotype analysis of the Rhg1 and Rhg4 loci was performed using a pipeline as previously described [Patil et al., 2016]. Briefly, SNP haplotypes were examined by generating map and genotype data files and clustering pictorial output for the Rhgl and Rhg4 genomic regions were visualized using FLAPJACK [Milne et al.]. The SNP identified from each line were clustered based on Neighbor-Joining (NJ) tree output and SNPs were further analyzed for possible synonymous/non-synonymous variation by translation into amino acid sequences. The SNP diversity, average pairwise divergence within population (Ow), Watterson's estimator (θw), and FST were estimated as previously described [Valliyodan et al.].
Comparative genomic hybridizations (CGH) assay was adapted as described [McHale et al.; Dobbels et al.]. The Taqman assay and digital PCR were performed as previously described [Kadam et al.; Wan et al.]. Briefly, 20 μl reaction was prepared, consisting of 10 μl 2× master reaction mix (Life Technologies, Mass., USA), 1 μl assay mix (18 μM Forward and 18 μM reverse primers+5 μM probe), 1 μl DNA, and 9 μl ddH2O. A 14.5 μl of the PCR mixture was loaded onto a QuantStudio™ 3D Digital PCR 20K Chip. The chip was covered with immersion fluid, a lid was applied, the assembly was filled with immersion fluid, and the loading port was sealed according to the manufacturer's instructions. The chips were loaded into the Dual Flat Block GeneAmpR PCR System 9700 (Life Technologies, Waltham, Mass., USA), and PCR was performed using the following conditions: 96 ° C. for 10 min; 60 ° C. for 2 min and 98 ° C. for 30 seconds, for 39 cycles; 60 ° C. for 2 min; 10 ° C. for storage. The Digital PCR 20K Chip was read using the QuantStudio™ 3D Digital PCR Chip Reader, and the data was analyzed using the QuantStudio™ 3D AnalysisSuiteTM Software (Thermo Fisher Scientific, Waltham, Mass., USA).
Aliquots of the genomic DNA samples isolated for whole-genome resequencing were used in PCR reactions. The PCR reactions were conducted using PrimeSTAR GXL DNA Polymerase (Takara Bio USA, Inc., formerly known as Clontech Laboratories, Mountain View, Calif., USA), according to the manufacturer's instructions.
Homology modeling of a putative GmSNAP18 and GmSHMT08 protein structure was conducted as previously described [Liu et al., 2017]. To induce and map the corresponding existing natural mutations (haplotypes) between the susceptible and resistant soybeans lines of the GmSHMT08 protein, the structural editing tool from UCSF Chimera package was employed. Additionally, the impact of catalytic activity of the enzyme homodimerization, tetramerization and/or substrate binding was studied. Approximately 5.0 angstroms containing all atoms/bonds of any residue surrounding the mutated residue has been selected first and shown in the model to study all possible residue interactions. Next, the rotamers tool was used to mutate the three residues and study their possible impact on protein activity and/or structure.
Three-day old soybean seedlings of different indicator lines were germinated and inoculated with freshly hatched second-stage juveniles of SCN race PA3, PAS and PA14 as previously described [Rambani et al.]. Three biological samples of inoculated and non-inoculated root tissues were collected at 2 days' post inoculation and used for RNA extraction and qPCR analysis. Total RNA was isolated using Qiagen RNeasy Plant Mini Kit (cat #74904) from root samples collected two days after SCN infection. Total RNA was DNase treated and purified using Turbo DNA-free Kit (QAmbion/Life Technologies AM1907). RNA was quantified using Nanodrop 1000 (V3.7), then a total of 400 nanograms of treated RNA was used to generate cDNA using the cDNA synthesis Kit (Thermoscript, Life Technologies, #11146-025), with random hexamers. About 1/10th of a 20 microliter reverse transcription reaction was used in gene specific qPCR with the Power SYBR® Green PCR Master Mix Kit (Applied Biosystems™ #4368706). Primers used in this study were described previously [Rambani et al.]. For each line, RNA from three biological replicates were used for quantification and then normalized using the deltadelta Cq method with Ubiquitin used as a reference gene (ΔCq=Cq(TAR)−Cq(REF). Each gene's expression was exponentially transformed to the expression level using the formula (ΔCq Expression=2·ΔCq). Each sample was run in parallel with a control in which RT was not included in the cDNA synthesis reaction.
Methods proceeded according to Examples 1-6, unless described otherwise.
In soybean, the SCN resistance QTL on chromosomes 18 (Rhg1) and 8 (Rhg4) are the two major QTL that have been identified and reported in several publications [Vuong et al.; Liu et al., 2012; Cook et al., 2012]. To investigate the sequence diversity and disequilibrium of the Rhg1 and Rhg4 loci, 1-Mb regions on either side of these loci were analyzed in 106 WGRS lines representing >96% of the sequence diversity [Valliyodan et al.]. The value of θπ, θw, and Tajima's D were estimated for related regions using sliding windows of 50kb extreme allele frequency differentiation over extended linked regions was observed. As the location neared the Rhg1 locus, On increased greatly in the 100-kb region (
Methods proceeded according to Examples 1-6, unless described otherwise.
The genetic diversity at SCN resistance loci provided an opportunity to obtain an overview of the haplotype variation at both the Rhg1 and Rhg4 loci. As reported earlier, three genes (Glyma.18g022400, Glyma.18g022500 and Glyma.18g022600) at the Rhg1 locus together confer resistance to SCN in PI 88788 [Cook et al., 2012]. Despite a high number of sequence polymorphisms found within each Rhg1 repeat in SCN-resistant lines, the SNPs that cause an altered amino acid sequence (non-synonymous) were identified only in the Glyma.18g022500 (GmSNAP18) gene (
Similar to the Rhg1 locus, analysis of the sequence variation, CNV, and haplotypes at the Rhg4 locus encompassing three genes (Glyma. 08g108800, Glyma.08g108900 and Glyma.08g109000) was performed. The gene Glyma.08g108900, encoding Serine hydroxymethyltransferase (GmSHMT08), showed three nonsynonymous SNPs associated with the SCN reaction (
To further confirm the CNV estimated using WGRS data of both Rhg1 and Rhg4 loci (
Methods proceeded according to Examples 1-6, unless described otherwise.
Haplotype analysis revealed that only three non-synonymous SNPs at the GmSHMT08 gene showed a strong association with both CNV of Rhg4 loci and SCN resistance (
The protein homodimers play a critical role in catalysis and regulation through the formation of stable interfaces [Karthikraja et al.]. The homodimer-homodimer interface of the GmSHMT08 protein at P13OR (corresponding to P200R in
Methods proceeded according to Examples 1-6, unless described otherwise.
Based on the WGRS information, the genomic region surrounding the cloned Rhg4 gene GmSHMT08 [Liu et al., 2012] appeared to be duplicated in at least 11 of the 106 sequenced genomes (
Since this 24.8-kb length is rather close to the estimated 30-kb duplicated region, it was speculated that the ends of this 24.8-kb region were likely close to the junction between two neighboring repeats. If this is the case, it may be possible to amplify by PCR this junction region in the lines with duplications using two outward end primers of the 24.8-kb region as depicted graphically in
Methods proceeded according to Examples 1-6, unless described otherwise.
The presence of CNV for the Rhg1 locus is common (or frequent) when compared to the Rhg4 locus (
Based on epistatic interactions of the GmSNAP18 and GmSHMT08, the 106 soybean lines were grouped into six categories that showed strong associations between genotypic variation (CNV and non-synonymous changes) and nematode susceptibility/resistance phenotypes (
Methods proceeded according to Examples 1-6, unless described otherwise.
These Examples have shown that resistant alleles contain either nine or three natural point mutations in the GmSNAP18 and GmSHMT08 proteins, respectively, when compared to the susceptible alleles. Out of the 106 lines examined, 14 lines carry resistant alleles at both the Rhgl-a and the Rhg4-a/Rhg4-c haplotypes, corresponding to the Peking'-type of resistance. However, the other 30 SCN resistant lines, corresponding to both ‘Cloud’- and PI 88788-type of resistance, carry the resistant Rhg1-a (11 lines), Rhg1-b (8 lines), and Rhgl-b1 (11 lines) haplotype, but all contain the Rhg4-b susceptible allele. Interestingly, PI 407729 carries both susceptible alleles at the Rhg1-c and the Rhg4-b loci, but exhibited resistance to all five races. In order to gain more insight into SCN resistance in this line, a haplotype analysis clustering of all the 106 lines at the promoter level of both genes was performed (
Similarly, the haplotype analysis of GmSNAP18 promoter (˜1.5 Kb) showed that the majority of the resistant lines carry a specific promoter haplotype (
Methods proceeded according to Examples 1-6, unless described otherwise.
To gain more insight into the impact of the identified CNV on both the GmSNAP18 and GmSHMT08 transcripts, qRT-PCR analysis was carried out in a number of lines representing different subgroups. Based on the haplotype combinations and CNV, five indicator lines including ‘Essex’, ‘Peking’, PI 437654, PI 090763, and PI 88788 were selected, and screened in the presence and absence of the nematode infection (
Recently, it has been shown that GmSNAP18 transcripts were induced in ‘Forrest’ (carrying the Rhg1-a and Rhg4-a haplotypes) and PI 88788 (carrying the Rhg1-b and Rhg4-b haplotypes) in response to SCN infection, whereas the susceptible line ‘Essex’ (carrying the Rhg1-c and Rhg4-b haplotypes) showed very low mRNA levels of GmSNAP18 [Liu et al., 2017]. In Forrest, GmSNAP18 transcripts showed about 2-fold upregulation in SCN-infected roots compared to non-infected roots at 3 and 5 days post SCN infection (dpi). Similarly, in PI 88788 GmSNAP18 transcripts showed 2-fold upregulation in SCN infected root compared to non-infected control at 5 dpi. GmSHMT08 transcripts were also found to be induced in both ‘Forrest’ and PI 88788 soybean lines [Kandoth et al.]. Similarly, the expression of ‘Essex’, ‘Peking’, and PI 436754 in response to infection by three SCN races (PA3, PA5, and PA14) at 2 dpi was investigated. The analysis demonstrated that GmSNAP18 transcripts (underlying Rhgl-a haplotype) were induced in the presence of the three nematode races in both ‘Peking’ and PI 436754 (
Methods proceeded according to Examples 1-6, unless described otherwise.
Soybean germplasm provides a wide range of SCN resistance that is controlled by natural variants (SNP and CNV) at two major loci, Rhg1 and Rhg4. In these Examples, high-quality deep sequencing information (˜15× genome coverage) for the Rhg1 and Rhg4 loci were utilized and haplotypes associated with SCN resistance to five races were identified. Haplotype analysis also identified SNPs associated with CNV. The CNV of the Rhg1 alleles, which carries 2 to 10 copies across different soybean varieties, is a well-known phenomenon [Lee et al.; Cook et al., 2014]. It is not surprising that nearly identical results for CNV of the Rhg1 locus were obtained, which is also related to the SCN-resistant efficacy, as previously reported. It was interesting, however, that increased copy number of the Rhg4 gene was observed in 11 soybean lines, ranging from 1.2 to 4.3 copies. The copy number increases were confirmed using different molecular platforms, including Digital-PCR, Taqman assay and CGH. Furthermore, a tandem repeat structure at the Rhg4 locus was also confirmed. A sequence of 35.7-kb was found duplicated at the Rhg4 locus in ‘Peking’, PI 437654 and PI 438489B. The duplicated region contains four genes, including the cloned Rhg4 gene, which encodes a serine hydroxymethyltransferase (SHMT). This new discovery provides a new insight for the SCN resistance mechanism at the Rhg4 locus.
During the last decade, many studies examined segmental duplication and genome re-sequencing applications, with a special focus on the identification of CNVs [Zarrei et al.; Sharp et al.; de Koning et al.]. In fact, deletions and duplications are considered to be major contributions to the genome variability, playing important roles in generating variation among many traits, including disease phenotypes. Many studies explored the human genomes for genetic disorders and identified a range of variants [Inoue & Lupski; Perry et al., 2007; Myers; Albertini et al.; Macdonald et al.]. However, CNV is an important type of structural variation because of its varied evolutionary impacts, stimulating genomic rearrangements, and gene dosage effects [Olsen & Wendel; Moore & Purugganan; Flagel & Wendel]. Different types of CNV have been observed in diverse organisms, including humans and chimpanzees [Perry et al., 2008], rats [Aitman et al.], Arabidopsis [DeBolt], extremophile crucifer [Dassanayake et al.] and Plasmodium falciparum [Heinberg et al.]. In soybean, it has been previously reported that copy number of three genes together, at the Rhgl-b locus, encoding a Soluble NSF Attachment Protein (a-SNAP), an Amino Acid Transporter (AAT), and a Wound-Inducible domain (WI12), mediates nematode resistance in soybean PI 88788 type of resistance [Cook et al., 2012; Bayless et al., 2018]. These Examples provide strong evidence that CNV of GmSHMT08 at the Rhg4 locus also plays a significant role in SCN resistance. Interestingly, mutations in human SHMT have been linked to a wide range of diseases [Maddocks et al.; Skibola et al.; Lim et al.]. Moreover, an shmt knockout mutant was shown to induce apoptosis in lung cancer cells by causing uracil misincorporation [Paone et al.]. Therefore, the findings on SHMT allelic variation in these Examples may have implications beyond the field of plant pathology, as similar variants may be important within the field of pharmacogenomics due to SHMT's involvement in human cancer.
These Examples demonstrated that the resistant allele contains three critical spontaneously occurring natural point mutations resulting in four amino acid changes; I37F (0.94%), P13OR (15.1%), N358Y (11.32), and Y358H (1.88%) at the GmSHMT08 protein when compared to the susceptible alleles. Homology modeling suggests that these point mutations may impair the key regulatory property of the encoded GmSHMT08 enzyme, including subunit associations (Dimerization and tetramerization), PLP cofactor and substrate binding, and catalytic site. The altered enzyme may further influence the folate homeostasis in soybean root cells, and ultimately restrict the growth of cyst nematodes in susceptible soybean lines, as has been suggested previously [Liu et al., 2012]. The current study demonstrated that the resistant Rhg4 allele was detected in 13.2% of the sequenced soybean lines representing the USDA Soybean Germplasm Collection, including ‘Peking’. Additionally, it has been reported that overexpression of Rhg4-‘Peking’ in roots of SCN-susceptible cultivar ‘Williams 82’ greatly reduced nematode parasitism [Matthews et al.].
Methods were according to Examples 1-6, unless described otherwise.
Since the discovery of SCN resistance QTL, most of the varieties in the U.S. trace back to ‘Peking’- and/or PI 88788-type of resistance. Due to the effectiveness of the high copy Rhg1 from PI 88788 source, it was frequently utilized (over 95%) by breeders to develop elite cultivars. However, limited variation, especially at the Rhg4 locus was captured in the recent breeding programs. The effectiveness of PI 88788-type resistance is breaking down due to continuous cropping of soybean varieties derived from PI 88788. Another reason could be that the Rhg1-type of resistance was sufficient at the time of development. However, due to virulence and adaptation of SCN populations, the high copy Rhg1 is not sufficient to confer broad-based resistance unless a new epistatically interacting (additive) resistant haplotype is substituted. The lack of genetic diversity and/or the right combination of resistant haplotypes has led to a widespread shift towards virulence in SCN populations. Analysis from these Examples showed that susceptibility phenotypes associated with low copies of Rhg1 could be overcome by incorporating Rhg4 alleles.
The 106 WGRS set contains 57 elites, 44 landraces, and 7 wild soybean lines [Valliyodan et al.]. None of the elite lines carry multiple copies at the Rhg4 locus and most of the lines (49/57) were highly susceptible to two or more SCN races (
Methods proceeded according to Examples 1-6, unless described otherwise.
It has been reported that the interaction of two or more alleles (epistasis) plays a major role in an organism's resistance to diseases and pests [Nagel; Bayless et al., 2016]. The Rhg1 GmSNAP18 protein interacts with NSF (N-ethylmaleimide-sensitive factor) protein and disturbs vesicle trafficking [Bayless et al., 2018; Bayless et al., 2016]. It is also well-documented that epistasis occurs in ‘Peking’-derived SCN resistance, in which the ‘Peking’-type Rhg1-a has high efficacy when the ‘Peking’-type Rhg4 is also present [Brucker et al.]. However, until now the genetic basis underlying high efficacy resistance was unknown. The present study shows that all the 106 soybean lines were grouped into six categories based on the genomic variation of Rhg1 and Rhg4 loci (
Interestingly, among SCN resistant PIs characterized in the present study, PI 407729, did not carry any known SCN resistance loci (Rhg4 or Rhg1) but still showed resistance to multiple SCN races. This can be explained, in part, by the presence of the SNP in the GmSHMT08+ promoter. These variations may correspond to trans-acting elements that can regulate other novel genes involved in SCN resistance beside classic Rhg1 and Rhg4 loci, and hence warrants further promoter analysis and gene functional characterization. Furthermore, genetic mapping of the PI 407729 resistant QTL may reveal a previously unknown SCN resistance locus, conferring a unique mode of resistance. Results obtained from the current study demonstrated that broad-based resistance to multiple SCN races requires very specific haplotypes of the Rhg1 and Rhg4 loci at the promoter, amino acid sequences and CNV. In fact, the type of interaction between the different alleles confers resistance to a given race that is haplotype-dependent. This study shows that having more copies of GmSHMT08 provides more transcript abundance, therefore reinforcing the resistance to SCN. Similar observations have been also revealed in the case of the GmSNAP18 gene.
The genetic basis for broad-based resistance to multiple races elucidated in the present study will greatly benefit soybean breeders in the development of SCN-resistance varieties. In addition, it will also help to select parental lines to design future crosses and trait introgressions. The SNP marker assays associated with CNV and SNP/indels can be used to stack multi-copy of the Rhgl-b (PI88788-type of resistance) or Rhg4 (‘Peking’ type resistance) for breeding purposes and it will provide more sources for broad-spectrum SCN resistance.
In summary, results obtained from the Examples reveal several new discoveries. (1) The Rhg4 locus is a highly repeated region similar to the Rhg1 locus, likely consisting of a 35.7-kb tandem repeat unit. Eleven lines with resistance to multiple races of SCN exhibited a CNV of 2.1 to 4.3 copies of Rhg4 coupled with a ‘Peking’-type Rhgl-a with copy numbers ranging from 1.9 to 3.5. (2) The lines with PI 88788-type Rhgl-b haplotypes required greater than 5.6 copies to confer resistance to SCN races 3 and 14, regardless of the Rhg4 haplotype. (3) When GmSNAP18 copy number dropped below 5.6 copies, a Peking type GmSHMT08 haplotype was required to ensure resistance to SCN pointing to a novel mechanism of epistasis between the GmSNAP18 and GmSHMT08 involving minimum requirements for copy numbers at both loci. (4) ‘Cloud’-type Rhg1 performed better than ‘PI 88788’-type Rhg1 and required less GmSNAP18 copy numbers to confer SCN resistance. (5) When soybean lines cumulated more copies of the GmSHMT08 gene, they acquired broad resistance to SCN. (6) Soybean lines with low CNV (1 to 3 copies) of Peking'-type Rhgl-a but lacking Rhg4 allele showed resistance only to SCN race 5. (7) Both Rhg1 and Rhg4 loci were in strong LD with the surrounding regions of the genome. (8) Expression analysis showed that transcript abundance of the GmSHMT08 in root tissue correlates with more copies of the Rhg4 locus, reinforcing the resistance to SCN. (9) Haplotype analysis of the GmSHMT08 and GmSNAP18 promoters provide an additional layer of the resistance mechanism. These findings provide new insight into epistatsis, haplotype compatibility, Copy Number Variants, promoter variation, and its impact on broad-based disease resistance.
Functional analysis was performed on the GmSHMT08 promoter carrying the four SNPs at four positions within the 2 Kb promoter F-GmSHMT08-ProΔ-757 TIA, F-GmSHMT08-ProΔ-1355 T/C, F-GmSHMT08-ProΔ-1785 T/C, F-GmSHMT08-ProΔ-1877 T/- independently. The F-GmSHMT08-ProΔ-757 T/A, ProΔ-1355 T/C, −1785 T/C, −1877 T/- construct carries all the four SNPs. Each construct contained the endogenous GmSHMT08 promoter, in addition to the GmSHMT08 coding sequence as shown in
A ExF12 (Essex x Forrest) RIL line carrying the resistant GmSNAP1+ allele but the susceptible GmSHMT08− allele [Lakhssassi et al.] has been used for the soybean composite root transformation.
ExF12 presented 97 cysts on average; therefore, it was completely susceptible to SCN.
As expected, susceptible ExF12 transgenic hairy root carrying the F-GmSHMT08-Pro::GmSHMT08-CDS (positive control) decreased the number of SCN cysts to nearly 11 in transgenic soybean roots. Both the GmSHMT08-WT endogenous promoter and the GmSHMT08-WT CDS responded to SCN infections and the ExF12 line become resistant to SCN.
Interestingly, when the construct carried the four susceptible SNPs at the Forrest GmSHMT08-Pro; F-GmSHMT08-ProΔ-757 T/A, −1355 T/C, −1785 T/C, −1877 T/-, the screened transgenic ExF12 lines presented 67 cysts in average, and therefore was susceptible to SCN. This suggests that at least one, two, three or the four SNPs tested on the F-GmSHMT08-Pro may be responsible for the observed susceptibility.
When tested independently, transgenic ExF12 lines' expressing the following independent constructs: F-GmSHMT08-ProΔ-757 T/A, F-GmSHMT08-ProΔ-1355 T/C, F-GmSHMT08-ProΔ-1785 T/C showed decreased in cyst number with 2, 4, and 3 cysts in average, respectively.
Interestingly, transgenic ExF12 lines expressing the F-GmSHMT08-ProΔ-1877 T/- construct presented 42 cysts on average, and therefore was susceptible to SCN. This directly points to the role of the SNP at position −1877 T/- (corresponding to the loss of the MADS SQUAMOSA-box TFBS) in SCN susceptibility/resistance.
Full data on cyst number present in tested lines with various GmSHMT08 promoter mutations is shown in
In total, 10 MADS SQUAMOSA-box Transcription Factor Binding Sites (TFBS) are present at the GmSHMT08 promoter of soybean susceptible lines. Five MADS SQUAMOSA-box were on the positive (+) strand, and the other five were present on the negative (−) strand (see
Five MADS SQUAMOSA-box were on the positive (+) strand.
Five MADS SQUAMOSA-box were on the negative (−) strand.
Within the 2 Kb GmSHMT08 promoter, the INDEL at position −1877 T/- was the only SNP that resulted to the loss of the MADS SQUAMOSA-box TFBS in resistant lines. All the other observed SNPs did not impact the presence of their corresponding TFBS between SCN resistant and susceptible lines.
Aflitos S et al. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing. The Plant Journal (2014), 80: 136-148.
Aitman TcJ et al. Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans. Nature (2006), 439: 851-855.
Albertini AcM et al. On the formation of spontaneous deletions: The importance of short sequence homologies in the generation of large deletions. Cell (1982), 29: 319-328.
Anderson JcE et al. A roadmap for functional structural variants in the soybean genome. G3: Genes, Genomes, Genetics (2014), 4: 1307-1318.
Arelli AcP et al. Soybean germplasm resistant to Races 1 and 2 of Heterodera glycines. Crop Science (1997), 37: 1367-1369.
Arelli PcR et al. Soybean reaction to Races 1 and 2 of Heterodera glycines. Crop Science (2000), 40: 824-826.
Arelli PcR et al. Inheritance of resistance in soybean PI 567516C to LY1 nematode population infecting cv. Hartwig. Euphytica (2009), 165: 1-4.
Ausubel et al. Short Protocols in Molecular Biology, 5th ed., Current Protocols, 2002.
Bayless A M et al. Disease resistance through impairment of α-SNAP-NSF interaction and vesicular trafficking by soybean Rhgl. Proceedings of the National Academy of Sciences (2016), 113: E7375-E7382.
Bayless A M et al. An atypical N-ethylmaleimide sensitive factor enables the viability of nematode-resistant Rhg1 soybeans. Proceedings of the National Academy of Sciences (2018), 115: E4512-E4521.
Brown S et al. A high-throughput automated technique for counting females of Heterodera glycines using a fluorescence-based imaging system. Journal of Nematology (2010), 42: 201-206.
Brucker E et al. Rhg1 alleles from soybean PI 437654 and PI 88788 respond differentially to isolates of Heterodera glycines in the greenhouse. Theoretical and Applied Genetics (2005), 111: 44-49.
Choi J-W et al. Whole-genome resequencing analyses of five pig breeds, including Korean wild and native, and three European origin breeds. DNA Research (2015), 22: 259-267.
Concibido V C et al. A decade of QTL mapping for cyst nematode resistance in soybean. Crop Science (2004), 44: 1121-1131.
Cook D E et al. Copy number variation of multiple genes at Rhg1 mediates nematode resistance in soybean. Science (2012), 338: 1206-1209.
Cook D E et al. Distinct copy number, coding sequence, and locus methylation patterns underlie Rhgl-mediated soybean resistance to soybean cyst nematode. Plant Physiology (2014), 165: 630-647.
Dassanayake M et al. The genome of the extremophile crucifer Thellungiella parvula. Nature Genetics (2011), 43: 913-918.
de Koning A P et al. Repetitive elements may comprise over two-thirds of the human genome. PLoS Genetics (2011), 7: e1002384.
DeBolt S. Copy number variation shapes genome diversity in Arabidopsis over immediate family generational scales. Genome Biology and Evolution (2010), 2: 441-453.
Dobbels A A et al. An induced chromosomal translocation in soybean disrupts a KASI ortholog and is associated with a high-sucrose and low-oil seed phenotype. G3: Genes, Genomes, Genetics (2017), 7: 1215-1223.
Elhai and Wolk. Conjugal Transfer of DNA to Cyanobacteria. Methods in Enzymology (1988), 167: 747-754.
Flagel L E & Wendel J F. Gene duplication and evolutionary novelty in plants. New Phytologist (2009), 183: 557-564.
Gardner M et al. Genetics and adaptation of soybean cyst nematode to broad spectrum soybean resistance. G3: Genes, Genomes, Genetics (2017), 7: 835-841.
Gibbs R A et al. The international HapMap project. Nature (2003), 426: 789-796.
Gore MA et al. A first-generation haplotype map of maize. Science (2009), 326: 1115-1117.
Heinberg A et al. Direct evidence for the adaptive role of copy number variation on antifolate susceptibility in Plasmodium falciparum. Molecular Microbiology (2013), 88: 702-712.
Huang X et al. Resequencing rice genomes: an emerging new era of rice genomics. Trends in Genetics (2013), 29: 225-232.
Inoue K & Lupski J R. Molecular mechanisms for genomic disorders. Annual Review of Genomics and Human Genetics (2002), 3: 199-242.
Jackson S A et al. Sequencing crop genomes: approaches and applications. New Phytologist (2011), 191: 915-925.
Kadam S et al. Genomic-assisted phylogenetic analysis and marker development for next generation soybean cyst nematode resistance breeding. Plant Science (2016), 242: 342-350.
Kandoth P K et al. Systematic mutagenesis of serine hydroxymethyltransferase reveals essential role in nematode resistance. Plant Physiology (2017), 175: 1370-1380.
Karthikraja V et al. Types of interfaces for homodimer folding and binding. Bioinformation (2009), 4: 101-111.
Lakhssassi N et al. Characterization of the soluble NSF attachment protein gene family identifies two members involved in additive resistance to a plant pathogen. Scientific Reports (2017), 7: 45226.
Lam H M et al. Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nature Genetics (2010), 42: 1053-1059.
Lam H M et al. Addendum: Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nature Genetics (2011), 43: 387-387.
Lee T G et al. Evolution and selection of Rhgl, a copy-number variant nematode-resistance locus. Molecular Ecology (2015), 24: 1774-1791.
Li Y H et al. De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits. Nature Biotechnology (2014), 32: 1045-1052.
Lim U et al. Polymorphisms in cytoplasmic serine hydroxymethyltransferase and methylenetetrahydrofolate reductase affect the risk of cardiovascular disease in men. Journal of Nutrition (2005), 135: 1989-1994.
Liu S M et al. A soybean cyst nematode resistance gene points to a new mechanism of plant resistance to pathogens. Nature (2012), 492: 256-260.
Liu S et al. The soybean GmSNAP18 gene underlies two types of resistance to soybean cyst nematode. Nature Communications (2017), 8: 14822.
Macdonald M A et al. A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes. The Huntington's Disease Collaborative Research Group. Cell (1993), 72: 971-983.
Maddocks O D et al. Serine metabolism supports the methionine cycle and DNA/RNA methylation through de novo ATP synthesis in cancer cells. Molecular Cell (2016), 61: 210-221.
Matthews B F et al. Engineered resistance and hypersusceptibility through functional metabolic studies of 100 genes in soybean to its major pathogen, the soybean cyst nematode. Planta (2013), 237: 1337-1357.
McHale L K et al. Structural variants in the soybean genome localize to clusters of biotic stress-response genes. Plant Physiology (2012), 159: 1295-1308.
McKenna A et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research (2010), 20: 1297-1303.
Milne I et al. Flapjack-graphical genotype visualization. Bioinformatics (2010), 26: 3133-3134.
Moore R C & Purugganan M D. The evolutionary dynamics of plant duplicate genes. Current Opinion in Plant Biology (2005), 8: 122-128.
Myers R. Huntington's disease genetics. NeuroRx (2004), 1: 255-262.
Nagel R L. Epistasis and the genetics of human diseases. Comptes Rendus Biologies (2005), 328: 606-615.
Niblack T L et al. Soybean cyst nematode in Illinois from 1990 to 2006: Shift in virulence phenotype of field populations. Journal of Nematology (2006), 38: 285-285.
Olsen K M & Wendel J F. A bountiful harvest: genomic insights into crop domestication phenotypes. Annual Review of Plant Biology (2013), 64: 47-70.
Paone A et al. SHMT1 knockdown induces apoptosis in lung cancer cells by causing uracil misincorporation. Cell Death & Disease (2014), 5: e1525.
Patil G et al. Soybean (Glycine max) SWEET gene family: insights through comparative genomics, transcriptome profiling and whole genome re-sequence analysis. BMC Genomics (2015), 16:520.
Patil G et al. Genomic-assisted haplotype analysis and the development of high-throughput SNP markers for salinity tolerance in soybean. Scientific Reports (2016), 6: 19199.
Perry G H et al. Diet and the evolution of human amylase gene copy number variation. Nature Genetics (2007), 39: 1256-1260.
Perry G H et al. Copy number variation and evolution in humans and chimpanzees. Genome Research (2008), 18: 1698-1710.
Qi X P et al. Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing. Nature Communications (2014), 5: 4340.
Rambani A et al. The methylome of soybean roots during the compatible interaction with the soybean cyst nematode, Heterodera glycines. Plant Physiology (2015), 168: 1364-1377.
Redon R et al. Global variation in copy number in the human genome. Nature (2006), 444: 444-454.
Rubin C-J et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature (2010), 464: 587-591.
Sambrook and Russel. Molecular Cloning: A Laboratory Manual, 3rd ed. Cold Spring Harbor Laboratory Press, 2001.
Sambrook and Russel. Condensed Protocols from Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 2006.
Schmutz J et al. Genome sequence of the palaeopolyploid soybean. Nature (2010), 465: 120-120. [Corrigendum of Schmutz J et al. Genome sequence of the palaeopolyploid soybean. Nature (2010), 463: 178-183.]
Schmutz J et al. A reference genome for common bean and genome-wide analysis of dual domestications. Nature Genetics (2014), 46: 707-713.
Sebat J et al. Large-scale copy number polymorphism in the human genome. Science (2004), 305: 525-528.
Sharp A J et al. Segmental duplications and copy-number variation in the human genome. American Journal of Human Genetics (2005), 77: 78-88.
Shlien A & Malkin D. Copy number variations and cancer. Genome Medicine (2009), 1: 62.
Skibola C F et al. Polymorphisms in the thymidylate synthase and serine hydroxymethyltransferase genes and risk of adult acute lymphocytic leukemia. Blood (2002), 99: 3786-3791.
Song Q et al. Construction of high resolution genetic linkage maps to improve the soybean genome sequence assembly Glyma1.01. BMC Genomics (2016), 17: 33.
Song Q et al. Genetic characterization of the soybean nested association mapping population. The Plant Genome (2017), 10: 10.3835.
Telenti A et al. Deep sequencing of 10,000 human genomes. Proceedings of the National Academy of Sciences (2016), 113: 11901-11906.
Valliyodan B et al. Landscape of genomic diversity and trait discovery in soybean. Scientific Reports (2016), 6: 23598.
Varshney R K et al. Whole-genome resequencing of 292 pigeonpea accessions identifies genomic regions associated with domestication and agronomic traits. Nature Genetics (2017), 49: 1082-1088.
Varshney R K et al. Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement. Nature Biotechnology (2013), 31: 240-246.
Vuong T D et al. Novel quantitative trait loci for broad-based resistance to soybean cyst nematode (Heterodera glycines Ichinohe) in soybean PI 567516C. Theoretical and Applied Genetics (2010), 121: 1253-1266.
Wan Jet al. Application of Digital PCR in the Analysis of Transgenic Soybean Plants. Advances in Bioscience and Biotechnology (2016), 7: 403-417.
Wang L H et al. Genome sequencing of the high oil crop sesame provides insight into oil biosynthesis. Genome Biology (2014), 15: R39.
Wrather J A & Koenning S R. Estimates of disease effects on soybean yields in the United States 2003 to 2005. Journal of Nematology (2006), 38: 173-180.
Wu X et al. Q T L, additive and epistatic effects for SCN resistance in PI 437654. Theoretical and Applied Genetics (2009), 118: 1093-1105.
Xu X et al. Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nature Biotechnology (2012), 30: 105-111.
Yano K et al. Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice. Nature Genetics (2016), 48: 927-934.
Zarrei M et al. A copy number variation map of the human genome. Nature Reviews Genetics (2015), 16: 172-183.
Zhou Z K et al. Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean. Nature Biotechnology (2015), 33: 408-414.
Zhou X et al. Population genomics reveals low genetic diversity and adaptation to hypoxia in snub-nosed monkeys. Molecular Biology and Evolution (2016), 33: 2670-2681.
This application claims the benefit of U.S. Provisional Application Ser. No. 62/791,637, filed Jan. 11, 2019, the entire disclosure of which is incorporated herein by reference.
This invention was made with government support under grant number S1066 awarded by the United States Department of Agriculutre, National Institute of Food and Agriculture. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62791637 | Jan 2019 | US |