The present invention relates to the field of plant biology and biotechnology. Specifically, the present invention relates to a method of plant breeding in order to identify plants by means of molecular markers, with higher resistance to diseases, more specifically plants of the genus Glycine and fungal diseases.
Soybean belongs to the botanical genus Glycine, more precisely to the family Fabaceae (legumes). Some 727 genera and 19,325 species are recognized (LEWIS, G. P.; SCHRIRE, B. D.; MACKINDER, B. A.; LOCK, J. M. Legumes of the World. Royal Botanic Gardens, Kew. p. 577, 2005) representing one of the largest families of Angiosperms and also one of the leading ones from an economic point of view.
This family has a cosmopolitan distribution and its main characteristic, although there are exceptions, is the vegetable-type fruit (pod). In addition, it ranges from tree species to annual herbaceous species, many of great economic importance, primarily, to feed (soy, beans, among others).
In addition, representatives of this family still have great ecological importance, as they are well adapted to the first colonization and exploitation of diverse environments, mainly due to their associations with nitrogen-fixing bacteria or with ectomycorrhizae. Bacteria of the genus Rhizobium, located in root nodules found in many species, convert atmospheric nitrogen into ammonia, a soluble form that can be used by other plants, resulting in species extremely valuable as suppliers of natural fertilizers (LEWIS, G. P. Legumes of Bahia. Royal Botanic Gardens, Kew. p. 369, 1987).
Soy (Glycine max) is one of the most important representatives of the Fabaceae family. In the 1970s, soy became consolidated as the main crop in Brazilian agribusiness. The producer has used all means to increase the use of technology, in order to reduce their costs, increase their productivity, and thereby improve their profitability. Thus, soybean productivity jumped from 2,823 kg/ha in the 2006/07 harvest, to 3,394 kg/ha in the 2017/18 harvest, a 20% increase (Monitoring Brazilian grain harvest, v. 6-2018/19 Crop-Tenth survey, Brasilia). The most recent data show that soy generates revenues of R$148.6 billion in 2018 and the highest revenue in exports, having reached US$40 billion in the same year (Cleonice de Carvalho, et al. Brazilian soybean yearbook 2019. Santa Cruz do Sul: Editora Gazeta Santa Cruz, P. 14, 2019).
The worldwide demand for quality animal protein, especially poultry, is continuously increasing around the world (HENCHION, M.; McCARTHY, M.; RESCONI, V. C.; TROY, D. Meat consumption: trends and quality matter. Meat Science, v.98, p.561-568, 2014.). Thus, this growing demand also generates an increase in the demand for protein meals used in the manufacture of animal feed, usually derived from soybeans (Embrapa (2011) Soybean Production Technologies, Central Region of Brazil 2012 and 2013. Londrina PR. Embrapa Soja).
World consumption of soybeans in crop year 2019/20 is projected to increase to 352 million tons, up from 345 million tons consumed in 2018/19 (Cleonice de Carvalho, et al. Brazilian soybean yearbook 2019. Santa Cruz do Sul: Editora Gazeta Santa Cruz, P. 14, 2019).
Furthermore, the area under soybean cultivation grew when comparing the period 2017/18 with 2018/19 from 124.52 million hectares to 125.64 million hectares (USDA, Global Market Analysis, February 2020).
Due to the economic importance of soybean in the Brazilian agricultural scenario, soybean breeding programs aim to develop cultivars that are more productive and resistant to diseases and pests present in the different regions of Brazil. A key part of the success of breeding programs for the selection of resistant genotypes lies in the use of inoculum sources (fungal isolates) representative of local diversity with known virulence spectrum and aggressiveness (Bermejo, Gabriela Rastelli. Genetic diversity of Brazilian isolates of Phakopsora pachyrhizi (Sydow & Sydow)/Gabriela Rastelli Bermejo; orientation Mayra Costa da Cruz Gallo de Carvalho-Bandeirantes: State University of Northern Parana, 2016).
In this scenario, improving soybean for resistance or tolerance to various pathogens is crucial to decrease constraining factors and maximize productivity. Among the pathogens, the fungus Corynespora cassiicola stands out (Berk. & M. A. Curtis) C. T. Wei, the etiological agent of the disease known as target spot. It is considered one of the most economically important diseases for soybean production in Brazil, especially in the Cerrado region (Almeida AMR, Ferreira L P, Yorinori J T, Silva J F V, Henning A A, Godoy C V, Costamilan L M, Meyer M C (2005) Soybean diseases. In: Kimati H, Amorim L, Rezende J A M, Bergamin Filho A, Camargo L E A (Eds.). Handbook of Plant Pathology—Vol. 2. Diseases of Cultivated Plants. 4. ed. Sao Paulo SP. Editora Agronômica Ceres. pp. 570-588).
The aforementioned fungus is found in virtually all soybean-growing regions of Brazil. Believed to be native and with the ability to infect a large number of plant species, such as cotton, increasing its adaptability in areas where soybean-cotton crop succession is performed (GALBIERI, R.; ARAÚJO, D. C. E. B.; KOBAYASTI, L.; GIROTTO, L.; MATOS, J. N.; MARANGONI, M. S.; ALMEIDA, W. P.; MEHTA, Y. R. Corynespora leaf blight of cotton in Brazil and its management. American Journal of Plant Sciences 5: 3805-3811. 2014).
This microorganism can survive on crop remains and infected seeds, which is one form of dissemination. It is estimated that the disease can cause a yield reduction of 24%, with variations between 8-42% in soybean crops with high disease pressure (GALBIERI, R.; ARAÚJO, D. C. E. B.; KOBAYASTI, L.; GIROTTO, L.; MATOS, J. N.; MARANGONI, M. S.; ALMEIDA, W. P.; MEHTA, Y. R. Corynespora leaf blight of cotton in Brazil and its management. American Journal of Plant Sciences 5: 3805-3811. 2014)
Severe but sporadic outbreaks have been observed in the cooler regions of the South and in the high Cerrados regions. Susceptible cultivars can suffer complete premature defoliation, pod rot, and stalk spotting. Through infection in the pod, the fungus can reach the seed and thus be spread to other areas. Infection, in the suture region of the developing pods, can result in necrosis, pod splitting, and germination or rotting of the still-green kernels (Embrapa (2011) Soybean Production Technologies, Central Region of Brazil 2012 and 2013. Londrina PR. Embrapa Soja.).
Conditions of high relative humidity and mild temperatures are favorable for leaf infection. The most common symptoms are leaf spots, with a yellowish halo and dark punctuation in the center, which cause severe defoliation. Stains also occur on the stem and pod. The fungus can infect roots, causing root rot and intense sporulation (Henning et al., 2005, supra).
In this sense, in general, infection by this pathogen can be observed in all parts of the plants above ground (GALBIERI, R.; ARAÚJO, D. C. E. B.; KOBAYASTI, L.; GIROTTO, L.; MATOS, J. N.; MARANGONI, M. S.; ALMEIDA, W. P.; MEHTA, Y. R. Corynespora leaf blight of cotton in Brazil and its management. American Journal of Plant Sciences, v.5, p. 3805-3811, 2014; 2. HARTMAN, G. L.; RUPE, J. C.; SIKORA, E. J.; DOMIER, L. L.; DAVIS, J. A.; STEFFEY, K. L. Compendium of soybean diseases and pests. In: HARTMAN et al. (Ed.). 5th. ed. The American Phytopathological Society, St. Louis, Mo. Paul, Minn. 201p., 2015).
The progress of target spot in the field is slower compared to Asian rust, but once the disease is established, it is difficult to control. The recommended management strategies for this disease are: rotation with non-host crops, seed treatment, chemical control at correct doses and intervals, and use of resistant cultivars. However, the lack of information on the reaction of soybean cultivars to this disease makes its management difficult, and chemical control is used as one of the most viable alternatives (MEYER, M.; GODOY, C.; VENANCIO, W.; TERAIVIOTO, A. Balanced management. Cultivar Magazine, v.165, p.03-0′7, 2013). In the case of chemical control, the association of multisite fungicides should always be recommended and the management should always begin in a preventive manner. The use of fungicide alone and in a curative manner can eliminate more sensitive populations of the fungus, increasing the frequency of the less sensitive (Teramoto, A.; Meyer, M. C.; Suassuna, N. D.; Cunha, M. G. In vitro sensitivity of Corynespora cassiicola isolated from soybean to fungicides and field chemical control of target spot. Summa Phytopathologica, v.43, n.4, p.281-289, 2017).
The genetic architecture for disease resistance has been established by several associative mapping studies, which point to a monogenic or polygenic character, depending on the type of interaction between pathogen and host. The same studies allowed the identification of DNA polymorphisms at the major effector loci associated with resistance responses. In this context, associative mapping studies are of great use for plant breeding programs by making it possible to map loci and gain knowledge about the position of a gene and its adjacent region. Furthermore, these studies allow the interpretation of possible resistance mechanisms and the prediction of the inheritance of the trait in controlled crosses, in addition to contributing to synteny or comparative mapping analysis and gene cloning (Xuehui Huang and Bin Han, Natural Variations and Genome-Wide Association Studies in Crop Plants, Annual Review of Plant Biology, 65: 531-551, 2014)
Linear mixed models have been developed and applied in associative mapping to reduce the number of false-positive associations caused by population structure and relationship (YU, J. M.; PRESSOIR, G.; BRIGGS, W. H.; VROH BI, I.; YAMASAKI, M.; DOEBLEY, J. F.; MCMULLEN, M. D.; GAUT, B. S.; NIELSEN, D. M.; HOLLAND, J. B.; KRESOVICH, S.; BUCKLER, E. S. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics, v.38, p.203-208, 2006; ZHANG, Z.; ERSOZ, E.; LAI, C.-Q.; TODHUNTER, R. J.; TIWARI, H. K.; GORE, M. A.; BRADBURY, P. J.; YU, J.; ARNETT, D. K.; ORDOVAS, J. M.; BUCKLER, E. S. Mixed linear model approach adapted for genome-wide association studies. Nature Genetics, v.42, p.355-360, 2010.).
Molecular markers have been used in identifying polymorphisms associated with disease resistance. In breeding programs, the marker-assisted selection approach (SAM) has been widely used because it allows the identification of disease resistance or other characteristics already in the early stages and early stages of plant development.
Using SAM, unfavorable alleles can be eliminated or greatly reduced in the first few generations, which allows for the evaluation and selection of an optimal number of plants in the field. In another application, SAM can facilitate the introgression of favorable alleles from resistance sources into elite strains (Shi, Z., Liu, S., Noe, J. et al. SNP identification and marker assay development for high-throughput selection of soybean cyst nematode resistance. BMC Genomics 16, 314 (2015). https://doi.org/10.1186/s12864-015-1531-3).
Resistant cultivars are usually developed by transferring resistance alleles from germplasm, often unadapted, to elite cultivars. Due to the wide genetic variability of fungal species and their constant adaptations, the emergence of new isolates that challenge the genetic resistance already introduced in elite cultivars is common. Thus, it is essential to explore a broad genetic base in germplasm to ensure the longevity of resistance (ALZATE-MARIN, A L.; CERVIGNI, G. D. L.; MOREIRA, M. A; (2005) Marker assisted selection in the development of disease resistant plants, with emphasis on common bean and soybean. Brazilian Phytopathology. v.30, no.4, p.333-342).
In this context, broad genome association studies are of great use for plant breeding programs because they allow the mapping of loci that control qualitative or quantitative traits (QTLs—Quantitative Trait Loci), and for providing knowledge about the position of a gene and its adjacent region. Furthermore, such studies allow the interpretation of evolutionary mechanisms and the prediction of progeny from controlled crossings, as well as contributing to the analysis of synteny or genetic mapping and gene cloning.
A genetic map is a graphical representation of a genome (or a part of a genome, such as a single chromosome) where the distances between reference points on the chromosome are measured by the recombination frequencies between these points. A genetic reference point can be any one of a variety of known polymorphic markers, for example, but not limited to molecular markers, such as SSR-type markers (Simple Sequence Repeats) RFLP-type markers (Restriction Fragment Length Polymorphism) or SNP-type markers (Single nucleotide polymorphism). Also, sSR-type markers can be derived from genomic or expressed nucleic acids (for example, ESTs (Expressed sequence tags)).
Gene-associated markers or QTLs, once mapped and evaluated for influence on phenotypic variation, can be used for SAM, which makes the process of choosing a particular genotype fast and efficient, making it a tool of great contribution to plant breeding (Collins, P J, et al, Marker assisted breeding for disease resistance in Crop Plants. Biotechnologies of Crop Improvement, v3, 41-47, 2018).
Recently, marker-assisted selection has increased the efficiency of traditional soybean breeding programs. Furthermore, the availability of integrated linkage maps of the soybean genome containing increasing densities of public soybean markers has facilitated soybean genetic mapping and SAM applications (Cregan et al. (1999) “An Integrated Genetic Linkage Map of the Soybean Genome” Crop Sci. 39:1464-1490).
SNPs (Single nucleotide polymorphism) are markers that consist of a differentiated shared sequence based on a single nucleotide.
SNPs between homologous DNA fragments and small insertions and deletions (indels), known collectively as single nucleotide polymorphisms (SNPs) have been shown to be the most abundant source of DNA polymorphisms in humans (Kwok P.-Y., Deng Q., Zakeri H., Nickerson D. A., 1996 Increasing the information content of STS-based genome maps: identifying polymorphisms in mapped STSs. Genomics 31: 123-126; Y. L. Zhu, Q. J. Song, D. L. Hyten, C. P. Van Tassell, L. K. Matukumalli, D. R. Grimm, S. M. Hyatt, E. W. Fickus, N. D. Young and P. B. Cregan Genetics Mar. 1, 2003 vol. 163 no. 3 1123-1134).
SNPs are suitable for developing high-throughput and easy-to-automate genotyping methods because most SNPs are biallelic, thus simplifying genotyping approaches and analyses. (Lin C H, Yeakley J M, McDaniel T K, Shen R (2009) Medium- to high-throughput SNP genotyping using VeraCode microbeads. Methods Mol Biol 496: 129-142; Yoon M S, Song Q J, Choi I Y, Specht J E, Hyten D L, et al. (2007) BARCSoySNP23: a panel of 23 selected SNPs for soybean cultivar identification. Theor Appl Genet 114: 885-899). Based on SNP analysis and bioinformatics tools, linkage disequilibrium and haplotype analysis can be quantified. Furthermore, another point to be considered is that the use of molecular markers for assisted improvement, including SNPs, detects genetic information without interference from the environment, in transcribed and non-transcribed regions, bringing the advantage of the possibility of eliminating or reducing the need for time-consuming and laborious phytopathological analyses. The breeder can identify individuals carrying markers linked to the allele of interest, as disease resistance, resulting in time and resource savings (ALZATE-MARIN, A L.; CERVIGNI, G. D. L.; MOREIRA, M. A; (2005) Marker assisted selection in the development of disease resistant plants, with emphasis on common bean and soybean. Brazilian Phytopathology. v.30, no.4, p.333-342).
Currently, the main form of control of target spot is through the use of fungicides. However, fungicides from the carboxamide chemical group have been reducing their control efficiency probably due to the presence of resistant isolates of Corynespora cassiicola to methyl-benzimidazole-carbamate fungicides (MBC) (GODOY, C. V.; UTIAMADA, C. M.; MEYER, M. C.; CAMPOS, H. D.; PIMENTA, C. B.; JACCOUD-FILHO, D. S. Efficiency of fungicides for the control of target spot, Corynespora cassiicola, in the 2013/14 crop: summarized results of cooperative trials. Londrina: Embrapa Soja, 2014. 6p. (Embrapa Soja. Technical Circular 104).
Thus, there is a need to use complementary methods for effective disease management, such as genetic resistance in cultivars. Despite the economic importance of soybeans and the threat of target spot, so far, there are no scientific publications describing sources (genotypes) for disease resistance, much less studies of genetic inheritance, description of resistance genes/locus and neither studies on the location of possible resistance genes to Corynespora cassiicola.
The present invention identifies soybean genome SNPs associated with soybean resistance to the fungus Corynespora cassiicola and discloses a method for identifying and selecting plants resistant to this pathogen. In addition, it also reveals a method for introgression into plants of resistance alleles to the fungus Corynespora cassiicola in soybean.
The advantages of the invention will be evident in the description of the invention provided herein.
In one aspect, the invention relates to a method for identifying, distinguishing and selecting plants of the genus Glycine, resistant or susceptible, to target spot caused by the fungus Corynespora cassiicola which comprises:
In one embodiment of the method, one or more markers are located in the genomic region of the genes or in the ranges of the genes Glyma.17g224300 (SEQ ID NO: 1), Glyma.17g223800 (SEQ ID NO: 2), Glyma.17g223900 (SEQ ID NO: 3), Glyma.17g224000 (SEQ ID NO: 4), Glyma.17g224100 (SEQ ID NO: 5), Glyma.17g224200 (SEQ ID NO: 6), Glyma.17g224500 (SEQ ID NO: 8), Glyma.17g224600 (SEQ ID NO: 9), Glyma.17g224700 (SEQ ID NO: 10), Glyma.17g224800 (SEQ ID NO: 11), Glyma.17g224900 (SEQ ID NO: 12), Glyma.17g225000 (SEQ ID NO: 13), Glyma.17g225100 (SEQ ID NO: 14), Glyma.17g225200 (SEQ ID NO: 15), Glyma.17g225300 (SEQ ID NO: 16), Glyma.17g225400 (SEQ ID NO: 17), Glyma.17g225500 (SEQ ID NO: 18). In a preferred embodiment, markers are located in the genomic region of genes or in the ranges of genes selected from the group consisting of Glyma.17G224300 (SEQ ID NO: 1), Glyma.17G224400 (SEQ ID NO: 7) and Glyma.17G224500 (SEQ ID NO: 8) and even more preferentially, said marker is a SNP selected from the group consisting of ss715627273, ss715627288, ss715627282, ss715627290, ss715627293, ss715627289, ss715627296, ss715627297, ss715627265, ss715627264, ss715627310, ss715627276, ss715627274, ss715627280 and ss715627279, or combinations thereof, or any other molecular marker in a range up to 5 cM or 1 Mbp from said group, even more preferably said marker is a SNP selected from the group consisting of ss715627288, ss715627273 and ss715627282, or combinations thereof, or any other molecular marker within 5 cM or 1 Mbp of that group.
In one form of embodiment, the method comprises identifying the markers by any amplification methodologies, or by use of probes, or by any type of sequencing (e.g. tGBS or directed sequencing).
In another form of embodiment, the method the plant of the genus Glycine is Glycine max.
In another aspect, the invention relates to a method of introgressing into plants of the genus Glycine alleles of resistance to target spot caused by the fungus Corynespora cassiicola, comprising:
(a) Crossing parents of plants of the genus Glycine identified by the method as defined in any of claims 1 to 6 with other parents lacking said resistance;
(b) Select progenies possessing markers associated with increased resistance or reduced susceptibility to Corynespora cassiicola by the method as defined in claim 1; e
(c) Backcross in one or more cycles the selected progenies with the recurrent genitor to develop new progenies.
In a further aspect, the invention relates to a nucleic acid molecule capable of hybridizing with any of the SEQ ID NO: 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, or 33, or subsequences thereof having at least 15 consecutive nucleotides, or sequences with at least 90% sequence identity. [0042] In a further aspect, the invention also relates to the use of a nucleic acid molecule as defined above in the methods of the invention.
In a further aspect, included in the invention is a detection kit comprising at least two nucleic acid molecules as defined above.
In a further aspect, the invention relates to a method for genotyping target Glycine plants resistant to target spot, comprising analyzing the presence in the DNA of the target plant for one or more markers associated with target spot resistance, selected from the group consisting of ss715627273, ss715627288, ss715627282, ss715627290, ss715627293, ss715627289, ss715627296, ss715627297, ss715627265, ss715627264, ss715627310, ss715627276, ss715627274, ss715627280 and ss715627279, or combinations thereof.
In a further aspect, the invention relates to a target spot resistant Glycine plant obtained by an introgression method as defined above.
Unless defined differently, all technical and scientific terms used herein have the same meaning as understood by a person skilled in the subject matter to which the invention pertains. The terminology used in describing the invention is intended to describe particular embodiments only, and does not intend to limit the scope of the teachings. Unless otherwise stated, all numbers expressing quantities, percentages and proportions, and other numerical values used in the descriptive report and claims, should be understood as being modified in all cases by the term “about”. Thus, unless otherwise stated, the numerical parameters shown in the descriptive report and in the claims are approximations that may vary, depending on the properties to be obtained.
The practice of the present invention will employ, unless otherwise indicated, conventional methods of chemistry, biochemistry, and recombinant DNA techniques, within the skill of the art. Such techniques are explained fully in the literature. Take a look, e.g. Fundamental Virology, 2nd Edition, vols. I & II (B. N. Fields and D. M. Knipe, eds.); T. E. Creighton, Proteins: Structures and Molecular Properties (W.H. Freeman and Company, 1993); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current edition); Sambrook, et al, Molecular Cloning: A Laboratory Manual (2nd Edition, 1989) Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.).
The following terms are defined, and may be used within the scope of the present invention in order to facilitate general understanding.
Gene: the basic physical and functional unit of heredity, being composed of DNA and capable of being transcribed into RNA. Some genes act as instructions for polypeptides;
QTL: quantitative Trait Loci, which refers to a quantitative trait locus. It is a locus that correlates with the variation of a quantitative trait in the phenotype of a population of organisms;
Locus: refers to a position or location that a particular gene or any other genetic element or factor contributing to a trait occupies in a chromosome of a given species.
Allele: variant forms of a given gene, which occupy the same region on homologous chromosomes, affecting the same trait, but in a different way. The same gene can have several alleles;
Chromosome: is an organized package of DNA found in the nucleus of the cell that can contain several genes;
Genotype: refer to the alleles, or variant forms of a gene, that are understood by an organism;
Genetic map: It is a graphical representation of a genome or a part of a genome, such as a single chromosome. It is a description of the genetic linkage relationships between loci on one or more chromosomes in a given species. For each genetic map, the distances between loci are measured by the recombination frequencies between them. Recombination between loci can be detected using a variety of markers;
Linkage disequilibrium: is defined in the context of the invention as the relative frequency of gamete types in a population of many individuals in a single generation. If the frequency of an allele A is p, a is p′, B is q and b is q′, then the expected frequency (without linkage disequilibrium) of genotype AB is pq, Ab is pq′, aB is p′q and ab is p′q′. Any deviation from the expected frequency is called a linkage disequilibrium Two loci are said to be “genetically linked” when they are in linkage disequilibrium.
Genetic linkage: refers to a trait association in inheritance due to the location of genes in close proximity on the same chromosome, measured by the percentage of recombination between loci (centi-Morgan, cM). The distances between loci are usually measured by the recombination frequency between loci on the same chromosome. The further apart two loci are from each other, the more likely it is that recombination will occur between them. Conversely, if two loci are close together, a recombination is less likely to happen between them. As a rule, 1 centi-Morgan is equal to 1% recombination between loci. When a QTL can be indicated by multiple markers, the genetic distance between markers at the ends (flankers) is indicative of the size of the QTL. For purposes of this invention, “genetically linked to a marker” can be considered that the marker is not more than 10 cM apart, preferably 5 cM, more preferably 2 cM and even more preferably 1 cM of the genetic determinant that confers resistance.
Molecular markers: are DNA fragments that are associated with a specific region of the genome, which can be monitored. They refer, in other words, to indicators that are used in methods to visualize differences in nucleic acid sequences. Marker molecules can take the form of short DNA sequences, as a sequence involving a single nucleotide polymorphism, where a single base pair change occurs. They can also take the form of longer DNA sequences, such as microsatellites, with 10 to 60 base pairs.
Germplasm: refers to the totality of genotypes in a population. It can also refer to plant material, for example a group of plants that are repositories of several alleles.
Resistance: refers to the ability of a plant to restrict the growth and development of a specific pathogen and/or the resulting signal/symptom, when compared to susceptible plants under similar environmental conditions and pathogen pressure. Includes both partial resistance and full resistance to infection (for example, infection by a pathogen that causes target spotting). A resistant plant will show no or few symptoms of the disease. A susceptible plant can either be a non-resistant plant or have lower levels of resistance to infection compared to a resistant plant.
Introgression: refers to natural or artificial processes in which genomic regions of one species, variety or cultivar are transferred to the genome of another species, variety or cultivar by crossing over. The process can optionally be completed by backcrossing between an individual and its recurrent parent.
Crossover: refers to the fusion of gametes via pollination to produce an offspring, including both self-fecundation (when pollen and ovule are from the same plant) or cross-fertilization (when pollen and egg are from different plants).
Marker assisted selection (SAM): is a process by which phenotypes are selected on the basis of molecular genotypes. Marker assisted selection includes the use of molecular markers to identify plants or populations that possess the genotype of interest in breeding programs.
PCR (polymerase chain reaction): refers to a method of producing relatively large quantities of specific regions of DNA, allowing various analyses based on these regions.
PCR Initiators (“primers”): relatively small fragments of single-stranded DNA used in the PCR amplification of specific regions of DNA.
Probe: refers to molecules or atoms that are able to recognize and bind to a specific target molecule, allowing detection of the target molecule. In particular, for purposes of this invention, “probe” refers to a sequence of labeled DNA or RNA that can be used to detect and/or quantify a complementary sequence by molecular hybridization.
The following detailed description refers to genetic markers and related methods for identification of such markers, genotyping of plants of the genus Glycine, and methods for marker-assisted breeding of these plants.
The loci pertaining to the present invention comprise bounded genomic sequences comprising one or more molecular markers, including a polymorphism identified in Table 5, Table 7 or Table 8, as shown in the SEQ ID NOS: 19 a 33, or is adjacent to one or more of these polymorphisms.
In one aspect of the invention, isolated nucleic acid sequences are provided (oligonucleotides) that are capable of hybridizing to the polymorphic loci of the present invention. In certain embodiments, for example, that come from initiators, such molecules comprise at least 15 nucleotide bases. Molecules useful as primers can hybridize under high-stringency conditions to one or more strands of a DNA segment at a polymorphic locus of the invention. Primers for DNA amplification are provided in pairs, i.e., forward primers (or F)” or “reverse (or R)”. One primer will be complementary to one DNA strand at the locus and the other primer will be complementary to the other DNA strand at the locus, i.e. preferentially, sequences that are at least 90% included, more preferably 95%, or 100% identical to a sequence as described in SEQ ID Nos: 19 to 48, or to sub-sequences of at least 15 nucleotides. Furthermore, it is understood that such primers can hybridize to a sequence at the locus that is distant from the polymorphism, for example, at least 5, 10, 20, 50, 100, 200, 500 or even about 1,000,000 nucleotides away from the polymorphism. The design of an initiator of the invention will depend on factors well known in the art, for example, avoiding a repetitive sequence.
In addition to this, it should be remembered here that, although preferred functions may be mentioned in relation to some oligonucleotides, it is obvious that a given oligonucleotide may assume several functions, and may be used in different forms in accordance with the present invention. As the person skilled in the art knows, in some situations, a primer can be used as a probe and vice versa, as well as being applicable in hybridization procedures, detection etc. Thus, it is noted that products according to the present invention, especially, inter alia, oligonucleotides, are not limited to the uses shown here, but rather, the uses should be interpreted broadly, independent of the use indicated here. Furthermore, when an oligonucleotide is described as being useful as a probe that can bind to an amplicon, the subject matter expert also understands that the complementary sequence of this oligonucleotide is equally useful as a probe to bind to the same amplicon. The same is true for the sequences described as useful as primers. Additionally, It is also obvious that any initiator suitable for a multiplex protocol can also, within the meaning and scope of the present invention, be used in a singleplex protocol. The same applies to a suitable primer for a real-time PCR protocol, that can be used in a conventional PCR protocol, within the meaning of the present invention.
The person skilled in the art, in this regard, understands that the oligonucleotides of the present invention, i.e., the primers and probes, need not be completely complementary to a part of the target sequence. The primer can exhibit sufficient complementarity to hybridize with the target sequence and perform the intrinsic functions of a primer. The same applies to a probe, that is, a probe can exhibit sufficient complementarity to hybridize with the target sequence and perform the intrinsic functions of a probe. Therefore, a primer or a probe in one embodiment need not be completely complementary to the target sequence. In one embodiment, the primer or probe can hybridize or ring with a part of the target to form a double strand. The conditions for hybridization of a nucleic acid are described by Joseph Sambrook et al. Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2001) and Haymes et al. Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985).
In another aspect of the invention, is the kit comprising at least two primers as described above.
Another aspect of the nucleic acid molecules of the invention are the hybridization probes. In one embodiment, such probes are oligonucleotides comprising at least 15 nucleotide bases and a detectable marker. The purpose of such molecules is to hybridize, for example, under high-stringency conditions, to a DNA strand in a segment of nucleotide bases that includes or is adjacent to a polymorphism of interest. Such oligonucleotides are preferentially at least 90%, more preferentially 95% identical to the sequence of a segment of Glycine DNA at a polymorphic locus, or to a fragment of it comprising at least 15 nucleotide bases. But specifically, the polymorphic locus is selected from the group consisting of SEQ ID NO: 19-33.
The detectable marker can be a radioactive element or a dye. In preferred aspects, the hybridization probe still comprises a fluorescent marker and a quencher, for example, for use in hybridization assays such as Taqman® assays, available from AB Biosystems. In this case, the detectable marker and the quencher are located at opposite ends. For SNP detection assays, it is useful to provide such markers and quenchers in pairs, for example, where each molecule for detection of a polymorphism has a distinct fluorescent marker and quencher, different for each polymorphism.
More specifically, with respect to the TaqMan™ probe, an oligonucleotide, whose 5′ terminal region is modified with a fluorophore and the 3′ terminal region is modified with a quencher, is added to the PCR reaction. It is also understood that it is possible to bind the fluorophore in the 3′ terminal region and the quencher in the 5′ terminal region. The reaction products are detected by fluorescence generated after the 5′ exonuclease activity->3′ of DNA polymerase. The fluorophores, which refer to fluorescent compounds that emit light with the excitation by light having a shorter wavelength than the light that is emitted, can be, but are not limited to, FAM, TAMRA, VIC, JOE, TET, HEX, ROX, RED610, RED670, NED, Cy3, Cy5, and Texas Red. The quenchers can be, but are not limited to, 6-TAMRA, BHQ-1,2,3 and MGB-NFQ. The choice of the fluorophore-quencher pair can be made so that the excitation spectrum of the quencher has an overlap with the emission spectrum of the fluorophore. One example is the FAM-TAMRA pair, FAM-MGB, VIC-MGB and so on. An expert on the subject will know how to recognize other appropriate pairs.
It is not necessary that there be complete complementarity between the sequences, as long as the differences do not completely impair the ability of the molecules to form a double-stranded structure. Therefore, for a nucleic acid molecule to be able to serve as a primer or probe, it must be sufficiently complementary in sequence to allow the formation of a double-stranded structure under the hybridization conditions used.
In a preferred embodiment, a nucleic acid molecule will hybridize to a segment of Glycine DNA shown in SEQ ID NO: 1 to 33.
SNPs are the result of a variation in sequence and new polymorphisms can be detected by sequencing genomic DNA or cDNA molecules.
In one aspect, polymorphisms in a genome can be determined by comparing the cDNA sequence of different strains. Although the detection of polymorphisms by cDNA sequence comparison is relatively convenient, the evaluation of the cDNA sequence does not allow information about the position of the introns in the corresponding genomic DNA. In addition, polymorphisms in the non-coding sequence cannot be identified from the cDNA. This can be a disadvantage, for example when using cDNA-derived polymorphisms as markers for genomic DNA genotyping. More efficient genotyping assays can be designed if the scope of polymorphisms includes those present in the single non-coding sequence.
Genomic DNA sequencing is more useful than cDNA for identifying and detecting polymorphisms. Polymorphisms in a genome can be determined by comparing the genomic DNA sequence of different strains. However, the genomic DNA of higher eukaryotes usually contains a large fraction of repetitive sequence and transposons. Genomic DNA can be sequenced more efficiently if the coding/unique fraction is enriched by subtracting or eliminating repetitive sequences.
There are several well-known strategies in the technique that can be employed to enrich the sample in coding sequences/unique sequences. Examples of these include the use of enzymes that are sensitive to cytosine methylation, the use of the McrBC endonuclease to cleave the repetitive sequence and the printing of microarrays of genomic libraries that are then hybridized with repetitive sequence probes.
A method for reducing repetitive DNA comprises constructing reduced representation libraries by separating the repetitive sequence of genomic DNA fragments from at least two varieties of a species, fractioning the separated genomic DNA fragments based on nucleotide sequence size, and comparing the sequence of fragments in a fraction to determine polymorphisms. More particularly, these methods for identifying polymorphisms in genomic DNA comprise digesting the total genomic DNA of at least two variants of a eukaryotic species with a methylation-sensitive endonuclease to provide a pool of digested DNA fragments. The average nucleotide length of the fragments is shorter for DNA regions characterized by a lower percentage of 5-methylated cytosine. Such fragments are separable, e.g. by gel electrophoresis, on the basis of nucleotide length. A fraction of DNA with shorter than average nucleotide length is separated from the digested DNA pool. DNA sequences in a fraction are compared to identify polymorphisms. Compared to the coding sequence, The repetitive sequence is most likely to comprise 5-methylated cytosine, e.g. in the -CG- and -CNG-sequence segments. In one mode of the method, genomic DNA from at least two different inbred varieties of a Glycine is digested with a methylation-sensitive endonuclease selected from the group consisting of enzymes such as Aci I, Apa I, Age I, Bsr FI, BssHII, Eag I, Eae I, Hha I, HinP II, Hpa II, Msp I, MspMII, Nar I, Not I, Pst I, Pvu I, Sac II, Sma I, Stu I and Xho I to provide a physically separated pool of digested DNA, for example by gel electrophoresis. Fractions of comparable size of DNA are obtained from the digested DNA of each of the aforementioned enzymes. DNA molecules from the comparable fractions are inserted into vectors or isolated to construct reduced representation libraries of genomic DNA clones that are sequenced and compared to identify polymorphisms.
Another method for enrichment of coding sequences/single sequence consists of constructing reduced representation libraries (using methylation-sensitive enzymes or not) by printing microarrays of the library on a nylon membrane, followed by hybridization with probes made from repetitive elements known to be present in the library. The repetitive sequence elements are identified and the library is reorganized by choosing only the negative clones. Such methods provide reduced representation genomic DNA segments of a plant that has genomic DNA comprising DNA regions with relatively higher levels of methylated cytosine and DNA regions with relatively lower levels of methylated cytosine.
In addition, microarrays can be used (DNA chip) of soy available in the technique, such as SoySNP50K (Song Q, Hyten DL, Jia G, Quigley CV, Fickus EW, Nelson RL, et al. (2013) Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean. PLoS ONE 8(1): e54985). This panel has been widely exploited for soybean genetic studies, allowing the identification of associations between SNPs and disease resistance, among other traits.
Polymorphisms in DNA sequences can be detected by a variety of methods well known in the art. DNA samples include, but are not limited to, the genotypes shown in Table 1.
For example, methods to detect SNPs and Indels include single base extension methods (SBE). Examples of SBE methods include, but are not limited to, those disclosed in U.S. Pat. Nos. 6,004,744; 6,013,431; 5,595,890; 5,762,876; and 5,945,283. SBE methods are based on extending a nucleotide primer that is immediately adjacent to a polymorphism to incorporate a detectable nucleotide residue after primer extension. In certain embodiments, the SBE method uses three synthetic oligonucleotides. Two of the oligonucleotides serve as PCR primers and are complementary to the sequence of the soybean genomic DNA site that flanks a region containing the polymorphism to be tested. After amplification of the soybean genome region containing the polymorphism, the PCR product is mixed with the third oligonucleotide (called the extension initiator), which is designed to hybridize to the amplified DNA immediately adjacent to the polymorphism in the presence of DNA polymerase and two differentially labeled dideoxynucleoside triphosphates. If polymorphism is present in the template, one of the labeled didesoxynucleosidetriphosphates can be added to the primer at a single base chain length. The allele present is then inferred by determining which of the two differential markers was added to the extension primer. Homozygous samples will result in the incorporation of only one of the two marked bases e, therefore, only one of the two markers will be detected. Heterozygous samples have both alleles present and therefore direct the incorporation of both markers (on different molecules of the extension primer) and, therefore, both markers will be detected.
In a preferred method for detecting polymorphisms, SNPs and Indels can be detected by methods disclosed in U.S. Pat. Nos. 5,210,015; 5,876,930; and 6,030,787 in which an oligonucleotide probe is used with a fluorescent dye at 5′ and a quencher at 3′ from the probe. When the probe is intact, the proximity of the fluorescent dye to the quencher results in suppression of the fluorescence of the fluorescent dye, e.g. by Forster-type energy transfer. During PCR, the forward and reverse primers hybridize to a specific sequence of the target DNA that flanks a polymorphism while the hybridization probe hybridizes to the polymorphism-containing sequence in the amplified PCR product. In the subsequent PCR cycle, DNA polymerase with 5→3′ exonuclease activity breaks the probe and separates the fluorescent dye from the quencher, resulting in increased fluorescence of the fluorescent dye.
A useful test is available from AB Biosystems as the Taqman® test, which employs four synthetic oligonucleotides in a single reaction that simultaneously amplifies soybean genomic DNA, discriminates the alleles present, and directly provides a signal for discrimination and detection. Two of the four oligonucleotides serve as PCR primers and generate a PCR product that encompasses the polymorphism to be detected. Two others are allele-specific fluorescence resonance energy transfer probes (FRET). In the trial, two FRET probes with different fluorescent reporter dyes are used, where a single dye is incorporated into an oligonucleotide that can ring with high specificity with only one of the two alleles. Useful reporter dyes include, among others, 6-carboxy-4,7,2 ‘,7’-tetrachlorofluorecein (TET)2′-chloro-7′-phenyl-1,4-dichloro-6-carboxyfluorescein (VIC) and 6-carboxyfluorescein phosphoramidite (FAM). A useful inhibitor is 6-carboxy-N, N, N′, N′-tetramethyl-rhodamine (TAMRA). Also, the 3′ end of each FRET probe is chemically blocked so that it cannot act as a PCR primer. A third fluorophore used as a passive reference is also present, for example rhodamine X (ROX) to help with subsequent normalization of the relevant fluorescence values (correcting volumetric errors in the reaction set-up). The amplification of the genomic DNA is started. During each PCR cycle, FRET probes bind in an allele-specific manner to the templates of DNA molecules. The ringed FRET probes (but not the non-ringed ones) are degraded by TAQ DNA polymerase as the enzyme meets the 5′ end of the ringed probe, thereby releasing the fluorophore from the vicinity of its quencher. After PCR, the fluorescence of each of the two fluorescents, as well as the passive reference, is determined fluorometrically. The normalized fluorescence intensity for each of the two dyes will be proportional to the amounts of each allele initially present in the sample e, therefore, the genotype of the sample can be inferred.
PCR primers are designed (a) to have a size of about 15 to 25 bases and sequences that hybridize at the polymorphic locus, (b) has a melting temperature in the range 57° C. to 60° C., corresponding to a ringing temperature of 52° C. to 55° C., (c) produces a product that includes the polymorphic site and typically has a size ranging from 75 to 250 base pairs. However, there are PCR techniques that allow amplification of larger fragments of 1000 or more base pairs. Primers are preferably located at the locus so that the polymorphic site is at least 1 base away from the 3′ end of each primer. However, it is understood that PCR primers can be up to 1000 base pairs or more away from the polymorphism and still provide amplification of a corresponding DNA fragment containing the polymorphism that can be used in soybean genotyping assays.
Directed sequencing techniques can be applied for polymorphism detection. The development of increasingly inexpensive and rapid sequencing technologies has led to the facilitation of large-scale detection of polymorphisms in various model and non-model plant species (Kumar S, Banks TW, Cloutier S. SNP Discovery through Next-Generation Sequencing and Its Applications. International journal of plant genomics vol. 2012 (2012): 831460). The development and improvement of freely available, open-source bioinformatics software has accelerated the discovery of SNPs. It is worth noting that the facilitation of whole genome sequencing has led to the discovery of several million SNPs in different organisms.
Polymorphisms at the loci of this invention can be used to identify associations of markers and target-spot resistance that are inferred from statistical analysis of genotypic and phenotypic data from members of a population
Various types of statistical analyses can be used to infer the association of markers and resistance to target spot from phenotype/genotype data, but a basic idea is to detect molecular markers, i.e., polymorphisms, for which alternative genotypes have significantly different average phenotypes. For example, if a given marker locus “A” has three alternative genotypes (AA, Aa and aa) and if these three classes of individuals have significantly different phenotypes, then we will infer that locus “A” is associated with the desired characteristic. The significance of differences in phenotype can be tested by various types of standard statistical tests, such as linear regression of genotypes of molecular markers in the phenotype or analysis of variance (ANOVA). The statistical software packages available on the market, commonly used to do this type of analysis include linear mixed models (MLM) developed by the MVP packages (YIN et al., 2018) GAPIT (TANG et al., 2016) and FarmCPU (LIU et al., 2016) with the Emma matrix algorithms (MVP) and VanRaden (GAPIT and FarmCPU). When many molecular markers are tested simultaneously, an adjustment, such as the Bonferroni correction, is made to the level of significance necessary to declare an association.
Often, the goal of an association study is not simply to detect associations of markers and desired traits, but to estimate the locations of genes that affect the trait directly in relation to the locations of the markers. In a simple approach to this goal, a comparison is made between marker locations of the magnitude of the difference between alternative genotypes or the level of significance of this difference. It is inferred that the trait genes are located closer to the marker(s) that have the largest associated genotypic difference. The genetic linkage of additional marker molecules can be established by a genetic mapping model, as, without limitation, the flanking marker model reported by Lander et al. (Lander et al. 1989 Genetics, 121: 185-199) and interval mapping, based on maximum likelihood methods, and implemented in the software package MAPMAKER/QTL (Lincoln and Lander, mapping Genes Controlling Quantitative Traits Using MAPMAKER/QTL, Whitehead Institute for Biomedical Research, Massachusetts, (1990).) Additional software includes Qgene, Version 2.23 (1996) Department of Plant Breeding and Biometrics, 266 Emerson Hall, Cornell University, Ithaca, N.Y.).
A maximum likelihood estimate is calculated (MV) for the presence of a marker, together with a MV that assumes no QTL effect, to avoid false positives. A log 10 of an odds ratio (“odds ratio” or LOD) is then calculated as: LOD=log 10 (MV for the presence of a QTL/MV without QTL bound). The LOD score essentially indicates how much more likely the data is to arise assuming the presence of a QTL versus in its absence. The LOD limit value to avoid a false positive with a given confidence, for example 95%, depends on the number of markers and the length of the genome.
For the development of the present invention, a set of genotypes was used (as per table 1) which were inoculated with isolates of Corynespora cassiicola that showed virulence considered high and intermediate (table 2). These genotypes were evaluated for resistance to target spot, resistant genotypes were selected as described in Table 4. Included within the scope and for the purposes of the present invention are all genotypes considered resistant and highly resistant, which can be used in breeding programs as sources of resistance to target spot. More preferentially are the genotypes considered highly resistant, selected from the group consisting of PI 71506, PI 153230, PI 567310B, PI 587802, PI 587860, PI 407999-1 and PI 548984.
In another aspect of the invention, the polymorphism at the sites of the invention is mapped on the soybean genome as a physical map of the soybean genome comprising positions on the map of two or more polymorphisms, as indicated in Tables 5, 7 and 8.
More specifically, the present invention describes the identification of genetic markers (SNPs or combinations of two or more SNPs) that can be used to identify alleles associated with resistance or tolerance to target spot in plants. More specifically, markers are present in a 110-kpb interval on chromosome 17 of G. max, associated with target spot resistance.
When a locus has been located in close proximity to molecular markers, these markers can be used to select improved aspects of the trait without the need for phenotypic analysis in each selection cycle. In marker assisted breeding and marker assisted selection, the associations between loci and markers are initially established through mapping analysis. In the same process, it is determined which alleles of the molecular markers are linked to favorable alleles of the locus/loci being studied. Subsequently, alleles of the markers associated with favorable locus/loci alleles are selected in the population. This procedure will improve the “value” of the trait to be selected, in this case resistance to the target spot, provided there is a sufficiently close link between markers and the locus involved in resistance. The degree of linkage required depends on the number of generations of selection because, in each generation, there is an opportunity to break the association by recombination.
There are a few ways to quantify the level of efficiency of molecular markers for selecting genotypes of interest. One of the main ways is in the use of accuracy calculations and type I and II error rates. Accuracy is a measure that shows how effective a marker is in detecting resistant and susceptible individuals. This calculation is used as a way to accurately indicate how close a genotypic result is to the phenotypic data for the trait under study. High accuracy values indicate high efficiency in the selection of individuals using molecular markers. Type I and II error rates, on the other hand, are measures that quantify possible flaws in the correlation of phenotypic and genotypic data. Type I errors, also called false-positive, are results in which the genotypic data indicate the presence of a resistance allele, while the phenotypic data suggest that the samples analyzed are susceptible to the trait. In contrast, type II, or false-negative errors, demonstrate the genotypic presence of susceptible alleles in samples with disease resistance phenotypes. Low Type I and II error values decrease the probability of eliminating resistant and susceptible materials, respectively, by using molecular markers (Maldonado dos Santos, J. V., Ferreira, E. G. C., Passianotto, A. L. d. L. et al (2019). Association mapping of a locus that confers southern stem canker resistance in soybean and SNP marker development. BMC Genomics 20, 798; Bruna Bley Brumer. Morphological, molecular and pathogenic characterization of Diaporthe aspalathi isolates and validation of SNPs markers associated with stem canker resistance in soybean. Master's Dissertation. Universidade Estadual de Londrina-UEL-PR-2016; Adriano Consoni Camolese. Phytophthora root rot in soybean: Identification of a recessive resistance gene and validation of SNPs for use in molecular marker assisted selection. Master's Dissertation. State University of Londrina-UEL-PR-2015).
Associations between specific marker alleles and favorable alleles can also be used to predict which types of progeny may segregate from a given cross. This prediction can allow the selection of appropriate parents for generation populations from which new combinations of favorable alleles are assembled to produce a new pure lineage. For example, if strain A has marker alleles previously associated with favorable alleles at locations 1, 20, and 31, while strain B has marker alleles associated with favorable effects at locations 15, 27, and 29, a new strain can be developed by crossing A×B and selecting progenies that have favorable alleles at all 6 loci.
Molecular markers are used to accelerate the introgression of genes or chromosomal segments into new genetic backgrounds (that is, in a diverse range of germplasm). Simple introgression involves crossing a donor line of a new trait to an elite line and, then select and backcross F1 plants repeatedly to the elite parent (recurrent) while selecting the maintenance of the gene of interest/chromosome segment. Over several generations of backcrossing, the genetic background of the original line is gradually replaced by the genetic background of the elite through recombination and segregation. This process can be accelerated by selecting the alleles of the recurrent parent through molecular markers. This approach is known as marker-assisted backcrossing.
Finally, it is possible to establish a “fingerprint” or fingerprint of a lineage, as the combination of alleles in a set of two or more marker loci. High density fingerprints can be used to establish and trace the identity of germplasm, which has utility in establishing a database of trait-marker associations to benefit a soybean breeding program, as well as protecting the intellectual property of the germplasm.
Thus, according to a first aspect of the invention, the present invention provides methods for identifying and selecting plants resistant to a fungal disease comprising the steps of:
Preferably, the method is directed toward identification of plants of the genus Glycine, more specifically plants of the species Glycine max.
Preferentially, resistance to the fungus is resistance to Corynespora cassiicola, the etiologic agent of target spot.
Obtaining a nucleic acid sample from a plant can be accomplished by standard DNA isolation methods well known in the art, as described supra.
Analysis for the presence of markers can be done by PCR, probes, or sequencing. In one form of embodiment, the nucleic acid molecules (PCR primers and probes) comprise sequences from SEQ ID Nos: 19-48, or sub-sequences of these that are at least 15 nucleotides in length. Also included in the scope of the invention are sequences that are at least 90% identical to SEQ ID Nos: 19-48 or their sub-sequences.
With respect to fungal disease, the method of the present invention preferably relates to the fungus Corynespora cassiicola, which causes the disease called Target Spot, and resistance or tolerance to said disease is conferred by a locus or QTL.
Preferably, the marker is a SNP-type marker (Single nucleotide polymorphism).
A marker corresponds to an amplification product generated by the amplification of a nucleic acid from Glycine sp., for example by polymerase chain reaction (PCR) using two primers. In this context, “molecular marker” refers to an indicator that is used in methods to visualize differences in characteristics of nucleic acid sequences (polymorphisms). A molecular marker “linked to” or “associated with” a gene capable of providing resistance to target spot can therefore refer to SNPs.
Furthermore, the markers can also be detected by using probes or targeted sequencing (tGBS).
Detection of a molecular marker may, in some embodiments, comprise the use of one or more primer sets that can be used to produce one or more amplification products. In a first embodiment, such primer sets can hybridize to a part of the nucleotide sequences as shown in SEQ ID Nos: 19 a 33 (Table 10) or sub-sequences of these that are at least 15 nucleotides in length. Still, they are included in the scope of the invention, sequences that are at least 90% identical to SEQ ID Nos: 19-48 or its subsequences.
In another embodiment of the present invention, the markers are located in the genes or ranges of the Glyma.17g224300 genes (SEQ ID NO: 1), Glyma.17g223800 (SEQ ID NO: 2), Glyma.17g223900 (SEQ ID NO: 3), Glyma.17g224000 (SEQ ID NO: 4), Glyma.17g224100 (SEQ ID NO: 5), Glyma.17g224200 (SEQ ID NO: 6), Glyma.17g224400 (SEQ ID NO: 7), Glyma.17g224500 (SEQ ID NO: 8), Glyma.17g224600 (SEQ ID NO: 9), Glyma.17g224700 (SEQ ID NO: 10), Glyma.17g224800 (SEQ ID NO: 11), Glyma.17g224900 (SEQ ID NO: 12), Glyma.17g225000 (SEQ ID NO: 13), Glyma.17g225100 (SEQ ID NO: 14), Glyma.17g225200 (SEQ ID NO: 15), Glyma.17g225300 (SEQ ID NO: 16), Glyma.17g225400 (SEQ ID NO: 17), Glyma.17g225500 (SEQ ID NO: 18) present on chromosome 17 of Glycine max.
In a third embodiment of the present invention, markers are preferably located in the adjacent regions of the selected genes of the group consisting of Glyma.17g224300 (SEQ ID NO: 1), Glyma.17g224400 (SEQ ID NO: 7) and Glyma.17g224500 (SEQ ID NO: 8) present on chromosome 17 of Glycine max.
In a fourth embodiment of the present invention, the markers are SNPs selected from the group consisting of ss715627273, ss715627288, ss715627282, ss715627290, ss715627293, ss715627289, ss715627296, ss715627297, ss715627265, ss715627264, ss715627310, ss715627276, ss715627274, ss715627280 and ss715627279, or combinations thereof.
In a fifth embodiment of the present invention, the SNPs are preferably ss715627288, ss715627273 and ss715627282.
In a sixth embodiment of the present invention, the plant is preferably of the species Glycine max.
In a further aspect, the present invention relates to a method of introgressing into plants of the genus Glycine alleles of resistance to target spot caused by the fungus Corynespora cassiicola, comprising the steps of:
In a further aspect, the present invention relates to a method for genotyping target Glycine plants resistant to target spot, comprising analyzing the presence in the DNA of the target plant for one or more markers associated with resistance to target spot, selected from the group consisting of ss715627273, ss715627288, ss715627282, ss715627290, ss715627293, ss715627289, ss715627296, ss715627297, ss715627265, ss715627264, ss715627310, ss715627276, ss715627274, ss715627280 and ss715627279, or combinations thereof. In a further aspect, the invention comprises commercial or customized kits comprising such nucleic acid molecules.
In a further aspect, the invention comprises a method for genotyping target Glycine plants resistant to target spot, comprising analyzing the presence in the DNA of the target plant for one or more markers associated with target spot resistance, selected from the group consisting of ss715627273, ss715627288, ss715627282, ss715627290, ss715627293, ss715627289, ss715627296, ss715627297, ss715627265, ss715627264, ss715627310, ss715627276, ss715627274, ss715627280 and ss715627279, or combinations thereof.
Preferably, the present invention relates to methods for producing a commercial variety resistant to Corynespora cassiicola from susceptible varieties, comprising performing the above introgression method using conventional breeding techniques. The present invention is further described by the examples below, which are intended only to exemplify one of the innumerable ways of carrying out the invention, however, without limiting its scope.
A total of 520 soybean genotypes were evaluated in this study. These are Glycine max accessions from various centers of origin, with most originating from Asia (62.5%) and America (23.4%). The list of samples used in this study can be seen in Table 1.
Seventeen isolates of Corynespora cassiicola were selected from the Holder's mycoteca that showed virulence considered high and intermediate, obtained in studies conducted on the Holder's premises. The isolates are described in Table 2.
Corynespora cassiicola isolates used in this work.
1Isolates preserved at Castelani;
Pure cultures of the fungi were obtained on BDA medium (potato-dextrose-agar) for 7 days. A repetition of each isolate was taken from the plate and mixed in a container, adding 100 mL of water, and proceeding with grinding in a blender for about 30 s. The solution obtained was filtered through a 20-mesh sieve. The residue that was retained on the sieve was discarded, and an aliquot was taken from the conidia suspension mix to count the spores. The final spore count of the suspension was 1750 conidia/mL.
The materials selected for this study were planted in the greenhouse to evaluate disease resistance, with a total of four samples per genotype. Two months after planting, the genotypes were inoculated with the bulk of the 17 Corynespora cassiicola isolates. Initially, twenty liters of spore suspension were prepared and sprayed with the aid of a backpack pump over the leaf area of the plants. Two inoculations were carried out, with an interval of 5 days. The inoculations were performed in the late afternoon, with leaf wetting on the five days following inoculations.
As a way to evaluate the disease response, two assessments were performed. First the average severity score was evaluated. For this, we used the diagrammatic scale developed by Soares and collaborators (2009) (SOARES, R. M.; GODOY, C. V.; OLIVEIRA, M. C. N. Diagrammatic scale for assessing the severity of target spot of soybean. Tropical Plant Pathology, v.34, p. 333-338, 2009) with some modifications (
After the two evaluations, the genotypes with Highly Resistant/Immune reaction were selected (AR) or Resistance (R) the target spot and with lesion size ranging from 0 to 2 mm for a new planting. The purpose of this new evaluation was to confirm the resistance or whether there was any leakage during the test. To that end, ten seeds of each genotype were planted in 8 L pots containing soil:sand, in a 3:1 ratio. As susceptible standard we used the cultivar NA 5909 and some genotypes with Susceptible reaction (S) or Highly Susceptible (AS) of the first trial. Again a spore suspension was prepared with spore count/mL and proceeded with spraying/first inoculation, in the greenhouse at the V2 stage.
The second inoculation occurred 4 days after the first inoculation. Inoculations were performed in the late afternoon, and leaf wetting was maintained for five days after inoculations. The evaluation was performed 20 days after the last inoculation, by determining the average severity score and lesion size (Table 3).
A total of 83 genotypes showed resistance to the action of the pathogen. Of these, seven materials were highly resistant to target spot: PI 71506, PI 153230, PI 567310B, PI 587802, PI 587860, PI 407999-1 and PI 548984. These materials can be worked on in breeding programs as sources of resistance to the target spot. In contrast, 616 materials showed susceptibility to the disease, of which 67 were highly susceptible. The classification of the materials as to their resistance to target spot can be seen in Table 4.
The panel chosen for this analysis was SoySNP50K (Song Q, Hyten D L, Jia G, Quigley C V, Fickus E W, Nelson R L, et al. (2013) Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean. PLoS ONE 8(1): e54985. https://doi.org/10.1371/journal.pone.0054985). This panel has genotyping data for all the materials evaluated in this work. Beyond this, has a broad coverage of the soybean genome, with 42,080 SNPs distributed across the 20 soybean chromosomes.
With the phenotypic and genotypic data from the samples used in this experiment, an associative analysis was developed in search of SNPs linked to target spot resistance. For this, linear mixed models were used (MLM) developed by the MVP packages (YIN et al., 2018) GAPIT (TANG et al., 2016) and FarmCPU (LIU et al., 2016) with the Emma matrix algorithms (MVP) and VanRaden (GAPIT and FarmCPU). In addition, a principal component analysis was performed with a value of 3. A cut-off line of 0.05 for the P value was chosen in order to determine the most significant SNPs in this analysis.
Through associative analysis, it was possible to identify a region on chromosome 17 linked to resistance to target spot (
A large block in linkage disequilibrium with 110 kpb was observed, in which 14 of the 15 SNPs are found (
The three most significant SNPs lie in a 27 kpb range within the identified block. In this range, three genes are present: Glyma.17G224300, Glyma.17G224400 and Glyma.17G224500. The SNP with the highest p-value is located at position 37,772,369 and is a nonsynonymous mutation under an exon of the Glyma.17G224500 gene, a protein kinase of the LRR type. The second SNP was identified at 1,868 bp downstream of the Glyma.17G224400 gene, an LTR-like gag polypeptide. Finally, the third SNP was identified at position 37,744,962 and is under an intron of the Glyma.17g224300 gene, a protein kinase of the LRR type. When using the haplotype of the three SNPs, a filtering with selection of the samples with higher resistance was observed (Table 8).
With the results obtained, it was detected that the SNP ss715627273 showed a genotype selection efficiency of 84.33%. When this SNP was compared together with other allelic variations, it was observed that there was not such a relevant increase in selection efficiency, nor in the decrease of error percentages (Table 8). This result demonstrates that the mark can be used alone to select resistant individuals.
A segregant population was developed by crossing BRSMG 68 (Winner) (resistant to C. cassiicola) and NA 5909 RG (susceptible to C. cassiicola). This population was advanced to the F3 generation, which a progeny test was performed on each individual inferring its F2:3. A total of 96 individuals were preliminarily evaluated phenotypically, in a greenhouse experiment, with four randomized blocks with 5 replicates per family. The same inoculation and evaluation methodology was used (scale of notes) described above (Soares et al., 2009).
The generated results were analyzed using an analysis of variance (ANOVA) and showed that there was a significant difference between the phenotyped families (Table 9). With the results obtained, an analysis of the inheritance of the trait was performed and the segregation hypotheses for the 3:1 trait were verified (a recessive gene), 13:3 (one dominant and one recessive gene) e 55:9 (two dominant and one recessive gene). To confirm the results, a larger number of families will be evaluated in future analyses.
Finally, the three markers with the highest p-values were synthesized via Taqman technology and amplified in the 96 families of the segregating population. The results showed that all three markers had a high effect on disease resistance (
Number | Date | Country | Kind |
---|---|---|---|
BR 102020009417 3 | May 2020 | BR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/BR2020/050353 | 9/2/2020 | WO |