The present disclosure relates to plants resistant to diseases, in particular to Brassica plants resistant to clubroot disease.
Clubroot is a widespread disease that causes major economic losses and has emerged as serious threat in many Brassica growing areas globally and particularly in North America. Clubroot disease is caused by Plasmodiophora brassicae, a soil-borne, root-infecting protist pathogen and phylogenetical intermediate between a fungus and bacteria. P. brassicae infection leads to swollen roots or ‘galls’ that hijack the host water and nutrient supplies, causing wilting, death and loss of yield. Management of clubroot is challenging because of two unique attributes of P. brassicae. The organism has very short life cycles and can produce multiple generations within a season. Second, each infected gall produces billions of spores that can survive in soil for many years and, in some cases, more than 15 years. Local spread of spores can be facilitated by wet conditions, but most dispersal of the pathogen is caused by transportation of infested soil or compost, e.g., on tools, equipment or plant material. P. brassicae has a wide host range in the Brassica family including numerous weed species.
There are currently no effective fungicides for the widespread control of clubroot. In the absence of effective chemical control options, developing sources of genetic resistance has the most potential for protecting Brassica from clubroot. Clubroot resistance, mostly qualitative and race-specific, exists in some Brassica vegetables such as rutabaga, turnips, and cabbages, including in Chinese cabbage (Yoshikawa. 1983. Japan Agricultural Research Quarterly, 17:6-11). Chinese cabbage F1 hybrids with this resistance have been shown to have good protection against clubroot, although a small number of races have been able to break through this resistance. To date, more than 10 loci have been identified that contribute to clubroot resistance, these include: CRa, CRb, CRc, CRk (Matsumoto et al. 1998. J Jpn Soc Hortic Sci 74:367-373; Piao et al. 2004. Theor Appl Genet 108:1458-1465; Sakamoto et al. 2008. Theo Appl Genet 117:759-767), Crr1, Crr2, Crr3, Crr4 (Suwabe et al. 2003. Theo Appl Genet 107:997-1002; Suwabe et al. 2006. Genetics 173:309-319; Hirai et al. 2004. Theor Appl Genet 108:639-643), CRd (Pang et al. 2018. Front Plant Sci 9:822), PbBa3.1, PbBa3.2, PbBa3.3, PbBa1.1, PbBa8.1 (Chen et al. 2013. PLoS ONE 8(12):e85307), Rcr1 (Chu et al. 2014. BMC Genomics 15(1):1166), Rcr4, Rcr8, and Rcr9 (Yu et al. 2017. Sci Rep 7(1):4516).
Nonetheless different subgroups or races of clubroot pathogen have been identified that exhibit virulence against plants having loci associated with a particular clubroot resistance. Additionally, repeated plantings of Brassica plants having the same (single or multiple) clubroot resistance loci may lead to the diminution and/or complete loss of effectiveness due to selection pressure for pathogens that overcome these genetic sources of resistance. This is of particular concern when varieties with clubroot resistant loci are challenged by high pathogen loads, which increases the probability for evolving new races that are virulent even for plants having those loci. Therefore, in order to mitigate the problem of evolving pathogen resistance and to protect against a broader spectrum of pathogens, there is a need and desire to identify, introgress, and track new sources of clubroot resistance in Brassica species, particularly for the commercially significant species such as Brassica napus.
Disclosed herein are genetic marker alleles, methods, and assays for identifying and tracking clubroot resistance loci on Brassica chromosome N8. The markers, methods, and assays are based, at least in part, on discoveries generated by an extensive and intensive genetic screening effort to identify new markers and/or sources of clubroot disease. The disclosed markers are tightly linked to the resistance loci CrB8, CrG8, CrE8, CrM8, and CrI8 described herein. The disclosed markers appear to be uniquely specific to resistant donor lines disclosed herein and/or are so rare in publicly available germplasm that they have not been previously identified as being linked to clubroot resistance.
The disclosed CrB8, CrG8, CrE8, CrM8, and CrI8 markers are suitable for high-throughput marker assisted selection. In certain examples, the markers are particularly suited for the identification of loci that are rare or particularly unusual. Additionally, the disclosed markers are suitable for the identification and introgression of clubroot resistance in inbred germplasm for each loci on chromosome N8 and can be used to generate hybrid clubroot resistant Brassica plants and seed.
Provided is a method of identifying a Brassica plant, cell, or germplasm comprising a clubroot disease resistance locus by obtaining a sample of nucleic acid from a Brassica plant, cell, or germplasm and screening the sample for a molecular marker allele, or a haplotype of molecular marker alleles, linked to one or more of the following clubroot resistance loci: (1) CrB8 located on chromosome N8 interval flanked by and including 12.94 cM and 16.44 cM, (2) CrG8 located on chromosome N8 interval flanked by and including 13.94 cM and 14.07 cM, (3) CrE8 located on chromosome N8 interval flanked by and including 12.87 cM and 13.98 cM, (4) CrM8 located on chromosome N8 interval flanked by and including 13.2 cM and 13.38 cM, or (5) CrI8 located on chromosome N8 interval flanked by and including 13.2 cM and 13.7 cM. As disclosed herein, the CrB8 locus corresponds to physical position 10,656,081 to position 13,303,318 of chromosome 8 (Chr 8); the CrG8 locus corresponds to physical position 11,124,294 to position 11,338,475 of Chr 8; the CrE8 locus corresponds to the physical position 10,146,787 to position 11,793,943 of Chr 8; the CrM8 locus corresponds to position 10,959,267 to position 11,159,261 of Chr 8; and the CrI8 locus corresponds to position 10,986,309 to position 11,191,524 on Chr 8 of a B. napus reference genome. Examples of single nucleotide polymorphism (SNP) markers that correspond to resistance (RES) and susceptibility (SUS) alleles for clubroot disease are identified in Tables 1-8. The probe sequences disclosed in Tables 1-8 comprises sequence flanking each of these SNPs. Many of the probe sequences (including the bolded and underlined SNP nucleotide) displayed in Tables 1-8 correspond to the genomic strand sequence complementary to that shown in the columns for the corresponding RES and SUS alleles. Thus for every method disclosed herein for a particular SNP nucleotide or flanking maker sequence, it is understood that the disclosed method also includes the SNP nucleotide or flanking sequence, respectively, on the complementary strand.
In some examples, the method of identifying a Brassica plant, cell, or germplasm comprising a clubroot disease resistance locus comprises screening for at least one of the following molecular marker alleles (e.g., a haplotype that includes two or more of the following marker alleles): a CrB8 resistance marker allele identified in Table 1 or Table 2 herein; a CrG8 resistance marker allele identified in Table 3 herein; a CrE8 resistance marker allele identified in Table 4 or Table 5 herein; a CrM8 resistance marker allele identified in Table 6 or Table 7 herein; or a CrI8 resistance marker allele identified in Table 8 herein. Thus, for example, the method can include screening for a haplotype comprising (A) 2, 3, 4, 5, 6 or more resistance marker alleles in Table 1 or Table 2 herein; (B) 2, 3, 4, 5, 6 or more resistance marker alleles in Table 3 herein; (C) 2, 3, 4, 5, 6 or more resistance marker alleles identified in Table 4 or Table 5 herein; (D) 2, 3, 4, 5, 6 or more resistance marker alleles in Table 6 or Table 7 herein; or (E) 2, 3, 4, 5, 6 or more marker resistance alleles in Table 8 herein.
Additionally, the disclosed method can include screening for one or more CrB8 resistance alleles identified in Table 1 or Table 2 herein in combination with one or more CrG8 resistance allele identified in Table 3 herein, one or more CrE8 resistance allele identified in Table 4 or Table 5 herein, one or more CrM8 alleles identified in Table 6 or Table 7 herein, or one or more CrI8 resistance alleles identified in Table 8 herein. Or the method can include screening for one or more CrG8 resistance allele identified in Table 3 herein in combination with one or more CrB8 resistance alleles identified in Table 1 or Table 2 herein, one or more CrE8 resistance allele identified in Table 4 or Table 5 herein, one or more CrM8 alleles identified in Table 6 or Table 7 herein, or one or more CrI8 resistance alleles identified in Table 8 herein. Or the method can include screening for one or more CrE8 resistance allele identified in Table 4 or Table 5 herein in combination with one or more CrB8 resistance alleles identified in Table 1 or Table 2 herein, one or more CrG8 resistance allele identified in Table 3 herein, one or more CrE8 resistance allele identified in Table 4 or Table 5 herein, one or more CrM8 alleles identified in Table 6 or Table 7 herein, or one or more CrI8 resistance alleles identified in Table 8 herein. Or the method can include screening for one or more CrM8 alleles identified in Table 6 or Table 7 herein in combination with one or more CrB8 resistance alleles identified in Table 1 or Table 2 herein, one or more CrG8 resistance allele identified in Table 3 herein, one or more CrE8 resistance allele identified in Table 4 or Table 5 herein, or one or more CrI8 resistance alleles identified in Table 8 herein. Or the method can include screening for one or more CrI8 resistance alleles identified in Table 8 herein in combination with one or more CrB8 resistance alleles identified in Table 1 or Table 2 herein, one or more CrG8 resistance allele identified in Table 3 herein, one or more CrE8 resistance allele identified in Table 4 or Table 5 herein, or one or more CrM8 alleles identified in Table 6 or Table 7 herein.
In some examples, the method of identifying a Brassica plant, cell, or germplasm comprising a clubroot disease resistance locus comprises screening for at least one of the following resistance marker alleles: (1) N101BWO-001-Q001 (SEQ ID NO:23), N101T3M-001-Q001 (SEQ ID NO:30), N101T3P-001-Q001(SEQ ID NO:33), or N101T3R-001-Q001 (SEQ ID NO:37) allele linked to CrB8; (2) N100C6A-001-Q001 (SEQ ID NO:44) allele linked to CrG8; (3) N100CJT-001-Q001 (SEQ ID NO:180), N101T3T-001-Q001 (SEQ ID NO:219), or N101T3U-001-Q001 (SEQ ID NO:222) allele linked to CrE8; (4) N100CDD-001-Q001 (SEQ ID NO:262), N101T3X-001-Q001 (SEQ ID NO:275), N101T3Y-001-Q001 (SEQ ID NO:278), or N101T41-001-Q001 (SEQ ID NO:282) allele linked to CrM8; (5) N101TOT-001-Q003 (SEQ ID NO:302) allele linked to CrI8.
Moreover, each of the methods for identifying a Brassica plant, cell, or germplasm comprising a clubroot disease resistance locus disclosed herein can further include selecting the Brassica plant, cell, or germplasm thereof based on the presence of the molecular marker allele or a haplotype of molecular marker alleles linked to the clubroot resistance locus. Thus, provided herein is a method of selecting a plant identified by any of the methods disclosed herein as having one or more CrB8 resistance allele identified in Table 1 or Table 2 herein; one or more CrG8 resistance allele identified in Table 3 herein; one or more CrE8 resistance allele identified in Table 4 or Table 5 herein; one or more CrM8 allele identified in Table 6 or Table 7 herein; or one or more CrI8 resistance allele identified in Table 8 herein. The disclosed selection methods are particularly useful for identifying and selecting such a Brassica plant, cell, or germplasm from a plurality (e.g., in a breeding population). Accordingly the disclosed methods can be used for marker assisted selection and/or introgression of the CrB8, CrG8, CrE8, CrM8, and CrI8 loci disclosed herein.
For example, disclosed herein is a method of introducing (e.g., introgressing) at least one clubroot resistance locus into a Brassica plant by crossing a first parent Brassica plant comprising at least one clubroot resistance locus with a second Brassica plant to produce progeny plants, which can be screened for the presence or absence of one or more CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance locus using any of the screening methods disclosed herein. Thus progeny plants having at least one molecular marker allele or a haplotype that includes two or more of marker alleles identified in Tables 1-8 can be identified using any of the marker allele screening methods disclosed herein (optionally, such method can include screening for the presence of one or more susceptibility alleles disclosed in Tables 1-8 that corresponds to the one or more screened-for resistance alleles and removing or discarding plants having the susceptibility allele instead of the screened-for resistance allele). The introgression method can then include selecting one or more progeny plants having the CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance locus that is screened for. In particular examples, the introgression method can further include crossing the selected one or more progeny plants with the second parent Brassica plant to produce backcross progeny plants. Such backcross progeny plants can be screened for the presence or absence of the CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance marker alleles to thereby identify and select backcross progeny plants having a CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance locus. The selected backcross progeny plant can itself be backcrossed to the second parent Brassica plant to produce further backcross progeny plants, which can be screened as described to enable selection of further backcross progeny plants having a CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance locus. Such backcrossing, screening, and selection can be repeated for two, three, four, five, six or more generations to introgress the CrB8, CrG8, CrE8, CrM8, or CrI8 clubroot disease resistance locus into the genetic background of the second parent Brassica plant.
The disclosure can be more fully understood from the following detailed description and Sequence Listing, which form a part of this application. The sequence descriptions and sequence listing attached hereto comply with the rules governing nucleotide and amino acid sequence disclosures in patent applications as set forth in 37 C.F.R. §§1.821 and 1.825. The sequence descriptions comprise the three letter codes for amino acids as defined in 37 C.F.R. §§ 1.821 and 1.825, which are incorporated herein by reference. When one strand of each nucleic acid sequence is shown, the complementary strand is understood to be included by any reference to the displayed strand.
Terms used in the claims and specification are defined as set forth below unless otherwise specified. It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.
An “allele” is one of several alternative forms of a gene occupying a given locus on a chromosome. When all the alleles present at a given locus on a chromosome are the same, that plant is “homozygous” at that locus. If the alleles present at a given locus on a chromosome differ, that plant is “heterozygous” at that locus.
An “amplicon” is amplified nucleic acid, e.g., a nucleic acid that is produced by amplifying a template nucleic acid by any available amplification method (e.g., polymerase chine reaction (PCR), ligase chain reaction (LCR), transcription, or the like).
“Backcrossing” refers to the process whereby hybrid progeny plants are repeatedly crossed back to one of the parents. In a backcrossing scheme, the “donor” parent refers to the parental plant with the desired gene or locus to be introgressed. The “recipient” parent (used one or more times) or “recurrent” parent (used two or more times) refers to the parental plant into which the gene or locus is being introgressed. Backcrossing has been widely used to introduce new traits into plants. See e.g., Jensen, N., Ed. Plant Breeding Methodology, John Wiley & Sons, Inc., 1988. In a typical backcross protocol, the original variety of interest (recurrent parent) is crossed to a second variety (non-recurrent parent) that carries a gene of interest to be transferred. The resulting progeny from this cross are then crossed again to the recurrent parent, and the process is repeated until a plant is obtained wherein essentially all of the desired morphological and physiological characteristics of the recurrent plant are recovered in the converted plant, in addition to the transferred gene from the nonrecurrent parent.
“Brassica” refers to any one of Brassica napus (AACC, 2n=38), Brassica juncea (AABB, 2n=36), Brassica carinata (BBCC, 2n=34), Brassica rapa (syn. B. campestris) (AA, 2n=20), Brassica oleracea (CC, 2n=18) or Brassica nigra (BB, 2n=16).
The term “cross” (or “crossed”) refers to the fusion of gametes via pollination to produce progeny (e.g., cells, seeds, and plants). This term encompasses both sexual crosses (i.e., the pollination of one plant by another) and selfing (i.e., self-pollination, for example, using pollen and ovule from the same plant).
The term “elite line” means any line that has resulted from breeding and selection for superior agronomic performance. An elite plant is any plant from an elite line.
The term “gene” (or “genetic element”) may refer to a heritable genomic DNA sequence with functional significance. A gene includes a nucleic acid fragment that expresses a functional molecule such as, but not limited to, a specific protein, including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence, as well as intervening intron sequences. The term “gene” may also be used to refer to, for example and without limitation, a cDNA and/or an mRNA encoded by a heritable genomic DNA sequence.
The term “genome” as it applies to a prokaryotic and eukaryotic cell or organism cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondria, or plastid) of the cell.
A “genomic sequence” or “genomic region” is a segment of a chromosome in the genome of a cell that is present on either side of the target site or, alternatively, also comprises the target site or a portion thereof. An “endogenous genomic sequence” refers to genomic sequence within a plant cell.
As used herein, “gene” includes a nucleic acid fragment or sequence that expresses a functional molecule such as, but not limited to, a specific protein coding sequence and regulatory elements, such as those preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence.
A “genomic locus” as used herein refers to the genetic or physical location on a chromosome of a gene.
The term “genotype” refers to the physical components, i.e., the actual nucleic acid sequence at one or more loci in an individual plant.
The term “germplasm” refers to genetic material of or from an individual plant or group of plants (e.g., a plant line, variety, and family), or a clone derived from a plant or group of plants. A germplasm may be part of an organism or cell, or it may be separate (e.g., isolated) from the organism or cell. In general, germplasm provides genetic material with a specific molecular makeup that is the basis for hereditary qualities of the plant. As used herein, “germplasm” refers to cells of a specific plant; seed; tissue of the specific plant (e.g., tissue from which new plants may be grown); and non-seed parts of the specific plant (e.g., leaf, stem, pollen, and cells). Thus, “germplasm” is used herein synonymously with “genetic material” and may be used to refer to seed (or other plant material) from which a plant may be propagated. A “germplasm bank” may refer to an organized collection of different seed or other genetic material (wherein each genotype is uniquely identified) from which a known cultivar may be cultivated, and from which a new cultivar may be generated. In embodiments, a germplasm utilized in a method or plant as described herein is from a canola line or variety. In particular examples, a germplasm is seed of the canola line or variety. In particular examples, a germplasm is a nucleic acid sample from the Brassica line or variety.
A “haplotype” is the genotype of an individual at a plurality of genetic loci, i.e. a combination of alleles. Typically, the genetic loci described by a haplotype are physically and genetically linked, i.e., on the same chromosome segment.
The terms “increased” or “improved” in connection with “clubroot resistance” is used herein to refer to plants having increased growth, productivity, and/or reduction in root size or number of root nodules, relative a plant that is susceptible (lacking resistance) to clubroot disease, when grown in a field comprising Plasmodiophora brassicae.
The term “introgression” refers to the transmission of an allele at a genetic locus into a genetic background. For example, introgression of a specific allele can involve a sexual cross between two parents of the same species, where at least one of the parents has the specific allele in its genome, to thereby transfer the allele to at least one progeny. Progeny comprising the specific allele form may be repeatedly backcrossed to a line having a desired genetic background. Backcross progeny may be selected for the specific allele form, so as to produce a new variety wherein the specific allele form has been fixed in the progeny's genetic background. In some embodiments, introgression of a specific allele may occur by recombination between two donor genomes (e.g., in a fused protoplast), where at least one of the donor genomes has the specific allele in its genome. Introgression may involve transmission of a specific allele that may be, for example, a selected allele form of a marker allele, a QTL, and/or a transgene.
As used herein an “isolated” biological component (such as a nucleic acid or protein) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs (i.e., other chromosomal and extra-chromosomal DNA and RNA, and proteins), while effecting a chemical or functional change in the component. For example and without limitation, a nucleic acid may be isolated from a chromosome by breaking chemical bonds connecting the nucleic acid to the remaining DNA in the chromosome and/or the other material previously associated with the nucleic acid in its cellular milieu (e.g., the nucleus). Nucleic acid molecules and proteins that have been “isolated” include nucleic acid molecules and proteins that are enriched or purified . The term also embraces nucleic acids and proteins prepared by recombinant expression in a host cell, as well as chemically-synthesized nucleic acid molecules, proteins, and peptides.
“Marker-assisted selection” (MAS) is a process by which phenotypes are selected based on marker genotypes. Marker assisted selection can include the use of genetic markers to identify plants for inclusion in and/or removal from a breeding program or planting. A molecular marker allele that demonstrates linkage disequilibrium with a desired phenotypic trait (e.g., a QTL) provides a useful tool for the selection of the desired trait in a plant population. Components for implementing a MAS approach include the creation of a dense (information rich) genetic map of molecular markers in the plant germplasm; the detection of at least one QTL based on statistical associations between marker and phenotypic variability; the definition of a set of particular useful marker alleles based on the results of the QTL analysis; and the use and/or extrapolation of this information to the current set of breeding germplasm to enable marker-based selection decisions to be made.
The closer a particular marker is to a gene that encodes a polypeptide that contributes to a particular phenotype (whether measured in terms of genetic or physical distance), the more tightly-linked is the particular marker to the phenotype. In view of the foregoing, it will be appreciated that the closer (whether measured in terms of genetic or physical distance) that a marker is linked to a particular gene, the more likely the marker is to segregate with that gene (e.g., a clubroot disease resistance marker disclosed herein) and its associated phenotype (e.g., clubroot disease resistance disclosed herein). Thus, the tightly linked genetic markers for clubroot resistance disclosed herein can be used in MAS programs to identity Brassica varieties that have or can generate progeny that have increased clubroot resistance (relative to parental varieties and/or otherwise isogenic plants lacking that clubroot disease resistance marker), to identify individual plants comprising this clubroot disease resistance trait, and to breed this trait into other Brassica varieties to improve their clubroot disease resistance. Marker-assisted selection is discussed in more detail in a subsection hereinbelow.
A “marker set” or a “set” of markers or probes refers to a specific collection of markers (or data derived therefrom) that may be used to identify individuals comprising a trait of interest. Thus, a set of markers linked to clubroot resistance may be used to identify a Brassica plant comprising one the clubroot disease resistance loci disclosed herein. Data corresponding to a marker set (or data derived from the use of such markers) may be stored in an electronic medium. While each marker in a marker set is useful in the identification of individuals comprising a trait of interest, subsets of markers in a set (i.e., some but not necessarily all of the markers in a marker set) can be used to effectively identify individuals comprising the trait of interest disclosed herein, i.e., one of the clubroot disease resistance loci disclosed herein.
A “modified gene” is a gene that has been mutated or altered through human intervention. Such a “modified” gene has a sequence that differs from the sequence of the corresponding non-modified gene by at least one nucleotide addition, deletion, or substitution. A “modified” plant is a plant comprising a modified gene or deletion.
As used herein the term “native gene” refers to a gene as it is found in its natural endogenous location operably linked to its own regulatory sequences, which have not been altered by human intervention. In the context of this disclosure, a “modified” gene is not a native gene.
As used herein, a “nucleic acid molecule” is a polymeric form of nucleotides, which can include both sense and anti-sense strands of RNA, cDNA, genomic DNA, recombinant and synthetic forms and mixed polymers of the above. A nucleotide refers to a ribonucleotide, deoxynucleotide, or a modified form of either type of nucleotide. As used herein “nucleic acid molecule” is synonymous with the terms “nucleic acid”, “nucleotide sequence”, “nucleic acid sequence”, and “polynucleotide.” The term includes single- and double-stranded forms of DNA or RNA. A nucleic acid molecule can refer to either or both naturally occurring and modified nucleotides linked together by naturally occurring and/or non-naturally occurring nucleotide linkages. Nucleic acid molecules may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications, such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., peptides), intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids, etc.). The term “nucleic acid molecule” also includes any topological conformation, including single-stranded, double stranded, partially duplexed, triplexed, hairpinned, circular, and padlocked conformations. An “endogenous nucleic acid sequence” refers to a nucleic acid sequence within a plant cell, (e.g. an endogenous allele of a native gene present within the genome of a Brassica plant cell).
The term “single-nucleotide polymorphism” (SNP) refers to a DNA sequence variation occurring when a single nucleotide in the genome (or other shared sequence) differs between members of a species or paired chromosomes in an individual. In some examples, markers linked to a clubroot disease resistance locus disclosed herein are SNP markers. Recent high-throughput genotyping technologies such as GoldenGate® and INFINIUM® assays (Illumina, San Diego, Calif.) may be used in accurate and quick genotyping methods by multiplexing SNPs from 384-plex to >100,000-plex assays per sample.
As used herein, “phenotype” means the detectable characteristics (e.g. clubroot disease resistance) of a cell or organism which can be influenced by genotype.
As used herein, the term “plant material” refers to any processed or unprocessed material derived, in whole or in part, from a plant. For example, and without limitation, a plant material may be a plant part, a seed, a fruit, a leaf, a root, a plant tissue, a plant tissue culture, a plant explant, or a plant cell.
As used herein, the term “plant” may refer to a whole plant, a cell or tissue culture derived from a plant, and/or any part of any of the foregoing. Thus, the term “plant” encompasses, for example and without limitation, whole plants; plant components and/or organs (e.g., leaves, stems, and roots); plant tissue; seed; and a plant cell. A plant cell may be, for example and without limitation, a cell in and/or of a plant, a cell isolated from a plant, and a cell obtained through culturing of a cell isolated from a plant. Thus, the term Brassica “plant” may refer to, for example and without limitation, a whole Brassica plant; multiple Brassica plants; Brassica plant cell(s); Brassica plant protoplast; Brassica tissue culture (e.g., from which a Brassica plant can be regenerated); Brassica plant callus; Brassica plant parts (e.g., seed, flower, cotyledon, leaf, stem, bud, root, and root tip); and Brassica plant cells that are intact in a Brassica plant or in a part of a Brassica plant.
As used herein, a plant or Brassica “line” refers to a group of plants that display little genetic variation (e.g., no genetic variation) between individuals for at least one trait. Inbred lines may be created by several generations of self-pollination and selection or, alternatively, by vegetative propagation from a single parent using tissue or cell culture techniques. As used herein, the terms “cultivar,” “variety,” and “type” are synonymous, and these terms refer to a line that is used for commercial production.
Trait or phenotype: The terms “trait” and “phenotype” are used interchangeably herein. For the purposes of the present disclosure, traits of particular interest are the clubroot disease resistance traits associated with each of the clubroot disease resistance loci disclosed herein.
A “variety” or “cultivar” is a plant line that can be used for commercial production and which is distinct and uniform in its characteristics when propagated. In the case of a hybrid variety or cultivar, the parental lines are distinct, stable, and uniform in their characteristics.
Detection of Disclosed Markers. Each of the markers for the CrB8, CrG8, CrE8, CrM8, and CrI8 loci disclosed herein can be detected by any suitable method for detecting genetic polymorphisms. Suitable methods of detection include nucleotide amplification and/or sequencing of the genetic material, e.g., nucleic acid or genomic DNA sequencing that reveals the presence for a disease resistance marker allele disclosed herein for the CrB8, CrG8, CrE8, CrM8, and CrI8 loci. See Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, and Table 8 (Tables 1-8) disclosing clubroot disease resistance markers alleles for each of the loci disclosed herein.
The clubroot disease resistance marker alleles can be identified and distinguished from susceptible allele using allele-specific amplification and PCR-based amplification assays such as TaqMan, rhAmp-SNP, KASPar, and molecular beacons. Such an assay can include the use of one or more probes that detect the marker allele in (i) nucleic acid that is isolated from a plant or (ii) an amplicon that is selectively amplified by amplification of nucleic acid isolated from a plant. Optionally, such an assay can further include an additional set of primers and/or one or more probes that detect the presence of a clubroot susceptible (e.g., wildtype) allele and thereby determine the zygosity (or even the absence) of clubroot resistance loci disclosed herein.
Additional methods for genotyping and detecting a resistant marker allele for the CrB8, CrG8, CrE8, CrM8, and CrI8 loci disclosed herein (or a linked marker) include but are not limited to, hybridization, primer extension, oligonucleotide ligation, nuclease cleavage, minisequencing and coded spheres. Such methods are reviewed in publications including Gut, 2001, Hum. Mutat. 17:475; Shi, 2001, Clin. Chem. 47:164; Kwok, 2000, Pharmacogenomics 1:95; Bhattramakki and Rafalski, “Discovery and application of single nucleotide polymorphism markers in plants”, in PLANT GENOTYPING: THE DNA FINGERPRINTING OF PLANTS (CABI Publishing, Wallingford 2001). A wide range of commercially available technologies utilize these and other methods to interrogate the allele disclosed herein (or a linked marker), including Masscode™ (Qiagen, Germantown, Md.), Invader® (Hologic, Madison, Wis.), SnapShot® (Applied Biosystems, Foster City, Calif.), Taqman® (Applied Biosystems, Foster City, Calif.) and Infinium Bead Chip™ and GoldenGate™ allele-specific extension PCR-based assay (Illumina, San Diego, Calif).
In particular example, detecting a disclosed maker can include nucleic acid sequencing, nucleic acid amplification, or the combined amplification and nucleic acid sequencing of the marker allele and 5 bp or more, 10 bp or more, 15 bp or more, 20 bp or more, 30 bp or more, 40 bp or more, 50 bp or more, 60 bp or more, 70 bp or more, 80 bp or more, 90 bp or more, 100 bp or more, 110 bp or more, 120 bp or more, 130 bp or more, 140 bp or more, 150 bp or more, 175 bp or more, 200 bp or more, 250 bp or more, 300 bp or more, 350 bp or more, 400 bp or more, 450 bp or more, 500 bp or more, 550 bp or more, or 600 bp or more of flanking sequence that are (i) upstream of (i.e., located 5′ to) the relevant marker allele and/or (ii) downstream of (i.e., located 3′ to) the relevant marker allele. Thus, in particular examples, the disclosed marker can be detected by amplifying nucleic acid (e.g., genomic DNA) sequence to produce an amplicon comprising one or more of the marker allele sequences identified in Tables 1-8 herein. Primers suitable for amplification of each marker are disclosed Tables 1-8. Additionally, the markers disclosed herein can be detected by nucleotide sequencing of nucleic acids such as genomic DNA (e.g., by first amplifying genomic sequence and sequencing the resulting amplicon) comprising a resistance marker allele sequence identified in Tables 1-8 for each of the disclosed CrB8, CrG8, CrE8, CrM8, and CrI8 loci, respectively.
Other methods of detecting the marker allele for the CrB8, CrG8, CrE8, CrM8, and CrI8 loci disclosed herein include single base extension (SBE) methods, which involve the extension of a nucleotide primer that is adjacent to a polymorphism to incorporate a detectable nucleotide residue upon extension of the primer through the polymorphism, e.g., extension through the marker allele disclosed herein.
Methods of detecting the marker allele for the CrB8, CrG8, CrE8, CrM8, and CrI8 loci disclosed herein also include LCR; and transcription-based amplification methods (e.g., SNP detection, SSR detection, RFLP analysis, and others). Useful techniques include hybridization of a probe nucleic acid to a nucleic acid corresponding to a marker allele disclosed herein, or a linked marker (e.g., an amplified nucleic acid produced using a genomic canola DNA molecule as a template). Hybridization formats including, for example and without limitation, solution phase; solid phase; mixed phase; and in situ hybridization assays may be useful for allele detection in particular embodiments. An extensive guide to hybridization of nucleic acids is discussed in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes (Elsevier, N.Y. 1993).
Many detection methods (including amplification-based and sequencing-based methods) may be readily adapted to high throughput analysis in some examples, for example, by using available high throughput sequencing methods, such as sequencing by hybridization.
Detecting each of the CrB8, CrG8, CrE8, CrM8, and CrI8 loci (or marker allele therefor) disclosed herein can be done using nucleotide sequencing products, amplicons, or probes comprising detectable labels. Detectable labels suitable for use include any composition that can be detected by spectroscopic, radioisotopic, photochemical, biochemical, immunochemical, electrical, optical, or chemical means. Thus, a particular allele of a SNP may be detected using, for example, autoradiography, fluorography, or other similar detection techniques, depending on the particular label to be detected. Useful labels include biotin (for staining with labeled streptavidin conjugate), magnetic beads, fluorescent dyes, radiolabels, enzymes, luminescent or phosphorescent indicators, and colorimetric labels. Other labels include ligands that bind to antibodies or specific binding targets labeled with fluorophores, chemiluminescent agents, and enzymes. In some examples the detection techniques disclosed herein include the use of fluorescent dyes (e.g. FAM, VIC, TET, FITC, TRITC, Texas Red, etc.) with or without a quencher (BHQ1 or DABsyl).
Molecular markers can be used in a variety of plant breeding applications (e.g. see Staub et al. (1996) Hortscience 31: 729-741; Tanksley (1983) Plant Molecular Biology Reporter. 1: 3-8). A molecular marker that demonstrates linkage with a locus affecting a desired phenotypic trait provides a useful tool for the selection of the trait in a plant population. This is particularly true where the phenotype is hard to assay. Since DNA marker assays are less laborious and take up less physical space than field phenotyping, much larger populations can be assayed, increasing the chances of finding a recombinant with the target segment from the donor line moved to the recipient line. Thus, marker-assisted selection (MAS) has been used to significantly increase the efficiency of plant breeding at least in part by improving the efficiency of backcrossing and gene introgression.
The closer the linkage between marker and locus, the more useful the marker, as recombination is less likely to occur between the marker and the genomic feature that causes the trait, which can result in false positives. Having flanking markers on both sides of a locus decreases the chances that false positive selection will occur as a double recombination event would be needed. Generally, it is most preferred to have a marker within or at the genomic locus (e.g., within the gene or at the mutation that causes the phenotype) itself, so that recombination cannot occur between the marker and the causal gene or mutation. In some embodiments, the methods disclosed herein produce a marker in a disease resistance gene, wherein the gene was identified by inferring genomic location from clustering of conserved domains or a clustering analysis.
When a gene is introgressed by MAS, it is not only the gene that is introduced but also the flanking regions (Gepts (2002). Crop Sci; 42: 1780-1790). This is referred to as “linkage drag.” In the case where the donor plant is highly unrelated to the recipient plant, these flanking regions carry additional genes that may code for agronomically undesirable traits. This “linkage drag” may also result in reduced yield or other negative agronomic characteristics even after multiple cycles of backcrossing into the elite line. This is also sometimes referred to as “yield drag.” The size of the flanking region can be decreased by additional backcrossing, although this is not always successful, as breeders do not have control over the size of the region or the recombination breakpoints (Young et al. (1998) Genetics 120:579-585). In classical breeding it is usually only by chance that recombinations are selected that contribute to a reduction in the size of the donor segment (Tanksley et al. (1989). Biotechnology 7: 257-264). Even after 20 backcrosses in backcrosses of this type, one may expect to find a sizeable piece of the donor chromosome still linked to the gene being selected. With markers however, it is possible to select those rare individuals that have experienced recombination near the gene of interest. In 150 backcross plants, there is a 95% chance that at least one plant will have experienced a crossover within 1 cM of the gene, based on a single meiosis map distance. Markers will allow unequivocal identification of those individuals. With one additional backcross of 300 plants, there would be a 95% chance of a crossover within 1 cM single meiosis map distance of the other side of the gene, generating a segment around the target gene of less than 2 cM based on a single meiosis map distance. This can be accomplished in two generations with markers, while it would have required on average 100 generations without markers (See Tanksley et al., supra). When the exact location of a gene is known, flanking markers surrounding the gene can be utilized to select for recombinations in different population sizes. For example, in smaller population sizes, recombinations may be expected further away from the gene, so more distal flanking markers would be required to detect the recombination.
Important components to the implementation of MAS are: (i) defining the population within which the marker-trait association will be determined, which can be a segregating population, or a random or structured population; (ii) monitoring the segregation or association of polymorphic markers relative to the trait, and determining linkage or association using statistical methods; (iii) defining a set of desirable markers based on the results of the statistical analysis, and (iv) the use and/or extrapolation of this information to the current set of breeding germplasm to enable marker-based selection decisions to be made. The markers described in this disclosure, as well as other marker types such as SSRs and FLPs, can be used in marker assisted selection protocols.
SSRs can be defined as relatively short runs of tandemly repeated DNA with lengths of 6 bp or less (Tautz (1989) Nucleic Acid Research 17: 6463-6471; Wang et al. (1994) Theoretical and Applied Genetics, 88:1-6) Polymorphisms arise due to variation in the number of repeat units, probably caused by slippage during DNA replication (Levinson and Gutman (1987) Mol Biol Evol 4: 203-221). The variation in repeat length may be detected by designing PCR primers to the conserved non-repetitive flanking regions (Weber and May (1989) Am J Hum Genet. 44:388-396). SSRs are highly suited to mapping and MAS as they are multi-allelic, codominant, reproducible and amenable to high throughput automation (Rafalski et al. (1996) Generating and using DNA markers in plants. In: Non-mammalian genomic analysis: a practical guide. Academic press. pp 75-135).
Various types of SSR markers can be generated, and SSR profiles can be obtained by gel electrophoresis of the amplification products. Scoring of marker genotype is based on the size of the amplified fragment.
Various types of FLP markers can also be generated. Most commonly, amplification primers are used to generate fragment length polymorphisms. Such FLP markers are in many ways similar to SSR markers, except that the region amplified by the primers is not typically a highly repetitive region. Still, the amplified region, or amplicon, will have sufficient variability among germplasm, often due to insertions or deletions, such that the fragments generated by the amplification primers can be distinguished among polymorphic individuals, and such indels are known to occur frequently in maize (Bhattramakki et al. (2002). Plant Mol Biol 48, 539-547; Rafalski (2002b), supra).
SNP markers detect single base pair nucleotide substitutions. Of all the molecular marker types, SNPs are the most abundant, thus having the potential to provide the highest genetic map resolution (Bhattramakki et al. 2002 Plant Molecular Biology 48:539-547). SNPs can be assayed at an even higher level of throughput than SSRs, in a so-called ‘ultra-high-throughput’ fashion, as SNPs do not require large amounts of DNA and automation of the assay may be straight-forward. SNPs also have the promise of being relatively low-cost systems. These three factors together make SNPs highly attractive for use in MAS. Several methods are available for SNP genotyping, including but not limited to, hybridization, primer extension, oligonucleotide ligation, nuclease cleavage, minisequencing, and coded spheres. Such methods have been reviewed in: Gut (2001) Hum Mutat 17 pp. 475-492; Shi (2001) Clin Chem 47, pp. 164-172; Kwok (2000) Pharmacogenomics 1, pp. 95-100; and Bhattramakki and Rafalski (2001) Discovery and application of single nucleotide polymorphism markers in plants. In: R. J. Henry, Ed, Plant Genotyping: The DNA Fingerprinting of Plants, CABI Publishing, Wallingford. A wide range of commercially available technologies utilize these and other methods to interrogate SNPs including Masscode™ (Qiagen), INVADER®. (Third Wave Technologies) and Invader PLUS®, SNAPSHOT®. (Applied Biosystems), TAQMAN®. (Applied Biosystems) and BEADARRAYS®. (Illumina).
A number of SNPs together within a sequence, or across linked sequences, can be used to describe a haplotype for any particular genotype (Ching et al. (2002), BMC Genet. 3:19 pp Gupta et al. 2001, Rafalski (2002b), Plant Science 162:329-333). Haplotypes can be more informative than single SNPs and can be more descriptive of any particular genotype. For example, a single SNP may be allele “T” for a specific line or variety with disease resistance, but the allele ‘T’ might also occur in the breeding population being utilized for recurrent parents. In this case, a haplotype, e.g. a combination of alleles at linked SNP markers, may be more informative. Once a unique haplotype has been assigned to a donor chromosomal region, that haplotype can be used in that population or any subset thereof to determine whether an individual has a particular gene. See, for example, WO2003054229. Using automated high throughput marker detection platforms makes this process highly efficient and effective.
Many of the markers presented herein can readily be used as single nucleotide polymorphic (SNP) markers to select for clubroot resistance. Using PCR, the primers are used to amplify DNA segments from individuals (preferably inbred) that represent the diversity in the population of interest. The PCR products are sequenced directly in one or both directions. The resulting sequences are aligned and polymorphisms are identified. The polymorphisms are not limited to single nucleotide polymorphisms (SNPs), but also include indels, CAPS, SSRs, and VNTRs (variable number of tandem repeats). Specifically, with respect to the fine map information described herein, one can readily use the information provided herein to obtain additional polymorphic SNPs (and other markers) within the region amplified by the primers disclosed herein. Markers within the described map region can be hybridized to BACs or other genomic libraries, or electronically aligned with genome sequences, to find new sequences in the same approximate location as the described markers.
In addition to SSR's, FLPs and SNPs, as described above, other types of molecular markers are also widely used, including but not limited to expressed sequence tags (ESTs), SSR markers derived from EST sequences, randomly amplified polymorphic DNA (RAPD), and other nucleic acid based markers.
Isozyme profiles and linked morphological characteristics can, in some cases, also be indirectly used as markers. Even though they do not directly detect DNA differences, they are often influenced by specific genetic differences. However, markers that detect DNA variation are far more numerous and polymorphic than isozyme or morphological markers (Tanksley (1983) Plant Molecular Biology Reporter 1:3-8).
Sequence alignments or contigs may also be used to find sequences upstream or downstream of the specific markers listed herein. These new sequences, close to the markers described herein, are then used to discover and develop functionally equivalent markers. For example, different physical and/or genetic maps are aligned to locate equivalent markers not described within this disclosure but that are within similar regions. These maps may be within the species, or even across other species that have been genetically or physically aligned.
In general, MAS uses polymorphic markers that have been identified as having a significant likelihood of co-segregation with a trait such as the clubroot disease resistance traits disclosed herein. Such markers are presumed to map near a gene or genes that give the plant its disease resistant phenotype, and are considered indicators for the desired trait, or markers. Plants are tested for the presence of a desired allele in the marker, and plants containing a desired genotype at one or more loci are expected to transfer the desired genotype, along with a desired phenotype, to their progeny. Thus, plants with clubroot disease resistance may be selected for by detecting one or more marker alleles, and in addition, progeny plants derived from those plants can also be selected. Hence, a plant containing a desired genotype in a given chromosomal region (i.e. a genotype associated with disease resistance) is obtained and then crossed to another plant. The progeny of such a cross would then be evaluated genotypically using one or more markers and the progeny plants with the same genotype in a given chromosomal region would then be selected as having disease resistance.
The markers disclosed herein can be used alone or in combination (i.e. as haplotype) to select for a favorable clubroot resistance locus. For example, each SNP having the resistance allele disclosed in Table 1 (e.g., N101BW0-001-Q001 having the “A” allele at position 10 of SEQ ID NO:1) can be used alone or in combination with another SNP resistance allele (e.g., the N101BW0-001-Q001 having “T” allele and N101BW2-001-Q001 having the “T” allele at position 12 of SEQ ID NO:5), or a combination thereof.
The skilled artisan would expect that there might be additional polymorphic sites at marker loci in and around a chromosome marker identified by the methods disclosed herein, wherein one or more polymorphic sites is in linkage disequilibrium (LD) with an allele at one or more of the polymorphic sites in the haplotype and thus could be used in a marker assisted selection program to introgress a gene allele or genomic fragment of interest. Two particular alleles at different polymorphic sites are said to be in LD if the presence of the allele at one of the sites tends to predict the presence of the allele at the other site on the same chromosome (Stevens, Mol. Diag. 4:309-17 (1999)). The marker loci can be located within 5 cM, 2 cM, or 1 cM (on a single meiosis based genetic map) of the disease resistance trait QTL.
The skilled artisan would understand that allelic frequency (and hence, haplotype frequency) can differ from one germplasm pool to another. Germplasm pools vary due to maturity differences, heterotic groupings, geographical distribution, etc. As a result, SNPs and other polymorphisms may not be informative in some germplasm pools.
While the invention has been particularly shown and described with reference to a preferred embodiment and various alternate embodiments, it will be understood by persons skilled in the relevant art that various changes in form and details can be made therein without departing from the spirit and scope of the invention. For instance, while the particular examples below may illustrate the methods and embodiments described herein using a specific plant, the principles in these examples may be applied to any plant. Therefore, it will be appreciated that the scope of this invention is encompassed by the embodiments of the inventions recited herein and in the specification rather than the specific examples that are exemplified below. All cited patents and publications referred to in this application are herein incorporated by reference in their entirety, for all purposes, to the same extent as if each were individually and specifically incorporated by reference.
The following are examples of specific aspects of the invention. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the invention in any way.
Example 1: Screening for Disease Resistance. Corteva Agriscience conducted a large, nearly decade-long research program to identify new, major genetic sources of disease resistance in Brassica. This effort included large-scale genetic screens of Brassica napus (winter oilseed rape and canola), Brassica napus vegetable form (rutabaga) and Brassica rapa (Chinese cabbage and stubble turnip) species which share common genomes. Extensive inter-specific pre-breeding was carried out to introgress resistance gene sources, eliminate linkage drag, characterize their efficacy in different genetic backgrounds, and locate their genomic positions by linkage mapping. One product of this effort was the identification of the genomic hot spots and proprietary markers for clubroot resistance disclosed in the following Examples.
Example 2: Clubroot resistance locus CrB8. Major clubroot resistance locus CrB8 was identified and its genetic position was located to the interval flanked by and including 12.94 cM and 16.44 cM on chromosome N8. One source of this resistance locus has been identified in SW Rebus spring turnip rape from Sweden (see e.g., Tanhuanpaa et al., 2016, Genome 59(1): 11-21). The physical position of CrB8 was mapped using proprietary genomic maps to the locus corresponding to nucleotide position 10,656,081 to position 13,303,318 of chromosome N8 of a non-proprietary Brassica napus reference genome. Gene markers were identified within the chromosomal interval and then converted to TaqMan™ (Thermo Fisher, Waltham, MA) assays. CrB8 marker name (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 1. These assays were tested on a canola diversity panel comprised of approximately 350 elite lines and hybrids representing the genetic diversity of the proprietary germplasm. Assays were also tested on a canola donor panel comprised of clubroot resistant donor lines. The purpose of both canola panel screenings was to confirm donor specificity of the markers. TaqMan™ markers were also tested on two proprietary DH mapping population to confirm marker-trait association. Finally, the TaqMan™ markers were tested on two F2 mapping populations to validate the markers' technical performance. In Table 1 and in Tables 2-8 herein, the single nucleotide polymorphism SNP for each resistance and susceptibility allele sequence is indicated by bold and underlined text.
Each TaqMan™ assay for this Example (as well as the remaining Examples 3-8 herein) was performed using 13.6 μl of a primer probe mixture (18 μM of each probe, 4 μM of each primer) and 1000 μl of master mix from ToughMix™ kit (Quanta Beverly, Mass.). A liquid handler dispensed 1.3 μl of the mix onto a 1536 well plate containing ˜6 ng of dried DNA. The plate was sealed with a laser sealer and thermocycled in a Hydrocycler device (LGC Genomic Limited, Middlesex, United Kingdom) under the following conditions: 94° C. for 15 min, 40 cycles of 94° C. for 30 secs, 60° C. for 1 min. PCR products are measured using at wavelengths 485 (FAM) and 520 (VIC) by a Pherastar™ plate reader (BMG Labtech, Offenburg, Germany). The values are normalized against ROX and plotted and scored on scatterplots utilizing the Kraken™ software.
Marker N101BW0-001-Q001 was found to be particularly tightly linked to resistance locus CrB8 and was uniquely specific to resistant donor lines.
Additional TaqMan™ markers were designed based on whole genome sequencing (WGS) data (Table 2). All markers were located within a 300 kb segment that does not include any of the markers identified in Table 1. Allele specificity was assessed using in silico WGS reads of the clubroot resistant donor and elite inbred susceptible germplasm. The selected markers can be used together as a haplotype. Donor specificity of the markers was determined using in silico WGS read data. Each marker (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 2.
The clubroot resistance markers in Tables 1 and 2 are very tightly linked to the CrB8 locus; each has an LOD score of 30 or greater. Furthermore, each of the markers was tested in two different mapping populations. Each test populations included at least 180 individuals. In both test populations, each of the marker alleles listed in Tables 1 and 2 demonstrated 100% association with the clubroot resistance and clubroot susceptibility traits.
Example 3: Clubroot resistance locus CrG8. Another major clubroot resistance locus CrG8 was identified and located to chromosomal interval flanked by and including 13.94 cM and 14.07 cM of chromosome N8. One source of this resistance locus has been identified in Gelria R European turnip (see, e.g., Hirai, M., 2006, Breeding Science 54: 223-229). The physical position of CrG8 was mapped using proprietary genomic maps to the locus corresponding to nucleotide position 11,124,294 to position 11,338,475 of chromosome N8 of a non-proprietary Brassica napus reference genome. Genetic markers located within the chromosomal interval were converted to TaqMan™ assays. Each CrG8 marker (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 3. These assays were tested on a canola diversity panel comprised of approximately 350 elite lines and hybrids representing the genetic diversity of the proprietary germplasm and a clubroot donor panel comprised of clubroot resistant donor lines. The purpose of the panel screenings was to confirm donor specificity of the markers. The TaqMan™ markers were also tested on a proprietary DH mapping population to confirm the marker-trait association and four proprietary F2 mapping populations to evaluate the markers' technical performance.
Marker N100C6A-001-Q001 was found to be particularly tightly linked to resistance locus CrG8 and was uniquely specific to resistant donor lines.
The clubroot resistance markers in Table 3 are very tightly linked to the CrG8 locus; each has an LOD score of 30 or greater. Furthermore, each of the markers was tested in two different mapping populations. Each test populations included at least 180 individuals. In both test populations, each of the marker alleles listed in Tables 1 and 2 demonstrated 100% association with the clubroot resistance and clubroot susceptibility traits.
Example 4: Clubroot resistance locus CrE8. An additional major clubroot resistance locus, CrE8 was identified and its genetic position was located to interval flanked by and including 12.87 cM and 13.98 cM on chromosome N8. The physical position of CrE8 was mapped using proprietary maps to the locus corresponding to nucleotide position 10,966,500 to 11,124,403 of chromosome N8 of a non-proprietary Brassica napus reference genome. Genetic markers linked to CrE8 were converted to TaqMan™ assays. Each CrE8 marker (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 4. The assays and tested on a canola diversity panel comprised of approximately 350 elite lines and hybrids representing the genetic diversity of the proprietary germplasm. The purpose of panel screening was to confirm donor specificity of the markers. TaqMan™ markers were also tested on two proprietary DH mapping populations to confirm marker-trait association and an F2 mapping population to validate the markers' technical performance. None of the markers overlap with publicly available 56 k array markers available from Illumina (Madison, Wisc. USA).
Marker N100CJT-001-Q001 was found to be particularly tightly linked to resistance locus CrE8 and was uniquely specific for resistant donors.
Additional TaqMan™ markers were designed based on WGS data (Table 5). All markers are located within a 300 kb segment that does not include any of the markers identified in Table 4. Allele specificity was assessed using in silico WGS reads of the donor and elite inbred germplasm. The selected markers can be used together as a haplotype. Each marker's name (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 5.
The clubroot resistance markers in Tables 4 and 5 are very tightly linked to the CrE8 locus; each has an LOD score of 30 or greater. Furthermore, each of the markers was tested in two different mapping populations. Each test populations included at least 180 individuals. In both test populations, each of the marker alleles listed in Tables 4 and 5 demonstrated 100% association with the clubroot resistance and clubroot susceptibility phenotype.
Example 5: Clubroot resistance locus CrM8. One more major clubroot resistance locus CrM8 was identified and its genetic position located to the interval flanked by and including 13.2 cM and 13.38 cM on chromosome N8. One source of this resistance locus has been identified in the Brassica napus variety Mendel (see e.g., Fredua-Agyeman et al., 2016, Euphytica 211: 201-213). The physical position of CrM8 was mapped using proprietary maps to the locus corresponding to nucleotide position 10,959,267 to position 11,159,261 of chromosome N8 of a non-proprietary Brassica napus reference genome. Genetic markers located within this chromosomal interval were converted to TaqMan™ assays. Each marker name (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 6. The assays were tested on a Brassica napus diversity panel comprised of approximately 350 elite lines and hybrids representing the genetic diversity of the proprietary germplasm and a clubroot donor panel comprising clubroot resistant donor lines. The purpose of the panel screenings was to confirm donor specificity of the markers. TaqMan™ markers were also tested on two proprietary DH mapping population to confirm the marker-trait association and four F2 mapping populations to validate the markers' technical performance. None of the markers overlap with markers on publicly available 56 k array from Illumina.
Marker N100CDD-001-Q001 was found to be particularly tightly linked to resistance locus CrM8 and was uniquely specific for resistant donor lines.
Additional TaqMan™ markers were designed based on WGS data (Table 7). All markers were located within a 300 kb segment that does not include any of the markers identified in Table 6. Allele specificity was assessed using in silico WGS reads of the donor and elite inbred germplasm. The selected markers can be used together as a haplotype. Each additional CrM8 marker's name (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 7.
The clubroot resistance markers in Tables 6 and 7 are very tightly linked to the CrM8 locus; each has an LOD score of 30 or greater. Furthermore, each of the markers was tested in two different mapping populations. Each test populations included at least 180 individuals. In both test populations, each of the marker alleles listed in Tables 6 and 7 demonstrated 100% association with the clubroot resistance and clubroot susceptibility phenotype.
Example 6: Clubroot resistance locus CrI8. Yet another major clubroot resistance locus, CrI8, was identified and its genetic position was located to the interval flanked by and including 13.2 cM and 13.7 cM on chromosome N8. CrI8 was mapped using proprietary maps to the locus corresponding to nucleotide position 10,986,309 to position 11,500,321 of chromosome N8 of a non-proprietary Brassica napus reference genome. Genetic markers located within the chromosomal interval were converted to TaqMan™ assays. Each marker's name (NAME), physical position (POS), its resistance allele (RES) and susceptible allele (SUS), SEQ ID NO, and corresponding sequences for assay primers and probes are described in Table 8. The assays were tested on a Brassica napus (canola/oilseed) diversity panel comprised of approximately 350 elite lines and hybrids representing the genetic diversity of the proprietary germplasm and a clubroot donor panel comprised of clubroot resistant donor lines. The purpose of the panel screenings was to confirm donor specificity of the markers. TaqMan™ markers were also tested on a proprietary DH mapping population to confirm the marker-trait association and a proprietary F2 mapping population to validate the technical performance of the TaqMan™ assays.
Marker N101T0T-001-Q003 was found to be particularly tightly linked to resistance locus CrI8 and was uniquely specific for resistant donor lines.
The clubroot resistance markers in Table 8 are very tightly linked to the CrI8 locus; each has an LOD score of 30 or greater. Furthermore, each of the markers was tested in two different mapping populations. Each test populations included at least 180 individuals. In both test populations, each of the marker alleles listed in Table 8 demonstrated 100% association with the clubroot resistance and clubroot susceptibility phenotype.
This Application claims the benefit of U.S. Provisional Application 63/142,717, filed on Jan. 28, 2021, which is incorporated by reference herein in its entirety. The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCII formatted sequence listing with a file named 8541-WO-PCT_ST25, created on Jan. 19, 2022 and having a size of 65 kilobytes, which is filed concurrently with the specification. The sequence listing comprised in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/013522 | 1/24/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63142717 | Jan 2021 | US |