The instant application contains a sequence listing, which has been submitted in XML file format by electronic submission and is hereby incorporated by reference in its entirety. The XML file, created on Mar. 14, 2024, is named P14293US01.xml and is 119,772 bytes in size.
Disclosed herein are novel plants, plant parts, and nucleotide sequences in soybean varieties comprising a mutated FT1a, JAG1, BS1, BS2, and/or TFL1b gene, along with methods of making the same by growing a soybean plant or lot, and methods of using the same.
Agriculture is an essential industry for the global economy and the United States in particular. Soybean (Glycine max) is an important legume crop worldwide due to its ability to fix atmospheric nitrogen. Soybeans serve as a major source of animal feed protein and soybean oil has uses in a wide variety of industries, including the food and beverage, biodiesel, and other industries.
Soybean sustainability is a priority for farmers worldwide. Farming practices such as water and nutrient management help farmers improve efficiencies, boost crop productivity, conserve water, enrich soil quality, improve nutrient efficiencies of the soil, and produce sustainable soybean crops. The benefits of bioengineering for soybean farmers include increased yields and extreme weather hardiness.
Disclosed herein are soybean plant cells comprising loss-of-function allele(s) of the endogenous soybean FT1a gene of SEQ ID NO: 3, JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1b gene of SEQ ID NO: 22, and/or an allelic variant thereof. Also provided are soybean plant parts comprising the aforementioned soybean plant cell, including stems, roots, leaves, flowers, pods, and seeds. Also provided are soybean seed lots comprising the seed. Also provided are soybean plants comprising the aforementioned soybean plant cells.
Also provided are biological samples comprising a nucleic acid containing loss-of-function allele(s) of the soybean FT1a gene of SEQ ID NO: 3, JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1b gene of SEQ ID NO: 22, and/or an allelic variant thereof. Polynucleotides comprising the loss-of-function allele of the soybean FT1a gene set forth in SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID NO: 8 are provided. Polynucleotides encoding the polypeptides of SEQ ID NO: 6, SEQ ID NO: 9, and SEQ ID NO: 10 are also provided. In some embodiments, the aforementioned polynucleotides are isolated.
Also disclosed are methods of producing a soybean seed lot comprising: (i) growing a population of soybean plants comprising the aforementioned soybean plant; and (ii) harvesting seed from the population of soybean plants of step (i) at maturity. Methods of producing a soybean crop comprising planting the aforementioned seed lot are provided.
Guide RNA molecules comprising a spacer RNA molecule which target exon 1 of the FT1a gene of SEQ ID NO: 3 or an allelic variant thereof are provided. Guide RNA molecules comprising a spacer RNA encoded by SEQ ID NO: 11 are also provided.
Also disclosed are methods for generating the aforementioned soybean plant cells, soybean plant parts, and soybean plants are provided. In some embodiments, the methods comprise introducing loss-of-function allele(s) in the endogenous soybean FT1a gene of SEQ ID NO: 3, JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1b gene of SEQ ID NO: 22, and/or an allelic variant thereof. In some embodiments, the methods comprise (i) screening a population of soybean plant cells, parts, or plants for the presence of a loss-of-function allele in the endogenous soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof; and (ii) isolating a soybean plant cell, soybean plant part, or soybean plant comprising the loss-of-function allele of the soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof.
Methods for determining whether a soybean plant cell, plant part, or plant comprises a loss-of-function allele of the endogenous soybean FT1a gene of SEQ ID NO: 3, JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1b gene of SEQ ID NO: 22, and/or an allelic variant thereof are provided. In certain embodiments, the methods comprise analyzing a polynucleotide comprising a portion of SEQ ID NO: 3 or an allelic variant thereof or analyzing an RNA encoded by a portion of SEQ ID NO: 3 or an allelic variant thereof from the plant cell, plant part, or plant, wherein an insertion, deletion, and/or substitution of one or more nucleotides in said polynucleotide or RNA is indicative of the presence of the loss-of-function allele. In certain embodiments, the methods comprise analyzing a polypeptide encoded by SEQ ID NO: 3, a portion thereof, or an allelic variant thereof from the soybean plant cell, plant part, or plant, wherein an insertion, deletion, and/or substitution of one or more amino acid residues of the polypeptide or a change in the biologic or biochemical activity of the polypeptide is indicative of the presence of the loss-of-function allele.
The phrase “allelic variant” as used herein refers to a polynucleotide or polypeptide sequence variant that occurs in a particular gene at particular locus in a different strain, variety, or isolate of a given organism.
As used herein, the phrase “amorphic allele” refers to an allele of a gene having no gene activity in comparison to the wild-type allele of the gene. Amorphic alleles are also known as null alleles.
The term “and/or” where used herein is to be taken as specific disclosure of each of the two specified features or components with or without the other. Thus, the term “and/or” as used in a phrase such as “A and/or B” herein is intended to include “A and B,” “A or B,” “A” (alone), and “B” (alone). Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following embodiments: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
As used herein, the phrase “biological sample” refers to either intact or non-intact (e.g., milled soybean seed or soybean plant tissue, chopped soybean plant tissue, lyophilized tissue) soybean plant tissue. It may also be an extract comprising intact or non-intact seed or soybean plant tissue. The biological sample can comprise flour, meal, syrup, oil, starch, and cereals manufactured in whole or in part to contain soybean plant by-products. In certain embodiments, the biological sample is “non-regenerable” (i.e., incapable of being regenerated into a soybean plant or soybean plant part).
As used herein, the terms “correspond,” “corresponding,” and the like, when used in the context of an nucleotide position, mutation, and/or substitution in any given polynucleotide (e.g., an allelic variant of SEQ ID NO: 3) with respect to the reference polynucleotide sequence (e.g., SEQ ID NO: 3) all refer to the position of the nucleotide in the given sequence that has identity to the nucleotide in the reference nucleotide sequence when the given polynucleotide is aligned to the reference polynucleotide sequence using a pairwise alignment algorithm (e.g., CLUSTAL O 1.2.4 with default parameters).
As used herein, the terms “Cpf1” and “Cas12a” are used interchangeably to refer to the same RNA dependent DNA endonuclease (RdDe).
As used herein, the phrase “elite soybean” refers to soybean plant or part thereof (e.g., seed) which has undergone breeding to provide one or more trait improvements (e.g., desirable agronomic performance (typically commercial production) or superior grain quality). In some cases, an elite line can be an agronomically or otherwise superior line or variety that has resulted from several or many cycles of breeding and selection for one or more trait improvements (e.g., superior agronomic performance or superior grain quality). Similarly, “elite soybean germplasm” is a germplasm resulting from breeding and selection for desirable agronomic performance (typically commercial production). Such germplasm may be agronomically superior germplasm, derived from and/or capable of giving rise to a plant with superior agronomic performance, such as an existing or newly developed elite line of soybean. Elite crop plant lines include plants which are an essentially homozygous, e.g., inbred or doubled haploid. Elite crop plants can include inbred lines used as is or used as pollen donors or pollen recipients in breeding (e.g., used to produce F1 plants). Elite crop plants can include inbred lines which are selfed to produce non-hybrid cultivars or varieties. Elite crop plants can include hybrid F1 progeny of a cross between two distinct elite inbred or doubled haploid plant lines.
As used herein, the phrase “endogenous gene” refers to the native form of a gene unit in its natural location in the genome of an organism.
As used herein, the term “expression” refers to the production of a functional end-product (e.g., an mRNA, guide RNA, or a protein) in either precursor or mature form.
As used herein, the phrase “hypomorphic allele” refers to an allele of a gene with less gene activity than a wild-type allele but more gene activity than an amorphic allele.
As used herein, the terms “include,” “includes,” and “including” are to be construed as at least having the features to which they refer while not excluding any additional unspecified features.
As used herein, the term “isomorphic allele” refers to an allele of a gene having wild-type gene activity.
The term “isolated” as used herein means having been removed from its natural environment.
As used herein, the term “introduced” means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods. Thus, “introduced” in the context of inserting a nucleic acid fragment (e.g., a recombinant DNA construct/expression construct) into a cell, means “transfection” or “transformation” or “transduction” and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., nuclear chromosome, plasmid, plastid, chloroplast, or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
As used herein, a “loss-of-function allele” can include an amorphic allele or a hypomorphic allele of a gene.
As used herein, the phrase “mutated BS1 gene” or “mBS1 gene” refers to an endogenous soybean BS1 gene or allelic variant thereof comprising a loss-of-function allele (e.g., an amorphic allele) of the soybean BS1 gene or an allelic variant thereof. The phrase “mutated BS2 gene” or “mBS2 gene” refers to an endogenous soybean BS2 gene or allelic variant thereof comprising a loss-of-function (e.g., amorphic) allele of the soybean BS2 gene or allelic variant thereof.
As used herein, a “non-natural” or “non-naturally occurring” mutation or allele refers to a mutation or allele in a gene which is generated via human intervention or descended from the mutation generated via human intervention. Non-limiting examples of human intervention which can be used to generate a non-naturally occurring mutation or allele include mutagenesis (e.g., chemical mutagenesis, ionizing radiation mutagenesis), mutagenesis followed by DNA sequence-based screening and selection (TILLING), and targeted genetic modifications (e.g., CRISPR-based methods, TALEN-based methods, zinc finger-based methods).
As used herein, the term “plant” includes reference to an immature or mature whole soybean plant, including a plant from which seed or grain or anthers have been removed. Any seed or embryo that will produce the plant is also considered to be the soybean plant.
As used herein, the term “mutated FT1a gene” or “ft1a gene” refer to an endogenous soybean FT1a gene comprising a loss-of-function allele. The term “ft1a protein” refers to a protein encoded by an endogenous soybean FT1a gene comprising a loss-of-function allele.
As used herein, the term “plant” includes a whole soybean plant and any descendant, cell, tissue, part, or parts of the plant. The term “plant” thus includes reference to an immature or mature whole soybean plant, including a plant from which seed or grain or anthers have been removed.
The term “plant part” include any part(s) of a plant, including, for example and without limitation: seed (including mature seed and immature seed); grain; stover; a plant cutting; a plant cell; a plant cell culture; or a plant organ (e.g., pollen, embryos, pods; flowers, fruits, shoots, leaves, roots, stems, and explants). A plant tissue or plant organ may be a seed, protoplast, callus, or any other group of plant cells that is organized into a structural or functional unit. A plant cell or tissue culture may be capable of regenerating a plant having the physiological and morphological characteristics of the plant from which the cell or tissue was obtained, and of regenerating a plant having substantially the same genotype as the plant. Regenerable cells in a plant cell or tissue culture may be embryos, protoplasts, meristematic cells, callus, pollen, leaves, anthers, roots, root tips, flowers, or stalks. In contrast, some plant cells are not capable of being regenerated to produce plants and are referred to herein as “non-regenerable” plant cells.
As used herein, the term “variety” refers a group of similar plants that by one or more structural features, genetic features, and/or performance can be distinguished from other varieties within the same species. In certain embodiments, the term variety refers to the botanical taxonomic designation whereby variety is ranked below species or subspecies, as well as the legal definition whereby the term “variety” refers to a commercial plant that is protected under the terms outlined in the International Convention for the Protection of New Varieties of Plants.
To the extent to which any of the preceding definitions is inconsistent with definitions provided in any patent or non-patent reference incorporated herein by reference, any patent or non-patent reference cited herein, or in any patent or non-patent reference found elsewhere, it is understood that the preceding definition will be used herein.
The present disclosure provides for soybean plant cells, plant parts including seed, plants, seed lots, and biological samples comprising a mutated FT1a gene (i.e., comprising a loss-of-function allele of the endogenous FT1a gene), mutated JAG1 gene, mutated BS1 gene, mutated BS2 gene, and/or mutated TFL1b gene. These soybean plants and parts can be utilized for human food, livestock feed, as a raw material in industry, or as breeding material for development of other soybean varieties.
The target endogenous FT1a gene comprises the genomic DNA of SEQ ID NO: 3 and allelic variants thereof located on soybean chromosome 18. The endogenous soybean FT1a gene is located at nucleotides 57,922,912 to 57,928,648 of chromosome 18 of the Glycine max Williams 82genome assembly version 4 (Wm82.a4.v1; Glyma.18G298900 on the world wide web internet site “soybase.org”; Grant et al. Nucl. Acids Res. (2010) 38 (suppl 1): D843-D846. doi: 10.1093/nar/gkp798). Alternative splicing of FT1a gene transcripts result in the mRNA splice variant 1 (Glyma.18G298900.1) as shown in
Soybean plant cells, plant parts, and plants comprising a loss-of-function allele of the soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof are provided. In certain embodiments, the soybean plant cells, plant parts, and plants comprising a loss-of-function allele of the soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof can further comprise a loss-of-function allele in any one or more of a JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1b gene of SEQ ID NO: 22, and/or an allelic variant thereof. In certain embodiments, one or more of the loss-of-function alleles is a non-natural allele. Examples of loss-of-function alleles can include a deletion, an insertion, and/or a substitution of one or more nucleotides of the endogenous FT1a gene. The insertion, deletion, and/or substitution can be made anywhere in the FT1a gene including, for example, in the promoter region, an exon, an intron, and/or the untranslated regions (5′ UTR or 3′ UTR). In certain embodiments, the loss-of-function allele comprises a deletion, insertion, and/or substitution in the coding region of the FT1a gene. In certain embodiments, a loss-of-function allele of the FT1a gene can comprise a deletion of the entire coding region or any portion of the coding region required for biological activity. In certain embodiments, the loss-of-function allele comprises a deletion, insertion, and/or substitution of one or more nucleotides of exon 1 of mRNA splice variant 1 and 2 (i.e., nucleotides 229 to 429 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), exon 2 of mRNA splice variant 1 and 2 (i.e., nucleotides 596 to 657 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), exon 3 of mRNA splice variant 1 and 2 (i.e., nucleotides 3643 to 3683 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), or in nucleotides corresponding to those of exon 4 of mRNA splice variant 1 (i.e., nucleotides 4999 to 5225 or nucleotides 5104 to 5225 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3) of the FT1a gene of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, the loss-of-function allele comprises a deletion or substitution of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nucleotides of the endogenous soybean FT1a gene of SEQ ID NO: 3 located at nucleotide 331 to 356 of SEQ ID NO: 3 or an allelic variant thereof.
In certain embodiments, the loss-of-function allele comprises a deletion, an insertion, and/or substitution that results in a frameshift mutation and/or a nonsense mutation in the coding region of the FT1a gene. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion of any number of nucleotides that are not divisible by 3 in an exon of the FT1a gene. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion of 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 97, 98, 100, 101, 103, 104, 106, 107, 109, 110, 112, 113, 115, 116, 118, 119, 121, 122, 124, 125, 127, 128, 130, 131, 133, 134, 136, 137, 139, 140, 142, 143, 145, 146, 148, 149, 151, 152, 154, 155, 157, 158, 160, 161, 163, 164, 166, 167, 169, 170, 172, 173, 175, 176, 178, 179, 181, 182, 184, 185, 187, 188, 190, 191, 193, 194, 196, 197, 199, or 200 nucleotides of the endogenous soybean FT1a gene of SEQ ID NO: 3 located at nucleotide 229 to 429 (i.e., the first exon) of SEQ ID NO: 3 and result in a frameshift mutation.
In certain embodiments, the frameshift mutation occurs at nucleotides corresponding to one or more of nucleotides 229 to 429 of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, the frameshift mutation occurs at nucleotides corresponding to one or more of nucleotides 340 to 343 of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, mutated FT1a genes comprising the loss-of-function allele with a frameshift mutation can comprise the nucleotide sequence of SEQ ID NO: 4 or SEQ ID NO: 5 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 4 or SEQ ID NO: 5 can comprise a nucleotide sequence having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 4 or SEQ ID NO: 5. In certain embodiments, mutated FT1a genes comprising the loss-of-function allele with a frameshift mutation can encode the polypeptide comprising the amino acid sequence of SEQ ID NO: 6 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 6 can comprise an amino acid sequence having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 6. In certain embodiments, the loss-of-function alleles of the FT1a gene can comprise a deletion of an FT1a gene set forth in SEQ ID NO: 13, 21, or an allelic variant thereof comprising the deletion and having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 13 or 21. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion of an FT1a gene set forth in Table 2. In certain embodiments, the deletion in the FT1a gene set forth in Table 2 is a deletion generated by a gRNA comprising a spacer RNA encoded by SEQ ID NO: 11 having the deletion coordinates set forth in column 1 of Table 2. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion in an FT1a gene corresponding to deleted nucleotides of SEQ ID NO: 3 set forth in column 3 of Table 2. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion in an allelic variant of SEQ ID NO: 3 having at least 90%, 95%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 3, wherein the deletion corresponds to a deletion of nucleotides in SEQ ID NO: 3 set forth in Table 2. In certain embodiments, the deletion of nucleotides of SEQ ID NO: 3 set forth in column 3 of Table 2 occurs in genomic DNA of SEQ ID NO: 3 or an allelic variant thereof having at least 90%, 95%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 3.
In certain embodiments, the loss-of-function allele comprises an internal deletion that preserves the reading frame of the encoded FT1a proteins while removing at least one, two, or three codons, thus resulting mutant ft1a proteins lacking at least one, two, or three amino acid residues. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion of any number of nucleotides that are divisible by 3 in an exon of the FT1a gene. In certain embodiments, loss-of-function alleles of the FT1a gene can comprise a deletion of 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, 93, 96, 99, 102, 105, 108, 111, 114, 117, 120, 123, 126, 129, 132, 135, 138, 141, 144, 147, 150, 153, 156, 159, 162, 165, 168, 171, 174, 177, 180, 183, 186, 189, 192, 195, 198, or 201 nucleotides of the endogenous soybean FT1a gene of SEQ ID NO: 3 located at nucleotide 229 to 429 (i.e., within the first exon of mRNA splice forms 1 and 2) of SEQ ID NO: 3 and preserve the reading frame. In certain embodiments, the loss-of-function allele comprises an internal deletion comprising at least nucleotides corresponding to at least nucleotides 343 to 351 of SEQ ID NO: 3 or an allelic variant thereof which preserves the reading frame. In certain embodiments, the loss-of-function allele comprises an internal deletion comprising at least nucleotides encoding amino acids corresponding to N39 to C41 of SEQ ID NO: 1 and SEQ ID NO: 2 or an allelic variant thereof which preserves the reading frame. In these embodiments, the loss-of-function allele can comprise an internal deletion of nucleotides encoding amino acids corresponding to N39 to C41 of SEQ ID NO: 1 and 2 and further comprise deletions of nucleotides encoding P2, R3, S4, T5, D6, P7, L8, V9, I10, G11, G12, V13, I14, G15, D16, V17, L18, E19, P20, F21, T22, S23, S24, V25, S26, M27, G28, I29, V30, Y31, N32, N33, C34, P35, Q36, V37, I38, E42, L43, K44, P45, S46, K47, I48, L49, N50, R51, P52, R53, I54, E55, I56, G57, G58, D59, D60, L61, R62, T63, F64, Y65, T66, and/or L67 of SEQ ID NO: 1, SEQ ID NO: 2, or an allelic variant thereof while preserving the reading frame. In certain embodiments, mutated FT1a genes comprising the loss-of-function allele with an internal deletion can comprise the nucleotide sequence of SEQ ID NO: 7 or SEQ ID NO: 8 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 7 or SEQ ID NO: 8 can comprise a nucleotide sequence having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 7 or SEQ ID NO: 8. In certain embodiments, mutated FT1a genes comprising the loss-of-function allele with an internal deletion can encode the polypeptide comprising the amino acid sequence of SEQ ID NO: 9 and/or SEQ ID NO: 10 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 9 and/or SEQ ID NO: 10 can comprise an amino acid sequence having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 9 and/or SEQ ID NO: 10.
The present disclosure provides for the soybean plant cells, plant parts including seed, plants, and biological samples comprising a mutated JAG1 (mJAG1) gene comprising a loss-of-function mutation of that gene and a loss-of-function mutation in any one or more of an FT1a, BS1, BS2, and/or TFL1b gene. In certain embodiments, one or more of the loss-of-function mutation is a non-natural mutation or allele. These loss-of-function mutations in the mJAG1 gene can be amorphic (null) or hypomorphic alleles of the JAG1 gene, can be homozygous, and include JAG1 mutations disclosed in WO2023/183772 and U.S. Provisional Application 63/269,663, which are each incorporated herein by reference in their entireties. The target endogenous JAG1 gene comprises the genomic DNA of SEQ ID NO: 18 and allelic variants thereof located on soybean chromosome 20. The endogenous soybean JAG1 gene is located at nucleotides 35827671 to 35830107 of chromosome 20 of the Glycine max Wm82.a2.v1 set forth in the https internet site “phytozome-next.jgi.doe.gov/report/transcript/Gmax_Wm82_a2_v1/Glyma.20G116200.1.” Allelic variants of an endogenous soybean JAG1 gene include variants which encode JAG1 proteins having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity to the JAG1 protein encoded by SEQ ID NO: 18. Allelic variants of an endogenous soybean JAG1 gene also include variants which comprise genomic DNA having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity to SEQ ID NO: 18. In certain embodiments, mJAG1 genes can encode the polypeptide encoded by the mutated jag1 gene of SEQ ID NO: 19, SEQ ID NO: 20, or an allelic variant thereof. In certain embodiments, allelic variants of the mJAG1 genes of SEQ ID NO: 19 or 20 can comprise a nucleic acid sequence comprising, consisting essentially of, or consisting of the deletions in the JAG1 gene set forth in Table 3, in SEQ ID NO: 19, in SEQ ID NO: 20,
The present disclosure also provides for the soybean plant cells, plant parts including seed, plants, and biological samples comprising an mBS1 or mBS2 gene. These mBS1 or mBS2 plants and parts can be utilized for human food, livestock feed, as a raw material in industry, or as breeding material for development of other soybean varieties. The target endogenous BS1 comprises the genomic DNA of SEQ ID NO: 25 and allelic variants thereof located on soybean chromosome 10. The endogenous soybean BS1 gene is located at nucleotides 46,684,005 to 46,690,867 of chromosome 10 of the Glyma.Wm82.a1 (Gmax1.01) physical map set forth in the https internet site “spybase.org.” The target endogenous soybean BS2 gene comprises the genomic DNA of SEQ ID NO: 26 and allelic variants thereof located on soybean chromosome 20. The endogenous soybean BS2 gene is located at about nucleotides 37,798,675 to 37,804,788 of chromosome 20 of the Glyma.Wm82.a1 (Gmax1.01) physical map set forth in the https internet site “spybase.org.” The orthologous BS1 gene of the legume Medicago truncatula has been identified as group II member of the TIFY family of transcription regulators (Ge et al. Proc Natl Acad Sci USA. 2016, 113(44):12414-12419, doi: 10.1073/pnas.1611763113; Bai et al. Genomics 2011, 98(2):128-136; Vanholme et al., Trends Plant Sci. 2007, 12(6):239-244). Allelic variants of an endogenous soybean BS1 or BS2 gene include variants which encode BS1 or BS2 proteins having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity to SEQ ID NO: 27 or 28, respectively. Allelic variants of an endogenous soybean BS1 or BS2 gene also include variants which comprise genomic DNA having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity to SEQ ID NO: 25 or SEQ ID NO: 26, respectively. In certain embodiments, amorphic alleles of the BS1 or BS2 gene can comprise a deletion of the entire BS1 or BS2 protein coding sequence or any portion of the BS1 or BS2 protein coding sequence required for biological activity. In certain embodiments, one or more of the loss-of-function or amorphic alleles are non-natural alleles. In certain embodiments, amorphic alleles of the BS1 gene can comprise a deletion in 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, or 26 nucleotides of the endogenous soybean BS1 gene of SEQ ID NO: 25 located at nucleotide 890 to 916 of SEQ ID NO: 25 and result in a frameshift mutation. In certain embodiments, amorphic alleles of the BS1 gene can comprise a deletion in 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, or 26 nucleotides the endogenous soybean BS2 gene of SEQ ID NO: 26 gene located at nucleotide 1572 to 1598 of SEQ ID NO: 26 and result in a frameshift mutation. In certain embodiments, mBS2 genes comprising the amorphic allele with a frameshift mutation can encode the polypeptide comprising the amino acid sequence of SEQ ID NO: 29 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 29 can comprise an amino acid sequence having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity across the entire length of SEQ ID NO: 29. In certain embodiments, mBS2 genes comprising the amorphic allele with a frameshift mutation can encode the polypeptide comprising the amino acid sequence of SEQ ID NO: 30 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 30 can comprise an amino acid sequence having at least 95%, 96%, or 98% sequence identity across the entire length of SEQ ID NO: 30.
In certain embodiments, soybean plants homozygous for an mBS1 or mBS2 gene can yield seed lots wherein the number of seeds per kilogram of seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene is decreased in comparison to the number of seeds per kilogram harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 and a BS2 gene). In certain embodiments, the number of seeds per kilogram of seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene is decreased by up to about 5%, 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 22%, 24%, 26%, 28%, or 30% in comparison to the number of seeds per kilogram harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 and BS2 gene). In certain embodiments, the number of seeds per kilogram of seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene is decreased by about 5% to any one of about 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 22%, 24%, 26%, 28%, or 30% in comparison to the number of seeds per kilogram harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 and BS2 gene). In certain embodiments, the average weight per 1000 seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene is increased by up to about 2%, 4%, 6%, 8%, 9%, 10%, 12% 15%, 18%, 20%, 22%, 24%, 26%, 28%, or 30% in comparison to the weight per 1000 seeds harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 and BS2 gene). In certain embodiments, the weight per 1000 seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene is increased by 5% to any of 6%, 8%, 9%, 10%, 12% 15%, 18%, 20%, 22%, 24%, 26%, 28%, or 30% in comparison to the weight per 1000 seeds harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 or BS2 gene). In certain embodiments, the aforementioned mBS1 gene is an amorphic allele of the BS1 gene comprising a deletion in 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, or 26 base pairs in the endogenous soybean BS1 gene of SEQ ID NO: 25 located at nucleotide 890 to 916 of SEQ ID NO: 25 or in the corresponding nucleotides of an allelic variant thereof and result in a frameshift mutation. In certain embodiments, the mBS1 gene comprises a deletion set forth in SEQ ID NO: 32 or the corresponding deletion in an allelic variant of SEQ ID NO: 32. In certain embodiments, the aforementioned mBS2 gene is an amorphic allele of the BS1 gene comprising a deletion in 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, or 26 base pairs in the endogenous soybean BS2 gene of SEQ ID NO: 26 gene located at nucleotide 1572 to 1598 of SEQ ID NO: 26 or in the corresponding nucleotides of an allelic variant thereof and result in a frameshift mutation. In certain embodiments, the mBS2 gene comprises a deletion set forth in SEQ ID NO: 31, 33, 34, or 35 or the corresponding deletion in an allelic variant of SEQ ID NO: 32: 31, 33, 34, or 35. In certain embodiments of any of the aforementioned seed lots, the mBS2 gene encodes the polypeptide comprising the amino acid sequence of SEQ ID NO: 29 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 29 can comprise an amino acid sequence having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity across the entire length of SEQ ID NO: 29. In certain embodiments of any of the aforementioned seed lots, the mBS2 gene encodes the polypeptide comprising the amino acid sequence of SEQ ID NO: 30 or an allelic variant thereof. In certain embodiments, such allelic variants of SEQ ID NO: 30 can comprise an amino acid sequence having at least 95%, 96%, 98%, 99%, or 99.5% sequence identity across the entire length of SEQ ID NO: 30. In certain embodiments, soybean plants homozygous for an mBS1 gene or an mBS2 gene provided herein can exhibit seed yields which are equivalent to or essentially the same as seed yields of corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 and BS2 gene). In certain embodiments, the mBS1 or mBS2 gene is generated with a gRNA comprising a spacer region encoded by SEQ ID NO: 36. In certain embodiments, the content of one or more amino acids is increased in the seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene in comparison to the content of those amino acids in seeds harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 or BS2 gene). In certain embodiments, the content of one or more amino acids selected from the group consisting of aspartate, threonine, serine, glutamate, proline, glycine, alanine, valine, isoleucine, leucine, phenylalanine, histidine, lysine, and arginine is increased in the seeds harvested from the soybean plant homozygous for the mBS1 or mBS2 gene in comparison to the content of those amino acids in seeds harvested from corresponding control soybean plant lacking the mBS1 or mBS2 gene (e.g., a wild-type soybean plant homozygous for a wild-type BS1 or BS2 gene).
In certain embodiments, soybean plants comprising a loss-of-function allele in one or more of an endogenous soybean FT1a, JAG1, BS1, and/or BS2 gene can further comprise a loss-of-function (e.g., amorphic) allele in the soybean TFL1b gene which is an ortholog of the Arabidopsis Terminal Flower 1 gene. The soybean TFL1b ortholog is found in the https internet database “soybase.org” as Glyma.19G194300 and also known as the soybean stem growth habit gene Dtl (Liu et al. Plant Physiol. 2010 May; 153(1):198-210. doi: 10.1104/pp. 109.150607. Epub 2010 Mar. 10. PMID: 20219831; PMCID: PMC2862436). The TFL1b wild-type genomic DNA is provided as SEQ ID NO: 22, the TFL1b wild-type coding sequence is provided as SEQ ID NO: 23, and the TFL1b protein is provided as SEQ ID NO: 24. Loss-of-function mutations in the TFL1b gene can be obtained by gene editing techniques (e.g., by use of CRISPR/Cas9, CRISPR/Cas12, TALEN, or aZFN-mediated site-specific mutagenesis of the TFL1b gene). In certain embodiments, the loss-of-function mutations in the TFL1b gene are non-natural. In certain embodiments, the hypomorphic mutations in the TFL1b gene include those disclosed in US Patent Application publication US20230235350A1, which is incorporated herein by reference in its entirety.
In certain embodiments, the yield of the soybean plant comprising a loss-of-function allele of the endogenous soybean FT1a, JAG1, BS1, BS2, and/or TFL1b gene is increased in comparison to the yield of a wild-type control soybean plant lacking the loss-of-function allele. Increased yield of the soybean plant can be measured in a number of ways, including pod count per plant, seed count per plant, total harvested seed weight per plant, or total harvested seed weight per unit area (e.g., seed weight per acre or seed weight per hectare). In certain embodiments, the increased yield can result from an improved response to stress, including an abiotic stress (e.g., drought, heat, cold, and/or salt stress).
In certain embodiments, the pod count per soybean plant comprising the loss-of-function allele in the FT1a gene and optionally a loss-of-function mutation in a JAG1, BS1, BS2, and/or TFL1b gene is increased in comparison to the pod count per plant for a wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the pod count per plant is increased by at least about 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% in comparison to the pod count per plant from the corresponding wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the seed count per plant comprising the loss-of-function allele in the FT1a gene is increased in comparison to the seed count per plant for a wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the seed count per plant comprising the loss-of-function allele in the FT1a gene is increased by at least about 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% in comparison to the seed count per plant from the corresponding wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the total harvested seed weight per plant comprising the loss-of-function allele in the FT1a gene is increased in comparison to the total harvested seed weight per plant for a wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the total harvested seed weight per plant is increased by at least about 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% in comparison to the total harvested seed weight per plant from the corresponding wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the total harvested seed weight per unit area for soybean plants comprising the loss-of-function allele in the FT1a gene is increased in comparison to the total harvested seed weight per unit area for a wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the total harvested seed weight per unit area for soybean plants comprising the loss-of-function allele in the FT1a gene is increased by at least about 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 12%, 15%, 18%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% in comparison to the total harvested seed weight per unit area from the corresponding wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, the average weight of 1000 seeds obtained from the soybean plant is equivalent to or essentially the same as the average weight of 1000 seeds obtained from a wild-type control soybean plant lacking the loss-of-function allele.
In certain embodiments, the pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area are increased when the soybean plant comprising the loss-of-function allele in the FT1a gene and optionally a loss-of-function mutation in a JAG1, BS1, BS2, and/or TFL1b gene is grown under stress in comparison to pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area for a wild-type control soybean plant lacking the loss-of-function allele grown under stress. Non-limiting examples of stresses include drought, cold, heat, salt, shade, nutrient deficiency, high planting density, and the presence of pests or pathogens. In certain embodiments, the stress comprises an abiotic stress. In certain embodiments, the abiotic stress comprises drought, cold, heat, salt, stress. In these embodiments, the pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area can be increased when the plant comprising the loss-of-function allele in the FT1a gene is grown under drought stress in comparison to pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area for a wild-type control soybean plant lacking the loss-of-function allele grown under drought stress.
In certain embodiments, the pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area are increased in comparison to a control (e.g., check) when the soybean plant comprising the loss-of-function allele in the FT1a gene and optionally a loss-of-function mutation in a JAG1, BS1, BS2, and/or TFL1b gene is grown for a full growing season. In certain embodiments, the seed plant comprising the loss-of-function allele in the FT1a gene are planted at or after (e.g., on or within a week) of the earliest initial planting date provided by the USDA Risk Management Agency for the maturity zone where they are planted. Non-limiting examples of Risk Management Agency (RMA) replant crop insurance dates can range from about April 1 in the southeastern United States to about May 5 in northern sections of the midwestern United States (see the https internet site “soybeanresearchinfo.com/wp-content/uploads/2022/01/2700-003-23_Planting-Date-V1.pdf”). In certain embodiments, at least 50%, 70%, 80%, or 90% of plants in the soybean crop comprising the loss-of-function allele in the FT1a gene have 95% of their pods at full maturity color when harvested. Full maturity color is variety-dependent and can be gray, tan, or brown. In certain embodiments, the soybean crop comprising the loss-of-function allele in the FT1a gene is a full season variety for the soybean maturity group zone where it is grown and wherein the seed are harvested at or after a full growing season for the full season variety. Soybeans comprising the loss-of-function allele in the FT1a gene can fall into any of the 13 maturity group designations ranging from 000, 00, 0, or 1 to X. Maturity groups can also be represented by Arabic numbers and tenths (e.g., “5.8”). Soybean maturity groups 00 to VIII are typically grown in the U.S. (See https internet site “soybeanresearchinfo.com/research-highlight/delineating-optimal-soybean-maturity-groups-across-the-united-states/”).
Soybean seed lots comprising the soybean seeds comprising the loss-of-function allele in the FT1a, JAG1, BS1, BS2, and/or TFL1b gene(s) are provided. In certain embodiments, soybean plants comprising the mutated FT1a gene can yield seed lots wherein the average weight of 1000 seeds in the seed lot is equivalent to or essentially the same as the average weight of 1000 seeds in a control seed lot obtained from a wild-type control plant lacking the loss-of-function allele in the FT1a gene (e.g., a wild-type soybean plant homozygous for a wild-type FT1a gene). In certain embodiments, the average number of seeds per kilogram of seeds in the seed lot is equivalent to or essentially the same as the average number of seeds per kilogram of seeds in a control seed lot obtained from a wild-type control soybean plant lacking the loss-of-function allele in the FT1a gene. In certain embodiments, the seed lot is packaged in lots comprising about 50 to 60 pounds (i.e., about 22.7 to 27.2 kilograms) of seeds.
Also provided are polynucleotides comprising any of the aforementioned mutated FT1a, JAG1, BS1, BS2, and/or TFL1b genes or fragments thereof. In certain embodiments, polynucleotides comprising the sequence of SEQ ID NO: 4, SEQ ID NO 5, SEQ ID NO 7, or SEQ ID NO 8 or allelic variants thereof are provided. In certain embodiments, the allelic variants of SEQ ID NO: 4, SEQ ID NO 5, SEQ ID NO 7, or SEQ ID NO 8 will comprise sequences having at least 95%, 96, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 4, SEQ ID NO 5, SEQ ID NO 7, or SEQ ID NO 8 with the proviso that the sequences are not identical to across their entire length to SEQ ID NO: 3. In certain embodiments, the polynucleotides encode a polypeptide comprising the amino acid sequence of SEQ ID NO: 6, SEQ ID NO 9, or SEQ ID NO 10 or an allelic variant thereof. In certain embodiments, the encoded allelic variant will comprise a polypeptide having at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, or 99.9% sequence identity across the entire length of SEQ ID NO: 6, SEQ ID NO 9, or SEQ ID NO 10 with the proviso that the sequences are not identical to SEQ ID NO: 1 or SEQ ID NO 2. In certain embodiments, the polynucleotide is an isolated polynucleotide.
Biological samples and soybean by-products comprising any of the aforementioned or otherwise disclosed polynucleotides including those set forth in the sequence listing, Table 2, and/or Table 3 are also provided. In certain embodiments, the by-products are processed products are made from the soybean plants of the disclosure or their seeds, including: (a) soybean seed meal (defatted or non-defatted); (b) extracted soybean proteins, oils, sugars, syrups, and starches; (c) soy fermentation products; (d) soybean based animal feed or human food products (e.g., feed and food comprising soybean seed meal (defatted or non-defatted) and other ingredients (e.g., other cereal grains, other seed meal, other protein meal, other oil, other starch, other sugar, a binder, a preservative, a humectant, a vitamin, and/or mineral); (e) a pharmaceutical; (f) raw or processed biomass (e.g., cellulosic and/or lignocellulosic material; silage); and (g) various industrial products.
Methods of using the soybean plants, seeds, and seed lots of the disclosure to produce soybean by-products are also provided. Such methods will typically include at least one processing step of cleaning, cracking, flaking, crushing, macerating, pressing, extracting, expelling, and/or extruding the seed.
This disclosure is also directed to methods for producing a soybean plant having a loss-of-function allele of the endogenous soybean FT1a, JAG1, BS1, BS2, and/or TFL1b gene by crossing a first parent soybean plant with a second parent soybean plant wherein the first or second parent soybean plant comprises the loss-of-function allele. Further, both the first and second parent soybean plants can comprise the loss-of-function allele. Any such methods using a soybean plant comprising the loss-of-function allele are part of this disclosure: selfing, backcrosses, hybrid production, crosses to populations, and the like. All plants produced using a soybean plant comprising the loss-of-function allele as a parent are within the scope of this disclosure, including plants derived from a soybean plant having the loss-of-function allele. Also provided are the F1 progeny soybean plants produced from the crossing of a soybean plant comprising the loss-of-function allele with any other soybean plant, F1 seed, and various parts of the F1 soybean plant. The following describes breeding methods that can be used with soybean plants of the disclosure in the development of further soybean plants. One such embodiment is a method for developing a progeny soybean plant in a soybean plant breeding program comprising: obtaining the soybean plant, or its parts, comprising a loss-of-function allele of the endogenous soybean FT1a gene and utilizing said plant or plant parts as a source of breeding material; and selecting a progeny plant having the loss-of-function allele. Breeding steps that can be used in the soybean plant breeding program include pedigree breeding, backcrossing, mutation breeding, and recurrent selection. In conjunction with these steps, techniques such as restriction fragment polymorphism enhanced selection, genetic marker enhanced selection (for example SNP or SSR markers), and the making of double haploids can be utilized.
Field crops are bred through techniques that take advantage of the plant's method of pollination. A soybean plant of the disclosure can be self-pollinated, sib-pollinated, or cross pollinated to create a pedigree soybean plant. A plant is self-pollinated if pollen from one flower is transferred to the same or another flower of the same plant. A plant is sib-pollinated when individuals within the same family or variety are used for pollination. A plant is cross-pollinated if the pollen comes from a flower on a different plant from a different family or variety. The terms “cross-pollination” and “out-cross” as used herein do not include self-pollination or sib-pollination. Soybean plants (Glycine max) are recognized to be naturally self-pollinated plants which, while capable of undergoing cross-pollination, rarely do so in nature. Insects are reported by some researchers to carry pollen from one soybean plant to another and it generally is estimated that less than one percent of soybean seed formed in an open planting can be traced to cross-pollination, i.e., less than one percent of soybean seed formed in an open planting is capable of producing F1 hybrid soybean plants.
Any other suitable breeding, selection, or growing methods may be used. Choice of the particular breeding or selection method will vary depending on environmental factors, population size, and the like.
In certain embodiments, soybean plant cells, plant parts (e.g., seeds), and plants comprising at least one mutation in the endogenous soybean FT1a, JAG1, BS1, BS2, and/or TFL1b gene and a transgenic locus are provided. In certain embodiment, the at least one mutation in the endogenous soybean FT1a, JAG1, BS1, BS2, and/or TFL1b is combined with one or more soybean GM events providing tolerance to any one or a combination of glyphosate-based, glufosinate-based, HPPD inhibitor-based, sulfonylurea- or imidazolinone-based, AHAS- or ALS-inhibiting and/or auxin-type (e.g., dicamba, 2,4-D) herbicides and/or an insect resistance trait. GM events that can be combined with the mutations disclosed herein include Event EE-GM3 (aka FG-072, MST-FGØ72-3, described in WO2011063411, USDA-APHIS Petition 09-328-01p), Event SYHTOH2 (aka 0H2, SYN-ØØØH2-5, described in WO2012/082548 and 12-215-01p), Event DAS-68416-4 (aka Enlist Soybean, described in WO2011/066384 and WO2011/066360, USDA-APHIS Petition 09-349-Olp), Event DAS-44406-6 (aka Enlist E3, DAS-444Ø6-6, described in WO2012/075426 and USDA-APHIS 11-234-01p), Event MON87708 (dicamba-tolerant event of Roundup Ready 2 Xtend Soybeans, described in WO2011/034704 and USDA-APHIS Petition 10-188-01p, MON-87708-9), Event MON89788 (aka Genuity Roundup Ready 2 Yield, described in WO2006/130436 and USDA-APHIS Petition 06-178-01p), Event 40-3-2 (aka Roundup Ready, GTS 40-3-2, MON-Ø4Ø32-6, described in USDA-APHIS Petition 93-258-01), Event A2704-12 (aka LL27, ACS-GMØØ5-3, described in WO2006108674 and USDA-APHIS Petition 96-068-01p), Event 127 (aka BPS-CV127-9, described in WO2010/080829), Event A5547-127 (aka LL55, ACS-GMØØ6-4, described in WO2006108675 and in USDA-APHIS Petition 96-068-01p), event MON87705 (MON-877Ø5-6, Vistive Gold, published PCT patent application WO2010/037016, USDA-APHIS Petition 09-201-01p), or event DP305423 (aka DP-3Ø5423-1, published PCT patent application WO2008/054747, USDA-APHIS Petition 06-354-01p), or EE-GM5 is combined with a combination of the following events: Event MON98788×MON87708 (aka Roundup Ready 2 Xtend Soybeans, MON-877Ø8-9×MON-89788-1), Event HOS×Event 40-3-2 (aka Plenish High Oleic Soybeansx Roundup Ready Soybeans), Event EE-GM3×EE-GM2 (aka FG-072×LL55, described in WO2011063413), Event MON 87701×MON 89788 (aka Intacta RR2 Pro Soybean, MON-877Ø1-2×MON-89788-1), DAS-81419-2×DAS-44406-6 (aka Conkesta™ Enlist E3™ Soybean, DAS-81419-2×DAS-444Ø6-6), Event DAS-68416-4×Event MON 89788 (aka Enlist™ RoundUp Ready® 2 Soybean, DAS-68416-4×MON-89788-1), Event MON-87769-7×Event MON-89788-1 (aka Omega-3×Genuity Roundup Ready 2 Yield Soybeans), Event MON 87705×Event MON 89788 (aka Vistive Gold, MON-877Ø5-6×MON-89788-1), or Event MON87769×Event MON89788 (aka Omega-3×Genuity Roundup Ready 2 Yield Soybeans, MON-87769-7×MON-89788-1), where all published PCT patent applications or US national stages thereof are incorporated herein by reference in there entireties. Also provided herein are soybean plant cells, plant parts (e.g., seeds), and plants comprising at least one mutation in the endogenous soybean FT1a, JAG1, BS1, BS2, and/or TFL1b and a modification of any of the aforementioned transgenic events or transgenic events. Modifications of the transgenic events include those disclosed in: WO2022/026375, WO2022/026379, WO2022/026390, WO2022/026395, WO2022/026403; US Patent Applic. Pub. Nos. US20220030822 and US20230250441; and U.S. Pat. No. 11,242,534, which are each incorporated herein by reference in their entireties.
Methods of producing a soybean seed lot comprising: (i) growing a population of soybean plants comprising a mutated FT1a, JAG1, BS1, BS2, and/or TFL1b gene to maturity; and (ii) harvesting seed from the population of soybean plants of step (i) at maturity, thereby producing the soybean seed lot, wherein the soybean plants are homozygous for the mutated FT1a gene. In certain embodiments, the seed lot is packaged in lots comprising about 50 to 60 pounds (i.e., about 22.7 to 27.2 kilograms).
Also provided herein are methods of treating the soybean seeds and seed lots of the disclosure and the resultant treated seeds and seed lots. Seeds can be treated with such fertilizers, biological agents, nematicides, insecticides, and fungicides by methods including in-furrow applications or by coating (e.g., with a drum coater, rotary coater, tumbling drum, fluidized bed, and/or spouted bed apparatus). Methods and compositions including various binders, fillers, film coats, and active ingredients such as fertilizers, surfactants, plant growth regulators, crop desiccants, fungicides, bacteriocides, bacteriostats, insecticides, and insect repellants for coating seeds that can be adapted for use with seeds provided herein are disclosed in U.S. Pat. No. 10,745,578, which is incorporated herein by reference in its entirety.
The disclosure also provides a method of making a soybean plant comprising a mutated FT1a, JAG1, BS1, BS2, and/or TFL1b gene. In certain embodiments, the methods can comprise making a deletion, an insertion and/or a substitution which results in a mutated FT1a, JAG1, BS1, BS2, and/or TFL1b gene. Gene editing molecules of use in methods provided herein include molecules capable of introducing a double-strand break (“DSB”) or single-strand break (“SSB”) at a specific site or sequence in a double-stranded DNA, such as in genomic DNA or in a target gene located within the genomic DNA as well as accompanying guide RNA. In certain embodiments, the loss-of-function allele results from introduction of a DSB at a target site in the FT1a gene (e.g., SEQ ID NO: 3 or an allelic variant thereof), JAG1 gene, BS1 gene, BS2 gene, and/or the TFL1b gene to induce non-homologous end joining (NHEJ) at the site of the break followed by recovery of desired loss-of-function alleles. In certain embodiments, the loss-of-function allele results from introduction of a DSB at a target site in the FT1a gene (e.g., SEQ ID NO: 3 or an allelic variant thereof) followed by homology-directed repair (HDR), microhomology-mediated end joining (MMEJ), or NHEJ to introduce a desired donor or other DNA template polynucleotide at the DSB, followed by recovery of the desired loss-of-function allele. Examples of such gene editing molecules include: (a) a nuclease comprising an RNA-guided nuclease, an RNA-guided DNA endonuclease or RNA directed DNA endonuclease (RdDe), a class 1 CRISPR type nuclease system, a type II Cas nuclease, a Cas9, a nCas9 nickase, a type V Cas nuclease, a Cas12a nuclease, a nCas12a nickase, a Cas12d (CasY), a Cas12e (CasX), a Cas12b (C2c1), a Cas12c (C2c3), a Cas12i, a Cas12j, a Cas14, an engineered nuclease, a codon-optimized nuclease, a zinc-finger nuclease (ZFN) or nickase, a transcription activator-like effector nuclease (TAL-effector nuclease or TALEN) or nickase (TALE-nickase), an Argonaute, and a meganuclease or engineered meganuclease; (b) a polynucleotide encoding one or more nucleases capable of effectuating site-specific alteration (including introduction of a DSB or SSB) of a target nucleotide sequence; (c) a guide RNA (gRNA) for use with an RNA-guided nuclease, or a DNA encoding a gRNA for use with an RNA-guided nuclease; (d) optionally donor DNA template polynucleotides suitable for insertion at a break in genomic DNA by homology-directed repair (HDR) or microhomology-mediated end joining (MMEJ); and (e) optionally other DNA templates (e.g., dsDNA, ssDNA, or combinations thereof) suitable for insertion at a break in genomic DNA (e.g., by non-homologous end joining (NHEJ).
In certain embodiments, the mutated FT1a, JAG1. BS1, BS2, and/or TFL1b gene and plant cells, parts including seeds, and plants comprising the mutated FT1a. BS1. BS2, and/or TFL1b gene are generated by CRISPR technology. CRISPR technology for editing the genes of eukaryotes is disclosed in US Patent Application Publications 2016/0138008A1 and US2015/0344912A1, and in U.S. Pat. Nos. 8,697,359, 8,771,945, 8,945,839, 8,999,641, 8,993,233, 8,895,308, 8,865,406, 8,889,418, 8,871,445, 8,889,356, 8,932,814, 8,795,965, and 8,906,616. Cpf1 endonuclease and corresponding guide RNAs and PAM sites are disclosed in US Patent Application Publication 2016/0208243 A1. Plant RNA promoters for expressing CRISPR guide RNA and plant codon-optimized CRISPR Cas9 endonuclease are disclosed in International Patent Application PCT/US2015/018104 (published as WO 2015/131101 and claiming priority to U.S. Provisional Patent Application 61/945,700). Methods of using CRISPR technology for genome editing in plants are disclosed in US Patent Application Publications US 2015/0082478A1 and US 2015/0059010A1 and in International Patent Application PCT/US2015/038767 A1 (published as WO 2016/007347 and claiming priority to U.S. Provisional Patent Application 62/023,246). All of the patent publications referenced in this paragraph are incorporated herein by reference in their entirety. In certain embodiments, an RNA-guided endonuclease that leaves a blunt end following cleavage of the target site is used. Blunt-end cutting RNA-guided endonucleases include Cas9, Cas12c, Cas12i, and Cas 12h (Yan et al., 2019). In certain embodiments, an RNA-guided endonuclease that leaves a staggered single stranded DNA overhanging end following cleavage of the target site following cleavage of the target site is used. Staggered-end cutting RNA-guided endonucleases include Cas12a, Cas12b, and Cas12e.
Guide RNA molecules comprising a spacer RNA molecule which targets the FT1a gene of SEQ ID NO: 3 or an allelic variant thereof are provided. In certain embodiments, the spacer RNA molecule targets a portion of exon 1 (i.e., nucleotides 229 to 429 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), exon 2 (i.e., nucleotides 596 to 657 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), exon 3 (i.e., nucleotides 3643 to 3683 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3), or nucleotides corresponding to exon 4 (i.e., nucleotides 4999 to 5225 or nucleotides 5104 to 5225 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3) of the FT1a gene of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, the spacer RNA molecule comprises the RNA encoded by SEQ ID NO: 11. Guide RNAs comprising a spacer RNA molecule encoded by SEQ ID NO: 11 can be used in conjunction with a Cas12a nuclease to generate mutated FT1a genes which: (i) comprise a deletion in the endogenous FT1a gene of SEQ ID NO: 3 or an allelic variant thereof; (ii) comprise deletions of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nucleotides of the endogenous soybean FT1a gene of SEQ ID NO: 3 located at nucleotide 331 to 356 of SEQ ID NO: 3 or in an equivalent position of an allelic variant of SEQ ID NO: 3; (iii) comprise the sequence of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 8 or allelic variants thereof; or SEQ ID NO: 4 or SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 8; or (iv) encode the polypeptides of SEQ ID NO: 6, SEQ ID NO: 9, or SEQ ID NO: 10 or allelic variants thereof. Guide RNAs which target the JAG1 gene include type V Cas nuclease guide RNAs comprising the spacer sequence encoded by SEQ ID NO: 17 and guide RNAs disclosed in WO2023/183772 and U.S. Provisional Application 63/269,663, which are each incorporated herein by reference in their entireties. Guide RNAs which target the BS1 and/or BS2 gene include V Cas nuclease guide RNAs comprising the spacer sequence encoded by SEQ ID NO: 36 and disclosed in U.S. Patent Application Ser. No. 63/490,375, which is incorporated herein by reference in its entirety. Guide RNAs which target the TFL1b gene include those disclosed in US Patent Application publication US20230235350A1, which is incorporated herein by reference in its entirety.
CRISPR-type genome editing can be adapted for use in the plant cells and methods provided herein in several ways. CRISPR elements, e.g., gene editing molecules comprising CRISPR endonucleases and CRISPR guide RNAs including single guide RNAs or guide RNAs in combination with tracrRNAs or scoutRNA, or polynucleotides encoding the same, are useful in effectuating genome editing without remnants of the CRISPR elements or selective genetic markers occurring in progeny. In certain embodiments, the CRISPR elements are provided directly to the eukaryotic cell (e.g., soybean plant cells), systems, methods, and compositions as isolated molecules, as isolated or semi-purified products of a cell free synthetic process (e.g., in vitro translation), or as isolated or semi-purified products of in a cell-based synthetic process (e.g., such as in a bacterial or other cell lysate). In certain embodiments, soybean plants or soybean plant cells used in the systems, methods, and compositions provided herein can comprise a transgene that expresses a CRISPR endonuclease (e.g., a Cas9, a Cpf1-type or other CRISPR endonuclease). In certain embodiments, one or more CRISPR endonucleases with unique PAM recognition sites can be used. Guide RNAs (sgRNAs or crRNAs and a tracrRNA) to form an RNA-guided endonuclease/guide RNA complex which can specifically bind sequences in the gDNA target site that are adjacent to a protospacer adjacent motif (PAM) sequence. The type of RNA-guided endonuclease typically informs the location of suitable PAM sites and design of crRNAs or sgRNAs. G-rich PAM sites, e.g., 5′-NGG are typically targeted for design of crRNAs or sgRNAs used with Cas9 proteins. Examples of PAM sequences include 5′-NGG (Streptococcus pyogenes), 5′-NNAGAA (Streptococcus thermophilus CRISPR1), 5′-NGGNG (Streptococcus thermophilus CRISPR3), 5′-NNGRRT or 5′-NNGRR (Staphylococcus aureus Cas9, SaCas9), and 5′-NNNGATT (Neisseria meningitidis). T-rich PAM sites (e.g., 5′-TTN or 5′-TTTV, where “V” is A, C, or G) are typically targeted for design of crRNAs or sgRNAs used with Cas12a proteins. In some instances, Cas12a can also recognize a 5′-CTA PAM motif. Other examples of potential Cas12a PAM sequences include TTN, CTN, TCN, CCN, TTTN, TCTN, TTCN, CTTN, ATTN, TCCN, TTGN, GTTN, CCCN, CCTN, TTAN, TCGN, CTCN, ACTN, GCTN, TCAN, GCCN, and CCGN (wherein N is defined as any nucleotide). Cpf1 endonuclease and corresponding guide RNAs and PAM sites are disclosed in US Patent Application Publication 2016/0208243 A1, which is incorporated herein by reference for its disclosure of DNA encoding Cpf1 endonucleases and guide RNAs and PAM sites.
In certain embodiments, the mutated FT1a gene and plant cells, parts including seeds, and plants comprising the mutated FT1a gene are generated by use of zinc finger nucleases or zinc finger nickases. Zinc-finger nucleases are site-specific endonucleases comprising two protein domains: a DNA-binding domain, comprising a plurality of individual zinc finger repeats that each recognize between 9 and 18 base pairs, and a DNA-cleavage domain that comprises a nuclease domain (typically Fok1). The cleavage domain dimerizes in order to cleave DNA; therefore, a pair of ZFNs are required to target non-palindromic target polynucleotides. In certain embodiments, zinc finger nuclease and zinc finger nickase design methods which have been described (Urnov et al. (2010) Nature Rev. Genet., 11:636-646; Mohanta et al. (2017) Genes vol. 8,12: 399; Ramirez et al. Nucleic Acids Res. (2012); 40(12): 5560-5568; Liu et al. (2013) Nature Communications, 4: 2565) can be adapted for use in the methods set forth herein. The zinc finger binding domains of the zinc finger nuclease or nickase provide specificity and can be engineered to specifically recognize any desired target DNA sequence. The zinc finger DNA binding domains are derived from the DNA-binding domain of a large class of eukaryotic transcription factors called zinc finger proteins (ZFPs). The DNA-binding domain of ZFPs typically contains a tandem array of at least three zinc “fingers” each recognizing a specific triplet of DNA. A number of strategies can be used to design the binding specificity of the zinc finger binding domain. One approach, termed “modular assembly”, relies on the functional autonomy of individual zinc fingers with DNA. In this approach, a given sequence is targeted by identifying zinc fingers for each component triplet in the sequence and linking them into a multifinger peptide. Several alternative strategies for designing zinc finger DNA binding domains have also been developed. These methods are designed to accommodate the ability of zinc fingers to contact neighboring fingers as well as nucleotide bases outside their target triplet. Typically, the engineered zinc finger DNA binding domain has a novel binding specificity, compared to a naturally occurring zinc finger protein. Engineering methods include, for example, rational design and various types of selection. Rational design includes, for example, the use of databases of triplet (or quadruplet) nucleotide sequences and individual zinc finger amino acid sequences, in which each triplet or quadruplet nucleotide sequence is associated with one or more amino acid sequences of zinc fingers which bind the particular triplet or quadruplet sequence. See, e.g., U.S. Pat. Nos. 6,453,242 and 6,534,261, both incorporated herein by reference in their entirety. Exemplary selection methods (e.g., phage display and yeast two-hybrid systems) can be adapted for use in the methods described herein. In addition, enhancement of binding specificity for zinc finger binding domains has been described in U.S. Pat. No. 6,794,136, incorporated herein by reference in its entirety. In addition, individual zinc finger domains may be linked together using any suitable linker sequences. Examples of linker sequences are publicly known, e.g., see U.S. Pat. Nos. 6,479,626; 6,903,185; and 7,153,949, incorporated herein by reference in their entirety. The nucleic acid cleavage domain is non-specific and is typically a restriction endonuclease, such as Fok1. This endonuclease must dimerize to cleave DNA. Thus, cleavage by Fok1 as part of a ZFN requires two adjacent and independent binding events, which must occur in both the correct orientation and with appropriate spacing to permit dimer formation. The requirement for two DNA binding events enables more specific targeting of long and potentially unique recognition sites. Fok1 variants with enhanced activities have been described and can be adapted for use in the methods described herein; see, e.g., Guo et al. (2010) J. Mol. Biol., 400:96-107.
In certain embodiments, the mutated FT1a gene and plant cells, parts including seeds, and plants comprising the mutated FT1a gene are generated by use of TAL-effector nucleases or TALENs. Transcription activator like effectors (TALEs) are proteins secreted by certain Xanthomonas species to modulate gene expression in host plants and to facilitate the colonization by and survival of the bacterium. TALEs act as transcription factors and modulate expression of resistance genes in the plants. Recent studies of TALEs have revealed the code linking the repetitive region of TALEs with their target DNA-binding sites. TALEs comprise a highly conserved and repetitive region consisting of tandem repeats of mostly 33 or 34 amino acid segments. The repeat monomers differ from each other mainly at amino acid positions 12 and 13. A strong correlation between unique pairs of amino acids at positions 12 and 13 and the corresponding nucleotide in the TALE-binding site has been found. The simple relationship between amino acid sequence and DNA recognition of the TALE binding domain allows for the design of DNA binding domains of any desired specificity. TALEs can be linked to a non-specific DNA cleavage domain to prepare genome editing proteins, referred to as TAL-effector nucleases or TALENs. As in the case of ZFNs, a restriction endonuclease, such as Fok1, can be conveniently used. Methods for use of TALENs in plants have been described and can be adapted for use in the methods described herein, see Mahfouz et al. (2011) Proc. Natl. Acad. Sci. USA, 108:2623-2628; Mahfouz (2011) GM Crops, 2:99-103; and Mohanta et al. (2017) Genes vol. 8,12: 399). TALE nickases have also been described and can be adapted for use in methods described herein (Wu et al.; Biochem Biophys Res Commun. (2014); 446(1):261-6; Luo et al; Scientific Reports 6, Article number: 20657 (2016)).
Various treatments can be used for delivery of gene editing molecules and/or other molecules to a plant cell. In certain embodiments, one or more treatments is employed to deliver the gene editing or other molecules (e.g., comprising a polynucleotide, polypeptide or combination thereof) into a plant cell, e.g., through barriers such as a cell wall, a plasma membrane, a nuclear envelope, and/or other lipid bilayer. In certain embodiments, a polynucleotide-, polypeptide-, or RNP (ribonucleoprotein)-containing composition comprising the molecules are delivered directly, for example by direct contact of the composition with a plant cell. Aforementioned compositions can be provided in the form of a liquid, a solution, a suspension, an emulsion, a reverse emulsion, a colloid, a dispersion, a gel, liposomes, micelles, an injectable material, an aerosol, a solid, a powder, a particulate, a nanoparticle, or a combination thereof can be applied directly to a plant, plant part, plant cell, or plant explant (e.g., through abrasion or puncture or otherwise disruption of the cell wall or cell membrane, by spraying or dipping or soaking or otherwise directly contacting, by microinjection). For example, a plant cell or plant protoplast is soaked in a liquid genome editing molecule-containing composition. In certain embodiments, the composition is delivered using negative or positive pressure, for example, using vacuum infiltration or application of hydrodynamic or fluid pressure. In certain embodiments, the composition is introduced into a plant cell or plant protoplast, e.g., by microinjection or by disruption or deformation of the cell wall or cell membrane, for example by physical treatments such as by application of negative or positive pressure, shear forces, or treatment with a chemical or physical delivery agent such as surfactants, liposomes, or nanoparticles; see, e.g., delivery of materials to cells employing microfluidic flow through a cell-deforming constriction as described in US Published Patent Application 2014/0287509, incorporated by reference in its entirety herein. Other techniques useful for delivering the composition to a eukaryotic cell, plant cell or plant protoplast include: ultrasound or sonication; vibration, friction, shear stress, vortexing, cavitation; centrifugation or application of mechanical force; mechanical cell wall or cell membrane deformation or breakage; enzymatic cell wall or cell membrane breakage or permeabilization; abrasion or mechanical scarification (e.g., abrasion with carborundum or other particulate abrasive or scarification with a file or sandpaper) or chemical scarification (e.g., treatment with an acid or caustic agent); and electroporation. In certain embodiments, the composition is provided by bacterially mediated (e.g., Agrobacterium sp., Rhizobium sp., Sinorhizobium sp., Mesorhizobium sp., Bradyrhizobium sp., Azotobacter sp., Phyllobacterium sp.) transfection of the plant cell or plant protoplast with a polynucleotide encoding the genome editing molecules (e.g., RNA dependent DNA endonuclease, RNA dependent DNA binding protein, RNA dependent nickase, ABE, or CBE, and/or guide RNA); see, e.g., Broothaerts et al. (2005) Nature, 433:629-633). Any of these techniques or a combination thereof are alternatively employed on a plant explant, plant part or tissue or intact plant (or seed) from which a plant cell is optionally subsequently obtained or isolated; in certain embodiments, the composition is delivered in a separate step after the plant cell has been isolated.
In certain embodiments, the methods for generating the soybean plant cell, soybean plant parts, or soybean plants comprise: (i) screening a population of soybean plant cells, parts, or plants for the presence of a loss-of-function allele in the endogenous soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof; and (ii) isolating a soybean plant cell, soybean plant part, or soybean plant comprising a loss-of-function allele of the soybean FT1a gene of SEQ ID NO: 3 or an allelic variant thereof.
In certain embodiments, the population of soybean plant cells, parts, or plants which are screened for the presence of a loss-of-function allele in the endogenous soybean FT1a gene of SEQ ID NO: 3 are first pre-screened by screening of phenotypic characteristics plants having loss-of-function mutations in an FT1a gene of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, such phenotypic characteristics include increased pod count per plant; seed count per plant; and/or total harvested seed weight per plant in comparison to pod count per plant; seed count per plant; and/or total harvested seed weight per plant for a wild-type control soybean plant lacking the loss-of-function allele. In certain embodiments, such phenotypic characteristics include increased pod count per plant; seed count per plant; and/or total harvested seed weight per plant in comparison to pod count per plant; seed count per plant; and/or total harvested seed weight per plant for a wild-type control soybean plant lacking the loss-of-function allele where the screened and control plants are grown under stress conditions (e.g., abiotic stress including drought, cold, heat, or salt stress). In certain embodiments, plants exhibiting one or more of the aforementioned phenotypic characteristics are then subjected to screening for the presence of a loss-of-function allele in the endogenous soybean FT1a gene of SEQ ID NO: 3 and soybean plants comprising loss-of-function alleles in the endogenous soybean FT1a gene of SEQ ID NO: 3 are identified and/or selected.
In certain embodiments, the population of soybean plant cells, parts, or plants which are screened for the presence of a loss-of-function allele in the endogenous soybean FT1a gene of SEQ ID NO: 3, JAG1 gene of SEQ ID NO: 18, BS1 gene of SEQ ID NO: 25, BS2 gene of SEQ ID NO: 26, TFL1B gene of SEQ ID NO: 22, and/or an allelic variant thereof have been subjected to one or more mutagenesis treatments. Loss-of-function alleles of the endogenous soybean FT1a gene can be generated by mutagenesis methods known in the art, such as chemical mutagenesis or radiation mutagenesis. Suitable chemical mutagens include ethyl methanesulfonate (EMS), sodium azide, methylnitrosourea (MNU), and diepoxybutane (DEB). Suitable radiation includes x-rays, fast neutron radiation, and gamma radiation.
Soybean plant cells, parts, or plants comprising a loss-of-function allele of the endogenous FT1a, JAG1, BS1, BS2, and/or TFL1b gene can be generated using mutagenesis and identified by TILLING (Targeting Induced Local Lesions IN Genomes) or identified using EcoTILLING. TILLING is a general reverse genetics technique that uses mutagenesis methods to create libraries of mutagenized individuals that are later subjected to high throughput screens for the discovery of mutations. In addition to allowing efficient detection of induced mutations, high-throughput TILLING technology is ideal for the detection of natural mutations. EcoTILLING is a method that uses TILLING techniques to look for natural mutations in individuals (Barkley and Wang. Current genomics vol. 9,4 (2008): 212-26. doi:10.2174/138920208784533656). Identified mutations can then be introduced into desirable genetic backgrounds by crossing the mutant with a plant of the desired genetic background and performing a suitable number of backcrosses to cross out the originally undesired parent background. A more detailed description of methods and compositions for TILLING are disclosed in US Patent Application Publication 2004/0053236 A1, which is incorporated herein by reference in its entirety and can be adapted for use in the methods provided herein for identifying soybean plant cells, parts, or plants comprising a loss-of-function allele of the endogenous FT1a gene.
In certain embodiments, the screening comprises analyzing pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area in one or more candidate plants or one or more candidate plant populations. In these embodiments, an increase in pod count per plant, seed count per plant, total harvested seed weight per plant, and/or total harvested seed weight per unit area in comparison to a wild-type control soybean plant lacking the loss-of-function allele is indicative of a soybean plant cell, soybean plant part, or soybean plant comprising the loss-of-function allele. In certain embodiments, the screening is conducted on a population of plants grown under stress. Suitable examples of stress conditions include drought, salt, cold, heat, salt, shade, nutrient deficiency, high planting density, and the presence of pests or pathogens.
Methods for determining whether a soybean plant cell, plant part, or plant comprises a loss-of-function allele of the endogenous soybean FT1a gene are provided. Methods for determining the presence or absence of the loss-of-function allele can be used in, for example, breeding programs for identification, selection, introgression, and the like.
In certain embodiments, the methods comprise analyzing a polynucleotide comprising a portion of SEQ ID NO: 3 or an allelic variant thereof or analyzing an RNA encoded by a portion of SEQ ID NO: 3 or an allelic variant thereof from the plant cell, plant part, or plant. In certain embodiments, an insertion, deletion, and/or substitution of one or more nucleotides in the polynucleotide or RNA is indicative of the presence of the loss-of-function allele. Detection of the loss-of-function allele in a nucleic acid sample (e.g., DNA, RNA, or cDNA) can be achieved by any combination of nucleic acid amplification (e.g., PCR amplification), hybridization, sequencing, and/or mass-spectrometry based techniques. In certain embodiments, such detection is achieved by amplification and/or hybridization-based detection methods using a primer (e.g., selective amplification primers) and/or probe (e.g., capable of selective hybridization or generation of a specific primer extension product) which specifically recognizes the FT1a gene (e.g., a portion of SEQ ID NO: 3 or an allelic variant thereof). Such primers and/or probes can comprise or consist of about 15, 20, 25, 30, 40, 45 or 50 more contiguous nucleotides of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, the primers or probes can comprise or consist of about 10 to 50 contiguous nucleotides, about 10 to 40 contiguous nucleotides, about 10 to 30 contiguous nucleotides or about 15 to 30 contiguous nucleotides of SEQ ID NO: 3 or an allelic variant thereof. In certain embodiments, the hybridization probes (e.g., polynucleotides comprising at least about 15 to 30 base pairs of SEQ ID NO: 3 or an allelic variant thereof) can comprise detectable labels (e.g., fluorescent, radioactive, epitope, and chemiluminescent labels). In certain embodiments, the FT1a gene can be directly sequenced using nucleic acid sequencing technologies, including whole genome sequencing.
In certain embodiments, the methods comprise analyzing a polypeptide encoded by SEQ ID NO: 3, a portion thereof, or an allelic variant thereof from the soybean plant cell, plant part, or plant. In certain embodiments, an insertion, deletion, and/or substitution of one or more amino acid residues of the polypeptide or a change in the biologic or biochemical activity of the polypeptide is indicative of the presence of the loss-of-function allele. Detection of the loss-of-function allele based on the polypeptide can be determined by methods well known in the art such as activity assays, western blots using antibodies capable of specifically binding the polypeptide, enzyme-linked immunosorbent assays (ELISA), radioimmunoassays (RIA), immunohistochemistry, immunocytochemistry, immunofluorescence, and the like.
In certain optional embodiments, the soybean plant cells disclosed herein are non-regenerable soybean plant cells. In certain optional embodiments provided herein, the soybean plant cells, soybean plant propagules (e.g., a seed, seedling, ovule, embryo, pollen, root, stem, leaf, shoot, explant, or callus), and soybean plants provided herein are not produced by an exclusively biological process. In certain optional embodiments provided herein, the methods for producing soybean plant cells, soybean plant propagules (e.g., a seed, seedling, ovule, embryo, pollen, root, stem, leaf, shoot, explant, or callus), and soybean plants provided herein are not exclusively biological processes.
The following non-limiting numbered embodiments also form part of the present disclosure:
The present disclosure also provides for soybean plants and plant parts comprising a mutated BS1 or mutated BS2 gene which can optionally further comprise a loss-of-function allele of an FT1a, JAG1, and/or TFL1b gene, along with methods of making and using the same. Additional non-limiting embodiments of the disclosure are provided herein as follows:
A vector was created to transform soybean plants and disrupt the open reading frame of the GmFTA1a gene (Glyma10g38970; SEQ ID NO: 3) through CRISPR-mediated gene editing. A CRISPR guide RNA comprising a crRNA fused to a spacer RNA encoded by SEQ ID NO: 11 was designed to target exon 1 of the Glycine max FT1a gene.
Genotypes recovered included −10:13D and −4:9D. The −10:13D genotype has a 13 bp deletion (SEQ ID NO: 5) resulting in a frameshift mutation and introducing a premature stop codon. This −10:13D genotype is predicted to encode a truncated 84 amino acid polypeptide (SEQ ID NO: 6). The −4:9D genotype has a 9 bp deletion (SEQ ID NO: 8) predicted to encode a polypeptide with a 3 amino acid internal deletion (SEQ ID NOs: 9 and 10).
Seeds of the homozygous −4:9D mutant line SENF2228 and the homozygous −10:13D mutant line SENF2229 were increased and planted in rows in the field along with checks and lines having unrelated edits. Data on total yield from field trials showed a tendency towards higher yield per plot for SENF2228 and SENF2229 compared to adjacently grown wild types, null segregants, and some unrelated mutants.
Some field-grown plants were also subjected to more detailed phenotyping. SENF2228 and SENF2229 showed a consistent tendency towards higher yield when compared to controls grown at comparable densities. This trend was apparent in yield components such as pod count, seed count, and total seed weight, though the individual seed weight was comparable to that of null segregant and wild type checks.
Seeds of the homozygous −4:9D mutant line SENF2228 and the homozygous −10:13D mutant line SENF2229 were increased and planted in rows in the field along with a NINF1170 check. The results of the field test are shown in Table 1. Phenotypic characteristics of SENF2228, SENF2229 and comparison variety NINF1170 are shown in Table 1.
1 “Seed Yield” refers to the yield adjusted to 12% moisture in bushels/acre of the grain at harvest.
2 “Plants Per Acre” refers to the estimated number of soybean plants per acre.
3 “Plant height” refers to the plant height taken from soil level to the apical node on the main stem of the plant at maturity and is measured in inches.
4 “Time to Flowering” refers to the number of days from planting when 50% of the plants have at least one open flower at any node on the main stem
5 “Time to Beginning Pod” refers to the number of days from planting when 50% of plants had pods 3/16 inch long at one of the four uppermost nodes on the main stem with a fully developed leaf.
6 “Time to Beginning Seed” refers to the number of days from planting when 50% of the plants have seed ⅛ inch long in a pod at one of the four uppermost nodes on the main stem with a fully developed leaf.
7 “Time to Full Maturity” refers to the number of days from planting when 50% of the plants had 95% of their pods reach full maturity color.
Additional soybean plants comprising loss-of-function mutations in the FT1a gene were generated essentially as described in Example 1 using a gRNA comprising the spacer RNA encoded by SEQ ID NO: 11 and a type V Cas nuclease. Soybean plants generated included the SENF2225 and SENF2226 plants which contain the −16:41D deletion in the FT1a gene shown in
Additional deletions in the soybean FT1a gene in a variety of different soybean genotypes generated with the gRNA comprising the spacer RNA encoded by SEQ ID NO: 11 and the type V Cas nuclease are provided in Table 2, which is incorporated herein by reference in its entirety. Additional deletions in the soybean JAG1 gene in a variety of different soybean genotypes generated with the gRNA comprising the spacer RNA encoded by SEQ ID NO: 17 and the type V Cas nuclease are provided in Table 3, which is incorporated herein by reference in its entirety. The left-most column of Tables 2 and 3 labelled “variant_ID” provides the coordinates of the different FT1a and JAG1 gene deletions (respectively) relative to the cleavage site in the soy FT1a gene DNA specified by the gRNA comprising the spacer RNA encoded by SEQ ID NO: 11 and relative to the cleavage site in the soy JAG1 gene specified by the gRNA comprising the spacer RNA encoded by SEQ ID NO: 17 (respectively). The cleavage site for the type V Cas nuclease (e.g., a Cas12 nuclease) is located at a position in the gRNA spacer (e.g., encoded by SEQ ID NO: 11 or 17) that is 19 nucleotides away from the PAM site. Different deletions in the target FT1a and JAG1 genes can fall either 5′ or 3′ to the cleavage site specified by the gRNA. In the variant_ID of column 1, the first number represents either the number of deleted nucleotides 5′ of the cleavage site (negative (−) values) or 3′ of the cleavage site (positive values (+)) for deletions which extend either 5′ or 3′ of the cleavage site. The cleavage site in the FT1a gene specified by the spacer encoded by SEQ ID NO: 11 is located between nucleotides 346 (“−1”) and 347 (“+1”) in the FT1a SEQ ID NO: 3 reference sequence. The cleavage site in the JAG1 gene specified by the spacer encoded by SEQ ID NO: 17 is located between nucleotides 425 (“−1”) and 426 (“+1”) in the JAG1 SEQ ID NO: 18 reference sequence. The second number in the variant_ID represents the total number of nucleotides which have been deleted. This nomenclature used in Tables 2 and 3 is illustrated in
The SENF2225 and SENF2226 plants also contained loss-of-function mutations in the soybean JAG1 gene as shown in
Field tests on the SENF2225 and SENF2226 soybean plants were performed at various locations in the midwestern United States during two growing seasons as described in Tables 4-11. The second growing season results are summarized in Table 12. The check (control lacking the FT1a and JAG1 mutations) is NINF1170.
33 ± 2.6
34 ± 3.6
63 ± 0.9
The plasmid pIN1512 was created to transform soybean plants and disrupt the open reading frame of the of GmBS1 (Glyma10g38970; SEQ ID NO: 25) and/or GmBS2 (Glyma20g28840; SEQ ID NO: 26) genes through CRISPR-mediated gene editing. This plasmid was constructed using the strategy and techniques described by Čtermák et al., 2017, The Plant Cell. 29 (6) 1196-1217; DOI: 10.1105/tpc.16.00922). The pIN1512vector has the following two functional expression cassettes between the right and left T-DNA border. A dicot ubiquitin gene promoter and 5′ untranslated region (UTR) drives the expression a CRISPR-Cas nuclease transcript. The Cas gene had a SV40 nuclear localization signal (NLS) fused to the 5′ end and a nucleoplasmin NLS fused to the 3′ end. The coding sequence was followed by an Arabidopsis thaliana heat shock gene terminator. Another expression cassette is made up of an Arabidopsis thaliana U6-26 promoter driving the expression of a CRISPR guide RNA comprising a crRNA fused to a spacer RNA (encoded by SEQ ID NO: 36) designed to target the Glycine max BS1 or BS2 gene and followed by an RNA polymerase iii termination signal.
The other functional elements of the pIN1512 vector were derived from a standard Agrobacterium binary transformation plasmid that can replicate in both Escherichia coli and Agrobacterium tumefaciens. They are a T-DNA right border sequence followed by an expression cassette to confer glyphosate resistance to the transgenic plants. This cassette consisted of the Arabidopsis thaliana Ubiquitin 10 gene promoter and 5′ UTR, Agrobacterium sp. strain CP4 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene and the Pisum sativum rib-1,5-bisphospate carboxylase (rbcS) small subunit gene terminator. The insertion site was followed by a T-DNA left border sequence.
The plasmid pIN1340 was transformed into Agrobacterium tumefaciens EHA105 (Hood et al., 1993, Transgenic Research. 2: 208-218. doi:10.1007/BF01977351) by electroporation following standard techniques. Frozen glycerol stocks were prepared for use in plant transformation.
Transgenic T0 soybean events were made by Agrobacterium-mediated transformation with vector pIN1512. Sterilized soybean seeds were imbibed in water overnight, and explants were prepared as mature cotyledon halves with trimmed hypocotyls. The explants went through the typical transformation and regeneration steps of infection and co-cultivation, shoot induction and elongation and selection, rooting, and transplanting to soil to produce T1 seeds (see, for example, Li et al, Optimization of Agrobacterium-Mediated Transformation in Soybean (2017) Frontiers in Plant Science v8 Article 246; Pareddy et al. Transgenic Res. 2020 June; 29(3):267-281. doi: 10.1007/s11248-020-00198-8).
T0 plants were grown and genotyped by amplicon sequencing (AmpSeq).
T0 regenerants with chimeric edited tissues were analyzed in the greenhouse. T0 plants comprising chimeric edited tissue containing: (i) no edits in the two BS1 or two BS2 genes (i.e., an editing dosage of 0 in the total of four BS1 and BS2 genes); (ii) an edit (e.g., deletion) in one of the two BS1 or in one of the two BS2 genes (i.e., an editing dosage of 1); (iii) an edit in two of the total of four BS1 and BS2 genes (i.e., an editing dosage of 2); (iv) an edit in of 3 of the total of four BS1 and BS2 genes (i.e., an editing dosage of 3); or (v) an edit in all four of the BS1 and BS2 genes (i.e., an editing dosage of 4) were analyzed. The editing dosage for BS1 and BS2 was correlated to a phenotypic score (“Pheno Score”) as shown in
T1 soybean plants collected from T0 plants described in Example 2 were designated as families (Fam1, Fam2, Fam3, Fam4, and Fam5), and their harvested seed weight was analyzed (
120 T1 seeds were planted from T1 Family 3 and resultant plants were assayed for presence of the transgene and edits in the BS1 or BS2 gene. Selected transgene null plants were analyzed and only the −1:5D BS2 deletion allele (SEQ ID NO: 31) was transmitted. 15 transgene null plants which were heterozygous for the −1:5D BS2 deletion allele (SEQ ID NO: 31) and 7 transgene null plants homozygous for the −1:5D BS2 deletion allele (SEQ ID NO: 31) were identified.
Seeds from these 7 T1 −1:5D BS2 deletion allele (SEQ ID NO: 32) homozygous edited plants were collected and planted in the field. Unedited NINF1170 and null NINF1170 soybean plants were used as checks (i.e., controls). Planting was at 30″ spacing; 15 ft single rows, with a target plant per acre (PPA) of about 110,000. Most phenotypes were on a plot basis. For intensive phenotyping, 2 ft of plants from a row (about 10 plants) were used.
Data was collected at the plot level for agronomic phenotypes (yield, plant height, maturity date, PPA, plot wt., moisture, lodging, etc.); 1000 seed weight (5 subsamples), and seeds/lb (calculated from 1000 seed weight). Data was collected at a plant level for pod count per plant, seed count per plant, total weight of seeds per plant, and single seed weight (average per plant). Data for the field experiment are summarized in Table 13 and
The transgene null plants homozygous for the −1:5D BS2 deletion allele (SEQ ID NO: 31) exhibited increases in the average weight of 1000 seed, average seed weight, and average of single seed weight in comparison the wild-type checks. The transgene null plants homozygous for the −1:5D BS2 deletion allele (SEQ ID NO: 31) also exhibited a decrease in the average number of seeds per pound of seeds in comparison the wild-type checks.
A homozygous mutant for the −1:5D BS2 allele (SENF2104; SEQ ID NO: 31) was grown out in the field alongside null segregants (SENF2104N) and non-edited (wild-type) plants of the same genetic background (NINF1170). The weight per 100 seeds averaged from multiple field trials is shown in Table 14.
Field plots were planted at eleven locations in Iowa, Ohio, Kansas, Nebraska, Indiana, and Illinois. The results consistently indicated, as shown in Table 15, that double BS2 null mutants (SENF2104) had seeds of higher weight as compared to wild-type plants (NINF1170) and null segregants (SENF2104N).
A description of the biological sequences which are provided in the co-filed sequence listing and also referred to in column 3 of the accompanying Tables 2 and 3 are provided in Table 16.
All cited patents and patent publications referred to in this application are incorporated herein by reference in their entirety. All of the materials and methods disclosed and claimed herein can be made and used without undue experimentation as instructed by the above disclosure and illustrated by the examples. Although the materials and methods of this disclosure have been described in terms of embodiments and illustrative examples, it will be apparent to those of skill in the art that substitutions and variations can be applied to the materials and methods described herein without departing from the concept, spirit, and scope of the disclosure. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope, and concept of the disclosure as encompassed by the embodiments of the disclosures recited herein and the specification and appended claims.
This U.S. non-provisional patent application claims the benefit of U.S. provisional patent application Ser. No. 63/490,375, filed Mar. 15, 2023, and is a bypass continuation-in-part of international patent application PCT/US24/10705, filed Jan. 8, 2024, which claims the benefit of U.S. provisional patent application Ser. No. 63/479,312, filed Jan. 10, 2023, the entire contents of which including the specification, claims, figures, and sequence listing are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
63490375 | Mar 2023 | US | |
63479312 | Jan 2023 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US24/10705 | Jan 2024 | WO |
Child | 18606199 | US |