A computer readable text file, entitled “056100-5093-WO-SequenceListing.txt,” created on or about 12 Mar. 2014 with a file size of about 923 kb contains the sequence listing for this application and is hereby incorporated by reference in its entirety.
Field of the Invention
The present invention relates to methods of increasing the levels of at least one sugar in developing seeds in a plant, with the methods comprising inserting an exogenous nucleic acid, which codes for at least one sugar transporter protein (SWEET protein), into a plant cell to create a transgenic plant cell, and subjecting the transgenic plant cell to conditions that promote expression of the at least one SWEET protein during seed development. The methods results in transgenic plant seeds, and transgenic plants that produce seed, where the levels of at least one sugar are increased as compared to seeds from non-transgenic plants of the same species grown under the same conditions.
Background of the Invention
Yield potential is determined by the efficiency with which plants intercept light, harness it as chemical energy and ultimately make storage products in harvest organs. Sugars are a dominant currency in these transactions, yet the path from the arrival of sucrose at the terminal phloem endings that enter developing seeds and the subsequent transfer and conversion steps that leads to seed filling are among the least understood parts of the energy conversion chain.
In most plants sucrose is the major form of carbohydrate translocated from source to sink tissues. Sucrose is synthesized predominantly in leaf cells via a pair of enzymes, sucrose phosphate synthase and sucrose phosphate phosphatase, and is then exported into the apoplasm by sucrose transporters of the SWEET family and subsequently imported into the vasculature with the help of sucrose/H+ co-transporters of the SUT family. It is assumed that the driving force for sucrose translocation in the phloem is created by active import of sucrose into the veins, thereby creating an osmotic gradient and pressure driven flow and that SWEETs feed the SUTs. One of the least understood areas of carbon allocation is phloem unloading and specifically the transfer of sugars from the maternal phloem to the developing embryo and endosperm. In legumes, post-phloem unloading is assumed to occur symplasmically, via plasmodesmata, followed by efflux of sucrose from the seed coat via an elusive efflux transport mechanism. The developing legume embryo takes up sucrose with the help of sucrose/H+ cotransporters of the SUT family. Overexpression of SUT1 in developing embryos of pea led to increased sucrose influx, indicating that there is potential for increasing yield through increasing active influx into the embryo in large seed dicots. Accumulation of carbohydrates in the embryo is further driven by enzymatic conversion of sucrose to hexoses and activated hexoses via invertases and sucrose synthase as well as by consuming these products by synthesis of starch and other storage compounds.
Sucrose-metabolizing enzymes such as cell wall invertase (Mn1) in the basal endosperm transfer layer (BETL) and sucrose synthase (SuSy) in the endosperm also play crucial roles in carbon transfer. This two-step degradation is indicative of re-synthesis of sucrose in the endosperm before conversion into starch. However to date, and despite its pivotal role in determining yield, the path of sugar transfer and metabolism in maize kernels remains somewhat unclear. Little is known about membrane transporters that drive accumulation of sugars in this important organ.
Metabolism and transport are closely coupled at the cellular, subcellular, tissue, and whole organism level. While most modeling of metabolic and transport networks in plant systems have been focused on the cellular level, models at the tissue level that integrate transport and metabolic production, consumption and storage are well established for mammalian systems. Brain, heart and liver models have successfully integrated multiple transported metabolites undergoing metabolism through linked metabolic steps of several pathways in several tissue compartments inside and between cells. Established theoretical frameworks, together with modern computing hardware and software tools, allow numerical solution and testing of models that capture key features of tissue level transport and transformation of substrates and products.
Several published plant studies have integrated transport, metabolism and/or storage to varying degrees. Detailed modeling of spatial and developmental auxin transport and signaling has elegantly illuminated hormonal regulation of meristem growth. Published models of sucrose transport, metabolism and storage in sugarcane led to the identification of control points, and a target for increasing flux to sucrose was identified and experimentally validated by transgenic manipulation.
The present invention relates to methods of increasing the levels of at least one sugar in developing seeds in a plant, with the methods comprising inserting an exogenous nucleic acid, which codes for at least one sugar transporter protein (SWEET protein), into a plant cell to create a transgenic plant cell, and subjecting the transgenic plant cell to conditions that promote expression of the at least one SWEET protein during seed development. The methods results in transgenic plant seeds, and transgenic plants that produce seed, where the levels of at least one sugar are increased as compared to seeds from non-transgenic plants of the same species grown under the same conditions.
The present invention relates to methods of increasing the levels of at least one sugar in developing seeds in a plant, with the methods comprising inserting an exogenous nucleic acid, which codes for at least one sugar transporter protein (SWEET protein), into a plant cell to create a transgenic plant cell, and subjecting the transgenic plant cell to conditions that promote expression of the at least one SWEET protein during seed development. The methods results in transgenic plant seeds, and transgenic plants that produce seed, where the levels of at least one sugar are increased as compared to seeds from non-transgenic plants of the same species grown under the same conditions.
The SWEET proteins, in general, belong to the PFAM family “MtN3_slv” (Accession No. PF03083). See pfam.sanger.ac.uk, which is a database of protein families that are determined and represented by multiple sequence alignments and hidden Markov models (HMMs). In one embodiment of the present invention, the SWEET transporter proteins utilized in the methods, constructs, plants and plant seeds of the present invention are uniporters, which is a well-known term in the art that means a protein that facilitates transport through facilitated diffusion, i.e., the molecules being transported are being transported with the solute gradient. Uniporters do not typically utilize energy for movement of the molecules they transport, other than harnessing the solute gradient.
SWEET proteins are well-known in the art, and their primary amino acid structures can be found in a variety of databases including but not limited to plant membrane protein databases such as aramemnon.botanik.uni-koeln.de, C. elegans protein databases such as www.wormbase.org, and even in human transporter databases, such as www.tcdb.org. In general SWEETs have a characteristic modular structure that is different from other sugar transporters. For example, SWEETs have a different three-dimensional structure from lac permease, yeast hexose transporters, human GLUTs or human SGLTs. The basic unit of a SWEET transporter is a domain composed of three transmembrane domains (TMs). In bacteria, proteins with 3 TMs have to form at least one dimer to create a sugar transporting pore. The eukaryotic versions of the SWEET proteins contain a repeat of this subunit, which is separated by an additional TM domain. This additional TM domain (“TM4”) is not conserved amongst family members, thus the specific amino acid sequence of this domain is not critical to proper functioning across the kingdom of SWEET proteins. This additional TM4 domain serves as an inversion linker that puts the two repeat units of 3 TMs into a parallel configuration, which is how the dimer is formed with the bacterial protein. This 7 TM structure is unique from all other known sugar transporters. That the animal versions of these SWEET proteins as well as bacterial proteins from this same family all transport sugars is indicative that the plant version of these SWEET proteins sugar transporters.
Members of the SWEET transporter superfamily are defined both by conserved amino acid sequences and structural features. For example, all SWEETs are composed of 7 TM divided in two conserved MtN3/saliva motifs embedded in the tandem 3 TM repeat unit, which is connected by a central TM helix that is less conserved, indicating that this central TM serves as a linker. The resulting structure has been described as the 3-1-3 TM SWEET structure.
The first TM domain on average is predicted to be composed of 23 amino acids, but could vary between 20 and 25. Within this TM domain there are at least 4 highly conserved amino acids: G, P, T and F.
The second TM domain on average is predicted to be composed of 19 amino acids, but could vary between 16 and 23. Within this TM domain there are at least 3 highly conserved amino acids: P, Y and Y.
The third TM domain on average is predicted to be composed of 23 amino acids, but could vary between 20 and 25. Within this TM domain there are at least 3 highly conserved amino acids: T, N and G.
The fifth TM domain on average is predicted to be composed of 23 amino acids, but could vary between 20 and 25. Within this TM domain there are at least 3 highly conserved amino acids: G, P and L.
The fifth loop, linking together TM 5 and 6, has 2 highly conserved amino acids: V and T.
The sixth TM domain on average is predicted to be composed of 23 amino acids, but could vary between 19 and 25. Within this TM domain there are at least 7 highly conserved amino acids: S, V, M, P, L, S and Y.
The sixth loop, linking together TM 6 and 7, has a highly conserved amino acid: D.
The seventh TM domain on average is predicted to be composed of 23 amino acids, but could vary between 20 and 25. Within this TM domain there are at least 5 highly conserved amino acids: P, N, G, Q and Y.
Both sugar transport and the seven TM three-dimensional structure are the two key features for this superfamily of proteins. Despite the great variability in size or sequence, and despite the broad number of organisms from which they can be isolated, all SWEETs tested using different heterologous systems have shown sugar transport function.
In one embodiment of the present invention, the SWEET transporter proteins utilized in the methods, constructs, plants and plant seeds of the present invention are sucrose or hexose uniporters. A hexose uniporter is, as the name implies, a transporter protein that transports hexose sugars, e.g., cyclic hexoses, aldohexoses and ketohexoses. Examples of sucrose or hexose uniporters that may be utilized in the methods, constructs, plants and plant seeds of the present invention include but are not limited to glucose uniporters and fructose uniporters.
In general, SWEETs from a particular species of plant can be categorized into clades, or groups, based on amino acid sequence similarity. In maize, for example there are four clades of SWEET proteins based on sequence similarity within each Glade. For example, Clade I in Zea mays contains SWEETS 1a, 1b, 2, 3a and 3b; Clade II contains SWEETs 4a, 4b, 4d, 6a and 6b; Clade III contains SWEETs 11, 12a, 12b, 13a, 13b, 13c, 14a, 14b, 15a and 15b; Clade IV contains SWEETs 16a, 16b and 17. The number of the specific SWEET protein in maize is used to reflect the phylogenetic relationship to Arabidopsis SWEETs, e.g., SWEET11 in maize is most closely related, by sequence comparison, to SWEET 11 in Arabidopsis, and smaller letters are used to indicate a possible gene amplification relative to Arabidopsis.
Accordingly, the numbering of the SWEET proteins, e.g., SWEET 1, SWEET 2, etc., refers to the amino acid sequence of that specific SWEET protein as derived from Arabidopsis thaliana, as well as orthologs in other species, based on amino acid sequence comparison. Thus, although the gene and protein nomenclature refers to genes and proteins identified in The Arabidopsis Information Resource (TAIR) database, which is available on the worldwide web at www.arabidopsis.org, it is understood that the invention is not limited to genes and proteins only in Arabidposis and that the invention encompasses orthologs of genes in other species. For example, it is understood that the methods, constructs, plants and plant seeds of the present invention utilizing the transporter(s) encoded by the genes AtSweet1-At1G21460, AtSweet2-At3G14770, AtSweet3-At5G53190, AtSweet4-At3G28007, AtSweet5-At5G62850, AtSweet6-At1G66770, AtSweet7-At4G10850, AtSweet8-At5G40260, AtSweet9-At2G39060, AtSweet10-At5G50790, AtSweet11-At3G48740, AtSweet12-At5G23660, AtSweet13-At5G50800, AtSweet14-At4G25010, AtSweet15-At5G13170, AtSweet16-At3G16690 and AtSweet17-At4G15920 in Arabidopsis (accession numbers following the gene name, e.g., “At1G21460,” refer accession numbers from the TAIR database, as described above) to can be applied to methods, constructs, plants and plant seeds utilizing the transporter(s) encoded by the orthologous genes in another species. As used herein, orthologous genes are genes from different species that perform the same or similar function and are believed to descend from a common ancestral gene and thus share a certain amount of amino acid identities in their sequence. Often, proteins encoded by orthologous genes have similar or nearly identical amino acid sequence identities to one another, and the orthologous genes themselves have similar nucleotide sequences, particularly when the redundancy of the genetic code is taken into account. Thus, by way of example, the ortholog of a sucrose transporter in Arabidopsis would be a sucrose transporter in another species of plant, regardless of the amino acid sequence of the two proteins.
In specific embodiments, the SWEET transporter proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from crops plants, such as a food crops, feed crops or biofuels crops. Exemplary important crops may include corn, wheat, soybean, cotton and rice. Crops also include corn, wheat, barley, triticale, soybean, cotton, millet, sorghum, sugarcane, sugar beet, potato, tomato, grapevine, citrus (orange, lemon, grapefruit, etc), lettuce, alfalfa, common bean, fava bean and strawberries, sunflowers and rapeseed, cassava, miscanthus and switchgrass. Other examples of plants include but are not limited to an African daisy, African violet, alfalfa, almond, anemone, apple, apricot, asparagus, avocado, azalea, banana and plantain, beet, bellflower, black walnut, bleeding heart, butterfly flower, cacao, caneberries, canola, carnation, carrot, cassava, diseases, chickpea, cineraria, citrus, coconut palm, coffee, common bean, maize, cotton, crucifers, cucurbit, cyclamen, dahlia, date palm, douglas-fir, elm, English walnut, flax, Acanthaceae, Agavaceae, Araceae, Araliaceae, Araucariacea, Asclepiadaceae, Bignoniaceae, Bromeliaceae, Cactaceae, Commelinaceae, Euphobiaceae, Gentianaceae, Gesneriaceae, Maranthaceae, Moraceae, Palmae, Piperaceae, Polypodiaceae, Urticaceae, Vitaceae, fuchsia, geranium, grape, hazelnut, hemp, holiday cacti, hop, hydrangea, impatiens, Jerusalem cherry, kalanchoe, lettuce, lentil, lisianthus, mango, mimulus, monkey-flower, mint, mustar, oats, papaya, pea, peach and nectarine, peanut, pear, pearl millet, pecan, pepper, Persian violet, pigeonpea, pineapple, pistachio, pocketbook plant, poinsettia, potato, primula, red clover, rhododendron, rice, rose, rye, safflower, sapphire flower, spinach, strawberry, sugarcane, sunflower, sweetgum, sweet potato, sycamore, tea, tobacco, tomato, verbena, and wild rice.
Based on the description of the amino acid sequences of SWEET transporters disclosed herein, one of skill could easily identify any SWEET transporter from virtually any plant species. Once identified, one of skill in the art can use readily available methods for isolating the coding sequence of the identified SWEET protein from a given species to produce nucleic acids encoding the desired SWEET proteins.
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from Zea mays. Examples of nucleic acid sequences and/or amino acid sequences of the SWEET proteins include but are not limited to ZmSweet1a-GRMZM2G039365, ZmSweet1b-GRMZM2G153358, ZmSweet2-GRMZM2G324903, ZmSweet3a-GRMZM2G179679, ZmSweet3b-GRMZM2G060974, ZmSweet4a-GRMZM2G000812, ZmSweet4b-GRMZM2G144581, ZmSweet4d-GRMZM2G137954, ZmSweet6a-GRMZM2G157675, ZmSweet6b-GRMZM2G416965, ZmSweet11-GRMZM2G368827, ZmSweet12a-GRMZM2G133322, ZmSweet12b-GRMZM2G099609, ZmSweet13a-GRMZM2G173669, ZmSweet13b-GRMZM2G021706, ZmSweet13c-GRMZM2G179349, ZmSweet14a-GRMZM2G094955, ZmSweet14b-GRMZM2G015976, ZmSweet15a-GRMZM2G168365, ZmSweet15b-GRMZM5G872392, ZmSweet16a-GRMZM2G106462, ZmSweet16b-GRMZM2G111926, ZmSweet17-GRMZM2G107597. Accession numbers following the gene name, e.g., “GRMZM2G039365,” refer accession numbers from the Maize Genetics and Genomics database at www.maizegdb.org as described above.
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from Orya sativa. Examples of nucleic acid sequences and/or amino acid sequences of the SWEET proteins include but are not limited to OsSweet1a-Os01g65880, OsSweet1b-Os05g35140, OsSweet2a-Os01g36070, OsSweet2b-Os01g50460, OsSweet3a-Os05g12320, OsSweet3b-Os01g12130, OsSweet4-0s02g19820, OsSweet5-0s05g51090, OsSweet6a-Os01g42110, OsSweet6b-Os01g42090, OsSweet7a-Os09g08030, OsSweet7b-Os09g08440, OsSweet7c-Os12g07860, OsSweet7d-Os09g08490, OsSweet7e-Os09g08270, OsSweet11-0s08g42350, OsSweet12-Os03g22590, OsSweet13-Os12g29220, OsSweet14-Os11g31190, OsSweet15-Os02g30910, OsSweet16-Os03g22200. Accession numbers following the gene name, e.g., “Os01g65880,” refer accession numbers from the Greenphyl database (version 4) at www.greenphyl.org as described herein, or the TIGR database at ice.plantbiology.msu.edu.
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from Arabidopsis thaliana. Examples of nucleic acid sequences and/or amino acid sequences of the SWEET proteins include but are not limited to AtSweet1-At1G21460, AtSweet2-At3G14770, AtSweet3-At5G53190, AtSweet4-At3G28007, AtSweet5-At5G62850, AtSweet6-At1G66770, AtSweet7-At4G10850, AtSweet8-At5G40260, AtSweet9-At2G39060, AtSweet10-At5G50790, AtSweet11-At3G48740, AtSweet12-At5G23660, AtSweet13-At5G50800, AtSweet14-At4G25010, AtSweet15-At5G13170, AtSweet16-At3G16690, AtSweet17-At4G15920. Accession numbers following the gene name, e.g., “At5G23660,” refer accession numbers from the TAIR database as described above.
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from Medicago truncatula. Examples of nucleic acid sequences and/or amino acid sequences of the SWEET proteins include but are not limited to MtSWEET2b-AC235677_9, MtSWEET3c-Medtr1g028460, MtSWEET1a-Medtr1g029380, MtSWEET15a-Medtr2g007890, MtSWEET6-Medtr3g080990, MtSWEET1b-Medtr3g089125, MtSWEET3a-Medtr3g090940, MtSWEET3b-Medtr3g090950, MtSWEET13-Medtr3g098910, MtSWEET11-Medtr3g098930, MtSWEET4-Medtr4g106990, MtSWEET15b-Medtr5g067530, MtSWEET9a-Medtr5g092600, MtSWEET5a-Medtr6g007610, MtSWEET5c-Medtr6g007623, MtSWEET5d-Medtr6g007633, MtSWEET5b-Medtr6g007637, MtSWEET2c-Medtr6g034600, MtSWEET9b-Medtr7g007490, MtSWEET15d-Medtr7g405710, MtSWEET15c-Medtr7g405730, MtSWEET2a-Medtr8g042490, MtSWEET14-Medtr8g096310, MtSWEET12-Medtr8g096320, MtSWEET7-Medtr8g099730, MtSWEET16-Mtr.42164.1.S1. Accession numbers following the gene name, e.g., “Medtr1g028460,” refer accession numbers from the legume genome database at www.plantgrn.noble.org as described herein
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention are SWEET proteins from Glycine max. Examples of nucleic acid sequences and/or amino acid sequences of the SWEET proteins include but are not limited to GmSWEET1a-XP003526670, GmSWEET1b-Glyma13g09140, GmSWEET1c-Glyma14g27610, GmSWEET2-XP003540515, GmSWEET3a-XP003544116, GmSWEET3b-Glyma13g08190, GmSWEET3c-ACU24301, GmSWEET3d-Glyma04g41680, GmSWEET4-Glyma17g09840, GmSWEET5a-Glyma19g01280, GmSWEET5b-Glyma19g01270, GmSWEET6a-Glyma20g16160, GmSWEET6b-Glyma13g10560.1, GmSWEET7-Glyma08g02890, GmSWEET9a-XP00355271, GmSWEET9b-XP003552719, GmSWEET9c-Glyma08g48281, GmSWEET10a-XP003532478, GmSWEET10b-Glyma05g38340, GmSWEET10c-NP001237418, GmSWEET10d-XP003523161, GmSWEET10e-Glyma06g17540, GmSWEET11a-XP003532471, GmSWEET11b-Glyma05g38351, GmSWEET12a-Glyma04g37530, GmSWEET12b-XP003526939, GmSWEET15a-Glyma08g19580, GmSWEET15b-Glyma15g05470, GmSWEET15c-XP003524088, GmSWEET15d-XP003551863, GmSWEET15e-Glyma08g47561, GmSWEET15f-Glyma18g53930, GmSWEET16a-Glyma09g04840, GmSWEET16b-Glyma15g16030, GmSWEET17-Glyma19g42040. Accession numbers following the gene name, e.g., “Glyma19g42040,” refer accession numbers from the legume genome database at www.plantgrn.noble.org or the Phytozome database at www.photozome.net, as described herein.
In other embodiments, the methods, constructs, plants and plant seed of the present invention may comprise or comprise the use of at least one exogenous nucleic acid encoding a SWEET protein or variant thereof, wherein the exogenous nucleic acid encodes a SWEET or variant thereof comprising an amino acid sequence that is at least 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to any one of the amino acid sequences of SEQ ID NOs: 1-410. In another embodiment, the methods, constructs, plants and plant seed of the present invention may comprise or comprise the use of at least one exogenous nucleic acid encoding a SWEET protein or variant thereof, wherein the exogenous nucleic acid encodes a SWEET or variant thereof consists of an amino acid sequence that is at least 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to any one of the amino acid sequences of SEQ ID NOs: 1-410.
Arabidopsis
thaliana
Nicotiana
attenuata
Brassica rapa
Populus
trichocarpa
Lotus
japonicus
Oryza sativa
Zea mays
Medicago
truncatula
Glycine max
Mus musculus
Homo sapiens
Caenorhabditis
elegans
Xenopus laevis
Brady-
rhizobium
japonicum
The invention relates to isolated nucleic acids encoding a SWEET, or variant thereof, and to constructs, cells, host cells, plant tissue and plant seeds comprising these nucleic acids. The nucleic acids of the invention can be DNA or RNA. The nucleic acid molecules can be double-stranded or single-stranded RNA or DNA; single stranded RNA or DNA can be the coding, or sense, strand or the non-coding, or antisense, strand. In particular, the nucleic acids may encode any SWEET or variant thereof, as well as fusion proteins. For example, the nucleic acids of the invention include polynucleotide sequences that encode glutathione-S-transferase (GST) fusion protein, poly-histidine (e.g., His6), poly-HN, poly-lysine, hemagglutinin, HSV-Tag. If desired, the nucleotide sequence of the isolated nucleic acid can include additional non-coding sequences such as non-coding 3′ and 5′ sequences (including regulatory sequences, for example).
The nucleic acid molecules of the invention can be “isolated.” As used herein, an “isolated” nucleic acid molecule or nucleotide sequence is intended to mean a nucleic acid molecule or nucleotide sequence that is not flanked by nucleotide sequences normally flanking the gene or nucleotide sequence (as in genomic sequences) and/or has been completely or partially removed from its native environment, e.g., a cell, tissue. For example, nucleic acid molecules that have been removed or purified from cells are considered isolated. In some instances, the isolated material will form part of a composition, for example, a crude extract containing other substances, buffer system or reagent mix. In other circumstances, the material may be purified to near homogeneity, for example as determined by PAGE or column chromatography such as HPLC. Thus, an isolated nucleic acid molecule or nucleotide sequence can includes a nucleic acid molecule or nucleotide sequence which is synthesized chemically, using recombinant DNA technology or using any other suitable method. To be clear, a nucleic acid contained in a vector would be included in the definition of “isolated” as used herein. Also, isolated nucleotide sequences include recombinant nucleic acid molecules, e.g., DNA, RNA, in heterologous organisms, as well as partially or substantially purified nucleic acids in solution. “Purified,” on the other hand is well understood in the art and generally means that the nucleic acid molecules are substantially free of cellular material, cellular components, chemical precursors or other chemicals beyond, perhaps, buffer or solvent. “Substantially free” is not intended to mean that other components beyond the novel nucleic acid molecules are undetectable. The nucleic acid molecules of the present invention may be isolated or purified. Both in vivo and in vitro RNA transcripts of a DNA molecule of the present invention are also encompassed by “isolated” nucleotide sequences.
The invention also encompasses variations of the nucleotide sequences of the invention, such as those encoding functional fragments or variants of the polypeptides as described herein. Such variants can be naturally-occurring, or non-naturally-occurring, such as those induced by various mutagens and mutagenic processes. Intended variations include, but are not limited to, addition, deletion and substitution of one or more nucleotides which can result in conservative or non-conservative amino acid changes, including additions and deletions.
The invention described herein also relates to fragments of the isolated nucleic acid molecules described herein. The term “fragment” is intended to encompass a portion of a nucleotide sequence described herein which is from at least about 20 contiguous nucleotides to at least about 50 contiguous nucleotides or longer in length. Such fragments may be useful as probes and primers. In particular, primers and probes may selectively hybridize to the nucleic acid molecule encoding the polypeptides described herein. For example, fragments which encode polypeptides that retain activity, as described below, are particularly useful.
The invention also provides nucleic acid molecules that hybridize under high stringency hybridization conditions, such as for selective hybridization, to the nucleotide sequences described herein (e.g., nucleic acid molecules which specifically hybridize to a nucleotide sequence encoding polypeptides described herein and encode a modified growth factor isooherin). Hybridization probes include synthetic oligonucleotides which bind in a base-specific manner to a complementary strand of nucleic acid. Suitable probes include polypeptide nucleic acids, as described in Nielsen et al., Science, 254:1497-1500 (1991).
Such nucleic acid molecules can be detected and/or isolated by specific hybridization e.g., under high stringency conditions. “Stringency conditions” for hybridization is a term of art that refers to the incubation and wash conditions, e.g., conditions of temperature and buffer concentration, which permit hybridization of a particular nucleic acid to a second nucleic acid; the first nucleic acid may be perfectly complementary, i.e., 100%, to the second, or the first and second may share some degree of complementarity, which is less than perfect, e.g., 60%, 75%, 85%, 95% or more. For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity.
“High stringency conditions”, “moderate stringency conditions” and “low stringency conditions” for nucleic acid hybridizations are explained in Current Protocols in Molecular Biology, John Wiley & Sons, (1998)), which is incorporated by reference. The exact conditions which determine the stringency of hybridization depend not only on ionic strength, e.g., 0.2×SSC, 0.1×SSC of the wash buffers, temperature, e.g., room temperature, 42° C., 68° C., etc., and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Thus, high, moderate or low stringency conditions may be determined empirically.
By varying hybridization conditions from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions which will allow a given sequence to hybridize with the most similar sequences in the sample can be determined.
Exemplary conditions are described in Krause, M. H. and S. A. Aaronson, Methods in Enzymology, 200:546-556 (1991), which is incorporated by reference. Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each degree (° C.) by which the final wash temperature is reduced, while holding SSC concentration constant, allows an increase by 1% in the maximum extent of mismatching among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in Tm. Using these guidelines, the washing temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought. Exemplary high stringency conditions include, but are not limited to, hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.1×SSC at 60° C. Example of progressively higher stringency conditions include, after hybridization, washing with 0.2×SSC and 0.1% SDS at about room temperature (low stringency conditions); washing with 0.2×SSC, and 0.1% SDS at about 42° C. (moderate stringency conditions); and washing with 0.1×SSC at about 68° C. (high stringency conditions). Washing can be carried out using only one of these conditions, e.g., high stringency conditions, washing may encompass two or more of the stringency conditions in order of increasing stringency. Optimal conditions will vary, depending on the particular hybridization reaction involved, and can be determined empirically.
Equivalent conditions can be determined by varying one or more of the parameters given as an example, as known in the art, while maintaining a similar degree of identity or similarity between the target nucleic acid molecule and the primer or probe used. Hybridizable nucleotide sequences are useful as probes and primers for identification of organisms comprising a nucleic acid of the invention and/or to isolate a nucleic acid of the invention, for example. The term “primer” is used herein as it is in the art and refers to a single-stranded oligonucleotide which acts as a point of initiation of template-directed DNA synthesis under appropriate conditions in an appropriate buffer and at a suitable temperature. The appropriate length of a primer depends on the intended use of the primer, but typically ranges from about 15 to about 30 nucleotides. Short primer molecules generally require cooler temperatures to form sufficiently stable hybrid complexes with the template. A primer need not reflect the exact sequence of the template, but must be sufficiently complementary to hybridize with a template. The term “primer site” refers to the area of the target DNA to which a primer hybridizes. The term “primer pair” refers to a set of primers including a 5′ (upstream) primer that hybridizes with the 5′ end of the DNA sequence to be amplified and a 3′ (downstream) primer that hybridizes with the complement of the 3′ end of the sequence to be amplified.
The nucleic acids described herein can be amplified by methods known in the art. For example, amplification can be accomplished by the polymerase chain reaction (PCR). See PCR Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, NY, N.Y., 1992); PCR Protocols: A Guide to Methods and Applications (eds. Innis, et al., Academic Press, San Diego, Calif., 1990); Eckert et al., PCR Methods and Applications 1:17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Pat. No. 4,683,202, all of which are incorporated by reference. Other suitable amplification methods include the ligase chain reaction (LCR) (see Wu and Wallace, Genomics, 4:560 (1989), Landegren et al., Science, 241:1077 (1988), both of which are incorporated by reference), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA, 86:1173 (1989), incorporated by reference), and self-sustained sequence replication (Guatelli et al., Proc. Nat. Acad. Sci. USA, 87:1874 (1990) incorporated by reference) and nucleic acid based sequence amplification (NASBA).
The present invention also relates to vectors that include nucleic acid molecules of the present invention, host cells that are genetically engineered with vectors of the invention and the production of SWEETs or variants thereof by recombinant techniques.
The terms “peptide,” “polypeptide” and “protein” are used interchangeably herein. As used herein, an “isolated polypeptide” is intended to mean a polypeptide that has been completely or partially removed from its native environment. For example, polypeptides that have been removed or purified from cells are considered isolated. In addition, recombinantly produced polypeptides molecules contained in host cells are considered isolated for the purposes of the present invention. Moreover, a peptide that is found in a cell, tissue or matrix in which it is not normally expressed or found is also considered as “isolated” for the purposes of the present invention. Similarly, polypeptides that have been synthesized are considered to be isolated polypeptides. “Purified,” on the other hand is well understood in the art and generally means that the peptides are substantially free of cellular material, cellular components, chemical precursors or other chemicals beyond, perhaps, buffer or solvent. “Substantially free” is not intended to mean that other components beyond the peptides or variants thereof are undetectable.
In specific embodiments, the SWEET proteins used in methods, constructs, plants and plant seeds of the present invention may comprise or comprise the use of a protein or peptide with an amino acid sequence of any one or more of SEQ ID NOs: 1-410.
In other embodiments, the methods, constructs, plants and plant seed of the present invention may comprise or comprise the use of variants of a SWEET protein. In one embodiment, SWEET variants comprise an amino acid sequence that is at least 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to any one of the amino acid sequences of SEQ ID NOs: 1-410. In another embodiment, the SWEET variants consist of a peptide with an amino acid sequence that is at least 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to any one of the amino acid sequences of SEQ ID NOs: 1-410.
Arabidopsis thaliana
Oryza sativa
Zea mays
Citrus sinensis
Medicago truncatula
Triticum aestivum
Glycine max
Populus trichocarpa
Vitis vinifera
Brachypodium distachyon
Hordeum vulgare
Sorghum bicolor
Picea sitchensis
Physcomitrella patens
Amborella trichopoda
Aquilegia caerulea
Chlamydomonas reinhardtii
Lotus japonicus
Saccharum officinarum
Musa acuminata
Manioth esculenta
Cucumis sativus
Csativus|Cucsa.057980|Cucsa.057980.1
Csativus|Cucsa.057980|Cucsa.057980.2
Csativus|Cucsa.077130|Cucsa.077130.1
Csativus|Cucsa.077130|Cucsa.077130.2
Csativus|Cucsa.091060|Cucsa.091060.1
Csativus|Cucsa.098360|Cucsa.098360.1
Csativus|Cucsa.114740|Cucsa.114740.1
Csativus|Cucsa.114740|Cucsa.114740.2
Csativus|Cucsa.114740|Cucsa.114740.3
Csativus|Cucsa.134790|Cucsa.134790.1
Csativus|Cucsa.134800|Cucsa.134800.1
Csativus|Cucsa.157110|Cucsa.157110.1
Csativus|Cucsa.157120|Cucsa.157120.1
Csativus|Cucsa.181790|Cucsa.181790.1
Csativus|Cucsa.201980|Cucsa.201980.1
Csativus|Cucsa.252960|Cucsa.252960.1
Csativus|Cucsa.277610|Cucsa.277610.1
Csativus|Cucsa.277620|Cucsa.277620.1
Csativus|Cucsa.303950|Cucsa.303950.1
Csativus|Cucsa.339600|Cucsa.339600.1
Csativus|Cucsa.339610|Cucsa.339610.1
Csativus|Cucsa.349380|Cucsa.349380.1
Nicotiana attenuata
Phoenix dactylifera
Phaseolus vulgaris
Pvulgaris|Phvul.003G199300|Phvul.003G199300.1
Pvulgaris|Phvul.009G162700|Phvul.009G162700.1
Pvulgaris|Phvul.009G137700|Phvul.009G137700.1
Pvulgaris|Phvul.009G249700|Phvul.009G249700.1
Pvulgaris|Phvul.009G162900|Phvul.009G162900.1
Pvulgaris|Phvul.009G134300|Phvul.009G134300.1
Pvulgaris|Phvul.009G162800|Phvul.009G162800.1
Pvulgaris|Phvul.005G076300|Phvul.005G076300.2
Pvulgaris|Phvul.005G076300|Phvul.005G076300.1
Pvulgaris|Phvul.011G168100|Phvul.011G168100.1
Pvulgaris|Phvul.008G001100|Phvul.008G001100.1
Pvulgaris|Phvul.008G001200|Phvul.008G001200.1
Pvulgaris|Phvul.004G017200|Phvul.004G017200.1
Pvulgaris|Phvul.004G017400|Phvul.004G017400.1
Pvulgaris|Phvul.004G017300|Phvul.004G017300.1
Pvulgaris|Phvul.004G017100|Phvul.004G017100.1
Pvulgaris|Phvul.001G061900|Phvul.001G061900.1
Pvulgaris|Phvul.001G064300|Phvul.001G064300.1
Pvulgaris|Phvul.006G210800|Phvul.006G210800.1
Pvulgaris|Phvul.006G000600|Phvul.006G000600.1
Pvulgaris|Phvul.002G283800|Phvul.002G283800.1
Pvulgaris|Phvul.002G283900|Phvul.002G283900.1
Pvulgaris|Phvul.002G283900|Phvul.002G283900.2
Pvulgaris|Phvul.002G300900|Phvul.002G300900.1
Pvulgaris|Phvul.002G203600|Phvul.002G203600.1
Ricunus communis
Prunus persica
Ppersica|ppa017677m.g|ppa017677m
Ppersica|ppa010394m.g|ppa010394m
Ppersica|ppa010181m.g|ppa010181m
Ppersica|ppa020717m.g|ppa020717m
Ppersica|ppa024244m.g|ppa024244m
Ppersica|ppa009789m.g|ppa009789m
Ppersica|ppa021855m.g|ppa021855m
Ppersica|ppa014953m.g|ppa014953m
Ppersica|ppa018792m.g|ppa018792m
Ppersica|ppa010594m.g|ppa010594m
Ppersica|ppa023718m.g|ppa023718m
Ppersica|ppa021908m.g|ppa021908m
Ppersica|ppa015264m.g|ppa015264m
Ppersica|ppa010208m.g|ppa010208m
Ppersica|ppa009422m.g|ppa009422m
Ppersica|ppa010808m.g|ppa010808m
Ppersica|ppa017165m.g|ppa017165m
Ppersica|ppa019530m.g|ppa019530m
Ppersica|ppa021919m.g|ppa021919m
In additional embodiments, the peptide variants described herein are functional and capable of transporting at least one sugar when used in the methods, constructs, plants and plant seeds of the present invention. In some embodiments, the SWEET variants of the present invention have an enhanced ability to transport at least one sugar compared to the wild-type SWEET.
A polypeptide having an amino acid sequence at least, for example, about 95% “identical” to a reference an amino acid sequence, e.g., SEQ ID NO: 1, is understood to mean that the amino acid sequence of the polypeptide is identical to the reference sequence except that the amino acid sequence may include up to about five modifications per each 100 amino acids of the reference amino acid sequence. In other words, to obtain a peptide having an amino acid sequence at least about 95% identical to a reference amino acid sequence, up to about 5% of the amino acid residues of the reference sequence may be deleted or substituted with another amino acid or a number of amino acids up to about 5% of the total amino acids in the reference sequence may be inserted into the reference sequence. These modifications of the reference sequence may occur at the N-terminus or C-terminus positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among amino acids in the reference sequence or in one or more contiguous groups within the reference sequence.
As used herein, “identity” is a measure of the identity of nucleotide sequences or amino acid sequences compared to a reference nucleotide or amino acid sequence. In general, the sequences are aligned so that the highest order match is obtained. “Identity” per se has an art-recognized meaning and can be calculated using well known techniques. While there are several methods to measure identity between two polynucleotide or polypeptide sequences, the term “identity” is well known to skilled artisans (Carillo (1988) J. Applied Math. 48, 1073). Examples of computer program methods to determine identity and similarity between two sequences include, but are not limited to, GCG program package (Devereux (1984) Nucleic Acids Research 12, 387), BLASTP, ExPASy, BLASTN, FASTA (Atschul (1990) J. Mol. Biol. 215, 403) and FASTDB. Examples of methods to determine identity and similarity are discussed in Michaels (2011) Current Protocols in Protein Science, Vol. 1, John Wiley & Sons.
In one embodiment of the present invention, the algorithm used to determine identity between two or more polypeptides is BLASTP. In another embodiment of the present invention, the algorithm used to determine identity between two or more polypeptides is FASTDB, which is based upon the algorithm of Brutlag (1990) Comp. App. Biosci. 6, 237-245). In a FASTDB sequence alignment, the query and reference sequences are amino sequences. The result of sequence alignment is in percent identity. In one embodiment, parameters that may be used in a FASTDB alignment of amino acid sequences to calculate percent identity include, but are not limited to: Matrix=PAM, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=1, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the length of the subject amino sequence, whichever is shorter.
If the reference sequence is shorter or longer than the query sequence because of N-terminus or C-terminus additions or deletions, but not because of internal additions or deletions, a manual correction can be made, because the FASTDB program does not account for N-terminus and C-terminus truncations or additions of the reference sequence when calculating percent identity. For query sequences truncated at the N- or C-termini, relative to the reference sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminus to the reference sequence that are not matched/aligned, as a percent of the total bases of the query sequence. The results of the FASTDB sequence alignment determine matching/alignment. The alignment percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score can be used for the purposes of determining how alignments “correspond” to each other, as well as percentage identity. Residues of the reference sequence that extend past the N- or C-termini of the query sequence may be considered for the purposes of manually adjusting the percent identity score. That is, residues that are not matched/aligned with the N- or C-termini of the comparison sequence may be counted when manually adjusting the percent identity score or alignment numbering.
For example, a 90 amino acid residue query sequence is aligned with a 100 residue reference sequence to determine percent identity. The deletion occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment does not show a match/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the reference sequence (number of residues at the N- and C-termini not matched/total number of residues in the reference sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched (100% alignment) the final percent identity would be 90% (100% alignment—10% unmatched overhang). In another example, a 90 residue query sequence is compared with a 100 reference sequence, except that the deletions are internal deletions. In this case the percent identity calculated by FASTDB is not manually corrected, since there are no residues at the N- or C-termini of the subject sequence that are not matched/aligned with the query. In still another example, a 110 amino acid query sequence is aligned with a 100 residue reference sequence to determine percent identity. The addition in the query occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment may not show a match/alignment of the first 10 residues at the N-terminus. If the remaining 100 amino acid residues of the query sequence have 95% identity to the entire length of the reference sequence, the N-terminal addition of the query would be ignored and the percent identity of the query to the reference sequence would be 95%.
As used herein, the terms “correspond(s) to” and “corresponding to,” as they relate to sequence alignment, are intended to mean enumerated positions within the reference protein, e.g., wild-type SWEET4d, and those positions in the variant or ortholog SWEET4d that align with the positions with the reference protein. Thus, when the amino acid sequence of a subject SWEET is aligned with the amino acid sequence of a reference SWEET, the amino acids in the subject sequence that “correspond to” certain enumerated positions of the reference sequence are those that align with these positions of the reference sequence, e.g., SEQ ID NO: 2, but are not necessarily in these exact numerical positions of the reference sequence. Methods for aligning sequences for determining corresponding amino acids between sequences are described herein.
Variants resulting from insertion of the polynucleotide encoding a SWEET into an expression vector system are also contemplated. For example, variants (usually insertions) may arise from when the amino terminus and/or the carboxy terminus of a SWEET is/are fused to another polypeptide.
In another aspect, the invention provides deletion variants wherein one or more amino acid residues in a SWEET are removed. Deletions can be effected at one or both termini of the SWEET, or with removal of one or more non-terminal amino acid residues of the SWEET. Deletion variants, therefore, include all functional fragments of a particular SWEET.
Within the confines of the disclosed percent identity, the invention also relates to substitution variants of disclosed polypeptides of the invention. Substitution variants include those polypeptides wherein one or more amino acid residues of a SWEET are removed and replaced with alternative residues. In one aspect, the substitutions are conservative in nature; however, the invention embraces substitutions that are also non-conservative. Conservative substitutions for this purpose may be defined as set out in the tables below. Amino acids can be classified according to physical properties and contribution to secondary and tertiary protein structure. A conservative substitution is recognized in the art as a substitution of one amino acid for another amino acid that has similar properties. Exemplary conservative substitutions are set out in below.
Alternatively, conservative amino acids can be grouped as described in Lehninger (1975) Biochemistry, Second Edition; Worth Publishers, pp. 71-77, as set forth below.
And still other alternative, exemplary conservative substitutions are set out below.
It should be understood that the definition of peptides or polypeptides of the invention is intended to include polypeptides bearing modifications other than insertion, deletion, or substitution of amino acid residues. By way of example, the modifications may be covalent in nature, and include for example, chemical bonding with polymers, lipids, other organic and inorganic moieties. Such derivatives may be prepared to improve intracellular processing, the targeting capacity of the polypeptide for desired cells or tissues and the like. Similarly, the invention further embraces SWEETs or variants thereof that have been covalently modified to include one or more water-soluble polymer attachments such as polyethylene glycol, polyoxyethylene glycol or polypropylene glycol.
The plant cell(s) utilized in methods, constructs, plants and plant seeds of the present invention can be from any part or tissue of a plant including but not limited to the root, stem, leaf, seed, seedcoat, flower, fruit, anther, nectary, ovary, petal, tapetum, xylem, or phloem. If the genetically modified plant cell is comprised within a whole plant, the entire plant need not contain or express the genetic modification.
As described herein, the genetically modified plants and/or plant cells and/or plant seeds may be a plant or from a plant that is a dicot or monocot or gymnosperm. The plant may be crops, such as a food crops, feed crops or biofuels crops. Exemplary important crops may include corn, wheat, soybean, cotton and rice. Crops also include corn, wheat, barley, triticale, soybean, cotton, millet, sorghum, sugarcane, sugar beet, potato, tomato, grapevine, citrus (orange, lemon, grapefruit, etc), lettuce, alfalfa, common bean, fava bean and strawberries, sunflowers and rapeseed, cassava, miscanthus and switchgrass. Other examples of plants include but are not limited to an African daisy, African violet, alfalfa, almond, anemone, apple, apricot, asparagus, avocado, azalea, banana and plantain, beet, bellflower, black walnut, bleeding heart, butterfly flower, cacao, caneberries, canola, carnation, carrot, cassava, diseases, chickpea, cineraria, citrus, coconut palm, coffee, common bean, maize, cotton, crucifers, cucurbit, cyclamen, dahlia, date palm, douglas-fir, elm, English walnut, flax, Acanthaceae, Agavaceae, Araceae, Araliaceae, Araucariacea, Asclepiadaceae, Bignoniaceae, Bromeliaceae, Cactaceae, Commelinaceae, Euphobiaceae, Gentianaceae, Gesneriaceae, Maranthaceae, Moraceae, Palmae, Piperaceae, Polypodiaceae, Urticaceae, Vitaceae, fuchsia, geranium, grape, hazelnut, hemp, holiday cacti, hop, hydrangea, impatiens, Jerusalem cherry, kalanchoe, lettuce, lentil, lisianthus, mango, mimulus, monkey-flower, mint, mustar, oats, papaya, pea, peach and nectarine, peanut, pear, pearl millet, pecan, pepper, Persian violet, pigeonpea, pineapple, pistachio, pocketbook plant, poinsettia, potato, primula, red clover, rhododendron, rice, rose, rye, safflower, sapphire flower, spinach, strawberry, sugarcane, sunflower, sweetgum, sweet potato, sycamore, tea, tobacco, tomato, verbena, and wild rice.
The methods, constructs, plants and plant seeds of the present invention relate to increasing levels of sugar in developing seeds. The terms “sugar” is well known in the art and is used to mean a monosaccharide, a disaccharide, a trisaccharide, a tetrasaccharide or polysaccharide. The sugar or sugars measured may or may not be modified, such as being acetylated. Specifically, the sugars that are increased are selected from the groups consisting of sucrose, fructose, glucose, mannose and galactose. The sugars that are increased may or may not be part of more complex compounds, such as trisaccharides, e.g., raffinose, tetrasaccharides, e.g., stachyose or polysaccharides, e.g., amylose, amylopectin. The invention is not limited to the identity of the specific sugars that are increased in the seeds and plants of the present invention. Indeed, the SWEET transporters of the present invention predominantly transport hexoses, such as but not limited to glucose, mannose, fructose and galactose, as well as disaccharides, such as but not limited to sucrose, lactose, maltose, trehalose, cellobiose into the developing seed. Once inside the seed coat or developing seed coat, however, the seed may utilize these increased hexoses and/or disaccharides to then form more complex sugars. These more complex sugars that may be contained (increased) in the seed or developing seed include but are not limited to disaccharides, trisaccharides, e.g., raffinose, tetrasaccharides, e.g., stachyose or polysaccharides, e.g., amylose, amylopectin.
Thus, an “increase in glucose,” for example, is used herein to mean that the levels of glucose are increased over controls, regardless of whether the glucose is free glucose, i.e., occurs as a monosaccharide, or if the glucose subunit is part of a more complex compound, such as but not limited to disaccharides, trisaccharides, tetrasaccharides, or even polysaccharides. Similarly, an “increase in fructose,” for example, is used herein to mean that the levels of fructose are increased over controls, regardless of whether the fructose is free fructose, i.e., occurs as a monosaccharide, or if the fructose subunit is part of a more complex compound, such as but not limited to disaccharides, trisaccharides, tetrasaccharides, or even polysaccharides. Similarly, an “increase in sucrose,” for example, is used herein to mean that the levels of sucrose are increased over controls, regardless of whether the sucrose is free sucrose, i.e., occurs as a disaccharide, or if the fructose is part of a more complex compound, such as but not limited to trisaccharides, tetrasaccharides, or even polysaccharides. Given that the building blocks of di-, tri-, tetra- and polysaccharides are well known, and that methods are well established for analyzing sugar content in seeds, e.g., Hirst, E. L., et al., Biochem. J., 95:453-458 (1965), Steadman, K., et al., Ann. Botany, 77:667-674 (1996), Buckeridge, M. S., Plant Physiol., 154(3):1017-1023 (2010), all of which are incorporated by reference, one of skill in the art can readily ascertain if there is an increase in the level of sugar in a seed or developing seed compared to control seeds or control developing seeds. In select embodiments, methods of assessing or measuring levels of sugar and/or starch content in seeds include but are not limited to HPLC, NMR and mass spectroscopy.
As used herein, the phase “increase in the levels at least one sugar,” or “increase at least one sugar,” or some derivation thereof, means an increase in the levels of at least one specific, measured sugar in the seed or developing seed, as compared to control seed or control developing seed, even if levels of another sugar in the seed or developing seed may decrease or remain static. Of course, more than one specific, measured sugar may be increased as compared to control seed or control developing seed. In specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least one of at least, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least two of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least three of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least four of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least five of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least six of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least seven of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in at least eight of, glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose or cellobiose into the seed or developing seed. In other specific embodiments, the phrase “increase in the levels of at least one sugar” means an increase in glucose, mannose, fructose, galactose, sucrose, lactose, maltose, trehalose and cellobiose into the seed or developing seed.
As used herein, the term “seed” is used as it is in the art, i.e., an embryonic plant contained in a seed coat and is generated after fertilization and at least some growth within the maternal plant. A “developing seed” is an embryonic plant that has not completed its growth within the maternal plant, or it can be an embryonic plant around which the seed coat has not completely formed. For the purposes of measuring sugars in seeds or developing seeds as that relates to the present invention described herein, the seeds or developing seeds may or may not be contained within the maternal plant. For example, the seeds may be contained within or on a fruit of the plant, and the fruit may or may not be free of the maternal plant at harvest. The location and methods of isolating the seeds or developing seeds is irrelevant for the purposes of the present invention.
The methods, constructs, plants and plant seeds of the present invention relate to inserting an exogenous nucleic acid into a plant cell, wherein the nucleic acid codes for at least one SWEET transporter protein described herein. As used herein, the phrase “exogenous nucleic acid” is used to mean a nucleic acid that normally does not exist or occur in the genome of the plant cell. For example, at least one extra copy of nucleic acid encoding a wild-type SWEET transporter is an exogenous nucleic acid. Of course copies of nucleic acids encoding mutant SWEET transporters would also be considered an exogenous nucleic acid.
In one embodiment, the exogenous nucleic acid that codes for at least one SWEET transporter protein is derived from the same species (which includes being from the same or different subspecies within the same species) in which the exogenous nucleic acid is to be inserted. For example, a nucleic acid coding for the at least one SWEET transporter protein is a nucleic acid encoding the Zea mays SWEET transporter protein and the exogenous nucleic acid is being inserted into Zea mays plant cells. In another embodiment, the exogenous nucleic acid that codes for at least one SWEET transporter protein is derived from a different species in which the exogenous nucleic acid is to be inserted. For example, a nucleic acid coding for the at least one SWEET transporter protein is a nucleic acid encoding the Arabidopsis thaliana SWEET transporter protein and the exogenous nucleic acid is being inserted into Zea mays plant cells. In yet another embodiment, the exogenous nucleic acid that codes for at least one SWEET transporter protein is derived from a different genus in which the exogenous nucleic acid is to be inserted. For example, a nucleic acid coding for the at least one SWEET transporter protein is a nucleic acid encoding the Zea perennis SWEET transporter protein and the exogenous nucleic acid is being inserted into Zea mays plant cells.
Methods for the introduction or insertion of nucleic acid molecules into plants and plant cells are well-known in the art. For example, plant transformation may be carried out using Agrobacterium-mediated gene transfer, microinjection, electroporation or biolistic methods as it is, e.g., described in Potrykus and Spangenberg (Eds.), Gene Transfer to Plants. Springer Verlag, Berlin, N.Y., 1995. Therein, and in numerous other references available to one of skill in the art, useful plant transformation vectors, selection methods for transformed cells and tissue as well as regeneration techniques are described and can be applied to the methods of the present invention.
By inserting the exogenous nucleic acid into a plant cell, a transgenic plant is thus created. The methods generally involve inserting an exogenous nucleic acid into a plant cell. The insertion may be transient such that the inserted nucleic acid is not necessarily inherited to subsequence generations. In the alternative, the insertion may be stable or integrated such that the inserted nucleic acid is inherited to subsequence generations. Moreover, the plant cell into which the nucleic acids are inserted may be in culture or it may be part of a whole plant. For example transfection of nucleic acids into plant cells, as understood herein, includes introducing nucleic acids into plant protoplasts and allowing the protoplasts to develop into a callus, which is then allowed to grow into a mature plant. As used herein, the phrase “growing the transgenic plant cell into a mature plant” is used to mean using culture or non-culture growing conditions that allow the transfected plant cell(s) to develop into a whole plant which will contain the at least one copy of the nucleic acid encoding at least one SWEET transporter protein. In other embodiments, “growing the transgenic plant cell into a mature plant” includes introducing the nucleic acid into a portion of a plant, such as a leaf, embryo or portion thereof, and subsequently regenerating a whole plant (T0 generation) from the leaf, embryo or portion thereof. The T0 generation plants can subsequently be mated or crossed with other plants to produce T1, T2, T3, etc generations of plants. These other “mating plants” crossed with the T0 generation plants that are used to produce subsequent generations of transgenic plants (transgenic for the SWEET transporters described herein) may or may not be wild-type plants. In another embodiment, the mating plants crossed with the T0 generation plants that are used to produce subsequent generations of transgenic plants may or may not be transgenic plants themselves, including but not limited to another T0 generation plant that is transgenic for at least one SWEET transporter disclosed herein). Of course, if the mating plants used to grow the transgenic plant cells into mature transgenic plants are themselves transgenic, the mating plants can be transgenic for the or different protein or nucleic acid. This subsequent crossing or mating of the T0 generation plants into subsequent generations, e.g., T1, T2, T3, etc., is included and contemplated when the phrase “growing transgenic plant cells into a mature transgenic plant” is used herein.
Once the transgenic plant cells are created, the transgenic plant cell(s) may then grow into a transgenic seed-bearing plant using methods disclosed herein and well-established in the art. The seeds produced by the transgenic seed-bearing plants then are capable of producing seeds that have increased sugar content as compared to non-transgenic plants of the same species. As used herein, a “non-transgenic plant” indicates that the plant does not have the same exogenous nucleic acid (as determined by sequence identity) encoding the SWEET protein as the transgenic plants provided herein. Thus, a non-transgenic plant, as used herein, can be a wild-type plant or it may be transgenic for a different nucleic acid, protein, mutation, etc.
As used herein, the phrase “increased levels of sugar” or “the levels are increased” is used to mean that at least one specific sugar, as defined herein, is increased when compared to control levels.
Once levels of at least one sugar are measured or assessed, either directly or indirectly, these measured levels can then be compared to control levels of the least one sugar. Control levels of sugar(s) are levels that are deemed to be levels of sugars in seeds from a non-transgenic plant (as defined herein) from the same species as the transgenic plant and grown in similar, if not the same, conditions. To establish the measured sugar levels of a non-transgenic (“normal”) plant, an individual non-transgenic plant or group of non-transgenic plants may be analyzed to determine levels of the specific sugar in the seeds that the plant or plants typically produce. The methods, constructs, compositions, plants and plant seeds of the present invention do not necessarily require that one skilled in the art actually perform the analysis to determine control levels of the at least one sugar in plants, as such data may be readily accessible in the literature or such data may be provided.
Of course, measurements of normal measured sugar levels can fall within a range of values, and values that do not fall within this “normal range” are said to be outside the normal range. These measurements may or may not be converted to a value, number, factor or score as compared to measurements in the “normal range.” For example, a specific measured value that is above the normal range may be assigned a value or +1, +2, +3, etc., depending on the scoring system devised. The comparison of the measured sugar levels to control levels is to determine if the plant seeds have elevated levels of sugar over control levels of the same sugar in the non-transgenic plants grown in the similar, if not the same, conditions.
The levels of sugar in both control and transgenic seeds can be assessed in a seed or developing seed. In one embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants when the seeds or developing seeds are at roughly the same stage of development. For example, in one embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants at the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least two stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least three stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least four stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least five stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least six stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least seven stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants in at least eight stages selected from the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. In another embodiment, the levels of sugar are measured in seeds or developing seeds from transgenic and non-transgenic plants at the zygote stage of seed development, the pre-globular stage of seed development, the globular stage of seed development, the transition stage of seed development, the heart stage of seed development, the torpedo stage of seed development, the linear cotyledon stage of seed development, the bending cotyledon stage of seed development, or the maturation green stage of seed development. As understood herein, levels of sugar in seeds from transgenic plants are considered as “increased” over levels of sugar in seeds from non-transgenic plants if levels are higher in at least one of these stages of seed development.
As used herein, subject a transgenic plant cell or a transgenic plant to conditions that promote expression of the at least one SWEET transporter is understood to mean that the plant or plant cells are grown under conditions to allow expression of the exogenous nucleic acid. In many instances, such methods of subjecting a plant or plant cell to conditions to allow expression of the at least one SWEET transporter protein include normal growth (greenhouse, field, etc.) conditions. Such circumstances would include instances where the promoter used to drive expression of the nucleic acid encoding the SWEET transporter protein is not an inducible promoter, e.g., a constitutive or tissue specific promoter. In other embodiments, methods of subjecting a plant or plant cell to conditions to allow expression of the at least one SWEET transporter protein include providing a stimulus to the transgenic plant or plant cells to induce expression of the promoter that is operably linked to the nucleic acid encoding the at least one SWEET transporter protein. One of skill in the art will be able to readily recognize the conditions or stimuli that are necessary to induce a chosen inducible promoter to drive expression of a nucleic acid.
The nucleic acid encoding at least one SWEET transporter may be isolated. As used herein, the term isolated refers to molecules separated from other cell/tissue constituents (e.g. DNA or RNA) that are present in the natural source of the macromolecule. The term isolated may also refer to a nucleic acid or peptide that is substantially free of cellular material, viral material, and culture medium when produced by recombinant DNA techniques, or that is substantially free of chemical precursors or other chemicals when chemically synthesized. Moreover, an isolated nucleic acid may include nucleic acid fragments which are not naturally occurring as fragments and would not be found in the natural state.
The nucleic acids to be inserted into the plant cells may be part of an expression vector. An expression vector is one into which a desired nucleic acid sequence may be inserted by restriction and ligation such that it is operably joined or operably linked to regulatory sequences and may be expressed as an RNA transcript. Expression refers to the transcription and/or translation of an endogenous gene, transgene or coding region in a cell.
A coding sequence and regulatory sequences are operably joined when they are covalently linked in such a way as to place the expression or transcription of the coding sequence under the influence or control of the regulatory sequences. If it is desired that the coding sequences be translated into a functional protein, two DNA sequences are said to be operably joined if induction of a promoter in the 5′ regulatory sequences results in the transcription of the coding sequence and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a promoter region would be operably joined to a coding sequence if the promoter region were capable of affecting transcription of that DNA sequence such that the resulting transcript might be translated into the desired protein or polypeptide.
Vectors may further contain one or more promoter sequences. A promoter may include an untranslated nucleic acid sequence usually located upstream of the coding region that contains the site for initiating transcription of the nucleic acid. The promoter region may also include other elements that act as regulators of gene expression. In further embodiments of the invention, the expression vector contains an additional region to aid in selection of cells that have the expression vector incorporated. The promoter sequence is often bounded (inclusively) at its 3′ terminus by the transcription initiation site and extends upstream (5′ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site, as well as protein binding domains responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain “TATA” boxes and “CAT” boxes. A promoter also optionally includes distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription.
Activation of promoters may be specific to certain cells or tissues, for example by transcription factors only expressed in certain tissues, or the promoter may be ubiquitous and capable of expression in most cells or tissues.
A constitutive promoter is a promoter that is active under most environmental and developmental conditions. An inducible promoter is a promoter that is active under certain or specific environmental or developmental regulation. Any inducible promoter can be used, see, e.g., Ward et al. Plant Mol. Biol. 22:361-366, 1993. Exemplary inducible promoters include, but are not limited to, that from the ACEI system (responsive to copper) (Meft et al. Proc. Natl. Acad. Sci. USA 90:4567-4571, 1993, In2 gene from maize (responsive to benzenesulfonamide herbicide safeners) (Hershey et al. Mol. Gen. Genetics 227:229-237, 1991, and Gatz et al. Mol. Gen. Genetics 243:32-38, 1994) or Tet repressor from Tn10 (Gatz et al. Mol. Gen. Genetics 227:229-237, 1991). The inducible promoter may respond to an agent foreign to the host cell, see, e.g., Schena et al. PNAS 88: 10421-10425, 1991. Other promoters include but are not limited to waxy 1 (“wx1”) promoter active in starchy endosperm tissue, the BETL1 promoter, Esr6a and 6b promoters and the Miniature1 (Mn1) promoter.
The inserted exogenous nucleic acid encoding at least one SWEET transporter may be expressed in any location in the cell, including the cytoplasm, cell surface or subcellular organelles such as the nucleus, vesicles, ER, vacuole, etc. Methods and vector components for targeting the expression of proteins to different cellular compartments are well known in the art, with the choice dependent on the particular cell or organism in which the transporter is expressed. See, for instance, Okumoto et al. PNAS 102: 8740-8745, 2005, Fehr et al. J. Fluoresc. 14: 603-609, 2005. Transport of protein to a subcellular compartment such as the chloroplast, vacuole, peroxisome, glyoxysome, cell wall or mitochondrion or for secretion into the apoplast, may be accomplished by means of operably linking a nucleotide sequence encoding a signal sequence to the 5′ and/or 3′ region of a gene encoding the transporter. Targeting sequences at the 5′ and/or 3′ end of the structural gene may determine during protein synthesis and processing where the encoded protein is ultimately compartmentalized.
The presence of a signal sequence directs a polypeptide to either an intracellular organelle or subcellular compartment or for secretion to the apoplasm. The term targeting signal sequence refers to amino acid sequences, the presence of which in or appended to an expressed protein targets it to a specific subcellular localization. For example, corresponding targeting signals may lead to the secretion of the expressed SWEET transporter, e.g. from a bacterial host in order to simplify its purification. In one embodiment, targeting of the transporter may be used to affect the concentration of at least one sugar in a specific subcellular or extracellular compartment. Appropriate targeting signal sequences useful for different groups of organisms are known to the person skilled in the art and may be retrieved from the literature or sequence data bases.
If targeting to the plastids of plant cells is desired, a targeting signal peptide can be used. An example of a targeting signal peptide includes but is not limited to amino acid residues 1 to 124 of Arabidopsis thaliana plastidial RNA polymerase (AtRpoT 3) (Plant Journal 17: 557-561, 1999), the targeting signal peptide of the plastidic Ferredoxin:NADP+ oxidoreductase (FNR) of spinach (Jansen et al. Current Genetics 13: 517-522, 1988), the amino acid sequence encoded by the nucleotides −171 to 165 of the cDNA sequence disclosed therein, the transit peptide of the waxy protein of maize including or without the first 34 amino acid residues of the mature waxy protein (Klosgen et al. Mol. Gen. Genet. 217: 155-161, 1989), the signal peptides of the ribulose bisphosphate carboxylase small subunit (Wolter et al. PNAS 85: 846-850, 1988; Nawrath et al. PNAS 91: 12760-12764, 1994), the signal peptide of the NADP malat dehydrogenase (Gallardo et al. Planta 197: 324-332, 1995), the signal peptide of the glutathione reductase (Creissen et al. Plant J. 8: 167-175, 1995) or the signal peptide of the R1 protein (Lorberth et al. Nature Biotechnology 16: 473-477, 1998).
Targeting to the mitochondria of plant cells may be accomplished by using targeting signal peptides such as but not limited to amino acid residues 1 to 131 of Arabidopsis thaliana mitochondrial RNA polymerase (AtRpoT 1) (Plant Journal 17: 557-561, 1999) or the transit peptide described by Braun (EMBO J. 11: 3219-3227, 1992).
Targeting to the vacuole in plant cells may be achieved by using targeting signal peptides such as but not limited to the N-terminal sequence (146 amino acids) of the patatin protein (Sonnewald et al. Plant J. 1: 95-106, 1991), the signal sequences described by Matsuoka and Neuhaus (Journal of Exp. Botany 50: 165-174, 1999), Chrispeels and Raikhel (Cell 68: 613-616, 1992), Matsuoka and Nakamura (PNAS 88: 834-838, 1991), Bednarek and Raikhel (Plant Cell 3: 1195-1206, 1991) and/or Nakamura and Matsuoka (Plant Phys. 101: 1-5, 1993).
Targeting to the ER in plant cells may be achieved by using, e.g., the ER targeting peptide HKTMLPLPLIPSLLLSLSSAEF in conjunction with the C-terminal extension HDEL (Haselhoff, PNAS 94: 2122-2127, 1997). Targeting to the nucleus of plant cells may be achieved by using, e.g., the nuclear localization signal (NLS) of the tobacco C2 polypeptide QPSLKRMKIQPSSQP (SEQ ID NO: 411).
Targeting to the extracellular space may be achieved by using a transit peptide such as but not limited to the signal sequence of the proteinase inhibitor II-gene (Keil et al. Nucleic Acid Res. 14: 5641-5650, 1986, von Schaewen et al. EMBO J. 9: 30-33, 1990), of the levansucrase gene from Erwinia amylovora (Geier and Geider, Phys. Mol. Plant Pathol. 42: 387-404, 1993), of a fragment of the patatin gene B33 from Solanum tuberosum, which encodes the first 33 amino acids (Rosahl et al. Mol Gen. Genet. 203: 214-220, 1986) or of the one described by Oshima et al. (Nucleic Acids Res. 18: 181, 1990).
Additional targeting to the plasma membrane of plant cells may be achieved by fusion to a different transporter, preferentially to the sucrose transporter SUT1 (Riesmeier, EMBO J. 11: 4705-4713, 1992). Targeting to different intracellular membranes may be achieved by fusion to membrane proteins present in the specific compartments such as vacuolar water channels (γTIP) (Karlsson, Plant J. 21: 83-90, 2000), MCF proteins in mitochondria (Kuan, Crit. Rev. Biochem. Mol. Biol. 28: 209-233, 1993), triosephosphate translocator in inner envelopes of plastids (Flugge, EMBO J. 8: 39-46, 1989) and photosystems in thylacoids.
Targeting to the Golgi apparatus can be accomplished using the C-terminal recognition sequence K(X)KXX where “X” is any amino acid (Garabet, Methods Enzymol. 332: 77-87, 2001. Targeting to the peroxisomes can be done using the peroxisomal targeting sequence PTS I or PTS II (Garabet, Methods Enzymol. 332: 77-87, 2001).
SWEETs Involvement in Seed Filling
Although no SWEET involvement has been found in embryos, overexpression of other sugar transporters, such as the tonoplast monosaccharide transporter (TMT1), under the control of a constitutive cauliflower mosaic virus 35S promoter, has been shown to increase biomass of Arabidopsis seeds. See Wingenter, K., et al., Plant Physiol., 154(2): 665-677 (October 2010), which is incorporated by reference. In particular, Wingenter et al. showed that increasing expression of the TMT1 transporter increased lipid and protein content in Arabidopsis seeds. Specifically, Arabidopsis overexpressing TMT1 grew faster than wild-type plants on soil and in high-glucose (Glc)-containing liquid medium. Soil-grown TMT1 overexpressor mutants produced larger seeds and greater total seed yield, which was associated with increased lipid and protein content. These changes in seed properties were correlated with slightly decreased nocturnal CO2 release and increased sugar export rates from detached source leaves. Thus, increased TMT activity in Arabidopsis induced modified subcellular sugar compartimentation, altered cellular sugar sensing, affected assimilate allocation, increased the biomass of Arabidopsis seeds, and accelerated early plant development.
In other contexts, Rossi, G., et al., Microbial Cell Factories, 9:15 (March 2010) (doi:10.1186/1475-2859-9-15) also reports that yeast cells engineered to overexpress the hexose transporter HXT1 or HXT7, lead to increased in glucose uptake in the cells. In still other contexts, Wang et al. (2008) reports that the rice GIF1 (Grain Incomplete Filling 1) gene encoding a cell-wall invertase is required for carbon partitioning during early grain-filling. Ectopic expression of the cultivated GIF1 gene with the 35S or rice Waxy promoter resulted in smaller grains, whereas overexpression of GIF1, driven by its native promoter, increased grain production. These findings, together with the domestication signature, which were identified by comparing nucleotide diversity of the GIF1 loci between cultivated and wild rice, strongly suggest that GIF1 is a potential domestication gene and that such a domestication-selected gene can be used for further crop improvement.
Analysis of cell-specific expression in developing seeds is consistent with a role of several SWEETs in sugar import into developing seeds. Analysis of public databases and prior publications indicates that Arabidopsis SWEET1, 4, 5, 7 and 8 (Clade I and II hexose transporters), are expressed in seeds during seed maturation: SWEET1 and 7 in seed coat, SWEET8 in endosperm, and SWEETS in embryo. See Chen, L. Q., et al., Nature, 468:527-532 (2010), which is incorporated by reference. Moreover, SWEET10, 11, 12 and 15 (Clade III sucrose transporters) are expressed in seeds during maturation, specifically SWEET11, 12 and 15 in seed coat, SWEET10 in the chalazal seed coat, and SWEET11 and 15 in the endosperm. See Chen, L. Q., et al., Science, 335:207-211 (January 2012).
Analysis provided herein confirms that GFP-fusions of SWEET11, 12 and 15 are expressed in seed coat (
Consistent with these preliminary data that implicate SWEETs in seed filling in Arabidopsis, 12 out of the 22 SWEETs are highly expressed in maize kernels (See Maize eFP at bar.utoronto.ca/efp_maize/cgi-bin/efpWeb.cgi and QTELLER at qteller.com). Four Clade I/II hexose transporter SWEETs are highly expressed in seeds. Specifically, SWEET4b and SWEET4d are found both in embryo and endosperm, and SWEET4a and SWEET2 are expressed throughout the seed. Moreover, 8 sucrose transporting Clade III SWEETs are also expressed during seed maturation. Specifically, SWEET11, SWEET13b, SWEET13c and SWEET15b are expressed throughout the seed, SWEET14a, SWEET14b, SWEET15a and SWEET15b are expressed primarily in endosperm. Particularly, the three hexose transporters 4a,b and d (
In the W22 background, caryopses collapse, endosperm is greatly reduced and embryo size appears smaller. This smaller phenotype is reminiscent of the documented mnl phenotype. In view of the high expression of ZmSWEET4d in BETL, the likely cause for this smaller phenotype is a block in sugar uptake into BETL affecting downstream kernel filling. The current model implicates a minimal number of transport steps, however, multiple SWEET and SUT paralogs in both Arabidopsis and maize seeds were identified, indicating greater complexity in seed filling, with the possibility of additional transport steps.
Automating the extraction, curation and reduction of spectroscopic data has greatly accelerated analysis of 13C-labeling data. The use of a single 13C-labeled sample analysis of endosperm tissue from single kernels can discriminate among four individual sibling plants from the same generation of a non-transgenic maize line. The plants and cultured kernels were grown together and the seed weights and compositions (starch, protein, oil, or cell wall contents and major soluble metabolite levels) were not significantly different among seeds from each plant. The small metabolic flux differences revealed by 13C-labeling patterns are due to segregation of the non-transgenic genetic background. Differences in flux profiles among 4 transgenic lines with the same growth and composition have been noted. In silico simulations using steady state flux maps are also able to predict the labeling patterns accompanying modest changes in core metabolism. A 10% change in TCA cycle flux results in ˜1% change in total carbon allocation and is associated with a distinct labeling phenotype, which can be discriminated from wild-type and other altered metabolic flux patterns, each of which yields its own label signatures or fingerprints.
SWEETs Involvement in Nectar Production
Plants have evolved anatomical and physiological features to attract animals to promote pollination. Reproductive isolation as one mechanism for speciation, is thought to be enhanced in animal pollinated species relative to wind transfer of pollen. Floral traits, including animal pollination, floral nectar spurs, bilateral symmetry and dioecious sexual system, can alter subsequent species abundance within clades. When Gaston de Saporta, Joseph Hooker, Oswald Heer and Charles Darwin discussed the ‘abominable mystery’—the apparent rapid radiation of angiosperms and insects in the mid-Cretaceous—de Saporta suggested that the development and refinement of insect-assisted pollination through the coevolution of pollinators and flowering plants may have been key to pollinator and angiosperm diversification. However the molecular mechanism of nectar secretion has remained elusive.
Flowering plants have evolved intricate methods to secure efficient interaction with pollinators and, thereby, both successful reproduction and genetic diversity through cross-pollination. Central to this process is the nectar, which contains high amounts of sugars and volatile compounds that attract and reward pollinators as well as toxins that repel unwanted floral visitors and compel pollinators to optimize outcrossing rates. Nectar composition varies widely quantitatively and qualitatively between species, presumably because it is produced to reward different families of animals. Depending on the species, 8 to 80% (w/w) of nectar is comprised of sugars, the most prevalent of which are sucrose, glucose and fructose. Nectar differs in composition from phloem sap, which delivers sugars to nectaries and is dominated by the di- and tri-saccharides sucrose and raffinose. Angiosperm nectar is synthesized and secreted by specialized organs called nectaries. Plants invest significant amounts of energy into the formation of flowers, the production of nectaries, and the secretion of sugary nectar. For example, Nicotiana attenuata, a self-compatible, hawkmoth- and hummingbird-pollinated Asterid, produces nectar that contains sucrose, hexoses and numerous secondary metabolites including nicotine. Brassica rapa, comprising self-compatible and incompatible varieties, produces hexose-dominant sugar. Arabidopsis thaliana, a self-compatible, self-fertilizer, also develops functional nectaries that produce volatiles and secrete hexose-rich nectar. It remains unclear whether nectar production in self-fertilizing plants represents an evolutionary remnant or may function to secure the low rate of outcrossing. Thus, understanding the phylogeny and biochemistry of nectar secretion may help to elucidate the processes underlying diversification of angiosperms.
Despite the importance of nectar, its secretion process has remained a matter of debate, with few functional data on the transport mechanism. To identify a transporter responsible for nectar secretion, databases of candidate sugar transporters were searched for those transporters specifically expressed in nectaries with characteristics compatible with cellular sugar efflux. Members of the recently identified SWEET sugar transporter family appeared as prime candidates for a role in nectar secretion. SWEET11 and 12 sucrose transporters are known to be responsible for cellular efflux that is key to phloem loading and, therefore, for translocation of sucrose from photosynthetic tissue to heterotrophic tissue, such as roots, flowers and seeds. Previous studies had described a SWEET9 homolog in Petunia hybrida, PhNEC1, to be specifically expressed in nectaries, and developmental timing of PhNEC1 expression has previously been correlated inversely with nectar starch content, making this transporter a prime candidate for having a role in nectar secretion. SWEET9, a close relative of NEC1, is highly expressed in Arabidopsis nectaries.
Previous studies had identified a subfamily of SWEETs as a novel class of sucrose efflux transporters responsible for moving sucrose from phloem parenchyma, the first step of loading sucrose into the vascular conduits of the phloem. Microarray and RT-PCR analyses show that SWEET9, which shares ˜50% sequence identity with SWEET11 and 12, is specifically expressed in Arabidopsis nectaries. SWEET9 is the only SWEET highly expressed in nectaries, therefore it is conceivable that SWEET9 mediates sucrose or hexose transport for nectar production. Transport studies show that SWEET9 mediates uptake and efflux of sucrose as assayed in Xenopus oocytes. Sucrose transport activity of SWEET9 was further confirmed by coexpression of SWEET9 with a Förster Resonance Energy Transfer (FRET) sucrose sensor in human embryonic kidney cells. Together these results show that SWEET9 can mediate both uptake and efflux of sucrose, consistent with the facilitated diffusion mechanism of a sucrose uniporter. Some SWEET homologs had been shown to transport both sucrose and glucose, and although the present inventors have not been able to obtain conclusive data that exclude the possibility that SWEET9 also transports hexoses, hence SWEET9 may also function in hexose efflux.
To determine directly whether SWEET9 is involved in sugar secretion from nectaries, nectar secretion was examined in three independent T-DNA insertion mutant lines [atsweet9-1, sk225 (carries a T-DNA insertion in position −308 before start codon and had no detectable transcript levels), atsweet9-2, SALK_060256 (pos. −940 before start codon and had reduced transcript levels), atsweet9-3, SALK_202913C (pos. 779 after the start codon in exon 4 or +345 from the start codon in the cDNA, knockout line). If SWEET9 plays a role in sugar uptake into or efflux from nectaries, one may expect specific phenotypes in the mutants such as but not limited to reduced sugar content, or, if the sugar efflux creates the osmotic driving force for nectar secretion, the loss of fluid secretion. In Arabidopsis, nectar droplets accumulate inside the cups formed by sepals that surround the lateral nectaries of wild-type flowers. None of the sweet9 mutants produced detectable nectar droplets (
To test if SWEET9 activity is limiting for nectar secretion, nectar secretion was analyzed in transgenic lines expressing SWEET9-GFP fusions under its native promoter in wild-type background. The extra copies of SWEET9 in wild-type background showed increased nectar volume as judged by droplet size quantification (
Without being bound to theory, it is possible that SWEET9 could function in at least one of at least three ways, depending on its localization. First SWEET9 may facilitate sucrose efflux at the phloem strands near nectaries, it may facilitate sugar uptake into nectary parenchyma, and/or it may facilitate sugar efflux from nectary parenchyma delivering sugars to the nectarial apoplasm. Translational fusions with GUS and eGFP under control of the SWEET9 promoter were specifically expressed in floral nectaries (
To assess starch accumulation in sweet9-1, starch was stained with Lugol's iodine solution in fixed sections (
Since SWEET11 and 12 are plasma membrane-localized sucrose efflux transporters, we analyzed the subcellular localization of the SWEET9-promoter driven SWEET9-eGFP fusion. SWEET9-eGFP fusions localized both at the plasma membrane and the Golgi-like compartments (
Bioinformatic analysis from previous studies of gene expression in nectaries of Arabidopsis suggests that genes involved in sucrose biosynthesis are upregulated in nectaries, indicating that resynthesis of sucrose from starch drives sugar efflux via SWEET9. The data previous suggests that two sucrose phosphate synthase (SPS) genes, both of which encode key enzymes for sucrose biosynthesis, are induced to high levels in maturing nectaries. Indeed, the SPS1F and SPS2F genes are highly expressed in nectaries. Artificial microRNA inhibition of the expression of the two SPS genes leads to a loss of nectar secretion and altered starch accumulation, which mimics the phenotype of the sweet9 mutants (
To explore whether SWEET9 is also essential for sugar efflux from nectaries of other Brassicaceae, the ortholog of SWEET9 was identified in turnip flowers (Brassica rapa).
To test whether Asterids also use SWEET9 orthologs for nectar secretion, SWEET9 was identified in Nicotiana attenuata. NaSWEET9 was most highly expressed in nectaries, and expression was found to increase during nectary maturation (
A phylogenetic analysis tentatively traces the origin of SWEET9 to a point before the split of Eudicots (Asterids and Rosids;
A model for the nectar secretion mechanism is shown in
Microarray data show that the several proton-coupled sugar transporters including hexose transporting STPs are also expressed in nectaries (expressions is relatively low compared to SWEET9), indicating that these proton-coupled sugar transporters may serve as selective reuptake activities. The relative activities of cell wall invertase combined with selective reuptake activities may determine the final ratio of sucrose, fructose and glucose (
The observation of starch accumulation in mutant stems at the floral base emphasizes not only the significant energy investment involved in nectar production but also the lack of feedback regulation of sucrose delivery or translocation to other parts of the flower, even in self-pollinating plants such as Arabidopsis. That largely self-pollinating Arabidopsis has retained nectar production and produces volatiles and secretes sugary nectar to attract and reward potential pollinators suggests the importance of securing outcrossed progeny, even at a low rate. This outcrossing plays a role in coevolution and limits inbreeding depression. For highly self-pollinating species with no inbreeding depression, however, nectar sugar accumulation and sugar accumulation in floral stems of sweet9 or cwinv4 may attract pathogens and provide strong selection for reduced nectar production.
Here, the critical role of SWEET9 in nectar secretion has been shown by confirming its expression in nectaries, demonstrating its sucrose transport actions, and showing localization at both the plasma membrane and an intracellular compartment with features similar to the Golgi apparatus. Mutation of SWEET9 or nectary-expressed sucrose phosphate synthase genes led to complete loss of nectar secretion. Surprisingly, sugars delivered to defective nectaries accumulated in the stems at the floral base, indicating the lack of negative feedback on phloem delivery and the inability to relocate the sucrose efficiently. The function of SWEET9 in nectar secretion is conserved in Rosids and Asterids (the two major clades of core Eudicot species), by blocking its expression in A. thaliana, B. rapa and N. attenuata.
The Examples provided herein are meant for illustrative purposes of select embodiments of the present invention are not intended to limit the scope of the invention in any way.
Heterologous expression of SWEET9 from the three species in HEK293T cells and Xenopus oocytes was performed as established in the art. Insertion sites and reduced transcript levels were verified by PCR and qPCR. BrSWEET9 TILLING mutants were obtained from RevGen UK (John Innes Centre, Norwich, UK; revgenuk.jic.ac.uk/) via screening of previously described mutant populations. Wild-type N. attenuata lines were transformed by Agrobacterium tumefaciens (strain LBA 4404) to silence N. attenuata sweet9 (nasweet9). SPS1F and SPS2F were co-silenced via a single amiRNA targeting the mRNAs for both genes, and nectar secretion was evaluated using a compound microscope (Leica MZ6) by eye, and documented by photography. Starch was stained using potassium iodide. Flowers (stage 14˜15, at anthesis) were examined for starch accumulation by iodine—potassium iodide (IKI) staining (Jensen, 1962). Assay was performed following the protocol mentioned in Ruhlmann et al., 2009, which is incorporated by reference.
The Arabidopsis SWEET9 gene encodes for a nectary-specific sugar transporter. Total nectar glucose content ratio and nectar droplet size of AtSWEET9 overexpression lines (driven by its native promoter) vs wild-type, were evaluated. Nectar glucose content was evaluated in each line and showed higher glucose content relative to wild-type nectar (2.04-2.66 times higher). In the same overexpressor lines, the volume of the nectar droplets was also evaluated, showing an average of 31% larger nectar volume compared with the wild-type droplets.
To investigate if the SWEET4d sugar transport activity within the seed BETL could be a liming factor for the sugar accumulation into the maize endosperm, transgenic A188 plants were generated to express both (i) full-length cDNA of gene GRMZM2G137954_T01 (SWEET4d) under the control of the rice Actin promoter, as well as (ii) full-length gDNA of gene GRMZM2G137954_T01 (SWEET4d) using as promoter the native 2 kb of 5′UTR upstream the ATG.
The plasmid used for the production plants containing construct (i) from above (SWEET4d overexpressors using cDNA) contained the backbone of vector pSB11 (Ishida et al., 1996), a Basta resistance cassette (rice Actin promoter and intron, Bar gene, and Nos terminator) next to the right border, and the SWEET4d coding sequence (lacking a stop codon) fused with the fluorescent protein GFP, under the control of the rice Actin promoter next to the left border. The plasmid used for the production of plants containing construct (ii) above (SWEET4d-overexpressors using genomic DNA) contained the backbone of vector pSB11 (Ishida et al., 1996), a Basta resistance cassette (rice Actin promoter and intron, Bar gene, and Nos terminator) next to the right border, and the SWEET4d full-length gDNA sequence (lacking a stop codon) fused with the fluorescent protein GFP, under the control of SWEET4d native promoter (2 kb) promoter next to the left border.
Agrobacterium-mediated transformation of maize inbred line A188 was based on a published protocol (Ishida et al., 2007). For each transformation event, the number of T-DNA insertions was evaluated by qRT-PCR, and the integrity of the transgene was verified by PCR.
References—all of which are incorporated by reference.
Part of the work performed during development of this invention utilized U.S. Government funds under Department of Energy Grant No. DE-FG02-04ER15542 and National Science Foundation Grant No. 0820730. The U.S. Government has certain rights in this invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US14/25310 | 3/13/2014 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61779066 | Mar 2013 | US |