The sequence listing file named “56202_B_seq_listing.txt”, which is 138,581,829 bytes (measured in MS-WINDOWS) which was electronically filed on a CD-ROM which was created on Sep. 28, 2009 is incorporated herein by reference.
Disclosed herein are recombinant DNA useful for providing enhanced traits to transgenic plants, seeds, pollen, plant cells and plant nuclei of such transgenic plants, methods of making and using such recombinant DNA, plants, seeds, pollen, plant cells and plant nuclei. Also disclosed are methods of producing hybrid corn seed comprising such recombinant DNA.
All genetic resources disclosed herein were directly obtained from sources that are currently common to the United States; the ancestral sources of each specific genetic material is unknown.
An aspect of this invention provides recombinant DNA constructs comprising polynucleotides characterized by an encoded protein having amino acids representing a protein family domain module as described in Table 10. Another aspect of this invention provides recombinant DNA constructs comprising polynucleotides characterized by an encoded protein with an amino acid sequence that is at least 90% identical to a corresponding consensus sequence defined in table 8. Yet another aspect of this invention provides recombinant DNA constructs comprising polynucleotides characterized by reference to SEQ ID NO:1-307 and the cognate proteins with amino acid sequences having reference to SEQ ID NO:308-614. The recombinant DNA constructs are useful for providing enhanced traits when stably integrated into the chromosomes and expressed in the nuclei of transgenic plants cells. In some aspects of the invention the recombinant DNA constructs, when expressed in a plant cell, provide for expression of cognate proteins. In those aspects of the invention, the recombinant DNA constructs for expressing cognate proteins are characterized by cognate amino acid sequences having a sequence selected from SEQ ID NOs: 308, 310, 312-315, 317-323, 325-343, 345, 347-349, 352-354, 356, 358-359, 366-372, 374-383, 389-392, 394, 396, 401-403, 405-412, 414, 417-424, 427-453, 455-473, 475, 488-501, 503-517, 519-531, 533-540, 542-543, and 546-614; having at least 90% identity over at least 90% of the length of a sequence selected from the group consisting of SEQ ID NOs: 308, 310, 312-315, 317-323, 325-343, 345, 347-349, 352-354, 356, 358-359, 366-372, 374-383, 389-392, 394, 396, 401-403, 405-412, 414, 417-424, 427-453, 455-473, 475, 488-501, 503-517, 519-531, 533-540, 542-543, and 546-614 or that are homologous to a sequence selected from the group consisting of SEQ ID NOs: 308, 310, 312-315, 317-323, 325-343, 345, 347-349, 352-354, 356, 358-359, 366-372, 374-383, 389-392, 394, 396, 401-403, 405-412, 414, 417-424, 427-453, 455-473, 475, 488-501, 503-517, 519-531, 533-540, 542-543, and 546-614.
In other aspects of the invention the recombinant DNA constructs provide for suppression of a native protein. In those other aspects of the invention the recombinant DNA constructs are characterized as being constructed with sense-oriented and anti-sense-oriented polynucleotides, e.g. polynucleotides derived from genes having SEQ ID NOs: 2, 4, 9, 17, 37, 39, 43-44, 48, 50, 53-58, 66, 77-81, 86, 88, 90-93, 97, 106, 108-109, 118-119, 147, 167, 169-180, 195, 211, 225, 234, or 237-238 or homologous genes. When the recombinant DNA construct is expressed in corn plants, the endogenous protein is a corn protein with an amino acid sequence of SEQ ID NO:316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 474, 476-487, 502, 532, 541, or 544-545 or the corn homolog of SEQ ID NOs:309, 311, 324, 373, 395, 454, or 518; when the recombinant DNA construct is expressed in soybean plants, the endogenous protein is a soybean protein with an amino acid sequence of SEQ ID NO: 309, 324, 373, 395, 518 or is a soybean homolog of SEQ ID NOs: 311, 316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 532, 541, or 544-545; and when the recombinant DNA construct is expressed in a plant other than a corn or a soybean plant, the endogenous protein is the other plant's endogenous protein that has an amino acid sequence homologous to SEQ ID NO: 309, 311, 316, 324, 344, 346, 350-351, 355, 357, 360-365, 373, 384-388, 393, 395, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 518, 532, 541, or 544-545.
In practical aspects of this invention the recombinant DNA constructs of the invention are stably integrated into the chromosome of a plant cell nucleus.
This invention also provides transgenic plant cells comprising the stably integrated recombinant DNA constructs of the invention, transgenic plants and seeds comprising a plurality of such transgenic plant cells and transgenic pollen of such plants. Such transgenic plants are selected from a population of transgenic plants regenerated from plant cells transformed with recombinant DNA constructs by screening transgenic plants for an enhanced trait as compared to control plants. The enhanced trait is one or more of enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
In another aspect of the invention the plant cells, plants, seeds, and pollen further comprise DNA expressing a protein that provides tolerance from exposure to an herbicide applied at levels that are lethal to a wild type plant cell.
This invention also provides methods for manufacturing non-natural, transgenic seed that can be used to produce a crop of transgenic plants with an enhanced trait resulting from expression of a stably-integrated recombinant DNA construct. More specifically, the method comprises (a) screening a population of plants for an enhanced trait and a recombinant DNA construct, where individual plants in the population can exhibit the trait at a level less than, essentially the same as or greater than the level that the trait is exhibited in control plants, (b) selecting from the population one or more plants that exhibit the trait at a level greater than the level that said trait is exhibited in control plants, (c) collecting seed from a selected plant, (d) verifying that the recombinant DNA is stably integrated in said selected plants, (e) analyzing tissue of a selected plant to determine the production or suppression of a protein having the function of a protein encoded by nucleotides in a sequence of one of SEQ ID NOs:1-307. In one aspect of the invention, the plants in the population further comprise DNA expressing a protein that provides tolerance to exposure to a herbicide applied at levels that are lethal to wild type plant cells and the selecting is affected by treating the population with the herbicide, e.g. a glyphosate, dicamba, or glufosinate compound. In another aspect of the invention the plants are selected by identifying plants with the enhanced trait. The methods are especially useful for manufacturing corn, soybean, cotton, canola, alfalfa, wheat, rice, sugarcane or sugar beet seed.
Another aspect of the invention provides a method of producing hybrid corn seed comprising acquiring hybrid corn seed from a herbicide tolerant corn plant which also has stably-integrated, recombinant DNA construct comprising a promoter that is (a) functional in plant cells and (b) is operably linked to DNA that encodes or suppresses a protein having the function of a protein encoded by nucleotides in a sequence of one of SEQ ID NOs:1-307. The methods further comprise producing corn plants from said hybrid corn seed, wherein a fraction of the plants produced from said hybrid corn seed is homozygous for said recombinant DNA, a fraction of the plants produced from said hybrid corn seed is hemizygous for said recombinant DNA, and a fraction of the plants produced from said hybrid corn seed has none of said recombinant DNA; selecting corn plants which are homozygous and hemizygous for said recombinant DNA by treating with an herbicide; collecting seed from herbicide-treated-surviving corn plants and planting said seed to produce further progeny corn plants; repeating the selecting and collecting steps at least once to produce an inbred corn line; and crossing the inbred corn line with a second corn line to produce hybrid seed.
Another aspect of the invention provides a method of selecting a plant comprising plant cells of the invention by using an immunoreactive antibody to detect the presence or absence of protein expressed or suppressed by recombinant DNA in seed or plant tissue. Yet another aspect of the invention provides anti-counterfeit milled seed having, as an indication of origin, plant cells of this invention.
Still other aspects of this invention relate to transgenic plants with enhanced water use efficiency or enhanced nitrogen use efficiency. For instance, this invention provides methods of growing a corn, cotton, soybean, or canola crop without irrigation water comprising planting seed having plant cells of the invention which are selected for enhanced water use efficiency. Alternatively methods comprise applying reduced irrigation water, e.g. providing up to 300 millimeters of ground water during the production of a corn crop. This invention also provides methods of growing a corn, cotton, soybean or canola crop without added nitrogen fertilizer comprising planting seed having plant cells of the invention which are selected for enhanced nitrogen use efficiency.
In the attached sequence listing:
SEQ ID NO:1-307 are nucleotide sequences of the coding strand of DNA for “genes” used in the recombinant DNA imparting an enhanced trait in plant cells, i.e. each represents a coding sequence for a protein;
SEQ ID NO: 308-614 are amino acid sequences of the cognate protein of the “genes” with nucleotide coding sequences 1-307;
SEQ ID NO: 615-36442 are amino acid sequences of homologous proteins;
SEQ ID NO: 36443 is a nucleotide sequence of a base plasmid vector useful for corn transformation;
SEQ ID NO: 36444 is a nucleotide sequence of a base plasmid vector useful for soybean and canola transformation;
SEQ ID NO: 36445 is a nucleotide sequence of a base plasmid vector useful for cotton transformation;
SEQ ID NO: 36446 is a nucleotide sequence of a base plasmid vector useful for co-transformation to produce gene stacks in corn;
SEQ ID NO: 36447-36478 are consensus amino acid sequences. Table 8 lists the protein SEQ ID NOs and their corresponding consensus SEQ ID NOs.
As used herein a “plant cell” means a plant cell that is transformed with stably-integrated, non-natural, recombinant DNA, e.g. by Agrobacterium-mediated transformation or by bombardment using microparticles coated with recombinant DNA or other means. A plant cell of this invention can be an originally-transformed plant cell that exists as a microorganism or as a progeny plant cell that is regenerated into differentiated tissue, e.g. into a transgenic plant with stably-integrated, non-natural recombinant DNA, or seed or pollen derived from a progeny transgenic plant.
As used herein a “transgenic plant” means a plant whose genome has been altered by the stable integration of recombinant DNA. A transgenic plant includes a plant regenerated from an originally-transformed plant cell and progeny transgenic plants from later generations or crosses of a transformed plant.
As used herein “recombinant DNA” means DNA which has been a genetically engineered and constructed outside of a cell including DNA containing naturally occurring DNA or cDNA or synthetic DNA.
As used herein “consensus sequence” means an artificial sequence of amino acids in a conserved region of an alignment of amino acid sequences of homologous proteins, e.g. as determined by a CLUSTALW alignment of amino acid sequence of homolog proteins.
As used herein a “homolog” means a protein in a group of proteins that perform the same biological function, e.g. proteins that belong to the same Pfam protein family and that provide a common enhanced trait in transgenic plants of this invention. Homologs are expressed by homologous genes. With reference to homologous genes, homologs include orthologs, i.e. genes expressed in different species that evolved from a common ancestral genes by speciation and encode proteins retain the same function, but do not include paralogs, i.e. genes that are related by duplication but have evolved to encode proteins with different functions. Homologous genes include naturally occurring alleles and artificially-created variants. Degeneracy of the genetic code provides the possibility to substitute at least one base of the protein encoding sequence of a gene with a different base without causing the amino acid sequence of the polypeptide produced from the gene to be changed. When optimally aligned, homolog proteins have at least 60% identity, 65% identity, 70% identity, 75% identity, 80%, identity, 85% identity, 90% identity, 95, 96, 97, 98, or 99% identity over the full length of a protein identified as being associated with imparting an enhanced trait when expressed in plant cells. In one aspect of the invention homolog proteins have an amino acid sequence that has at least 90% identity to a consensus amino acid sequence of proteins and homologs disclosed herein.
Homologs are identified by comparison of amino acid sequence, e.g. manually or by use of a computer-based tool using known homology-based search algorithms such as the suite of BLAST programs available from NCBI. A local sequence alignment program, e.g. BLAST, can be used to search a database of sequences to find similar sequences, and the summary Expectation value (E-value) used to measure the sequence base similarity. Because a protein hit with the best E-value for a particular organism may not necessarily be an ortholog, i.e. have the same function, or be the only ortholog, a reciprocal query is used to filter hit sequences with significant E-values for ortholog identification. The reciprocal query entails search of the significant hits against a database of amino acid sequences from the base organism that are similar to the sequence of the query protein. A hit can be identified as an ortholog, when the reciprocal query's best hit is the query protein itself or a protein encoded by a duplicated gene after speciation. A further aspect of the homologs encoded by DNA useful in the transgenic plants of the invention are those proteins that differ from a disclosed protein as the result of deletion or insertion of one or more amino acids in a native sequence.
Percent identity describes the extent to which the sequences of DNA or protein segments are invariant in an alignment of sequences, for example nucleotide sequences or amino acid sequences. An alignment of sequences is created by manually aligning two sequences, e.g. a stated sequence, as provided herein, as a reference, and another sequence, to produce the highest number of matching elements, e.g. individual nucleotides or amino acids, while allowing for the introduction of gaps into either sequence. An “identity fraction” for a sequence aligned with a reference sequence is the number of matching elements, divided by the full length of the reference sequence, not including gaps introduced by the alignment process into the reference sequence. “Percent identity” (“% identity”) as used herein is the identity fraction times 100.
“Pfam” is a large collection of multiple sequence alignments and hidden Markov models covering many common protein families, e.g. Pfam version 19.0 (December 2005) contains alignments and models for 8183 protein families and is based on the Swissprot 47.0 and SP-TrEMBL 30.0 protein sequence databases. See S. R. Eddy, “Profile Hidden Markov Models”, Bioinformatics 14:755-763, 1998. The Pfam database is currently maintained and updated by the Pfam Consortium. The alignments represent some evolutionary conserved structure that has implications for the proteids function. Profile hidden Markov models (profile HMMs) built from the protein family alignments are useful for automatically recognizing that a new protein belongs to an existing protein family even if the homology by alignment appears to be low.
Protein domains are identified by querying the amino acid sequence of a protein against Hidden Markov Models which characterize protein family domains (“Pfam domains”) using HMMER software, which is available from the Pfam Consortium. The HMMER software is also disclosed in patent application publication US 2008/0148432 A1 incorporated herein by reference. A protein domain meeting the gathering cutoff for the alignment of a particular Pfam domain is considered to contain the Pfam domain.
A “Pfam domain module” is a representation of Pfam domains in a protein, in order from N terminus to C terminus. In a Pfam domain module individual Pfam domains are separated by double colons “::”. The order and copy number of the Pfam domains from N to C terminus are attributes of a Pfam domain module. Although the copy number of repetitive domains is important, varying copy number often enables a similar function. Thus, a Pfam domain module with multiple copies of a domain should define an equivalent Pfam domain module with variance in the number of multiple copies. A Pfam domain module is not specific for distance between adjacent domains, but contemplates natural distances and variations in distance that provide equivalent function. The Pfam database contains both narrowly- and broadly-defined domains, leading to identification of overlapping domains on some proteins. A Pfam domain module is characterized by non-overlapping domains. Where there is overlap, the domain having a function that is more closely associated with the function of the protein (based on the E value of the Pfam match) is selected.
Once one DNA is identified as encoding a protein which imparts an enhanced trait when expressed in transgenic plants, other DNA encoding proteins with the same Pfam domain module are identified by querying the amino acid sequence of protein encoded by candidate DNA against the Hidden Markov Models which characterizes the Pfam domains using HMMER software. Candidate proteins meeting the same Pfam domain module are in the protein family and have cognate DNA that is useful in constructing recombinant DNA for the use in the plant cells of this invention. Hidden Markov Model databases for use with HMMER software in identifying DNA expressing protein with a common Pfam domain module for recombinant DNA in the plant cells of this invention are available from the Pfam Consortium (ftp.sanger.ac.uk/pub/databases/Pfam/) and are incorporated herein by reference.
The HMMER software and Pfam databases (version 23.0) were used to identify known domains in the proteins corresponding to amino acid sequence of SEQ ID NOs: 308-310, 312-313, 315, 317-332, 334-343, 345, 347-364, 366-372, 374, 382, 387-403, 406-412, 414-425, 427-448, 450-460, 462-465, 467-477, 479-482, 484-487, 493-511, 513-539, 542, 545-558, 560-578, 580-582, 584-598, 600-602, 604-608, 612-614. All DNA encoding proteins that have scores higher than the gathering cutoff disclosed in Table 11 by Pfam analysis disclosed herein can be used in recombinant DNA of the plant cells of this invention, e.g. for selecting transgenic plants having enhanced agronomic traits. The relevant Pfams modules for use in this invention, as more specifically disclosed below, are PHD::SET, CBFB_NFYA, 2-Hacid_dh_C, 60KD_IMP, A_thal—3526, AA_kinase, AA_kinase::NAD_binding—3::Homoserine_dh, AA_permease, Aa_trans, ABC_membrane::ABC_tran, ABC_tran, Acetate_kinase, Acid_phosphat_B, Acyl-ACP_TE, AlaDh_PNT_N::AlaDh_PNT_C, Amino_oxidase, Aminotran—1—2 Aminotran—5, Ammonium_transp, Ank::Ank::Ank::Ank::Ank::Ank::Ank::Ank, AP2, Arginase, Arginosuc_synth, AsnA, Asp, Asp_decarbox, ATP-sulfurylase, Auxin_inducible, BCCT, BPD_transp—2, BTB, bZIP—1, bZIP—1::MethyltransfD12, bZIP—2, bZIP—2::bZIP—1, C4dic_mal_tran, CBFB_NFYA, CCT, Clp_N::Clp_N::AAA::AAA—2::ClpB_D2-small, cNMP_binding::Crp, CSD, DEAD::Helicase_C, DEAD—2::DUF1227, Dehydrin Dimerisation::Methyltransf—2, DSPc, DUF1292, DUF506, DUF640, DUF647, DUF828::PH—2, eIF-5a, eRF1—1::eRF1—2::eRF1—3, FAD_binding—3::FHA, FAD_binding—4::ALO, FAD_binding—4::Lact-deh-memb, F-box::Tub, Fe-ADH, Form_Nir_trans, FTCD_N, Gal_Lectin, GATase—2::Asn_synthase, GATase—2::Glu_syn_centrat:Glu_synthase::GXGXG, GDPD, GH3, Gln-synt_N::Gln-synt_C, Globin::FAD_binding—6::NAD_binding—1, Glt_symporter, Glutaredoxin, Glutaredoxin::Glutaredoxin::Glutaredoxin, Glyco_transf—20::Trehalose_PPase, GTP1_OBG::MMR_HSR1::DUF1967, HLH, HMG_box Homeobox::HALZ, Homeobox:: START, HR_lesion, HSP20, HSP70, LIM, LIM::LIM, LisH::WD40::WD40::WD40::WD40::WD40::WD40::WD40 LRR—1::LRR—1::LRR—1::LRR—1::Pkinase_Tyr, LRRNT—2::LRR—1::LRR—1::LRR—1::LRR—1::LRR—1::LRR—1::LRR—1::LRR—1::LRR—1::L RR—1::LRR—1::LRR—1::LRR—1::Pkinase, LRRNT—2::LRR—1::Pkinase_Tyr, LysM, MBD, MethyltransfD 12. MIT::AAA::Vps4_C, Mito_carr::Mito_carr::Mito_carr, MMR_HSR1::KH—2, Monooxygenase_B, Myb_DNA-binding::Myb_DNA-binding, NAD_binding—1, NAD_binding—2, NAM, Ndr, NIR_SIR_ferr::NIR_SIR::NIR_SIR_ferr, Nitroreductase, NUDIX, OKR_DC—1::OKR_DC—1_C, OPT, p450, PAS—2::GAF::Phytochrome::PAS::PAS::HisKA::HATPase_c, PBP, Peptidase_C2::Calpain_III, Peptidase—510, peroxidase, Pkinase, Pkinase::NAF, PLAC8, PMEI, PDX::Homeobox, PP2C, PTA_PTB, PTR2 RCC1::RCC1::RCC1, Response_reg::Myb_DNA-binding, Ribosomal_L21p, RolB_RolC::Amino_oxidase, RRM—1, RRM—1::RRM—1, RRM—1::zf-CCHC RWP-RK::PB1, SBP, SBP_bac—3, Sina, SIS::CBS, SOUL, SPRY, SRF-TF, SRF-TF::K-box, SSF, Ssl1::C1—4, Sulfate_transp::STAS, Thg1::Thg1 Thiolase_N::Thiolase_C, Thioredoxin::Glutaredoxin::Glutaredoxin::Glutaredoxin, TPP_enzyme_N::TPP_enzyme_M::TPP_enzyme_C Transaldolase, tRNA_synt—1c_R1::tRNA_synt—1c_R2::tRNA-synt—1c::tRNA-synt—1c_C, ubiquitin, U-box, Usp, WD40::WD40, WRKY, WRKY::WRKY zf-B_box, zf-B_box::MethyltransfD12, zf-B_box::zf-B_box::CCT, zf-C3HC4, zf-CCCH, zf-CCHC::Plus-3, and zf-D of for which databases are included in the appended computer listing.
As used herein “promoter” means regulatory DNA for initializing transcription. A “plant promoter” is a promoter capable of initiating transcription in plant cells whether or not its origin is a plant cell, e.g. is it well known that Agrobacterium promoters are functional in plant cells. Thus, plant promoters include promoter DNA obtained from plants, plant viruses and bacteria such as Agrobacterium and Bradyrhizobium bacteria. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, or seeds. Such promoters are referred to as “tissue preferred”. Promoters that initiate transcription only in certain tissues are referred to as “tissue specific”. A “cell type” specific promoter primarily drives expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An “inducible” or “repressible” promoter is a promoter which is under environmental control. Examples of environmental conditions that may effect transcription by inducible promoters include anaerobic conditions, or certain chemicals, or the presence of light. Tissue specific, tissue preferred, cell type specific, and inducible promoters constitute the class of “non-constitutive” promoters. A “constitutive” promoter is a promoter which is active under most conditions.
As used herein “operably linked” means the association of two or more DNA fragments in a recombinant DNA construct so that the function of one, e.g. protein-encoding DNA, is controlled by the other, e.g. a promoter.
As used herein “expressed” means produced, e.g. a protein is expressed in a plant cell when its cognate DNA is transcribed to mRNA that is translated to the protein.
As used herein “suppressed” means decreased, e.g. a protein is suppressed in a plant cell when there is a decrease in the amount and/or activity of the protein in the plant cell. The presence or activity of the protein can be decreased by any amount up to and including a total loss of protein expression and/or activity.
As used herein a “control plant” means a plant that does not contain the recombinant DNA that imparts an enhanced trait. A control plant is used to identify and select a transgenic plant that has an enhanced trait. A suitable control plant can be a non-transgenic plant of the parental line used to generate a transgenic plant, i.e. devoid of recombinant DNA. A suitable control plant may in some cases be a progeny of a hemizygous transgenic plant line that does not contain the recombinant DNA, known as a negative segregant.
As used herein an “enhanced trait” means a characteristic of a transgenic plant that includes, but is not limited to, an enhance agronomic trait characterized by enhanced plant morphology, physiology, growth and development, yield, nutritional enhancement, disease or pest resistance, or environmental or chemical tolerance. In more specific aspects of this invention enhanced trait is selected from group of enhanced traits consisting of enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. In an important aspect of the invention the enhanced trait is enhanced yield including increased yield under non-stress conditions and increased yield under environmental stress conditions. Stress conditions may include, for example, drought, shade, fungal disease, viral disease, bacterial disease, insect infestation, nematode infestation, cold temperature exposure, heat exposure, osmotic stress, reduced nitrogen nutrient availability, reduced phosphorus nutrient availability and high plant density. “Yield” can be affected by many properties including without limitation, plant height, pod number, pod position on the plant, number of internodes, incidence of pod shatter, grain size, efficiency of nodulation and nitrogen fixation, efficiency of nutrient assimilation, resistance to biotic and abiotic stress, carbon assimilation, plant architecture, resistance to lodging, percent seed germination, seedling vigor, and juvenile traits. Yield can also be affected by efficiency of germination (including germination in stressed conditions), growth rate (including growth rate in stressed conditions), ear number, seed number per ear, seed size, composition of seed (starch, oil, protein) and characteristics of seed fill.
Increased yield of a transgenic plant of the present invention can be measured in a number of ways, including test weight, seed number per plant, seed weight, seed number per unit area (i.e. seeds, or weight of seeds, per acre), bushels per acre, tons per acre, or kilo per hectare. For example, corn yield may be measured as production of shelled corn kernels per unit of production area, for example in bushels per acre or metric tons per hectare, often reported on a moisture adjusted basis, for example at 15.5 percent moisture. Increased yield may result from improved utilization of key biochemical compounds, such as nitrogen, phosphorous and carbohydrate, or from improved responses to environmental stresses, such as cold, heat, drought, salt, and attack by pests or pathogens. Recombinant DNA used in this invention can also be used to provide plants having improved growth and development, and ultimately increased yield, as the result of modified expression of plant growth regulators or modification of cell cycle or photosynthesis pathways. Also of interest is the generation of transgenic plants that demonstrate enhanced yield with respect to a seed component that may or may not correspond to an increase in overall plant yield. Such properties include enhancements in seed oil, seed molecules such as protein and starch, oil components as may be manifest by an alterations in the ratios of seed components.
Recombinant DNA constructs are assembled using methods well known to persons of ordinary skill in the art and typically comprise a promoter operably linked to DNA, the expression of which provides the enhanced agronomic trait. Other construct components may include additional regulatory elements, such as 5′ leaders and introns for enhancing transcription, 3′ untranslated regions (such as polyadenylation signals and sites), DNA for transit or signal peptides.
Numerous promoters that are active in plant cells have been described in the literature. These include promoters present in plant genomes as well as promoters from other sources, including nopaline synthase (NOS) promoter and octopine synthase (OCS) promoters carried on tumor-inducing plasmids of Agrobacterium tumefaciens and the CaMV35S promoters from the cauliflower mosaic virus as disclosed in U.S. Pat. Nos. 5,164,316 and 5,322,938. Useful promoters derived from plant genes are found in U.S. Pat. No. 5,641,876 which discloses a rice actin promoter, U.S. Pat. No. 7,151,204 which discloses a maize chloroplast aldolase promoter and a maize aldolase (FDA) promoter, and US Patent Application Publication 2003/0131377 A1 which discloses a maize nicotianamine synthase promoter. These and numerous other promoters that function in plant cells are known to those skilled in the art and available for use in recombinant polynucleotides of the present invention to provide for expression of desired genes in transgenic plant cells.
Furthermore, the promoters may be altered to contain multiple “enhancer sequences” to assist in elevating gene expression. Such enhancers are known in the art. By including an enhancer sequence with such constructs, the expression of the selected protein may be enhanced. These enhancers often are found 5′ to the start of transcription in a promoter that functions in eukaryotic cells, but can often be inserted upstream (5′) or downstream (3′) to the coding sequence. In some instances, these 5′ enhancing elements are introns. Particularly useful as enhancers are the 5′ introns of the rice actin 1 (see U.S. Pat. No. 5,641,876) and rice actin 2 genes, the maize alcohol dehydrogenase gene intron, the maize heat shock protein 70 gene intron (U.S. Pat. No. 5,593,874) and the maize shrunken 1 gene. See also US Patent Application Publication 2002/0192813A1 which discloses 5′, 3′ and intron elements useful in the design of effective plant expression vectors.
In other aspects of the invention, sufficient expression in plant seed tissues is desired to affect improvements in seed composition. Exemplary promoters for use for seed composition modification include promoters from seed genes such as napin as disclosed in U.S. Pat. No. 5,420,034, maize L3 oleosin as disclosed in U.S. Pat. No. 6,433,252), zein Z27 as disclosed by Russell et al. (1997) Transgenic Res. 6(2):157-166), globulin 1 as disclosed by Belanger et al (1991) Genetics 129:863-872), glutelin 1 as disclosed by Russell (1997) supra), and peroxiredoxin antioxidant (Per1) as disclosed by Stacy et al. (1996) Plant Mol. Biol. 31(6):1205-1216.
Recombinant DNA constructs useful in this invention will also generally include a 3′ element that typically contains a polyadenylation signal and site. Well-known 3′ elements include those from Agrobacterium tumefaciens genes such as nos 3′, tml 3′, tmr 3′, tms 3′, ocs 3′, tr7 3′, for example disclosed in U.S. Pat. No. 6,090,627; 3′ elements from plant genes such as wheat (Triticum aesevitum) heat shock protein 17 (Hsp17 3′), a wheat ubiquitin gene, a wheat fructose-1,6-biphosphatase gene, a rice glutelin gene, a rice lactate dehydrogenase gene and a rice beta-tubulin gene, all of which are disclosed in US Patent Application Publication 2002/0192813 A1; and the pea (Pisum sativum) ribulose biphosphate carboxylase gene (rbs 3′), and 3′ elements from the genes within the host plant.
Constructs and vectors may also include a transit peptide for targeting of a gene to a plant organelle, particularly to a chloroplast, leucoplast or other plastid organelle. For descriptions of the use of chloroplast transit peptides see U.S. Pat. No. 5,188,642 and U.S. Pat. No. 5,728,925. For description of the transit peptide region of an Arabidopsis EPSPS gene useful in the present invention, see Klee, H. J. et al (MGG (1987) 210:437-442).
Recombinant DNA constructs for gene suppression can be designed for any of a number the well-known methods for suppressing transcription of a gene, the accumulation of the mRNA corresponding to that gene or preventing translation of the transcript into protein. Posttranscriptional gene suppression can be practically effected by transcription of RNA that forms double-stranded RNA (dsRNA) having homology to mRNA produced from a gene targeted for suppression.
Gene suppression can also be achieved by insertion mutations created by transposable elements may also prevent gene function. For example, in many dicot plants, transformation with the T-DNA of Agrobacterium may be readily achieved and large numbers of transformants can be rapidly obtained. Also, some species have lines with active transposable elements that can efficiently be used for the generation of large numbers of insertion mutations, while some other species lack such options. Mutant plants produced by Agrobacterium or transposon mutagenesis and having altered expression of a polypeptide of interest can be identified using the polynucleotides of the present invention. For example, a large population of mutated plants may be screened with polynucleotides encoding the polypeptide of interest to detect mutated plants having an insertion in the gene encoding the polypeptide of interest.
Transgenic plants may comprise a stack of one or more polynucleotides disclosed herein resulting in the production or suppression of multiple polypeptide sequences. Transgenic plants comprising stacks of polynucleotide sequences can be obtained by either or both of traditional breeding methods or through genetic engineering methods. These methods include, but are not limited to, breeding individual lines each comprising a polynucleotide of interest, transforming a transgenic plant comprising a gene disclosed herein with a subsequent gene, and co-transformation of genes into a single plant cell. Co-transformation of genes can be carried out using single transformation vectors comprising multiple genes or genes carried separately on multiple vectors.
Transgenic plants comprising or derived from plant cells of this invention transformed with recombinant DNA can be further enhanced with stacked traits, e.g. a crop plant having an enhanced trait resulting from expression of DNA disclosed herein in combination with herbicide and/or pest resistance traits. For example, genes of the current invention can be stacked with other traits of agronomic interest, such as a trait providing herbicide resistance, or insect resistance, such as using a gene from Bacillus thuringensis to provide resistance against lepidopteran, coliopteran, homopteran, hemiopteran, and other insects. Herbicides for which transgenic plant tolerance has been demonstrated and the method of the present invention can be applied include, but are not limited to, glyphosate, dicamba, glufosinate, sulfonylurea, bromoxynil and norflurazon herbicides. Polynucleotide molecules encoding proteins involved in herbicide tolerance are well-known in the art and include, but are not limited to, a polynucleotide molecule encoding 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) disclosed in U.S. Pat. Nos. 5,094,945; 5,627,061; 5,633,435 and 6,040,497 for imparting glyphosate tolerance; polynucleotide molecules encoding a glyphosate oxidoreductase (GOX) disclosed in U.S. Pat. No. 5,463,175 and a glyphosate-N-acetyl transferase (GAT) disclosed in US Patent Application Publication 2003/0083480 A1 also for imparting glyphosate tolerance; dicamba monooxygenase disclosed in US Patent Application Publication 2003/0135879 A1 for imparting dicamba tolerance; a polynucleotide molecule encoding bromoxynil nitrilase (Bxn) disclosed in U.S. Pat. No. 4,810,648 for imparting bromoxynil tolerance; a polynucleotide molecule encoding phytoene desaturase (crtl) described in Misawa et al, (1993) Plant J. 4:833-840 and in Misawa et al, (1994) Plant J. 6:481-489 for norflurazon tolerance; a polynucleotide molecule encoding acetohydroxyacid synthase (AHAS, aka ALS) described in Sathasiivan et al. (1990) Nucl. Acids Res. 18:2188-2193 for imparting tolerance to sulfonylurea herbicides; polynucleotide molecules known as bar genes disclosed in DeBlock, et al. (1987) EMBO J. 6:2513-2519 for imparting glufosinate and bialaphos tolerance; polynucleotide molecules disclosed in US Patent Application Publication 2003/010609 A1 for imparting N-amino methyl phosphonic acid tolerance; polynucleotide molecules disclosed in U.S. Pat. No. 6,107,549 for impartinig pyridine herbicide resistance; molecules and methods for imparting tolerance to multiple herbicides such as glyphosate, atrazine, ALS inhibitors, isoxoflutole and glufosinate herbicides are disclosed in U.S. Pat. No. 6,376,754 and US Patent Application Publication 2002/0112260. Molecules and methods for imparting insect/nematode/virus resistance are disclosed in U.S. Pat. Nos. 5,250,515; 5,880,275; 6,506,599; 5,986,175 and US Patent Application Publication 2003/0150017 A1.
Numerous methods for transforming chromosomes in a plant cell nucleus with recombinant DNA are known in the art and are used in methods of preparing a transgenic plant cell nucleus cell, and plant. Two effective methods for such transformation are Agrobacterium-mediated transformation and microprojectile bombardment. Microprojectile bombardment methods are illustrated in U.S. Pat. Nos. 5,015,580 (soybean); 5,550,318 (corn); 5,538,880 (corn); 5,914,451 (soybean); 6,160,208 (corn); 6,399,861 (corn); 6,153,812 (wheat) and 6,365,807 (rice) and Agrobacterium-mediated transformation is described in U.S. Pat. Nos. 5,159,135 (cotton); 5,824,877 (soybean); 5,463,174 (canola); 5,591,616 (corn); 5,846,797 (cotton); 6,384,301 (soybean), 7,026,528 (wheat) and 6,329,571 (rice), US Patent Application Publication 2004/0087030 A1 (cotton), and US Patent Application Publication 2001/0042257 A1 (sugar beet), all of which are incorporated herein by reference for enabling the production of transgenic plants. Transformation of plant material is practiced in tissue culture on a nutrient media, i.e. a mixture of nutrients that will allow cells to grow in vitro. Recipient cell targets include, but are not limited to, meristem cells, hypocotyls, calli, immature embryos and gametic cells such as microspores, pollen, sperm and egg cells. Callus may be initiated from tissue sources including, but not limited to, immature embryos, hypocotyls, seedling apical meristems, microspores and the like. Cells containing a transgenic nucleus are grown into transgenic plants.
In addition to direct transformation of a plant material with a recombinant DNA, a transgenic plant cell nucleus can be prepared by crossing a first plant having cells with a transgenic nucleus with recombinant DNA with a second plant lacking the transgenic nucleus. For example, recombinant DNA can be introduced into a nucleus from a first plant line that is amenable to transformation to transgenic nucleus in cells that are grown into a transgenic plant which can be crossed with a second plant line to introgress the recombinant DNA into the second plant line. A transgenic plant with recombinant DNA providing an enhanced trait, e.g. enhanced yield, can be crossed with transgenic plant line having other recombinant DNA that confers another trait, for example herbicide resistance or pest resistance, to produce progeny plants having recombinant DNA that confers both traits. Typically, in such breeding for combining traits the transgenic plant donating the additional trait is a male line and the transgenic plant carrying the base traits is the female line. The progeny of this cross will segregate such that some of the plants will carry the DNA for both parental traits and some will carry DNA for one parental trait; such plants can be identified by markers associated with parental recombinant DNA, e.g. marker identification by analysis for recombinant DNA or, in the case where a selectable marker is linked to the recombinant, by application of the selecting agent such as a herbicide for use with a herbicide tolerance marker, or by selection for the enhanced trait. Progeny plants carrying DNA for both parental traits can be crossed back into the female parent line multiple times, for example usually 6 to 8 generations, to produce a progeny plant with substantially the same genotype as one original transgenic parental line but for the recombinant DNA of the other transgenic parental line
In the practice of transformation DNA is typically introduced into only a small percentage of target plant cells in any one transformation experiment. Marker genes are used to provide an efficient system for identification of those cells that are stably transformed by receiving and integrating a recombinant DNA molecule into their genomes. Preferred marker genes provide selective markers which confer resistance to a selective agent, such as an antibiotic or a herbicide. Any of the herbicides to which plants of this invention may be resistant are useful agents for selective markers. Potentially transformed cells are exposed to the selective agent. In the population of surviving cells will be those cells where, generally, the resistance-conferring gene is integrated and expressed at sufficient levels to permit cell survival. Cells may be tested further to confirm stable integration of the exogenous DNA. Commonly used selective marker genes include those conferring resistance to antibiotics such as kanamycin and paromomycin (nptII), hygromycin B (aph IV), spectinomycin (aadA) and gentamycin (aac3 and aacC4) or resistance to herbicides such as glufosinate (bar or pat), dicamba (DMO) and glyphosate (aroA or EPSPS). Examples of such selectable markers are illustrated in U.S. Pat. Nos. 5,550,318; 5,633,435; 5,780,708 and 6,118,047. Markers which provide an ability to visually screen transformants can also be employed, for example, a gene expressing a colored or fluorescent protein such as a luciferase or green fluorescent protein (GFP) or a gene expressing a beta-glucuronidase or uidA gene (GUS) for which various chromogenic substrates are known.
Plant cells that survive exposure to the selective agent, or plant cells that have been scored positive in a screening assay, may be cultured in regeneration media and allowed to mature into plants. Developing plantlets regenerated from transformed plant cells can be transferred to plant growth mix, and hardened off, for example, in an environmentally controlled chamber at about 85% relative humidity, 600 ppm CO2, and 25-250 microeinsteins m−2 s−1 of light, prior to transfer to a greenhouse or growth chamber for maturation. Plants are regenerated from about 6 weeks to 10 months after a transformant is identified, depending on the initial tissue, and plant species. Plants may be pollinated using conventional plant breeding methods known to those of skill in the art and seed produced, for example self-pollination is commonly used with transgenic corn. The regenerated transformed plant or its progeny seed or plants can be tested for expression of the recombinant DNA and selected for the presence of enhanced agronomic trait.
Transgenic plants derived from transgenic plant cells having a transgenic nucleus of this invention are grown to generate transgenic plants having an enhanced trait as compared to a control plant and produce transgenic seed and haploid pollen of this invention. Such plants with enhanced traits are identified by selection of transformed plants or progeny seed for the enhanced trait. For efficiency a selection method is designed to evaluate multiple transgenic plants (events) comprising the recombinant DNA, for example multiple plants from 2 to 20 or more transgenic events. Transgenic plants grown from transgenic seed provided herein demonstrate improved agronomic traits that contribute to increased yield or other trait that provides increased plant value, including, for example, improved seed quality. Of particular interest are plants having enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
Table 1 provides a list of protein encoding DNA (“genes”) that are useful as recombinant DNA for production of transgenic plants with enhanced agronomic trait, the elements of Table 1 are described by reference to: “PEP SEQ ID NO” identifies an amino acid sequence from SEQ ID NO: 308 to 614.
“NUC SEQ ID NO” identifies a DNA sequence from SEQ ID NO:1 to 307.
“Gene ID” refers to an arbitrary identifier.
“Gene Name” denotes a common name for the protein encoded by the recombinant DNA preceded by the abbreviated genus and species as fully defined in the sequence listing. The + or − preceding the gene name indicates whether the protein is expressed (+) or suppressed (−) in plants to provide an enhanced trait.
Arabidopsis peptide
Selection Methods for Transgenic Plants with Enhanced Agronomic Trait
Within a population of transgenic plants each regenerated from a plant cell having a nucleus with recombinant DNA many plants that survive to fertile transgenic plants that produce seeds and progeny plants will not exhibit an enhanced agronomic trait. Selection from the population is necessary to identify one or more transgenic plant cells having a transgenic nucleus that can provide plants with the enhanced trait. Transgenic plants having enhanced traits are selected from populations of plants regenerated or derived from plant cells transformed as described herein by evaluating the plants in a variety of assays to detect an enhanced trait, e.g. enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. These assays also may take many forms including, but not limited to, direct screening for the trait in a greenhouse or field trial or by screening for a surrogate trait. Such analyses can be directed to detecting changes in the chemical composition, biomass, physiological properties, morphology of the plant. Changes in chemical compositions such as nutritional composition of grain can be detected by analysis of the seed composition and content of protein, free amino acids, oil, free fatty acids, starch or tocopherols. Changes in biomass characteristics can be made on greenhouse or field grown plants and can include plant height, stem diameter, root and shoot dry weights; and, for corn plants, ear length and diameter. Changes in physiological properties can be identified by evaluating responses to stress conditions, for example assays using imposed stress conditions such as water deficit, nitrogen deficiency, cold growing conditions, pathogen or insect attack or light deficiency, or increased plant density. Changes in morphology can be measured by visual observation of tendency of a transformed plant with an enhanced agronomic trait to also appear to be a normal plant as compared to changes toward bushy, taller, thicker, narrower leaves, striped leaves, knotted trait, chlorosis, albino, anthocyanin production, or altered tassels, ears or roots. Other selection properties include days to pollen shed, days to silking, leaf extension rate, chlorophyll content, leaf temperature, stand, seedling vigor, internode length, plant height, leaf number, leaf area, tittering, brace roots, stay green, stalk lodging, root lodging, plant health, barreness/prolificacy, green snap, and pest resistance. In addition, phenotypic characteristics of harvested grain may be evaluated, including number of kernels per row on the ear, number of rows of kernels on the ear, kernel abortion, kernel weight, kernel size, kernel density and physical grain quality.
Assays for screening for a desired trait are readily designed by those practicing in the art. The following illustrates useful screening assays for corn traits using hybrid corn plants. The assays can be readily adapted for screening other plants such as canola, cotton and soybean either as hybrids or inbreds.
Transgenic corn plants having nitrogen use efficiency are identified by screening in fields with three levels of nitrogen (N) fertilizer being applied, e.g. low level (0 N), medium level (80 lb/ac) and high level (180 lb/ac). Plants with enhanced nitrogen use efficiency provide higher yield as compared to control plants.
Transgenic corn plants having enhanced yield are identified by screening using progeny of the transgenic plants over multiple locations with plants grown under optimal production management practices and maximum weed and pest control. A useful target for improved yield is a 5% to 10% increase in yield as compared to yield produced by plants grown from seed for a control plant. Selection methods may be applied in multiple and diverse geographic locations, for example up to 16 or more locations, over one or more planting seasons, for example at least two planting seasons, to statistically distinguish yield improvement from natural environmental effects.
Transgenic corn plants having enhanced water use efficiency are identified by screening plants in an assay where water is withheld for a period to induce stress followed by watering to revive the plants. For example, a useful selection process imposes 3 drought/re-water cycles on plants over a total period of 15 days after an initial stress free growth period of 11 days. Each cycle consists of 5 days, with no water being applied for the first four days and a water quenching on the 5th day of the cycle. The primary phenotypes analyzed by the selection method are the changes in plant growth rate as determined by height and biomass during a vegetative drought treatment.
Transgenic corn plants having enhanced cold tolerance are identified by screening plants in a cold germination assay and/or a cold tolerance field trial. In a cold germination assay trays of transgenic and control seeds are placed in a growth chamber at 9.7° C. for 24 days (no light). Seeds having higher germination rates as compared to the control are identified as having enhanced cold tolerance. In a cold tolerance field trial plants with enhanced cold tolerance are identified from field planting at an earlier date than conventional Spring planting for the field location. For example, seeds are planted into the ground around two weeks before local farmers begin to plant corn so that a significant cold stress is exerted onto the crop, named as cold treatment. Seeds also are planted under local optimal planting conditions such that the crop has little or no exposure to cold condition, named as normal treatment. At each location, seeds are planted under both cold and normal conditions preferably with multiple repetitions per treatment.
Transgenic corn plants having seeds with increased protein and/or oil levels are identified by analyzing progeny seed for protein and/or oil. Near-infrared transmittance spectrometry is a non-destructive, high-throughput method that is useful to determine the composition of a bulk seed sample for properties listed in table 2.
Although the plant cells and methods of this invention can be applied to any plant cell, plant, seed or pollen, e.g. any fruit, vegetable, grass, tree or ornamental plant, the various aspects of the invention are preferably applied to corn, soybean, cotton, canola, alfalfa, wheat, rice, sugarcane, and sugar beet plants. In many cases the invention is applied to corn plants that are inherently resistant to disease from the Mal de R10Cuarto virus or the Puccina sorghi fungus or both.
The following examples are included to demonstrate aspects of the invention, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific aspects which are disclosed and still obtain a like or similar results without departing from the spirit and scope of the invention.
This example illustrates the construction of plasmids for transferring recombinant DNA into a plant cell nucleus that can be regenerated into transgenic plants.
A base corn transformation vector pMON93039, as set forth in SEQ ID NO:36443, illustrated in Table 3, is fabricated for use in preparing recombinant DNA for Agrobacterium-mediated transformation into corn tissue.
Agrobacterium
Arabidopsis EPSPS
Agrobacterium tumefaciens Ti
Agrobacterium
E. coli
To construct transformation vectors for expressing a protein identified in Table 1, primers for PCR amplification of the protein coding nucleotides are designed at or near the start and stop codons of the coding sequence, in order to eliminate most of the 5′ and 3′ untranslated regions. The protein coding nucleotides are inserted into the base vector in the gene of interest expression cassette at an insertion site, i.e. between the intron element (coordinates 1287-1766) and the polyadenylation element (coordinates 1838-2780).
To construct transformation vectors for suppressing a protein identified in Table 1, the amplified protein coding nucleotides are assembled in a sense and antisense arrangement and inserted into the base vector at the insertion site in the gene of interest expression cassette to provide transcribed RNA that will form a double-stranded RNA for RNA interference suppression of the protein. More specifically, the sense and anti-sense DNA is derived from an endogenous corn gene that expresses a corn protein with an amino acid sequence of SEQ ID NO: 316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 474, 476-487, 502, 532, 541, or 544-545 or the corn homolog of SEQ ID NOs:309, 311, 324, 373, 395, 454, or 518.
Vectors for use in transformation of soybean and canola tissue are prepared having the elements of expression vector pMON82053 (SEQ ID NO: 36444) as shown in Table 4 below.
Agrobacterium T-
Agrobacterium T-
E. coli plasmid ColE1.
To construct transformation vectors for expressing a protein identified in Table 1, primers for PCR amplification of the protein coding nucleotides are designed at or near the start and stop codons of the coding sequence, in order to eliminate most of the 5′ and 3′ untranslated regions. The protein coding nucleotides are inserted into the base vector in the gene of interest expression cassette at an insertion site, i.e. between the promoter element (coordinates 1-613) and the polyadenylation element (coordinates 688-1002).
To construct transformation vectors for suppressing a protein identified in Table 1, the amplified protein coding nucleotides are assembled in a sense and antisense arrangement and inserted into the base vector at the insertion site in the gene of interest expression cassette to provide transcribed RNA that will form a double-stranded RNA for RNA interference suppression of the protein. More specifically, for soybean the sense and anti-sense DNA is derived from an endogenous soybean gene that expresses a soybean protein with an amino acid sequence of SEQ ID NOs: 309, 324, 373, 395, 518 or is a soybean homolog of SEQ ID NOs: 311, 316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 532, 541, or 544-545, and for canola the sense and anti-sense DNA is derived from an endogenous canola gene that encodes the canola homolog of SEQ ID NOs: 309, 311, 316, 324, 344, 346, 350-351, 355, 357, 360-365, 373, 384-388, 393, 395, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 518, 532, 541, or 544-545.
Plasmids for use in transformation of cotton tissue are prepared with elements of expression vector pMON99053 (SEQ ID NO: 36445) as shown in Table 5 below.
Agrobacterium T-
tumefaciens Ti plasmid which functions to
Agrobacterium T-
E. coli plasmid ColE1.
To construct transformation vectors for expressing a protein identified in Table 1, primers for PCR amplification of the protein coding nucleotides are designed at or near the start and stop codons of the coding sequence, in order to eliminate most of the 5′ and 3′ untranslated regions. The protein coding nucleotides are inserted into the base vector in the gene of interest expression cassette at an insertion site, i.e. between the promoter element (coordinates 388-1091) and the polyadenylation element (coordinates 1165-1797).
To construct transformation vectors for suppressing a protein identified in Table 1, the amplified protein coding nucleotides are assembled in a sense and antisense arrangement and inserted into the base vector at the insertion site in the gene of interest expression cassette to provide transcribed RNA that will form a double-stranded RNA for RNA interference suppression of the protein. More specifically, the sense and anti-sense DNA is derived from an endogenous cotton gene that encodes the cotton homolog of SEQ ID NO: 309, 311, 316, 324, 344, 346, 350-351, 355, 357, 360-365, 373, 384-388, 393, 395, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 518, 532, 541, or 544-545.
A base corn transformation vector pMON96782, as set forth in SEQ ID NO: 36446, illustrated in Table 6, is fabricated for use in preparing recombinant DNA for Agrobacterium-mediated transformation into corn tissue.
Agrobacterium
Agrobacterium
E. coli
E. coli colE1 plasmid.
Primers for PCR amplification of protein coding nucleotides of the genes of interest are designed at or near the start and stop codons of the coding sequence, in order to eliminate most of the 5′ and 3′ untranslated regions. Protein coding regions of genes encoding a first and second protein of interest are amplified. The amplified region from the first gene of interest is cloned between nucleotides 1801 and 1834 of the base vector and the amplified region from the second gene of interest is cloned between nucleotides 3883 and 3918 of the base vector.
This example illustrates transformation methods useful in producing a transgenic nucleus in a corn plant cell, and the plants, seeds and pollen produced from a transgenic cell with such a nucleus having an enhanced trait, i.e. enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. A plasmid vector is prepared by cloning the DNA of SEQ ID NO:1 into the gene of interest expression cassette in the base vector for use in corn transformation of corn tissue provided in Example 1, Table 3.
For Agrobacterium-mediated transformation of corn embryo cells corn plants of a readily transformable line are grown in the greenhouse and ears are harvested when the embryos are 1.5 to 2.0 mm in length. Ears are surface sterilized by spraying or soaking the ears in 80% ethanol, followed by air drying Immature embryos are isolated from individual kernels on surface sterilized ears. Prior to inoculation of maize cells, Agrobacterium cells are grown overnight at room temperature Immature maize embryo cells are inoculated with Agrobacterium shortly after excision, and incubated at room temperature with Agrobacterium for 5-20 minutes. Immature embryo plant cells are then co-cultured with Agrobacterium for 1 to 3 days at 23° C. in the dark. Co-cultured embryos are transferred to selection media and cultured for approximately two weeks to allow embryogenic callus to develop. Embryogenic callus is transferred to culture medium containing 100 mg/L paromomycin and subcultured at about two week intervals. Transformed plant cells are recovered 6 to 8 weeks after initiation of selection.
For Agrobacterium-mediated transformation of maize callus immature embryos are cultured for approximately 8-21 days after excision to allow callus to develop. Callus is then incubated for about 30 minutes at room temperature with the Agrobacterium suspension, followed by removal of the liquid by aspiration. The callus and Agrobacterium are co-cultured without selection for 3-6 days followed by selection on paromomycin for approximately 6 weeks, with biweekly transfers to fresh media. Paromomycin resistant calli are identified about 6-8 weeks after initiation of selection.
To regenerate transgenic corn plants a callus of transgenic plant cells resulting from transformation and selection is placed on media to initiate shoot development into plantlets which are transferred to potting soil for initial growth in a growth chamber at 26° C. followed by a mist bench before transplanting to 5 inch pots where plants are grown to maturity. The regenerated plants are self-fertilized and seed is harvested for use in one or more methods to select seeds, seedlings or progeny second generation transgenic plants (R2 plants) or hybrids, e.g. by selecting transgenic plants exhibiting an enhanced trait as compared to a control plant.
The above process is repeated to produce multiple events of transgenic corn plant cells that are transformed with recombinant DNA from each of the genes identified in Table 1. Events are designed to produce in the transgenic cells one of the proteins identified in Table 1, except the proteins of SEQ ID NOs: 316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 474, 476-487, 502, 532, 541, and 544-545, and the corn homolog of SEQ ID NOs: 309, 311, 324, 373, 395, 454, and 518 which are suppressed. Progeny transgenic plants and seed of the transformed plant cells are screened for enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. From each group of multiple events of transgenic plants with a specific recombinant DNA from Table 1 the event that produces the greatest enhancement in yield, water use efficiency, nitrogen use efficiency, enhanced cold tolerance, enhanced seed protein and enhanced seed oil is identified and progeny seed is selected for commercial development.
This example illustrates plant transformation useful in producing a transgenic nucleus in a soybean plant cell, and the plants, seeds and pollen produced from a transgenic cell with such a nucleus having an enhanced trait, i.e. enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
For Agrobacterium mediated transformation, soybean seeds are imbided overnight and the meristem explants excised. The explants are placed in a wounding vessel. Soybean explants and induced Agrobacterium cells from a strain containing plasmid DNA with the gene of interest cassette and a plant selectable marker cassette are mixed no later than 14 hours from the time of initiation of seed imbibition, and wounded using sonication. Following wounding, explants are placed in co-culture for 2-5 days at which point they are transferred to selection media for 6-8 weeks to allow selection and growth of transgenic shoots. Resistant shoots are harvested approximately 6-8 weeks and placed into selective rooting media for 2-3 weeks. Shoots producing roots are transferred to the greenhouse and potted in soil. Shoots that remain healthy on selection, but do not produce roots are transferred to non-selective rooting media for an additional two weeks. Roots from any shoots that produce roots off selection are tested for expression of the plant selectable marker before they are transferred to the greenhouse and potted in soil.
The above process is repeated to produce multiple events of transgenic soybean plant cells that are transformed with recombinant DNA from each of the genes identified in Table 1. Events are designed to produce in the transgenic cells one of the proteins identified in Table 1, except the proteins of SEQ ID NOs: 309, 324, 373, 395, and 518 and the soybean homologs of SEQ ID NOs: 311, 316, 344, 346, 350-351, 355, 357, 360-365, 384-388, 393, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 532, 541, and 544-545, which are suppressed. Progeny transgenic plants and seed of the transformed plant cells are screened for enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced seed protein and enhanced seed oil. From each group of multiple events of transgenic plants with a specific recombinant DNA from Table 1 the event that produces the greatest enhancement in yield, water use efficiency, nitrogen use efficiency, enhanced cold tolerance, enhanced seed protein and enhanced seed oil is identified and progeny seed is selected for commercial development.
This example illustrates plant transformation useful in producing a transgenic nucleus in a cotton plant cell, and the plants, seeds and pollen produced from a transgenic cell with such a nucleus having an enhanced trait, i.e. enhanced water use efficiency, increased yield, enhanced nitrogen use efficiency and enhanced seed oil.
Transgenic cotton plants containing each recombinant DNA having a sequence of SEQ ID NO: 1 through SEQ ID NO: 307 are obtained by transforming with recombinant DNA from each of the genes identified in Table 1 using Agrobacterium-mediated transformation. The above process is repeated to produce multiple events of transgenic cotton plant cells that are transformed with recombinant DNA from each of the genes identified in Table 1. Events are designed to produce in the transgenic cells one of the proteins identified in Table 1, except the cotton homologs of the proteins of SEQ ID NOs: 309, 311, 316, 324, 344, 346, 350-351, 355, 357, 360-365, 373, 384-388, 393, 395, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 518, 532, 541, and 544-545 which are suppressed.
From each group of multiple events of transgenic plants with a specific recombinant DNA from Table 1 the event that produces the greatest enhancement in yield, water use efficiency, nitrogen use efficiency, enhanced cold tolerance, enhanced seed protein and enhanced seed oil is identified and progeny seed is selected for commercial development.
Progeny transgenic plants are selected from a population of transgenic cotton events under specified growing conditions and are compared with control cotton plants. Control cotton plants are substantially the same cotton genotype but without the recombinant DNA, for example, either a parental cotton plant of the same genotype that was not transformed with the identical recombinant DNA or a negative isoline of the transformed plant. Additionally, a commercial cotton cultivar adapted to the geographical region and cultivation conditions, i.e. cotton variety ST474, cotton variety FM 958, and cotton variety Siam L-23, are used to compare the relative performance of the transgenic cotton plants containing the recombinant DNA.
Transgenic cotton plants with enhanced yield and water use efficiency are identified by growing under variable water conditions. Specific conditions for cotton include growing a first set of transgenic and control plants under “wet” conditions, i.e. irrigated in the range of 85 to 100 percent of evapotranspiration to provide leaf water potential of −14 to −18 bars, and growing a second set of transgenic and control plants under “dry” conditions, i.e. irrigated in the range of 40 to 60 percent of evapotranspiration to provide a leaf water potential of −21 to −25 bars. Pest control, such as weed and insect control is applied equally to both wet and dry treatments as needed. Data gathered during the trial includes weather records throughout the growing season including detailed records of rainfall; soil characterization information; any herbicide or insecticide applications; any gross agronomic differences observed such as leaf morphology, branching habit, leaf color, time to flowering, and fruiting pattern; plant height at various points during the trial; stand density; node and fruit number including node above white flower and node above crack boll measurements; and visual wilt scoring. Cotton boll samples are taken and analyzed for lint fraction and fiber quality. The cotton is harvested at the normal harvest timeframe for the trial area. Enhanced water use efficiency is indicated by increased yield, improved relative water content, enhanced leaf water potential, increased biomass, enhanced leaf extension rates, and improved fiber parameters.
This example illustrates plant transformation useful in producing the transgenic canola plants of this invention and the production and identification of transgenic seed for transgenic canola having enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
Tissues from in vitro grown canola seedlings are prepared and inoculated with overnight-grown Agrobacterium cells containing plasmid DNA with the gene of interest cassette and a plant selectable marker cassette. Following co-cultivation with Agrobacterium, the infected tissues are allowed to grow on selection to promote growth of transgenic shoots, followed by growth of roots from the transgenic shoots. The selected plantlets are then transferred to the greenhouse and potted in soil. Molecular characterizations are performed to confirm the presence of the gene of interest, and its expression in transgenic plants and progenies. Progeny transgenic plants are selected from a population of transgenic canola events under specified growing conditions and are compared with control canola plants. Control canola plants are substantially the same canola genotype but without the recombinant DNA, for example, either a parental canola plant of the same genotype that is not transformed with the identical recombinant DNA or a negative isoline of the transformed plant.
Transgenic canola plant cells are transformed with each of the recombinant DNA identified in Table 1. The above process is repeated to produce multiple events of transgenic canola plant cells that are transformed with recombinant DNA from each of the genes identified in Table 1. Events are designed to produce in the transgenic cells one of the proteins identified in Table 1, except the canola homologs of the proteins of SEQ ID NOs: 309, 311, 316, 324, 344, 346, 350-351, 355, 357, 360-365, 373, 384-388, 393, 395, 397-400, 404, 413, 415-416, 425-426, 454, 474, 476-487, 502, 518, 532, 541, and 544-545 which are suppressed. Progeny transgenic plants and seed of the transformed plant cells are screened for enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced seed protein and enhanced seed oil. From each group of multiple events of transgenic plants with a specific recombinant DNA from Table 1 the event that produces the greatest enhancement in yield, water use efficiency, nitrogen use efficiency, enhanced cold tolerance, enhanced seed protein and enhanced seed oil is identified and progeny seed is selected for commercial development.
This example illustrates the identification of homologs of proteins encoded by the DNA identified in Table 1 which is used to provide transgenic seed and plants having enhanced agronomic traits. From the sequence of the homologs, homologous DNA sequence can be identified for preparing additional transgenic seeds and plants of this invention with enhanced agronomic traits.
An “All Protein Database” was constructed of known protein sequences using a proprietary sequence database and the National Center for Biotechnology Information (NCBI) non-redundant amino acid database (nr.aa). For each organism from which a polynucleotide sequence provided herein was obtained, an “Organism Protein Database” was constructed of known protein sequences of the organism; it is a subset of the All Protein Database based on the NCBI taxonomy ID for the organism.
The All Protein Database was queried using amino acid sequences provided herein as SEQ ID NO: 308 through SEQ ID NO: 614 using NCBI “blastp” program with E-value cutoff of 1e-8. Up to 1000 top hits were kept, and separated by organism names. For each organism other than that of the query sequence, a list was kept for hits from the query organism itself with a more significant E-value than the best hit of the organism. The list contains likely duplicated genes of the polynucleotides provided herein, and is referred to as the Core List. Another list was kept for all the hits from each organism, sorted by E-value, and referred to as the Hit List.
The Organism Protein Database was queried using polypeptide sequences provided herein as SEQ ID NO: 308 through SEQ ID NO: 614 using NCBI “blastp” program with E-value cutoff of 1e-4. Up to 1000 top hits were kept. A BLAST searchable database was constructed based on these hits, and is referred to as “SubDB”. SubDB is queried with each sequence in the Hit List using NCBI “blastp” program with E-value cutoff of 1e-8. The hit with the best E-value was compared with the Core List from the corresponding organism. The hit is deemed a likely ortholog if it belongs to the Core List, otherwise it is deemed not a likely ortholog and there is no further search of sequences in the Hit List for the same organism. For proteins of table 1 having an identified Pfam domain module, homology to reported homologs was further confirmed by searching the respective identified homologs for conservation of the Pfam domain module identified in the protein from table 1. Homologs from a large number of distinct organisms were identified and are reported below in table 7 with the SEQ ID NO of the DNA sequence corresponding to original protein query sequence and the identified homologs as [SEQ ID NO]: [Homolog SEQ ID N0s].
Recombinant DNA constructs are prepared using the DNA encoding each of the identified homologs and the constructs are used to prepare multiple events of transgenic corn, soybean, canola and cotton plants as illustrated in Examples 2-5. Plants are regenerated from the transformed plant cells and used to produce progeny plants and seed that are screened for enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. From each group of multiple events of transgenic plants with a specific recombinant DNA for a homolog the event that produces the greatest enhancement in yield, water use efficiency, nitrogen use efficiency, enhanced cold tolerance, enhanced seed protein and enhanced seed oil is identified and progeny seed is selected for commercial development.
This example illustrates the identification of consensus amino acid sequence for the proteins and homologs encoded by DNA that is used to prepare the transgenic seed and plants of this invention having enhanced agronomic traits.
ClustalW program was selected for multiple sequence alignments of the amino acid sequence of SEQ ID NO: 311, 314, 316, 333, 344, 346, 365, 373, 383-386, 404-405, 426, 449, 461, 466, 478, 488, 490-491, 512, 541, 543-544, 559, 579, 583, 603, and 610-611 and their homologs. Three major factors affecting the sequence alignments dramatically are (1) protein weight matrices; (2) gap open penalty; (3) gap extension penalty. Protein weight matrices available for ClustalW program include Blosum, Pam and Gonnet series. Those parameters with gap open penalty and gap extension penalty were extensively tested. On the basis of the test results, Blosum weight matrix, gap open penalty of 10 and gap extension penalty of 1 were chosen for multiple sequence alignment.
The consensus amino acid sequence can be used to identify DNA corresponding to the full scope of this invention that is useful in providing transgenic plants, for example corn and soybean plants with enhanced agronomic traits, for example improved nitrogen use efficiency, improved yield, improved water use efficiency and/or improved growth under cold stress, due to the expression in the plants of DNA suppressing a protein with amino acid sequence identical to the consensus amino acid sequence.
The SEQ ID NOs for the identified consensus sequences are reported in table 8 below and the full consensus sequences are provided in the attached sequence listing.
This example illustrates the identification of domain and domain module by Pfam analysis.
The amino acid sequence of the expressed proteins that are shown to be associated with an enhanced trait were analyzed for Pfam protein family against the current Pfam collection of multiple sequence alignments and hidden Markov models using the HMMER software. The Pfam protein domains and modules for the proteins of SEQ ID NOs: 308-310, 312-313, 315, 317-332, 334-343, 345, 347-364, 366-372, 374, 382, 387-403, 406-412, 414-425, 427-448, 450-460, 462-465, 467-477, 479-482, 484-487, 493-511, 513-539, 542, 545-558, 560-578, 580-582, 584-598, 600-602, 604-608, and 612-614 are shown in Tables 9, 10 and 11. The Hidden Markov model databases for the identified patent families are also available from the Pfam consortium (ftp.sanger.ac.uk/pub/databases/Pfam/) allowing identification of other homologous proteins and their cognate encoding DNA to enable the full breadth of the invention for a person of ordinary skill in the art. Certain proteins are identified by a single Pfam domain and others by multiple Pfam domains. The function of the identified Pfam domains in proteins providing an enhanced trait in plants was verified by searching identified homologs for the conservation of the identified Pfam domains. The score value for the identified Pfam domains in sequences from table 1 and the minimum score value for the Pfam domain between a protein from table 1 and its identified homologs are reported below in table 9.
This application claims benefit under 35 USC §119(e) of U.S. provisional application Ser. Nos. 61/101,722, filed Oct. 1, 2008, 61/139,164, filed Dec. 19, 2008, 61/148,438, filed Jan. 30, 2009, 61/155,950, filed Feb. 23, 2009, 61/164,664, filed Mar. 30, 2009, 61/182,785, filed Jun. 1, 2009 and 61/226,953, filed Jul. 20, 2009, each of which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2009/058908 | 9/30/2009 | WO | 00 | 9/16/2011 |
Number | Date | Country | |
---|---|---|---|
61101722 | Oct 2008 | US | |
61139164 | Dec 2008 | US | |
61148438 | Jan 2009 | US | |
61155950 | Feb 2009 | US | |
61164664 | Mar 2009 | US | |
61182785 | Jun 2009 | US | |
61226953 | Jul 2009 | US |