Genes and uses for plant improvement

Abstract
Transgenic seed for crops with improved traits are provided by trait-improving recombinant DNA in the nucleus of cells of the seed where plants grown from such transgenic seed exhibit one or more improved traits as compared to a control plant. Of particular interest are transgenic plants that have increased yield. The present invention also provides recombinant DNA molecules for expression of a protein, and recombinant DNA molecules for suppression of a protein.
Description
INCORPORATION OF SEQUENCE LISTING

Two copies of the sequence listing (Copy 1 and Copy 2) and a computer readable form (CRF) of the sequence listing, all on CD-R's, each containing the file named 38-21(53708)C_seqListing.txt, which is 95,642,000 bytes (measured in MS-WINDOWS) and was created on May 9, 2006, are incorporated herein by reference in their entirety.


INCORPORATION OF TABLES

Two copies of Table 2 on CD-R's, each containing the file named 38-21(53708)C_table2.doc, which is 324,000 bytes when measured in MS-WINDOWS) operating system, was created on May 9, 2006, and comprises 77 pages when viewed in MS Word® (program, are incorporated herein by reference in their entirety.


INCORPORATION OF COMPUTER PROGRAM LISTING

A Computer Program Listing with folders “hmmer-2.3.2” and “288pfamDir” is contained on a CD-R and is incorporated herein by reference in their entirety. Folder hmmer-2.3.2 contains the source code and other associated file for implementing the HMMer software for Pfam analysis. Folder 228pfamDir contains 288 Pfam Hidden Markov Models. Both folders were created on the disk on May 9, 2006, having a total size of 48,746,496 bytes when measured in MS-WINDOWS® (operating system.


FIELD OF THE INVENTION

Disclosed herein are transgenic plant cells, plants and seeds comprising recombinant DNA and methods of making and using such plant cells, plants and seeds


BACKGROUND OF THE INVENTION

Transgenic plants with improved traits such as improved yield, environmental stress tolerance, pest resistance, herbicide tolerance, modified seed compositions, and the like are desired by both farmers and consumers. Although considerable efforts in plant breeding have provided significant gains in desired traits, the ability to introduce specific DNA into plant genomes provides further opportunities for generation of plants with improved and/or unique traits. The ability to develop transgenic plants with improved traits depends in part on the identification of useful recombinant DNA for production of transformed plants with improved properties, e.g. by actually selecting a transgenic plant from a screen for such improved property.


SUMMARY OF THE INVENTION

This invention provides plant cell nuclei with recombinant that imparts enhanced agronomic traits in transgenic plants having the nuclei in their cells. Recombinant DNA in this invention is provided in a construct comprising a promoter that is functional in plant cells and that is operably linked to DNA that encodes a protein having at least one amino acid domain in a sequence that exceeds the Pfam gathering cutoff for amino acid sequence alignment with a protein domain family identified by a Pfam name in the group of Pfam domain names identified in Table 17. In more specific embodiments of the invention plant cells are provided which express a protein having amino acid sequence with at least 90% identity to a consensus amino acid sequence in the group of consensus amino acid sequences consisting of the consensus amino acid sequence constructed for SEQ ID NO: 426 and homologs thereof listed in Table 2 through the consensus amino acid sequence constructed for SEQ ID NO: 850 and homologs thereof listed in Table 2.


Amino acid sequences of homologs are SEQ ID NO: 851 through 33634. In even more specific embodiments of the invention the protein expressed in plant cells is a protein selected from the group of proteins identified in Table 1 by annotation to a related protein in Genbank and alternatively identified in Table 16 by identification of protein domain family.


Other aspects of the invention are specifically directed to transgenic plant cells, and transgenic plants comprising a plurality of plant cells with such nuclei, progeny transgenic seed, embryo and transgenic pollen from such plants. Such plant cell nuclei are selected from a population of transgenic plants regenerated from plant cells with a nucleus transformed with recombinant DNA by screening the transgenic plants in the population for an enhanced trait as compared to control plants that do not have the recombinant DNA in their nucleus, where the enhanced trait is enhanced water use efficiency, enhanced cold tolerance, enhanced heat tolerance, enhanced shade tolerance, enhanced tolerance to salt exposure, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil. In some aspects of the invention the recombinant DNA expresses a protein that imparts the enhanced trait; in other aspects of the invention the recombinant DNA expresses RNA for suppressing the level of an endogenous protein. In yet another aspect of the invention the nucleus of plant cells in plants, seeds, embryo and pollen further comprise DNA expressing a protein that provides tolerance from exposure to an herbicide applied at levels that are lethal to a wild type of said plant cell. Such tolerance is especially useful not only as an advantageous trait in such plants but is also useful in a selection step in the methods of the invention. In aspects of the invention the agent of such herbicide is a glyphosate, dicamba, or glufosinate compound.


Yet other aspects of the invention provide nuclei is cells of transgenic plants which are homozygous for the recombinant DNA and transgenic seed of the invention from corn, soybean, cotton, canola, alfalfa, wheat or rice plants. In other important embodiments for practice of various aspects of the invention in Argentina the recombinant DNA in the nucleus is provided in plant cells derived from corn lines that that are and maintain resistance to a virus such as the Mal de Rio Cuarto virus or a fungus such as the Puccina sorghi fungus or to both.


This invention also provides methods for manufacturing non-natural, transgenic seed that can be used to produce a crop of transgenic plants with an enhanced trait resulting from expression of stably-integrated, recombinant DNA in the nucleus of the plant cells. In some aspects of the invention the recombinant DNA can express a protein having at least one domain of amino acids in a sequence that exceeds the Pfam gathering cutoff for amino acid sequence alignment with a protein domain family identified by a Pfam name in the group of Pfam names identified in Table 17; in other aspects the recombinant DNA suppresses the level of a such a protein More specifically the method comprises (a) screening a population of plants for an enhanced trait and recombinant DNA, where individual plants in the population can exhibit the trait at a level less than, essentially the same as or greater than the level that the trait is exhibited in control plants which do not express the recombinant DNA; (b) selecting from the population one or more plants that exhibit the trait at a level greater than the level that said trait is exhibited in control plants; (c) verifying that the recombinant DNA is stably integrated in said selected plants; (d) analyzing tissue of a selected plant to determine the production of a protein having the function of a protein encoded by nucleotides in a sequence of one of SEQ ID NO:1-425; and (e) collecting seed from a selected plant. In one aspect of the invention the plants in the population further comprise DNA expressing a protein that provides tolerance to exposure to an herbicide applied at levels that are lethal to wild type plant cells and where the selecting is effected by treating the population with the herbicide, e.g. a glyphosate, dicamba, or glufosinate compound. In another aspect of the invention the plants are selected by identifying plants with the enhanced trait. The methods are especially useful for manufacturing corn, soybean, cotton, alfalfa, wheat or rice seed selected as having one of the enhanced traits described above.


Another aspect of the invention provides a method of producing hybrid corn seed comprising acquiring hybrid corn seed from a herbicide tolerant corn plant which also has a nucleus of this invention with stably-integrated, recombinant DNA The method further comprises producing corn plants from said hybrid corn seed, where a fraction of the plants produced from said hybrid corn seed is homozygous for said recombinant DNA, a fraction of the plants produced from said hybrid corn seed is hemizygous for said recombinant DNA, and a fraction of the plants produced from said hybrid corn seed has none of said recombinant DNA; selecting corn plants which are homozygous and hemizygous for said recombinant DNA by treating with an herbicide; collecting seed from herbicide-treated-surviving corn plants and planting said seed to produce further progeny corn plants; repeating the selecting and collecting steps at least once to produce an inbred corn line; and crossing the inbred corn line with a second corn line to produce hybrid seed.


Another aspect of the invention provides a method of selecting a plant comprising a nucleus of this invention in its plant cells by using an immunoreactive antibody to detect the presence of protein expressed by recombinant DNA in seed or plant tissue. Another aspect of the invention provides anti-counterfeit milled seed having, as an indication of origin, a nucleus of this invention with unique recombinant DNA.


Aspects of the invention relating to nucleus in plant cells having recombinant DNA for suppressing the expression of a protein are identified in Table 1 and Table 16.


More specific aspects of the invention provide plant cells having recombinant DNA for suppressing the expression of a protein having the function in a plant of the protein with amino acid sequence of SEQ ID NO: 426, 428, 429, 430, 524, 525, 541, 601, 602, 650, 651, 654, 655, 657, 660, 694, 698, 772, 801 or the corresponding Pfam identified in Table 16, i.e. Histone, WD40, NPH3, FHA, PB1, ADH_zinc_N, NAPRTase, ADK_lid, p450, B56, DUF231, C2, DUF568, WD40, F-box, Pkinase, Terpene_synth, respectively. Such suppression can be effected by any of a number of ways known in the art, e.g. anti-sense suppression, RNAi or mutation knockout and the like.


Another aspect of this invention relates to growing transgenic plants with enhanced water use efficiency or enhanced nitrogen use efficiency. For instance, this invention provides methods of growing a corn, cotton or soybean crop without irrigation water comprising planting seed having plant cells of the invention which are selected for enhanced water use efficiency. Alternatively methods comprise applying reduced irrigation water, e.g. providing up to 300 millimeters of ground water during the production of a corn crop. This invention also provides methods of growing a corn, cotton or soybean crop without added nitrogen fertilizer comprising planting seed having plant cells of the invention which are selected for enhanced nitrogen use efficiency. Alternatively methods comprise applying reduced amount of nitrogen input as compared to the conventional input during the production of a corn crop.


The various aspects of this invention are especially useful for transgenic plant cells in seeds and transgenic plants having any of the above-described enhanced traits in crop plants such as corn (maize), soybean, cotton, canola (rape), wheat, sunflower, sorghum, alfalfa, barley, millet, rice, tobacco, fruit and vegetable crops, and turfgrass.


The invention also provides recombinant DNA constructs comprising the DNA useful in the nuclei in plant cells for imparting enhanced traits in plants having those cells.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a consensus amino acid sequence of SEQ ID NO: 601 and homologs.



FIGS. 2 and 3 are plasmid maps.




DETAILED DESCRIPTION OF THE INVENTION

In the attached sequence listing:


SEQ ID NO:1-425 are nucleotide sequences of the coding strand of DNA for “genes” used in the recombinant DNA imparting an enhanced trait in plant cells, i.e. each represents a coding sequence for a protein;


SEQ ID NO:426-850 are amino acid sequences of the cognate protein of the “genes” with nucleotide coding sequence 1-425;


SEQ ID NO:851-33634 are amino acid sequences of homologous proteins;


SEQ ID NO:33635 is a consensus amino acid sequence.


SEQ ID NO:33636 is a nucleotide sequence of a plasmid base vector useful for corn transformation; and


SEQ ID NO:33637 is a DNA sequence of a plasmid base vector useful for soybean transformation.


The nuclei of this invention are identified by screening transgenic plants for one or more traits including improved drought stress tolerance, improved heat stress tolerance, improved cold stress tolerance, improved high salinity stress tolerance, improved low nitrogen availability stress tolerance, improved shade stress tolerance, improved plant growth and development at the stages of seed imbibition through early vegetative phase, and improved plant growth and development at the stages of leaf development, flower production and seed maturity.


“Gene” refers to chromosomal DNA, plasmid DNA, cDNA, synthetic DNA, or other DNA that encodes a peptide, polypeptide, protein, or RNA molecule, and regions flanking the coding sequences involved in the regulation of expression. In aspects of the invention where an improved trait is provided by expression of a protein, “gene” refers at least to coding nucleotide sequence for a protein or a function polypeptide fragment of a protein that imparts the trait. In aspects of the invention where an improved trait is provided by suppression of expression of an endogenous protein, “gene” refers to any part of the gene that can be a target for suppression.


“Transgenic seed” means a plant seed whose nucleus has been altered by the incorporation of recombinant DNA, e.g., by transformation as described herein. The term “transgenic plant” is used to refer to the plant produced from an original transformation event, or progeny from later generations or crosses of a plant to a transformed plant, so long as the progeny contains a nucleus with the recombinant DNA in its genome.


“Recombinant DNA” a polynucleotide having a genetically engineered modification introduced through combination of endogenous and/or exogenous elements in a transcription unit, manipulation via mutagenesis, restriction enzymes, and the like or simply by inserting multiple copies of a native transcription unit. Recombinant DNA may comprise DNA segments obtained from different sources, or DNA segments obtained from the same source, but which have been manipulated to join DNA segments which do not naturally exist in the joined form. A recombinant polynucleotide may exist outside of the cell, for example as a PCR fragment, or integrated into a genome, such as a plant genome.


“Trait” means a physiological, morphological, biochemical, or physical characteristic of a plant or particular plant material or cell. In some instances, this characteristic is visible to the human eye, such as seed or plant size, or can be measured by biochemical techniques, such as detecting the protein, starch, or oil content of seed or leaves, or by observation of a metabolic or physiological process, e.g., by measuring uptake of carbon dioxide, or by the observation of the expression level of a gene or genes, e.g., by employing Northern analysis, RT-PCR, microarray gene expression assays, or reporter gene expression systems, or by agricultural observations such as stress tolerance, yield, or pathogen tolerance.


A “control plant” is a plant without trait-improving recombinant DNA in its nucleus. A control plant is used to measure and compare trait improvement in a transgenic plant with such trait-improving recombinant DNA. A suitable control plant may be a non-transgenic plant of the parental line used to generate a transgenic plant herein. Alternatively, a control plant may be a transgenic plant that comprises an empty vector or marker gene, but does not contain the recombinant DNA that produces the trait improvement. A control plant may also be a negative segregant progeny of hemizygous transgenic plant. In certain demonstrations of trait improvement, the use of a limited number of control plants can cause a wide variation in the control dataset. To minimize the effect of the variation within the control dataset, a “reference” is used. As use herein a “reference” is a trimmed mean of all data from both transgenic and control plants grown under the same conditions and at the same developmental stage. The trimmed mean is calculated by eliminating a specific percentage, i.e., 20%, of the smallest and largest observation from the data set and then calculating the average of the remaining observation.


“Trait improvement” means a detectable and desirable difference in a characteristic in a transgenic plant relative to a control plant or a reference. In some cases, the trait improvement can be measured quantitatively. For example, the trait improvement can entail at least a 2% desirable difference in an observed trait, at least a 5% desirable difference, at least about a 10% desirable difference, at least about a 20% desirable difference, at least about a 30% desirable difference, at least about a 50% desirable difference, at least about a 70% desirable difference, or at least about a 100% difference, or an even greater desirable difference. In other cases, the trait improvement is only measured qualitatively. It is known that there can be a natural variation in a trait. Therefore, the trait improvement observed entails a change of the normal distribution of the trait in the transgenic plant compared with the trait distribution observed in a control plant or a reference, which is evaluated by statistical methods provided herein. Trait improvement includes, but is not limited to, yield increase, including increased yield under non-stress conditions and increased yield under environmental stress conditions. Stress conditions may include, for example, drought, shade, fungal disease, viral disease, bacterial disease, insect infestation, nematode infestation, cold temperature exposure, heat exposure, osmotic stress, reduced nitrogen nutrient availability, reduced phosphorus nutrient availability and high plant density.


Many agronomic traits can affect “yield”, including without limitation, plant height, pod number, pod position on the plant, number of internodes, incidence of pod shatter, grain size, efficiency of nodulation and nitrogen fixation, efficiency of nutrient assimilation, resistance to biotic and abiotic stress, carbon assimilation, plant architecture, resistance to lodging, percent seed germination, seedling vigor, and juvenile traits. Other traits that can affect yield include, efficiency of germination (including germination in stressed conditions), growth rate (including growth rate in stressed conditions), ear number, seed number per ear, seed size, composition of seed (starch, oil, protein) and characteristics of seed fill. Also of interest is the generation of transgenic plants that demonstrate desirable phenotypic properties that may or may not confer an increase in overall plant yield. Such properties include enhanced plant morphology, plant physiology or improved components of the mature seed harvested from the transgenic plant.


“Yield-limiting environment” means the condition under which a plant would have the limitation on yield including environmental stress conditions.


“Stress condition” means a condition unfavorable for a plant, which adversely affect plant metabolism, growth and/or development. A plant under the stress condition typically shows reduced germination rate, retarded growth and development, reduced photosynthesis rate, and eventually leading to reduction in yield. Specifically, “water deficit stress” used herein preferably refers to the sub-optimal conditions for water and humidity needed for normal growth of natural plants. Relative water content (RWC) can be used as a physiological measure of plant water deficit. It measures the effect of osmotic adjustment in plant water status, when a plant is under stressed conditions. Conditions which may result in water deficit stress include heat, drought, high salinity and PEG induced osmotic stress.


“Cold stress” means the exposure of a plant to a temperatures below (two or more degrees Celsius below) those normal for a particular species or particular strain of plant.


“Nitrogen nutrient” means any one or any mix of the nitrate salts commonly used as plant nitrogen fertilizer, including, but not limited to, potassium nitrate, calcium nitrate, sodium nitrate, ammonium nitrate. The term ammonium as used herein means any one or any mix of the ammonium salts commonly used as plant nitrogen fertilizer, e.g., ammonium nitrate, ammonium chloride, ammonium sulfate, etc.


“Low nitrogen availability stress” means a plant growth condition that does not contain sufficient nitrogen nutrient to maintain a healthy plant growth and/or for a plant to reach its typical yield under a sufficient nitrogen growth condition. For example, a limiting nitrogen condition can refers to a growth condition with 50% or less of the conventional nitrogen inputs. “Sufficient nitrogen growth condition” means a growth condition where the soil or growth medium contains or receives optimal amounts of nitrogen nutrient to sustain a healthy plant growth and/or for a plant to reach its typical yield for a particular plant species or a particular strain. One skilled in the art would recognize what constitute such soil, media and fertilizer inputs for most plant species.


“Shade stress” means a growth condition that has limited light availability that triggers the shade avoidance response in plant. Plants are subject to shade stress when localized at lower part of the canopy, or in close proximity of neighboring vegetation. Shade stress may become exacerbated when the planting density exceeds the average prevailing density for a particular plant species. The average prevailing densities per acre of a few examples of crop plants in the USA in the year 2000 were: wheat 1,000,000-1,500,000; rice 650,000-900,000; soybean 150,000-200,000, canola 260,000-350,000, sunflower 17,000-23,000 and cotton 28,000-55,000 plants per acre (Cheikh, e.g., (2003) U.S. Patent Application No. 20030101479).


“Increased yield” of a transgenic plant of the present invention is evidenced and measured in a number of ways, including test weight, seed number per plant, seed weight, seed number per unit area (i.e., seeds, or weight of seeds, per acre), bushels per acre, tons per acre, tons per acre, kilo per hectare. For example, maize yield can be measured as production of shelled corn kernels per unit of production area, e.g., in bushels per acre or metric tons per hectare, often reported on a moisture adjusted basis, e.g., at 15.5% moisture. Increased yield can result from improved utilization of key biochemical compounds, such as nitrogen, phosphorous and carbohydrate, or from improved tolerance to environmental stresses, such as cold, heat, drought, salt, and attack by pests or pathogens. Trait-improving recombinant DNA can also be used to provide transgenic plants having improved growth and development, and ultimately increased yield, as the result of modified expression of plant growth regulators or modification of cell cycle or photosynthesis pathways.


“Expression” means transcription of DNA to produce RNA. The resulting RNA may be without limitation mRNA encoding a protein, antisense RNA, or a double-stranded RNA for use in RNAi technology. Expression also refers to production of encoded protein from mRNA.


A “plant promoter” is a promoter capable of initiating transcription in plant cells whether or not its origin is a plant cell. Exemplary plant promoters include, but are not limited to, those that are obtained from plants, plant viruses, and bacteria which comprise genes expressed in plant cells such Agrobacterium or Rhizobium. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, or seeds. Such promoters are referred to as “tissue preferred”. Promoters which initiate transcription only in certain tissues are referred to as “tissue specific”. A “cell type” specific promoter primarily drives expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An “inducible” or “repressible” promoter is a promoter which is under environmental control. Examples of environmental conditions that may effect transcription by inducible promoters include anaerobic conditions, or certain chemicals, or the presence of light. Tissue specific, tissue preferred, cell type specific, and inducible promoters constitute the class of “non-constitutive” promoters. A “constitutive” promoter is a promoter which is active under most conditions. As used herein, “antisense orientation” includes reference to a polynucleotide sequence that is operably linked to a promoter in an orientation where the antisense strand is transcribed. The antisense strand is sufficiently complementary to an endogenous transcription product such that translation of the endogenous transcription product is often inhibited.


As used herein, “operably linked” refers to the association of two or more nucleic acid fragments on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., that the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in sense or antisense orientation.


A “consensus sequence” refers to an artificial, amino acid sequence of conserved parts of the proteins encoded by homologous genes, e.g., as determined by a CLUSTALW alignment of amino acid sequence of homolog proteins.


Homologous genes are genes which encode proteins with the same or similar biological function to the protein encoded by the second gene. Homologous genes may be generated by the event of speciation (see ortholog) or by the event of genetic duplication (see paralog). “Orthologs” refer to a set of homologous genes in different species that evolved from a common ancestral gene by specification. Normally, orthologs retain the same function in the course of evolution; and “paralogs” refer to a set of homologous genes in the same species that have diverged from each other as a consequence of genetic duplication. Thus, homologous genes can be from the same or a different organism. As used herein, “homolog” means a protein that performs the same biological function as a second protein including those identified by sequence identity search.


Percent identity refers to the extent to which two optimally aligned DNA or protein segments are invariant throughout a window of alignment of components, e.g., nucleotide sequence or amino acid sequence. An “identity fraction” for aligned segments of a test sequence and a reference sequence is the number of identical components which are shared by sequences of the two aligned segments divided by the total number of sequence components in the reference segment over a window of alignment which is the smaller of the full test sequence or the full reference sequence. “Percent identity” (“% identity”) is the identity fraction times 100. “% identity to a consensus amino acid sequence” is 100 times the identity fraction in a window of alignment of an amino acid sequence of a test protein optimally aligned to consensus amino acid sequence of this invention.


Arabidopsis” means plants of Arabidopsis thaliana.


“Pfam” refers to a large collection of multiple sequence alignments and hidden Markov models covering many common protein families, e.g. Pfam version 18.0 (August 2005) contains alignments and models for 7973 protein families and is based on the Swissprot 47.0 and SP-TREMBL 30.0 protein sequence databases. See S. R. Eddy, “Profile Hidden Markov Models”, Bioinformatics 14:755-763, 1998. Pfam is currently maintained and updated by a Pfam Consortium. The alignments represent some evolutionary conserved structure that has implications for the protein's function. Profile hidden Markov models (profile HMMs) built from the Pfam alignments are useful for automatically recognizing that a new protein belongs to an existing protein family even if the homology by alignment appears to be low. Once one DNA is identified as encoding a protein which imparts an enhanced trait when expressed in transgenic plants, other DNA encoding proteins in the same protein family are identified by querying the amino acid sequence of protein encoded by candidate DNA against the Hidden Markov Model which characterizes the Pfam domain using HMMER software, a current version of which is provided in the appended computer listing. Candidate proteins meeting the gathering cutoff for the alignment of a particular Pfam are in the protein family and have cognate DNA that is useful in constructing recombinant DNA for the use in the plant cells of this invention. Hidden Markov Model databases for use with HMMER software in identifying DNA expressing protein in a common Pfam for recombinant DNA in the plant cells of this invention are also included in the appended computer listing. The HMMER software and Pfam databases are version 18.0 and were used to identify known domains in the proteins corresponding to amino acid sequence of SEQ ID NO: 426 through SEQ ID NO: 850. All DNA encoding proteins that have scores higher than the gathering cutoff disclosed in Table 17 by Pfam analysis disclosed herein can be used in recombinant DNA of the plant cells of this invention, e.g. for selecting transgenic plants having enhanced agronomic traits. The relevant Pfams for use in this invention, as more specifically disclosed below, are Mito_carr, 6PGD, UBX, iPGM_N, WD40, Fer4, Enolase_C, DUF1639, PBP, PLAC8, Acyl-CoA_dh1, Isoamylase_N, Acyl-CoA_dh2, PC4, Sugar_tr, UCH, Enolase_N, HATPase_c, PRA-PH, Pkinase, SBP56, PEP-utilizers, SIS, PCI, DUF1644, Terpene_synth, Acyl-CoA_dh_M, Acyl-CoA_dh_N, Glutaminase, Lectin_legB, Dehydrin, MatE, Ank, 2-Hacid_dh_C, Chal_sti_synt_C, DUF1070, ATP-grasp2, Arginase, HABP4_PAI-RBP1, ABC2_membrane, DUF1723, Glyco_hydro1, MFS1, ARD, PDT, HMA, Pro_isomerase, Ferric_reduct, PRA-CH, Aa_trans, ACT, LisH, PGM_PMM_II, Spermine_synth, zf-MYND, LRRNT2, Ribul_P3_epim, PGM_PMM_IV, NPH3, DapB_C, TPR1, TPR2, FAE1_CUT1_RppA, K_trans, F-box, Cyclin_C, ADK, NUDIX, NIR_SIR, PEPCK_ATP, La, DapB_N, MtN3_slv, FMO-like, TIM, FKBP_C, PMEI, Peptidase_C12, Cyclin_N, DUF568, Methyltransf11, Methyltransf12, DUF1677, DnaJ_C, BRAP2, IF2_N, Carboxyl_trans, mTERF, Glyoxalase, TMEM14, Mlo, Beta_elim_lyase, Pyr_redox_dim, Glyco_transf8, Nicastrin, Flavodoxin1, Epimerase, PTPA, Lipase3, Pyr_redox2, GSHPx, ELM2, PGI, Aminotran12, ABC_tran, GRP, PGK, Oleosin, Sulfotransfer1, EXS, DUF1325, AMP-binding, Arm, NTP_transferase, LSM, Metalloenzyme, Molybdop_Fe4S4, MFAP1_C, Aminotran3, PHD, B56, DUF588, PSI_PsaF, zf-CCCH, HEAT, PALP, FH2, SapB1, Ammonium_transp, MannoseP_isomer, NOP5NT, SapB2, Pyr_redox, Pollen_allerg1, Asp, DUF662, FHA, YjeF_N, COX5C, GTP_EFTU_D2, Ion_trans2, PK, DUF231, FAD_binding-1, Hrf1, FAD_binding4, FAD_binding6, FAD_binding8, CBS, Smr, aPHC, DUF241, Brix, Ras, Acetyltransf1, NAF, SPX, Na_Ca_ex, C2, p450, PP2C, Histone, 2-Hacid_dh, SBF, CCT, BCNT, PK_C, Miro, CH, PfkB, ACP_syn_III_C, Sterol_desat, ADH_zinc_N, CS, Cys_Met_Meta_PP, Lactamase_B, Bromodomain, CDI, Linker_histone, DAO, Dicty_CAR, Aldo_ket_red, zf-AN1, Methyltransf 6, DUF1005, LEA2, NIR_SIR_ferr, DUF260, Oxidored_FMN, DUF26, Lectin_C, Pec_lyase_C, Nop, TB2_DP1_HVA22, ADH_N, YGGT, NAPRTase, NAD_binding1, DUF914, PGM_PMM_I, NAD_binding2, AICARFT_IMPCHas, Auxin_inducible, NAD_binding6, Anti-silence, RuBisCO_large, Response_reg, FeThRed_A, Di19, SNARE, PGM_PMM_III, Molydop_binding, efhand, zf-CCHC, GTP_EFTU, ARID, adh_short, Fibrillarin, RuBisCO_large_N, WWE, AA_permease, PABP, OMPdecase, RRM1, U-box, OPT, TBC, MGS, DUF786, 3Beta_HSD, zf-UBP, zf-A20, DPBB1, GDPD, PI-PLC-X, SEP, PI-PLC-Y, NOSIC, Glycolytic, SET, ADK_lid, Alpha-amylase, EB 1, PGAM, Abhydrolase1, Glyco_hydro14, Lung7-TM_R, Abhydrolase3, TCTP, GATase2, Gln-synt_C, 20G-FeII_Oxy, Pribosyltran, MIF, CoA_trans, RCC1, Pkinase_Tyr, MIP, DnaJ, HSCB_C, Trehalose_PPase, LRR1, Cupin2, LRR2, Glyco_hydro28, Yip1, Trp_syntA, Sedlin_N, SGS, Aldedh, CK_II_beta, zf-C3HC4, GIDA, PB 1, IMPDH, Carb_kinase, PurA, Molybdopterin, Nodulin-like, Tim17, Xan_ur_permease, Hist_deacetyl, RNA_pol_Rpb8, Agenet, Myb_DNA-binding, Glyoxal_oxid_N, Ribophorin_I, and FAE3-kCoA_syn1.


Recombinant DNA Constructs


The present invention provides recombinant DNA constructs comprising one or more polynucleotides disclosed herein for imparting one or more improved traits to transgenic plant when incorporated into the nucleus of the plant cells. Such constructs also typically comprise a promoter operatively linked to said polynucleotide to provide for expression in the plant cells. Other construct components may include additional regulatory elements, such as 5′ or 3′ untranslated regions (such as polyadenylation sites), intron regions, and transit or signal peptides. Such recombinant DNA constructs can be assembled using methods known to those of ordinary skill in the art.


In a preferred embodiment, a polynucleotide of the present invention is operatively linked in a recombinant DNA construct to a promoter functional in a plant to provide for expression of the polynucleotide in the sense orientation such that a desired protein or polypeptide fragment of a protein is produced. Also provided are embodiments wherein a polynucleotide is operatively linked to a promoter functional in a plant to provide for expression of gene suppression RNA to suppress the level of an endogenous protein.


Recombinant constructs prepared in accordance with the present invention also generally include a 3′ untranslated DNA region (UTR) that typically contains a polyadenylation sequence following the polynucleotide coding region. Examples of useful 3′ UTRs include those from the nopaline synthase gene of Agrobacterium tumefaciens (nos), a gene encoding the small subunit of a ribulose-1,5-bisphosphate carboxylase-oxygenase (rbcS), and the T7 transcript of Agrobacterium tumefaciens. Constructs and vectors may also include a transit peptide for targeting of a gene target to a plant organelle, particularly to a chloroplast, leucoplast or other plastid organelle. For descriptions of the use of chloroplast transit peptides, see U.S. Pat. No. 5,188,642 and U.S. Pat. No. 5,728,925, incorporated herein by reference.


Table 1 provides a list of genes that provide recombinant DNA that was used in a model plant to discover associated improved traits and that can be used with homologs to define a consensus amino acid sequence for characterizing recombinant DNA for use in the nuclei of this invention. An understanding of Table 1 is facilitated by the following description of the headings:


“NUC SEQ ID NO” refers to a SEQ ID NO. for particular DNA sequence in the Sequence Listing.


“PEP SEQ ID NO” refers to a SEQ ID NO. in the Sequence Listing for the amino acid sequence of a protein cognate to a particular DNA


“construct_id” refers to an arbitrary number used to identify a particular recombinant DNA construct comprising the particular DNA.


“Gene ID” refers to an arbitrary name used to identify the particular DNA. “orientation” refers to the orientation of the particular DNA in a recombinant DNA construct relative to the promoter.

TABLE 1NUCSEQPEP SEQIDIDConstructNONOGene IDIDorientation1426CGPG69910919ANTI-SENSE2427CGPG56711142SENSE3428CGPG26711310ANTI-SENSE4429CGPG117911866ANTI-SENSE5430CGPG95911937ANTI-SENSE6431CGPG215816218SENSE7432CGPG244617410SENSE8433CGPG186218401SENSE9434CGPG301418665SENSE10435CGPG167419141SENSE11436CGPG268019156SENSE12437CGPG357719621SENSE13438CGPG406519760SENSE14439CGPG392919857SENSE15440CGPG248870510SENSE16441CGPG301270605SENSE17442CGPG316270607SENSE18443CGPG60770812SENSE19444CGPG408470933SENSE20445CGPG391770939SENSE21446CGPG441471319SENSE22447CGPG18571446SENSE23448CGPG167971609SENSE24449CGPG27171649SENSE25450CGPG443471814SENSE26451CGPG219871945SENSE27452CGPG525372005SENSE28453CGPG523172026SENSE29454CGPG481572533SENSE30455CGPG485972637SENSE31456CGPG158972782SENSE32457CGPG182672794SENSE33458CGPG389972919SENSE34459CGPG561072983SENSE35460CGPG566573025SENSE36461CGPG569773050SENSE37462CGPG569573092SENSE38463CGPG486273338SENSE39464CGPG491073344SENSE40465CGPG654173536SENSE41466CGPG175673644SENSE42467CGPG492773662SENSE43468CGPG510673672SENSE44469CGPG516773733SENSE45470CGPG192473846SENSE46471CGPG520173863SENSE47472CGPG188474059SENSE48473CGPG508974205SENSE49474CGPG587074321SENSE50475CGPG588874337SENSE51476CGPG146174388SENSE52477CGPG674374412SENSE53478CGPG672274445SENSE54479CGPG8274522SENSE55480CGPG676174526SENSE56481CGPG678174576SENSE57482CGPG491474707SENSE58483CGPG584074743SENSE59484CGPG598074755SENSE60485CGPG174375202SENSE61486CGPG543475220SENSE62487CGPG582475226SENSE63488CGPG587975231SENSE64489CGPG594975239SENSE65490CGPG609675258SENSE66491CGPG621875270SENSE67492CGPG622675271SENSE68493CGPG751875355SENSE69494CGPG765475460SENSE70495CGPG687575847SENSE71496CGPG825975907SENSE72497CGPG822975927SENSE73498CGPG822475962SENSE74499CGPG192775984SENSE75500CGPG596276119SENSE76501CGPG537576209SENSE77502CGPG894976323SENSE78503CGPG894376346SENSE79504CGPG889676352SENSE80505CGPG896076360SENSE81506CGPG589176512SENSE82507CGPG726076575SENSE83508CGPG264776733SENSE84509CGPG699576741SENSE85510CGPG904676845SENSE86511CGPG904776857SENSE87512CGPG908376902SENSE88513CGPG907676913SENSE89514CGPG910876917SENSE90515CGPG910976929SENSE91516CGPG593377009SENSE92517CGPG633577022SENSE93518CGPG917477108SENSE94519CGPG912077125SENSE95520CGPG920077135SENSE96521CGPG926377219SENSE97522CGPG808777351SENSE98523CGPG38514810SENSE99524CGPG185714835ANTI-SENSE100525CGPG178815419ANTI-SENSE101526CGPG196616223SENSE102527CGPG190817903SENSE103528CGPG354518279SENSE104529CGPG355718284SENSE105530CGPG334018332SENSE106531CGPG343118349SENSE107532CGPG353018623SENSE108533CGPG311919551SENSE109534CGPG359419627SENSE110535CGPG403119785SENSE111536CGPG403619944SENSE112537CGPG238470125SENSE113538CGPG159870504SENSE114539CGPG56070830SENSE115540CGPG400270943SENSE116541CGPG142271711ANTI-SENSE117542CGPG442971962SENSE118543CGPG86272352SENSE119544CGPG225172413SENSE120545CGPG64672603SENSE121546CGPG256972676SENSE122547CGPG550772737SENSE123548CGPG554772742SENSE124549CGPG551772762SENSE125550CGPG579272993SENSE126551CGPG576673031SENSE127552CGPG577573091SENSE128553CGPG480973224SENSE129554CGPG487273340SENSE130555CGPG642073420SENSE131556CGPG640273489SENSE132557CGPG507373750SENSE133558CGPG509173751SENSE134559CGPG357073814SENSE135560CGPG434273855SENSE136561CGPG448373856SENSE137562CGPG152773929SENSE138563CGPG435174063SENSE139564CGPG475374072SENSE140565CGPG475774073SENSE141566CGPG480574075SENSE142567CGPG662474135SENSE143568CGPG316174313SENSE144569CGPG377074315SENSE145570CGPG667374427SENSE146571CGPG670274490SENSE147572CGPG2174511SENSE148573CGPG680174531SENSE149574CGPG15474532SENSE150575CGPG676274538SENSE151576CGPG676374550SENSE152577CGPG146774579SENSE153578CGPG616074644SENSE154579CGPG1574703SENSE155580CGPG582574733SENSE156581CGPG593674749SENSE157582CGPG597474752SENSE158583CGPG136674773SENSE159584CGPG739074896SENSE160585CGPG742174976SENSE161586CGPG744674991SENSE162587CGPG629575289SENSE163588CGPG147676003SENSE164589CGPG182176074SENSE165590CGPG697576137SENSE166591CGPG618976220SENSE167592CGPG886876301SENSE168593CGPG890976318SENSE169594CGPG895176347SENSE170595CGPG589276513SENSE171596CGPG717176566SENSE172597CGPG614276719SENSE173598CGPG721276757SENSE174599CGPG903476891SENSE175600CGPG927077208SENSE176601CGPG111212116ANTI-SENSE177602CGPG128413053ANTI-SENSE178603CGPG164014733SENSE179604CGPG213616132SENSE180605CGPG354218276SENSE181606CGPG169170851SENSE182607CGPG406770941SENSE183608CGPG533572134SENSE184609CGPG2772326SENSE185610CGPG344172978SENSE186611CGPG437573619SENSE187612CGPG517673737SENSE188613CGPG663774196SENSE189614CGPG66174546SENSE190615CGPG86974558SENSE191616CGPG615974643SENSE192617CGPG593175234SENSE193618CGPG628275278SENSE194619CGPG767175585SENSE195620CGPG157476063SENSE196621CGPG929477401SENSE197622CGPG243517808SENSE198623CGPG333618330SENSE199624CGPG361318409SENSE200625CGPG416819813SENSE201626CGPG651773580SENSE202627CGPG99373935SENSE203628CGPG663174124SENSE204629CGPG539374725SENSE205630CGPG131376403SENSE206631CGPG815077366SENSE207632CGPG166172414SENSE208633CGPG642773409SENSE209634CGPG490673723SENSE210635CGPG509273747SENSE211636CGPG614674634SENSE212637CGPG602274769SENSE213638CGPG588375232SENSE214639CGPG759675429SENSE215640CGPG824875965SENSE216641CGPG823375975SENSE217642CGPG157376061SENSE218643CGPG714176155SENSE219644CGPG894176322SENSE220645CGPG633776440SENSE221646CGPG900576828SENSE222647CGPG920877136SENSE223648CGPG434877303SENSE224649CGPG809977352SENSE225650CGPG61810913ANTI-SENSE226651CGPG25110925ANTI-SENSE227652CGPG63611357SENSE228653CGPG63911358SENSE229654CGPG28711432ANTI-SENSE230655CGPG89311927ANTI-SENSE231656CGPG112212050SENSE232657CGPG65712358ANTI-SENSE233658CGPG148913809SENSE234659CGPG212216877SENSE235660CGPG168318117ANTI-SENSE236661CGPG468571651SENSE237662CGPG472972449SENSE238663CGPG488072649SENSE239664CGPG568073107SENSE240665CGPG731774875SENSE241666CGPG490275079SENSE242667CGPG672075180SENSE243668CGPG585075228SENSE244669CGPG601075246SENSE245670CGPG748875375SENSE246671CGPG759275476SENSE247672CGPG767275502SENSE248673CGPG692375857SENSE249674CGPG698875873SENSE250675CGPG135775970SENSE251676CGPG591876115SENSE252677CGPG602076217SENSE253678CGPG703276287SENSE254679CGPG706976294SENSE255680CGPG890076305SENSE256681CGPG892376391SENSE257682CGPG629676438SENSE258683CGPG705376452SENSE259684CGPG713476563SENSE260685CGPG695177055SENSE261686CGPG699477059SENSE262687CGPG701977062SENSE263688CGPG702477063SENSE264689CGPG925577218SENSE265690CGPG927477256SENSE266691CGPG613077314SENSE267692CGPG807477344SENSE268693CGPG808177348SENSE269694CGPG17210443ANTI-SENSE270695CGPG248217814SENSE271696CGPG322218537SENSE272697CGPG325918831SENSE273698CGPG190019044ANTI-SENSE274699CGPG412819950SENSE275700CGPG2070221SENSE276701CGPG20171110SENSE277702CGPG342071718SENSE278703CGPG430871838SENSE279704CGPG524472087SENSE280705CGPG553872729SENSE281706CGPG373873302SENSE282707CGPG645473484SENSE283708CGPG653073594SENSE284709CGPG517573736SENSE285710CGPG575674016SENSE286711CGPG536174266SENSE287712CGPG670974479SENSE288713CGPG675574549SENSE289714CGPG676574574SENSE290715CGPG603974607SENSE291716CGPG615874642SENSE292717CGPG357875015SENSE293718CGPG752875380SENSE294719CGPG775675560SENSE295720CGPG766375765SENSE296721CGPG824975977SENSE297722CGPG223276052SENSE298723CGPG549476109SENSE299724CGPG594776116SENSE300725CGPG716076158SENSE301726CGPG626276232SENSE302727CGPG690976269SENSE303728CGPG888476303SENSE304729CGPG896576474SENSE305730CGPG586176529SENSE306731CGPG713876625SENSE307732CGPG724676759SENSE308733CGPG537877026SENSE309734CGPG916877131SENSE310735CGPG150514944SENSE311736CGPG361418410SENSE312737CGPG322018536SENSE313738CGPG306219540SENSE314739CGPG358019624SENSE315740CGPG372570353SENSE316741CGPG59271215SENSE317742CGPG441771320SENSE318743CGPG227673076SENSE319744CGPG497573666SENSE320745CGPG279073852SENSE321746CGPG497274080SENSE322747CGPG514074211SENSE323748CGPG1974510SENSE324749CGPG676974527SENSE325750CGPG677074539SENSE326751CGPG73774556SENSE327752CGPG678974577SENSE328753CGPG537474729SENSE329754CGPG597874753SENSE330755CGPG597974754SENSE331756CGPG744774908SENSE332757CGPG94875107SENSE333758CGPG605275254SENSE334759CGPG766175741SENSE335760CGPG826175931SENSE336761CGPG544776207SENSE337762CGPG706876293SENSE338763CGPG593876517SENSE339764CGPG710076559SENSE340765CGPG900376804SENSE341766CGPG629977018SENSE342767CGPG913577115SENSE343768CGPG914477128SENSE344769CGPG919477158SENSE345770CGPG920277159SENSE346771CGPG913977163SENSE347772CGPG41410923ANTI-SENSE348773CGPG107214836SENSE349774CGPG198615130SENSE350775CGPG291618214SENSE351776CGPG11170236SENSE352777CGPG431770639SENSE353778CGPG438770668SENSE354779CGPG449270674SENSE355780CGPG59770804SENSE356781CGPG34570821SENSE357782CGPG478672505SENSE358783CGPG499872824SENSE359784CGPG507673283SENSE360785CGPG510473292SENSE361786CGPG650273578SENSE362787CGPG663074112SENSE363788CGPG548474249SENSE364789CGPG539474278SENSE365790CGPG605874359SENSE366791CGPG209074393SENSE367792CGPG666474414SENSE368793CGPG770675530SENSE369794CGPG788475752SENSE370795CGPG696575867SENSE371796CGPG125675910SENSE372797CGPG229776101SENSE373798CGPG624676431SENSE374799CGPG903776832SENSE375800CGPG194116023SENSE376801CGPG205516424ANTI-SENSE377802CGPG242717807SENSE378803CGPG38670802SENSE379804CGPG39370817SENSE380805CGPG60970819SENSE381806CGPG402270908SENSE382807CGPG94272351SENSE383808CGPG180073087SENSE384809CGPG478373223SENSE385810CGPG325773709SENSE386811CGPG169673804SENSE387812CGPG198273819SENSE388813CGPG378073851SENSE389814CGPG660274156SENSE390815CGPG662174194SENSE391816CGPG549374252SENSE392817CGPG581174317SENSE393818CGPG590274328SENSE394819CGPG679174506SENSE395820CGPG677874540SENSE396821CGPG616674650SENSE397822CGPG373574718SENSE398823CGPG585474745SENSE399824CGPG745174956SENSE400825CGPG502475076SENSE401826CGPG605475255SENSE402827CGPG620775269SENSE403828CGPG762075432SENSE404829CGPG762375468SENSE405830CGPG777575686SENSE406831CGPG7375911SENSE407832CGPG210076064SENSE408833CGPG602676121SENSE409834CGPG726976193SENSE410835CGPG636176237SENSE411836CGPG699376281SENSE412837CGPG889976388SENSE413838CGPG692676557SENSE414839CGPG717276567SENSE415840CGPG712976624SENSE416841CGPG727676764SENSE417842CGPG903176855SENSE418843CGPG910576976SENSE419844CGPG908276985SENSE420845CGPG621277014SENSE421846CGPG920677112SENSE422847CGPG915177117SENSE423848CGPG912977138SENSE424849CGPG922477226SENSE425850CGPG935877418SENSE


Recombinant DNA


DNA for use in the present invention to improve traits in plants have a nucleotide sequence of SEQ ID NO:1 through SEQ ID NO:425, as well as the homologs of such DNA molecules. A subset of the DNA for gene suppression aspects of the invention includes fragments of the disclosed full polynucleotides consisting of oligonucleotides of 21 or more consecutive nucleotides. Oligonucleotides the larger molecules having a sequence selected from the group consisting of SEQ ID NO:1 through SEQ ID NO:425 are useful as probes and primers for detection of the polynucleotides used in the invention. Also useful in this invention are variants of the DNA. Such variants may be naturally occurring, including DNA from homologous genes from the same or a different species, or may be non-natural variants, for example DNA synthesized using chemical synthesis methods, or generated using recombinant DNA techniques. Degeneracy of the genetic code provides the possibility to substitute at least one base of the protein encoding sequence of a gene with a different base without causing the amino acid sequence of the polypeptide produced from the gene to be changed. Hence, a DNA useful in the present invention may have any base sequence that has been changed from the sequences provided herein by substitution in accordance with degeneracy of the genetic code.


Homologs of the genes providing DNA demonstrated as useful in improving traits in model plants disclosed herein will generally have significant identity with the DNA disclosed herein. DNA is substantially identical to a reference DNA if, when the sequences of the polynucleotides are optimally aligned there is about 60% nucleotide equivalence; more preferably 70%; more preferably 80% equivalence; more preferably 85% equivalence; more preferably 90%; more preferably 95%; and/or more preferably 98% or 99% equivalence over a comparison window. A comparison window is preferably at least 50-100 nucleotides, and more preferably is the entire length of the polynucleotide provided herein. Optimal alignment of sequences for aligning a comparison window may be conducted by algorithms; preferably by computerized implementations of these algorithms (for example, the Wisconsin Genetics Software Package Release 7.0-10.0, Genetics Computer Group, 575 Science Dr., Madison, Wis.).


The reference polynucleotide may be a full-length molecule or a portion of a longer molecule. Preferentially, the window of comparison for determining polynucleotide identity of protein encoding sequences is the entire coding region.


Proteins useful for imparting improved traits are entire proteins or at least a sufficient portion of the entire protein to impart the relevant biological activity of the protein. Proteins useful for generation of transgenic plants having improved traits include the proteins with an amino acid sequence provided herein as SEQ ID NO: 426 through SEQ ID NO: 850, as well as homologs of such proteins.


Homologs of the proteins useful in the invention are identified by comparison of the amino acid sequence of the protein to amino acid sequences of proteins from the same or different plant sources, e.g., manually or by using known homology-based search algorithms such as those commonly known and referred to as BLAST, FASTA, and Smith-Waterman. As used herein, a homolog is a protein from the same or a different organism that performs the same biological function as the polypeptide to which it is compared. An orthologous relation between two organisms is not necessarily manifest as a one-to-one correspondence between two genes, because a gene can be duplicated or deleted after organism phylogenetic separation, such as speciation. For a given protein, there may be no ortholog or more than one ortholog. Other complicating factors include alternatively spliced transcripts from the same gene, limited gene identification, redundant copies of the same gene with different sequence lengths or corrected sequence. A local sequence alignment program, e.g., BLAST, can be used to search a database of sequences to find similar sequences, and the summary Expectation value (E-value) used to measure the sequence base similarity. As a protein hit with the best E-value for a particular organism may not necessarily be an ortholog or the only ortholog, a reciprocal BLAST search is used in the present invention to filter hit sequences with significant E-values for ortholog identification. The reciprocal BLAST entails search of the significant hits against a database of amino acid sequences from the base organism that are similar to the sequence of the query protein. A hit is a likely ortholog, when the reciprocal BLAST's best hit is the query protein itself or a protein encoded by a duplicated gene after speciation. Thus, homolog is used herein to describe proteins that are assumed to have functional similarity by inference from sequence base similarity. The relationship of homologs with amino acid sequences of SEQ ID NO: 851 to SEQ ID NO: 33634 to the proteins with amino acid sequences of SEQ ID NO: to 426 to SEQ ID NO: 850 is found in the listing of Table 2.


Other functional homolog proteins differ in one or more amino acids from those of a trait-improving protein disclosed herein as the result of one or more of the well-known conservative amino acid substitutions, e.g., valine is a conservative substitute for alanine and threonine is a conservative substitute for serine. Conservative substitutions for an amino acid within the native sequence can be selected from other members of a class to which the naturally occurring amino acid belongs. Representative amino acids within these various classes include, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar amino acids such as glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; and (4) neutral nonpolar (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. Conserved substitutes for an amino acid within a native amino acid sequence can be selected from other members of the group to which the naturally occurring amino acid belongs. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Naturally conservative amino acids substitution groups are: valine-leucine, valine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, aspartic acid-glutamic acid, and asparagine-glutamine. A further aspect of the invention comprises proteins that differ in one or more amino acids from those of a described protein sequence as the result of deletion or insertion of one or more amino acids in a native sequence.


Homologs of the trait-improving proteins disclosed provided herein will generally demonstrate significant sequence identity. Of particular interest are proteins having at least 50% sequence identity, more preferably at least about 70% sequence identity or higher, e.g., at least about 80% sequence identity with an amino acid sequence of SEQ ID NO: 426 through SEQ ID NO: 850. Of course useful proteins also include those with higher identity, e.g., 90% to 99% identity. Identity of protein homologs is determined by optimally aligning the amino acid sequence of a putative protein homolog with a defined amino acid sequence and by calculating the percentage of identical and conservatively substituted amino acids over the window of comparison. The window of comparison for determining identity can be the entire amino acid sequence disclosed herein, e.g., the full sequence of any of SEQ ID NO: 426 through SEQ ID NO: 850.


Genes that are homologous to each other can be grouped into families and included in multiple sequence alignments. Then a consensus sequence for each group can be derived. This analysis enables the derivation of conserved and class- (family) specific residues or motifs that are functionally important. These conserved residues and motifs can be further validated with 3D protein structure if available. The consensus sequence can be used to define the full scope of the invention, e.g., to identify proteins with a homolog relationship. Thus, the present invention contemplates that protein homologs include proteins with an amino acid sequence that has at least 90% identity to such a consensus amino acid sequence sequences.


Promoters


Numerous promoters that are active in plant cells have been described in the literature. These include promoters present in plant genomes as well as promoters from other sources, including nopaline synthase (NOS) promoter and octopine synthase (OCS) promoters carried on tumor-inducing plasmids of Agrobacterium tumefaciens, caulimovirus promoters such as the cauliflower mosaic virus or Figwort mosaic virus promoters. For instance, see U.S. Pat. Nos. 5,858,742 and 5,322,938 which disclose versions of the constitutive promoter derived from cauliflower mosaic virus (CaMV35S), U.S. Pat. No. 5,378,619 which discloses a Figwort Mosaic Virus (FMV) 35S promoter, U.S. Pat. No. 6,437,217 which discloses a maize RS81 promoter, U.S. Pat. No. 5,641,876 which discloses a rice actin promoter, U.S. Pat. No. 6,426,446 which discloses a maize RS324 promoter, U.S. Pat. No. 6,429,362 which discloses a maize PR1 promoter, U.S. Pat. No. 6,232,526 which discloses a maize A3 promoter, U.S. Pat. No. 6,177,611 which discloses constitutive maize promoters, U.S. Pat. No. 6,433,252 which discloses a maize L3 oleosin promoter, U.S. Pat. No. 6,429,357 which discloses a rice actin 2 promoter and intron, U.S. Pat. No. 5,837,848 which discloses a root specific promoter, U.S. Pat. No. 6,084,089 which discloses cold inducible promoters, U.S. Pat. No. 6,294,714 which discloses light inducible promoters, U.S. Pat. No. 6,140,078 which discloses salt inducible promoters, U.S. Pat. No. 6,252,138 which discloses pathogen inducible promoters, U.S. Pat. No. 6,175,060 which discloses phosphorus deficiency inducible promoters, U.S. Patent Application Publication 2002/0192813A1 which discloses 5′, 3′ and intron elements useful in the design of effective plant expression vectors, U.S. patent application Ser. No. 09/078,972 which discloses a coixin promoter, U.S. patent application Ser. No. 09/757,089 which discloses a maize chloroplast aldolase promoter, and U.S. patent application Ser. No. 10/739,565 which discloses water-deficit inducible promoters, all of which are incorporated herein by reference. These and numerous other promoters that function in plant cells are known to those skilled in the art and available for use in recombinant polynucleotides of the present invention to provide for expression of desired genes in transgenic plant cells.


Furthermore, the promoters can include multiple “enhancer sequences” to assist in elevating gene expression. Such enhancers are known in the art. By including an enhancer sequence with such constructs, the expression of the selected protein may be enhanced. These enhancers often are found 5′ to the start of transcription in a promoter that functions in eukaryotic cells, but can often be inserted in the forward or reverse orientation 5′ or 3′ to the coding sequence. In some instances, these 5′ enhancing elements are introns. Deemed to be particularly useful as enhancers are the 5′ introns of the rice actin 1 and rice actin 2 genes. Examples of other enhancers that can be used in accordance with the invention include elements from the CaMV 35S promoter, octopine synthase genes, the maize alcohol dehydrogenase gene, the maize shrunken 1 gene and promoters from non-plant eukaryotes.


In some aspects of the invention it is preferred that the promoter element in the DNA construct be capable of causing sufficient expression to result in the production of an effective amount of a polypeptide in water deficit conditions. Such promoters can be identified and isolated from the regulatory region of plant genes that are over expressed in water deficit conditions. Specific water-deficit-inducible promoters for use in this invention are derived from the 5′ regulatory region of genes identified as a heat shock protein 17.5 gene (HSP17.5), an HVA22 gene (HVA22), a Rab17 gene and a cinnamic acid 4-hydroxylase (CA4H) gene (CA4H) of Zea maize. Such water-deficit-inducible promoters are disclosed in U.S. application Ser. No. 10/739,565, incorporated herein by reference.


In some aspects of the invention, sufficient expression in plant seed tissues is desired to effect improvements in seed composition. Exemplary promoters for use for seed composition modification include promoters from seed genes such as napin (U.S. Pat. No. 5,420,034), maize L3 oleosin (U.S. Pat. No. 6,433,252), zein Z27 (Russell et al., (1997)Transgenic Res. 6(2):157-166), globulin 1 (Belanger et al., (1991) Genetics 129:863-872), glutelin 1 (Russell (1997) supra), and peroxiredoxin antioxidant (Per 1) (Stacy et al., (1996) Plant Mol. Biol. 31(6):1205-1216).


In some aspects of the invention, preferential expression in plant green tissues is desired. Promoters of interest for such uses include those from genes such as SSU (Fischhoff, et al., (1992) Plant Mol. Biol. 20:81-93), aldolase and pyruvate orthophosphate dikinase (PPDK) (Taniguchi, et al., (2000) Plant Cell Physiol. 41(1):42-48).


Gene suppression includes any of the well-known methods for suppressing transcription of a gene or the accumulation of the mRNA corresponding to that gene thereby preventing translation of the transcript into protein. Posttranscriptional gene suppression is mediated by transcription of RNA that forms double-stranded RNA (dsRNA) having homology to a gene targeted for suppression. Suppression can also be achieved by insertion mutations created by transposable elements may also prevent gene function. For example, in many dicot plants, transformation with the T-DNA of Agrobacterium may be readily achieved and large numbers of transformants can be rapidly obtained. Also, some species have lines with active transposable elements that can efficiently be used for the generation of large numbers of insertion mutations, while some other species lack such options. Mutant plants produced by Agrobacterium or transposon mutagenesis and having altered expression of a polypeptide of interest can be identified using the polynucleotides of the present invention. For example, a large population of mutated plants may be screened with polynucleotides encoding the polypeptide of interest to detect mutated plants having an insertion in the gene encoding the polypeptide of interest.


Gene Stacking


The present invention also contemplates that the trait-improving recombinant DNA provided herein can be used in combination with other recombinant DNA to create plants with a multiple desired traits. The combinations generated can include multiple copies of any one or more of the recombinant DNA constructs. These stacked combinations can be created by any method, including but not limited to cross breeding of transgenic plants, or multiple genetic transformation.


Transformation Methods


Numerous methods for producing plant cell nuclei with recombinant DNA are known in the art and may be used in the present invention. Two commonly used methods for plant transformation are Agrobacterium-mediated transformation and microprojectile bombardment. Microprojectile bombardment methods are illustrated in U.S. Pat. No. 5,015,580 (soybean); U.S. Pat. No. 5,550,318 (corn); U.S. Pat. No. 5,538,880 (corn); U.S. Pat. No. 5,914,451 (soybean); U.S. Pat. No. 6,160,208 (corn); U.S. Pat. No. 6,399,861 (corn) and U.S. Pat. No. 6,153,812 (wheat) and Agrobacterium-mediated transformation is described in U.S. Pat. No. 5,159,135 (cotton); U.S. Pat. No. 5,824,877 (soybean); U.S. Pat. No. 5,591,616 (corn); and U.S. Pat. No. 6,384,301 (soybean), all of which are incorporated herein by reference. For Agrobacterium tumefaciens based plant transformation system, additional elements present on transformation constructs will include T-DNA left and right border sequences to facilitate incorporation of the recombinant polynucleotide into the plant genome.


In general it is preferred to introduce heterologous DNA randomly, i.e., at a non-specific location, in the genome of a target plant line. In special cases it may be useful to target heterologous DNA insertion in order to achieve site-specific integration, e.g., to replace an existing gene in the genome, to use an existing promoter in the plant genome, or to insert a recombinant polynucleotide at a predetermined site known to be active for gene expression. Several site specific recombination systems exist which are known to function implants include cre-lox as disclosed in U.S. Pat. No. 4,959,317 and FLP-FRT as disclosed in U.S. Pat. No. 5,527,695, both incorporated herein by reference.


Transformation methods of this invention are preferably practiced in tissue culture on media and in a controlled environment. “Media” refers to the numerous nutrient mixtures that are used to grow cells in vitro, that is, outside of the intact living organism. Recipient cell targets include, but are not limited to, meristem cells, callus, immature embryos and gametic cells such as microspores, pollen, sperm and egg cells. It is contemplated that any cell from which a fertile plant may be regenerated is useful as a recipient cell. Callus may be initiated from tissue sources including, but not limited to, immature embryos, seedling apical meristems, microspores and the like. Cells capable of proliferating as callus are also recipient cells for genetic transformation. Practical transformation methods and materials for making transgenic plants of this invention, e.g., various media and recipient target cells, transformation of immature embryos and subsequent regeneration of fertile transgenic plants are disclosed in U.S. Pat. Nos. 6,194,636 and 6,232,526 and U.S. patent application Ser. No. 09/757,089, which are incorporated herein by reference.


In practice DNA is introduced into only a small percentage of target cell nuclei in any one experiment. Marker genes are used to provide an efficient system for identification of those cells with nuclei that are stably transformed by receiving and integrating a transgenic DNA construct into their genomes. Preferred marker genes provide selective markers that confer resistance to a selective agent, such as an antibiotic or herbicide. Potentially transformed cells with a nucleus of the invention are exposed to the selective agent. In the population of surviving cells will be those cells where, generally, the resistance-conferring gene has been integrated and expressed at sufficient levels to permit cell survival. Cells may be tested further to confirm stable integration of the exogenous DNA in the nucleus. Useful selective marker genes include those conferring resistance to antibiotics such as kanamycin (nptII), hygromycin B (aph IV) and gentamycin (aac3 and aacC4) or resistance to herbicides such as glufosinate (bar or pat) and glyphosate (EPSPS). Examples of such selectable are illustrated in U.S. Pat. Nos. 5,550,318; 5,633,435; 5,780,708 and 6,118,047, all of which are incorporated herein by reference. Screenable markers which provide an ability to visually identify transformants can also be employed, e.g., a gene expressing a colored or fluorescent protein such as a luciferase or green fluorescent protein (GFP) or a gene expressing a beta-glucuronidase or uidA gene (GUS) for which various chromogenic substrates are known. It is also contemplated that combinations of screenable and selectable markers will be useful for identification of transformed cells. See PCT publication WO 99/61129 which discloses use of a gene fusion between a selectable marker gene and a screenable marker gene, e.g., an NPTII gene and a GFP gene.


Cells that survive exposure to the selective agent, or cells that have been scored positive in a screening assay, may be cultured in regeneration media and allowed to mature into plants. Developing plantlets can be transferred to soil less plant growth mix, and hardened off, e.g., in an environmentally controlled chamber at about 85% relative humidity, 600 ppm CO2, and 25-250 microeinsteins m−2s−1 of light, prior to transfer to a greenhouse or growth chamber for maturation. Plants are preferably matured either in a growth chamber or greenhouse. Plants are regenerated from about 6 wk to 10 months after a transformant is identified, depending on the initial tissue. During regeneration, cells are grown to plants on solid media at about 19 to 28° C. After regenerating plants have reached the stage of shoot and root development, they may be transferred to a greenhouse for further growth and testing. Plants may be pollinated using conventional plant breeding methods known to those of skill in the art and seed produced.


Progeny may be recovered from transformed plants and tested for expression of the exogenous recombinant polynucleotide. Useful assays include, for example, “molecular biological” assays, such as Southern and Northern blotting and PCR; “biochemical” assays, such as detecting the presence of RNA, e.g., double stranded RNA, or a protein product, e.g., by immunological means (ELISAs and Western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole regenerated plant.


Discovery of Trait-Improving Recombinant DNA


To identify nuclei with recombinant DNA that confer improved traits to plants, Arabidopsis thaliana was transformed with a candidate recombinant DNA construct and screened for an improved trait.



Arabidopsis thaliana is used a model for genetics and metabolism in plants. Arabidopsis has a small genome, and well-documented studies are available. It is easy to grow in large numbers and mutants defining important genetically controlled mechanisms are either available, or can readily be obtained. Various methods to introduce and express isolated homologous genes are available (see Koncz, e.g., Methods in Arabidopsis Research e.g., (1992), World Scientific, New Jersey, in “Preface”).


A two-step screening process was employed which comprised two passes of trait characterization to ensure that the trait modification was dependent on expression of the recombinant DNA, but not due to the chromosomal location of the integration of the transgene. Twelve independent transgenic lines for each recombinant DNA construct were established and assayed for the transgene expression levels. Five transgenic lines with high transgene expression levels were used in the first pass screen to evaluate the transgene's function in T2 transgenic plants. Subsequently, three transgenic events, which had been shown to have one or more improved traits, were further evaluated in the second pass screen to confirm the transgene's ability to impart an improved trait. The following Table 3 summarizes the improved traits that have been confirmed as provided by a recombinant DNA construct.


In particular, Table3 reports:


“PEP SEQ ID” which is the amino acid sequence of the protein cognate to the DNA in the recombinant DNA construct corresponding to a protein sequence of a SEQ ID NO. in the Sequence Listing.


“construct_id” is an arbitrary name for the recombinant DNA describe more particularly in Table 1.


“annotation” refers to a description of the top hit protein obtained from an amino acid sequence query of each PEP SEQ ID NO to GenBank database of the National Center for Biotechnology Information (ncbi). More particularly, “gi” is the GenBank ID number for the top BLAST hit.


“description” refers to the description of the top BLAST hit.


“e-value” provides the expectation value for the BLAST hit.


“identity” refers to the percentage of identically matched amino acid residues along the length of the portion of the sequences which is aligned by BLAST between the sequence of interest provided herein and the hit sequence in GenBank.


“traits” identify by two letter codes the confirmed improvement in a transgenic plant provided by the recombinant DNA. The codes for improved traits are:


“CK” which indicates cold tolerance improvement identified under a cold shock tolerance screen;


“CS” which indicates cold tolerance improvement identified by a cold germination tolerance screen;


“DS” which indicates drought tolerance improvement identified by a soil drought stress tolerance screen;


“PEG” which indicates osmotic stress tolerance improvement identified by a PEG induced osmotic stress tolerance screen;


“HS” which indicates heat stress tolerance improvement identified by a heat stress tolerance screen;


“SS” which indicates high salinity stress tolerance improvement identified by a salt stress tolerance screen;


“LN” which indicates nitrogen use efficiency improvement identified by a limited nitrogen tolerance screen;


“LL” which indicates attenuated shade avoidance response identified by a shade tolerance screen under a low light condition;


“PP” which indicates improved growth and development at early stages identified by an early plant growth and development screen;


“SP” which indicates improved growth and development at late stages identified by a late plant growth and development screen provided herein.

TABLE 3annotationPEP SEQe%ID NOvalueidentityGenBank iddescriptiontrait4261.00E−35100gi|21553646|histone H2ACK4276.00E−59100gi|7327824|ref|NP_195785.1|CKmacrophage migrationinhibitory factor familyprotein/MIF family protein[Arabidopsis thaliana]428099gi|28416663|ref|NP_180648.1|CKtransducin family protein/WD-40 repeat family protein4292.00E−78100gi|23198326|gb|AAN15690.1|unknownCKprotein [Arabidopsisthaliana]4300100gi|23297810|ref|NP_194910.2|CKLNphototropic-responsiveNPH3 family protein4318.00E−48100gi|12248017|ref|NP_173542.1| smallCKnuclear ribonucleoprotein,putative/snRNP, putative/Sm protein, putative4321.00E−126100gi|16323514|ref|NP_187673.1|CKCSPPHSdiadenosine 5′,5′′′-P1,P4-tetraphosphate hydrolase,putative [Arabidopsisthaliana]4330100gi|30679613|ref|NP_172113.1| DNA-CKSSPEGbinding bromodomain-containing protein Containssimilarity to a Ring3 protein4342.00E−48100gi|52354271|gb|AAC17827.1| similar toCKPPlate embryogenesis abundantproteins4350100gi|20466169|ref|NP_564623.2|CKsodium/calcium exchangerfamily protein/calcium-binding EF hand familyprotein4362.00E−87100gi|19310829|ref|NP_179396.1| histoneCKH1-3 (HIS1-3)4371.00E−99100gi|21593872|ref|NP_563973.1|CKlactoylglutathione lyasefamily protein/glyoxalase Ifamily protein4381.00E−15966gi|15220367|ref|NP_176890.1|F-boxCKCSHSfamily protein [Arabidopsisthaliana]439078gi|9757954|ref|NP_199024.1| expressedCKprotein [Arabidopsisthaliana]4405.00E−51100gi|21618078|ref|NP_176371.1|CKHSSSpostsynaptic protein-related[Arabidopsis thaliana]4410100gi|24030234|ref|NP_027420.1| zincCKCSSSLLPEGfinger (CCCH-type) familyprotein [Arabidopsisthaliana]4420100gi|21618116|ref|NP_567671.1| bileCKacid:sodium symporterfamily protein4431.00E−168100gi|16930405|ref|NP_850182.1| PURCKPPHSSSalpha-1 protein [Arabidopsisthaliana]444083gi|23397029|ref|NP_567709.1| 26SCKCSPPproteasome regulatorysubunit, putative (RPN7)445083gi|30349498|ref|NP_850094.1| CBL-CKSSPEGinteracting protein kinase 3(CIPK3)4460100gi|9294291|dbj|BAB02193.1|cytochromeCKPPp4504470100gi|7270365|emb|CAB80133.1|cyclinCKPPHSSSPEGdelta-34480100gi|31580811|gb|AAP51420.1|ferric-CKchelate reductase[Arabidopsis thaliana]449099gi|8439909|ref|NP_563791.1| proteinCKphosphatase 2C familyprotein/PP2C familyprotein4500100gi|15238701|ret|NP_197894.1|cytochromeCKDSP450 family protein[Arabidopsis thaliana]4511.00E−178100gi|21553567|gb|AAM62660.1|unknownCK[Arabidopsis thaliana]4520100gi|23198354|ref|NP_193140.1| selenium-CKHSbinding protein, putative[Arabidopsis thaliana]4530100gi|22655117|ref|NP_565411.2| calcium-CKCSHSSSLLdependent protein kinaseisoform 6 (CPK6)4544.00E−83100gi|21537110|ref|NP_197849.1| expressedCKLNprotein [Arabidopsisthaliana]4551.00E−122100gi|23397082|ref|NP_174418.1|CKPEGphotosystem I reactioncenter subunit III familyprotein4560100gi|30680112|ref|NP_179369.2|expressedCKprotein [Arabidopsisthaliana]4570100gi|4678321|ref|NP_190390.1| expressedCKprotein [Arabidopsisthaliana]4580100gi|3169180|ref|NP_179889.1| caseinCKkinase II alpha chain,putative [Arabidopsisthaliana]4590100gi|30725494|ref|NP_851182.1| proteinCKSSkinase family protein[Arabidopsis thaliana]4600100gi|26986957|ref|NP_742382.1|4-CKDSaminobutyrateaminotransferase[Pseudomonas putidaKT2440]461091gi|37526249|ref|NP_929593.1|4-CKaminobutyrateaminotransferase (gamma-amino-N-butyratetransaminase) (GABAtransaminase)462083gi|37527865|ref|NP_931210.1|glutamateCKsynthase [NADPH] smallchain (glutamate synthasebeta subunit) (NADPH-GOGAT) (GLTS beta chain)4630100gi|23308229|ref|NP_175649.1| glycosylCKPPhydrolase family 1 protein/beta-glucosidase, putative a4640100gi|30685484|ref|NP_849681.1|serine/threonineCKLLLNprotein phosphatase2A (PP2A) 55 kDaregulatory subunit B[Arabidopsis thaliana]465099gi|16080446|ref|NP_391273.1|phosphoglycerateCKPPkinase [Bacillussubtilis subsp. subtilis str.168]4660100gi|6729016|gb|AAF27012.1|putativeCKSAR DNA-binding protein-1[Arabidopsis thaliana]4670100gi|42569691|ref|NP_181253.2|transducinCKHSfamily protein/WD-40repeat family protein4685.00E−96100gi|38603990|ref|NP_177237.1| C2CKdomain-containing protein4690100gi|22136622|ref|NP_193273.1|CKPPcytochrome P450 familyprotein [Arabidopsisthaliana]4700100gi|22331742|ref|NP_190771.2|F-boxCKfamily protein/WD-40repeat family protein[Arabidopsis thaliana]4711.00E−162100gi|7021738|ref|NP_566517.1| expressedCKCSSSprotein [Arabidopsisthaliana]472098gi|10176804|dbj|BAB09992.1|serine/threonineCKSSLLprotein kinase-like[Arabidopsis thaliana]4730100gi|28827534|ref|NP_568971.1| leucine-CKrich repeat transmembraneprotein kinase, putative4740100gi|12321664|gb|AAG50866.1|proteinCKkinase, putative [Arabidopsisthaliana]4751.00E−177100gi|2651299|ref|NP_181590.1| proteinCKkinase family protein4760100gi|56381947|ref|NP_564161.1|CKSSphosphoglycerate/bisphosphoglyceratemutase familyprotein4770100gi|15075789|ref|NP_386871.1|CKPROBABLEPHOSPHOGLYCERATEKINASE PROTEIN478096gi|28872422|ref|NP_795041.1|glutamineCKCSsynthetase4790100gi|21595073|ref|NP_192046.1| mitogen-CKSSPEGactivated protein kinase,putative/MAPK, putative(MPK4)4800100gi|10174347|ref|NP_242595.1| glutamateCKsynthase (small subunit)4810100gi|16081086|ref|NP_391914.1|ornithineCKPPaminotransferase4820100gi|18400003|ref|NP_564469.1|WD-40CKHSSSLLPEGrepeat family protein[Arabidopsis thaliana]4830100gi|4559336|ref|NP_181783.1| proteinCKkinase family protein[Arabidopsis thaliana]4840100gi|11994319|ref|NP_188976.1| caseinCKCSPPSSPEGkinase, putative [Arabidopsisthaliana]4850100gi|23507789|ref|NP_176390.1|CKLNmitochondrial transcriptiontermination factor-related/mTERF-related [Arabidopsisthaliana]4861.00E−120100gi|10177463|dbj|BAB10854.1|unnamedCKLLprotein product [Arabidopsisthaliana]4870100gi|15221219|ref|NP_177573.1|proteinCKLLkinase, putative [Arabidopsisthaliana]4880100gi|15218854|ref|NP_174217.1|CBL-CKCSLLPEGinteracting protein kinase 18(CIPK18)4890100gi|22136780|ref|NP_189420.1|CKPPLLmonodehydroascorbatereductase, putative[Arabidopsis thaliana]4901.00E−132100gi|17104619|gb|AAL34198.1|putativeCKglutathione peroxidase[Arabidopsis thaliana]4910100gi|7268589|ref|NP_192114.1| sugarCKtransporter, putative[Arabidopsis thaliana]4920100gi|56381949|ref|NP_200733.2|sugarCKtransporter family protein[Arabidopsis thaliana]4937.00E−6083gi|31433365|ref|NP_922597.1|unknownCKSSLLprotein4941.00E−17179gi|34910712|ref|NP_916703.1|putativeCKnuclear RNA binding proteinA [Oryza sativa4951.00E−139100gi|51536564|ref|NP_189139.1|zinc fingerCKPP(C3HC4-type RING finger)family protein4961.00E−13698gi|22995109|ref|ZP_00039591.1|COG0149:CKPEGTriosephosphateisomerase497074gi|32488446|ref|XP_473139.1|CKOSJNBa0004N05.3 [Oryzasativa (japonica cultivar-group)]4980100gi|23126040|ref|ZP_00107950.1|COG0155:CKSulfite reductase, betasubunit (hemoprotein)[Nostoc punctiforme PCC73102]4990100gi|7287983|ref|NP_191594.1|CKPEGarmadillo/beta-catenin repeatfamily protein/F-boxfamily protein [Arabidopsisthaliana]5000100gi|9758604|dbj|BAB09237.1|beta-CKamylase [Arabidopsisthaliana]5010100gi|23463051|ref|NP_178065.1|inosine-5′-CKmonophosphatedehydrogenase [Arabidopsisthaliana]5021.00E−10077gi|55168250|gb|AAV44116.1|unknownCKprotein [Oryza sativa(japonica cultivar-group)]503083gi|50942175|ref|XP_480615.1|putativeCKaminoimidazolecarboximideribonucleotidetransformylase5041.00E−159100gi|10177927|ref|NP_199602.1|CKrespiratory burst oxidaseprotein D (RbohD)/NADPH oxidase5051.00E−17286gi|57899142|dbj|BAD87004.1|unknownCKprotein [Oryza sativa(japonica cultivar-group)]5060100gi|15223469|ref|NP_171679.1|proteinCKkinase family protein[Arabidopsis thaliana]5070100gi|3482918|ref|NP_172414.1|ATP-CKSScitrate synthase (ATP-citrate(pro-S-)-lyase/citratecleavage enzyme), putative5080100gi|23297150|ref|NP_567724.1|fibrillarinCK2 (FIB2) [Arabidopsisthaliana]5090100gi|7268631|ref|NP_193573.1|F-boxCKLLPEGfamily protein [Arabidopsisthaliana]5106.00E−8779gi|21553981|ref|NP_564267.1|peptidyl-CKprolyl cis-trans isomerasecyclophilin-type familyprotein [Arabidopsisthaliana]5111.00E−15168gi|34913784|ref|NP_918239.1|pectateCKlyase-like protein5124.00E−9175gi|53793523|dbj|BAD54684.1|unknownCKprotein [Oryza sativa(japonica cultivar-group)]5131.00E−4163gi|50948309|ref|XP_483682.1|putativeCKLLauxin induced protein5141.00E−5358gi|56202166|dbj|BAD73644.1|60SCKCSPEGribosomal protein L18A-like5157.00E−8593gi|34910948|dbj|BAB84492.1|nuclearCKmovement protein-like5160100gi|6513938|ref|NP_566157.1|nodulinCKfamily protein [Arabidopsisthaliana]5171.00E−171100gi|6957722|ref|NP_186908.1|delta 7-CKLLsterol-C5-desaturase,putative [Arabidopsisthaliana]5180100gi|17132051|ref|NP_486997.1|CKPPhypothetical protein alr2957[Nostoc sp. PCC 7120]519071gi|34911116|ref|NP_916905.1|putativeCKglyoxal oxidase5206.00E−2442gi|4803925|ref|NP_565499.1|expressedCKSSprotein [Arabidopsisthaliana]521081gi|50934555|ref|XP_476805.1|unknownCKSSLNPEGprotein5228.00E−94100gi|26453028|ref|NP_197956.1|expressedCKPPSSPEGprotein5231.00E−167100gi|9795142|emb|CAB67657.2|splicingCSPPfactor-like protein[Arabidopsis thaliana]ref|NP_190918.3|zincknuckle (CCHC-type)family protein [Arabidopsisthaliana]5241.00E−146100gi|6642639|ref|NP_187382.1|forkhead-CSPEGassociated domain-containing protein/FHAdomain-containing protein5250100gi|9279657|ref|NP_188451.1|CSPPHSLNocticosapeptide/Phox/Bem1p (PB1) domain-containingprotein [Arabidopsisthaliana]5260100gi|9759010|dbj|BAB09537.1|unnamedCSprotein product [Arabidopsisthaliana]5270100gi|18390671|ref|NP_563769.1|F-boxCSPPHSSSfamily protein5280100gi|12248033|ref|NP_172880.1|CSphytochrome kinase,putative [Arabidopsisthaliana]5290100gi|21554404|ref|NP_179646.1|DNAJCSPPSSheat shock family protein[Arabidopsis thaliana]5301.00E−111100gi|10177521|ref|NP_201441.1|dehydrinCSPEG(RAB18) rab18 protein -Arabidopsis thalianasp|P30185|DHR18_ARATHDehydrin Rab185310100gi|20260458|dbj|BAB70612.1|CSSSanthocyanin-relatedmembrane protein 15320100gi|21553568|ref|NP_849480.1|expressedCSPPPEGprotein5330100gi|21553616|ref|NP_568089.1|expressedCSprotein5341.00E−158100gi|21554409|ref|NP_174469.1|histidineCSbiosynthesis bifunctionalprotein (HISIE)[Arabidopsis thaliana]pir||T51812 phosphoribosyl-AMP cyclohydrolase (EC3.5.4.19)5351.00E−15668gi|50949063|ref|XP_493889.1|putativeCSprotein kinase [Oryza sativa]536076gi|31455393|emb|CAD92450.1|aminoCSSSacid permease 6 [Brassicanapus]5370100gi|6324421|sp|Q12068|GRE2_YEASTCSNADPH-dependentmethylglyoxal reductaseGRE2 (Genes de respuesta aestres protein 2)538099gi|18401915|ref|NP_566612.1|histoneCSdeacetylase family protein[Arabidopsis thaliana]5390100gi|30685903|ref|NP_851041.1|zinc fingerCSPEG(CCCH-type) family protein[Arabidopsis thaliana]5401.00E−13468gi|31432120|ref|NP_921503.1|putativeCSPPpolyprotein [Oryza sativa(japonica cultivar-group)]5410100gi|22136556|emb|CAB80464.1|CScinnamyl-alcoholdehydrogenase ELI3-2[Arabidopsis thaliana]5420100gi|7413528|ref|NP_196086.1|CScytochrome P450, putative[Arabidopsis thaliana]543099gi|42568440|ref|NP_199845.2|DNACSPEGrepair protein-related[Arabidopsis thaliana]5440100gi|30102478|ref|NP_195917.1|hydrolase,CSPEGalpha/beta fold familyprotein [Arabidopsisthaliana]5450100gi|21280881|ref|NP_176968.1|glycerateCSHSSSPEGdehydrogenase/NADH-dependent hydroxypyruvatereductase5460100gi|10177967|ref|NP_198904.1|WD-40CSPEGrepeat family protein/zfwd3protein (ZFWD3)[Arabidopsis thaliana]5470100gi|7269306|ref|NP_567705.1|ubiquitin-CSspecific protease 16, putative(UBP16)548099gi|6324066|sp|P53845|YN03_YEASTCSHypothetical 35.5 kDaprotein in PIK1-POL2intergenic region549099gi|6323074|ref|NP_013146.1|Microtubule-CSassociated protein (MAP)of the XMAP215/Dis1family; regulatesmicrotubule dynamicsduring spindle orientationand metaphase chromosomealignment; interacts withspindle pole bodycomponent Spc72p[Saccharomyces cerevisiae]5500100gi|6324990|ref|NP_015058.1|DicarboxylicCSPPamino acid permease,mediates high-affinity andhigh-capacity transport of L-glutamate and L-aspartate;also a transporter for Gln,Asn, Ser, Ala, and Gly[Saccharomyces cerevisiae]5510100gi|51013603|sp|P50947|YNJ7_YEASTCSPEGHypothetical 37.0 kDaprotein in RAS2-RPS7Bintergenic region5520100gi|6319413|ref|NP_009495.1|Shp1pCS[Saccharomyces cerevisiae]emb|CAA80789.1|YBLO515 [Saccharomycescerevisiae]emb|CAA84878.1|SHP15530100gi|22136824|ref|NP_197455.1|expressedCSPPprotein [Arabidopsisthaliana]5540100gi|22655036|ref|NP_566453.1|WD-40CSrepeat family protein[Arabidopsis thaliana]5550100gi|48771699|ref|ZP_00276041.1|COG1062:CSZn-dependent alcoholdehydrogenases, class III[Ralstonia metalliduransCH34]5560100gi|49176169|ref|NP_416429.3|D-cysteineCSdesulfhydrase, PLP-dependent enzyme cysteinedesulfhydrase, PLP-dependent enzyme5570100gi|9294512|gb|AAK21273.1|aberrantCSPPSSlateral root formation 5[Arabidopsis thaliana]ref|NP_566730.1|MATEefflux family protein[Arabidopsis thaliana]5580100gi|31711972|ref|NP_175641.1|proteinCSSSkinase family protein/C-type lectin domain-containing protein5598.00E−61100gi|21592962|ref|NP_564579.1|expressedCSSSprotein [Arabidopsisthaliana]5600100gi|30102452|ref|NP_195770.1|CSLLmitochondrial substratecarrier family protein[Arabidopsis thaliana]5610100gi|6056406|gb|AAF02870.1|HypotheticalCSprotein [Arabidopsisthaliana]5621.00E−143100gi|21689635|emb|CAB78986.1|lectin likeCSSSprotein [Arabidopsisthaliana]5630100gi|22135769|ref|NP_188127.1|CSoxidoreductase, zinc-bindingdehydrogenase familyprotein5646.00E−91100gi|21553433|ref|NP_199669.1|FK506-CSPEGbinding protein 2-2(FKBP15-2)/immunophilin/peptidyl-prolyl cis-transisomerase/rotamase5651.00E−151100gi|21592742|ref|NP_195534.1|expansin,CSPPputative (EXP20)5668.00E−64100gi|21537125|ref|NP_196975.1|expressedCSprotein [Arabidopsisthaliana]5671.00E−49100gi|16331623|ref|NP_442351.1|hypotheticalCSLNprotein ssl0353[Synechocystis sp. PCC6803]5680100gi|21618054|ref|NP_564383.1|ABCCStransporter family protein[Arabidopsis thaliana]5690100gi|7321037|ref|NP_192879.1|CSPPSSARID/BRIGHT DNA-binding domain-containingprotein/ELM2 domain-containing protein/Myb-like DNA-binding domain-containing protein5701.00E−44100gi|58299|emb|CAA48415.1|unnamedCSprotein product [syntheticconstruct]571099gi|10172815|dbj|BAB03922.1|glycineCSPEGbetaine aldehydedehydrogenase [Bacillushalodurans C-125]5721.00E−165100gi|3928103|ref|NP_181434.1|aquaporin,CSPPputative [Arabidopsisthaliana]573091gi|37524126|ref|NP_927470.1|phosphoenolpyruvateCScarboxykinase[ATP] [Photorhabdusluminescens subsp.laumondii TTO1]5740100gi|9293860|ref|NP_564217.1|lysine andCShistidine specific transporter,putative [Arabidopsisthaliana]5750100gi|10175358|ref|NP_243603.1|1-CSpyrroline-5-carboxylatedehydrogenase [Bacillushalodurans C-125]5760100gi|10175785|ref|NP_244029.1|pyruvateCSPPkinase [Bacillus haloduransC-125]5778.00E−45100gi|9758890|dbj|BAB09466.1|auxin-CSinduced protein-like[Arabidopsis thaliana]5781.00E−108100gi|17104761|ref|NP_193699.1|Ras-CSLNrelated GTP-binding protein,putative [Arabidopsisthaliana]5790100gi|5915859|sp|O22203|C98A3_ARATHCSCytochrome P450 98A35800100gi|6729015|ref|NP_187156.1|proteinCSSSPEGkinase family protein[Arabidopsis thaliana]5810100gi|21689671|ref|NP_195397.1|CSPPSStransporter-related[Arabidopsis thaliana]5820100gi|20148451|ref|NP_564797.1|flavin-CSSSPEGcontaining monooxygenasefamily protein/FMO familyprotein5830100gi|21280977|gb|AAM45011.1|putativeCSprotein kinase [Arabidopsisthaliana]5841.00E−11682gi|27452901|gb|AAO15285.1|PutativeCSdihydrodipicolinatereductase-like protein[Oryza sativa (japonicacultivar-group)]5851.00E−12399gi|23113896|ref|ZP_00099233.1|COG0036:CSPentose-5-phosphate-3-epimerase[Desulfitobacteriumhafniense DCB-2]5868.00E−9599gi|16330731||YCF3_SYNY3CSPhotosystem I assemblyprotein ycf35870100gi|21700831|ref|NP_191332.2|proteinCSDSSSPEGkinase, putative [Arabidopsisthaliana]5880100gi|4415911|gb|AAD20142.1|putativeCSpoly(A) binding protein[Arabidopsis thaliana]5890100gi|5733869|gb|AAD49757.1|Contains F-CSbox domain PF|00646.[Arabidopsis thaliana]5901.00E−149100gi|29824243|ref|NP_192170.1|tryptophanCSsynthase, alpha subunit,putative [Arabidopsisthaliana]5910100gi|26449849|ref|NP_198924.1|CSHSglycerophosphoryl diesterphosphodiesterase familyprotein592078gi|56783874|dbj|BAD81286.1|putativeCSdual specificity kinase 15930100gi|6321857|ref|NP_011933.1|Ssf1pCSDS[Saccharomyces cerevisiae]sp|P38789|SSF1_YEASTRibosome biogenesis proteinSSF15941.00E−18089gi|50906987|ref|XP_464982.1|lipase classCS3-like [Oryza sativa(japonica cultivar-group)]dbj|BAD21500.1|lipaseclass 3-like [Oryza sativa(japonica cultivar-group)]5950100gi|2558659|gb|AAB81672.1|putativeCSHSPEGprotein kinase [Arabidopsisthaliana]5960100gi|9758547|ref|NP_201503.1|expressedCSprotein [Arabidopsisthaliana]5970100gi|4678335|ref|NP_190376.1|L-CSSSgalactono-1,4-lactonedehydrogenase, putative5981.00E−173100gi|21553608|ref|NP_564800.1|expressedCSprotein [Arabidopsisthaliana]599079gi|50912531|ref|XP_467673.1|putativeCSDSSSPEGnicastrin [Oryza sativa(japonica cultivar-group)]600080gi|50898642|ref|XP_450109.1|nodulationCSPPSSreceptor kinase-like protein6015.00E−74100gi|30693828|ref|NP_850693.1|expressedDSprotein [Arabidopsisthaliana]6020100gi|25054896|ref|NP_179923.2|nicotinateDSphosphoribosyltransferasefamily protein/NAPRTasefamily protein [Arabidopsisthaliana]6031.00E−121100gi|6006854|gb|AAF00630.1|hypotheticalDSLNprotein [Arabidopsisthaliana]6040100gi|6862917|ref|NP_566273.1|heavy-DSmetal-associated domain-containing protein6050100gi|56382003|ref|NP_177794.1|12-DSoxophytodienoate reductase(OPR1) [Arabidopsisthaliana]6060100gi|21280825|gb|AAM45040.1|putativeDSAtMlo-h1 protein[Arabidopsis thaliana]607061gi|22136432|ref|NP_191438.2|glycosylDSSStransferase family 8 protein[Arabidopsis thaliana]608069gi|22136270|ref|NP_190235.1|DSarmadillo/beta-catenin repeatfamily protein/U-boxdomain-containing familyprotein6090100gi|4678377|ref|NP_193087.1|DSammonium transporter 1,member 1 (AMT1.1)6101.00E−14399gi|27754217|ref|NP_187413.1|expressedDSLNPEGprotein [Arabidopsisthaliana]6111.00E−123100gi|21554344|ref|NP_198627.1|ASF1-DSPPHSlike anti-silencing familyprotein [Arabidopsisthaliana]612077gi|25402857|pir||A86318proteinDSPPF15H18.11 [imported] -Arabidopsis thalianagb|AAF25996.1|F15H18.116131.00E−13999gi|16329950|ref|NP_440678.1|hypotheticalDSprotein slr1900[Synechocystis sp. PCC6803]6140100gi|30725526|gb|AAP37785.1|At4g24520DSSS[Arabidopsis thaliana]emb|CAB79362.1|NADPH-ferrihemoprotein reductaseATR1615098gi|3335345|gb|AAC27147.1|ContainsDSLLPEGsimilarity to ABCtransporter gb|1651790 fromSynechocystis sp.6161.00E−120100gi|38603932|ref|NP_193578.1|Ras-DSrelated GTP-binding protein,putative [Arabidopsisthaliana]6170100gi|15810337|ref|NP_565310.1|expressedDSPEGprotein [Arabidopsisthaliana]6180100gi|8978074|ref|NP_199518.1|proteinDSkinase, putative [Arabidopsisthaliana]619075gi|9294678|dbj|BAB03027.1|glutamine-DSfructose-6-phosphatetransaminase 2 [Arabidopsisthaliana]6200100gi|42567142|ref|NP_194265.2|EXSDSfamily protein/ERD1/XPR1/SYG1 familyprotein [Arabidopsisthaliana] gb|AAR99486.1|PHO1-like protein[Arabidopsis thaliana]6211.00E−15189gi|50938765|ref|XP_478910.1|serine/threonineDSSSPEGprotein kinase PKPA-like protein [Oryza sativa6220100gi|16323410|gb|AAD49980.1|Similar toHSgb|AF110333 PrMC3protein from Pinus radiataand is a member ofPF|00135 Carboxylesterasesfamily.6237.00E−86100gi|23397128|ref|NP_178620.1|glycine-HSSSrich protein (GRP)[Arabidopsis thaliana]6241.00E−104100gi|24417354|ref|NP_564833.1|expressedHSprotein [Arabidopsisthaliana]6251.00E−4857gi|21553397|ref|NP_180326.1|zincHSfinger (AN1-like) familyprotein [Arabidopsisthaliana]6260100gi|16130575|ref|NP_417147.1|succinate-HSsemialdehydedehydrogenase I, NADP-dependent [Escherichia coliK12]6270100gi|15218645|ref|NP_176713.1|cytochromeHSPEGP450, putative[Arabidopsis thaliana]6280100gi|16329269|ref|NP_439997.1|hypotheticalHSprotein slr0731[Synechocystis sp. PCC6803]629099gi|12322845|gb|AAG51407.1|putativeHScysteine synthase; 39489-37437[Arabidopsis thaliana]6301.00E−126100gi|28827686|gb|AAO50687.1|unknownHSPEGprotein [Arabidopsisthaliana]6311.00E−143100gi|23197714|ref|NP_196259.21|DNAJHSSSLLheat shock N-terminaldomain-containing protein6320100gi|21554091|gb|AAM63172.1|putativeLLintegral membrane protein[Arabidopsis thaliana]6330100gi|16329756|ref|NP_440484.1|formaldehydeLLdehydrogenase(glutathione)6340100gi|26450489|ref|NP_190148.1|LLtransducin family protein/WD-40 repeat family protein6350100gi|3980410|ref|NP_180485.1|lectinLLLNprotein kinase, putative[Arabidopsis thaliana]6361.00E−153100gi|6572061|ref|NP_190708.1|expressedLLLNprotein [Arabidopsisthaliana]6371.00E−141100gi|22328141|ref|NP_201402.2|heterogeneousLLLNnuclearribonucleoprotein, putative/hnRNP, putative6380100gi|15219657|ref|NP_176819.1|proteinLLkinase family protein[Arabidopsis thaliana]6395.00E−5355gi|34906632|ref|NP_914663.1|P0431G06.11LL[Oryza sativa (japonicacultivar-group)]6401.00E−169100gi|21674686|ref|NP_662751.1|6-LLphosphogluconatedehydrogenase,decarboxylating, putative[Chlorobium tepidum TLS]6411.00E−6996gi|32490260|ref|XP_473078.1|LLPEGOSJNBa0014K14.9 [Oryzasativa (japonica cultivar-group)]6420100gi|52354393|gb|AAU44517.1|hypotheticalLLPEGprotein AT4G22980[Arabidopsis thaliana]6432.00E−90100gi|9758347|ref|NP_200586.1|expressedLLLNprotein [Arabidopsisthaliana]644082gi|50946759|ref|XP_482907.1|putativeLLLNglycoprotein 3-alpha-L-fucosyltransferase6451.00E−13183gi|7269121|emb|CAB79230.1|predictedLLprotein [Arabidopsisthaliana]6461.00E−13088gi|50929089|ref|XP_474072.1|OSJNBb0079B02.14LLLN[Oryza sativa(japonica cultivar-group)]6470100gi|10176523|ref|NP_244766.1|acetyl-LLCoA:acetoacetyl-CoAtransferase [Bacillushalodurans C-125]6480100gi|34365735|ref|NP_566384.1|LLxanthine/uracil permeasefamily protein [Arabidopsisthaliana]6491.00E−84100gi|10177539|ref|NP_201459.1|expressedLLprotein [Arabidopsisthaliana]6501.00E−137100gi|21537398|ref|NP_201145.1|adenylateLNkinase [Arabidopsisthaliana]6510100gi|10177005|ref|NP_851136.1|LNcytochrome P450 familyprotein [Arabidopsisthaliana]652099gi|9758597|ref|NP_851196.1|outwardLNrectifying potassium channel(KCO1) [Arabidopsisthaliana]6531.00E−126100gi|20453405|ref|NP_180901.1|plastidLNdevelopmental protein DAG,putative [Arabidopsisthaliana]6540100gi|7378631|ref|NP_195967.1|LNserine/threonine proteinphosphatase 2A (PP2A)regulatory subunit B′(B′alpha)6550100gi|50058955|gb|AAT69222.1|hypotheticalLNprotein At2g30900[Arabidopsis thaliana]6563.00E−36100gi|26453058|] ref|NP_191804.1|LNexpressed protein[Arabidopsis thaliana]6570100gi|28973131|ref|NP_190313.1|LNphosphoinositide-specificphospholipase C familyprotein6580100gi|23297411|gb|AAN12963.1|enolase (2-LNphospho-D-glyceratehydroylase) [Arabidopsisthaliana]6591.00E−157100gi|22136938|ref|NP_566528.1|expressedLNprotein [Arabidopsisthaliana]6600100gi|9294186|ref|NP_566763.1|auxin-LNresponsive family protein[Arabidopsis thaliana]6610100gi|7269372|ref|NP_194252.1|LNtransporter, putative[Arabidopsis thaliana]6623.00E−93100gi|28393887|ref|NP_182046.1|auxin-LNresponsive protein-related[Arabidopsis thaliana]6630100gi|4725942|ref|NP_192980.1|trehalose-LN6-phosphate phosphatase,putative6640100gi|15074657|ref|NP_385829.1|LNPROBABLEAMINOTRANSFERASEPROTEIN6651.00E−17981gi|35215000|ref|NP_927371.1|LNglutathione dependentformaldehydedehydrogenase6661.00E−142100gi|23507799|ref|NP_565973.1|LOBLNdomain protein 16/lateralorgan boundaries domainprotein 16667099gi|48731145|ref|ZP_00264891.1|COG1012:LNNAD-dependent aldehydedehydrogenases[Pseudomonas fluorescensPfO-1]668099gi|15223033|ref|NP_177763.1|proteinLNkinase, putative [Arabidopsisthaliana]6690100gi|22137136|ref|NP_181639.1|RNALNrecognition motif (RRM)-containing protein6700100gi|50904773|ref|XP_463875.1|putativeLNiron-phytosiderophoretransporter protein yellowstripe 16715.00E−4048gi|15232064|ref|NP_189339.1|expressedLNprotein [Arabidopsisthaliana]672088gi|51535369|dbj|BAD37240.1|putativeLNphosphotyrosyl phosphataseactivator [Oryza sativa(japonica cultivar-group)]6738.00E−93100gi|21555349|ref|NP_190798.1|ATPLNsynthase D chain-related[Arabidopsis thaliana]6740100gi|9795597|ref|NP_173294.1|LNsulfotransferase familyprotein [Arabidopsisthaliana]6750100gi|42567054|ref|NP_194058.2|proteinLNkinase family protein[Arabidopsis thaliana]6760100gi|31711846|ref|NP_177412.1|cinnamyl-LNalcohol dehydrogenase,putative6770100gi|23308375|ref|NP_851141.1|RNALNrecognition motif (RRM)-containing protein6780100gi|7329658|emb|CAB82755.1|proteinLNkinase ATN1-like protein[Arabidopsis thaliana]6790100gi|22655386|ref|NP_197288.1|cationLNexchanger, putative (CAX7)[Arabidopsis thaliana]6801.00E−162100gi|20197230|gb|AAM14984.1|highLNaffinity K+ transporter(AtKUP1 AtKT1p)[Arabidopsis thaliana]6810100gi|15219169|ref|NP_175713.1|proteinLNkinase family protein[Arabidopsis thaliana]6820100gi|9758951|ref|NP_568809.2|proteinLNkinase family protein[Arabidopsis thaliana]6830100gi|7573368|ref|NP_196762.1|expressedLNprotein [Arabidopsisthaliana]6840100gi|20259017|ref|NP_200323.1|expressedLNprotein [Arabidopsisthaliana]6856.00E−80100gi|30793811|ref|NP_191519.1|DNA-LNPEGdirected RNA polymerase I,II, and III, putative6860100gi|7268609|ref|NP_193550.1|outwardLNrectifying potassiumchannel, putative (KCO6)6871.00E−11171gi|7270315|ref|NP_195093.1|L-LNgalactose dehydrogenase (L-GalDH) [Arabidopsisthaliana]6885.00E−98100gi|29824277|ref|NP_568016.1|expressedLNprotein [Arabidopsisthaliana]6891.00E−17069gi|7269325|emb|CAB79384.1|proteinLNkinase (AFC2) [Arabidopsisthaliana]6903.00E−9679gi|57899263|dbj|BAD87508.1|putativeLNcalcyclin-binding protein[Oryza sativa (japonicacultivar-group)]6911.00E−164100gi|11994637|emb|CAD44270.1|LNmonomeric G-protein[Arabidopsis thaliana]6921.00E−109100gi|7269377|ref|NP_194256.1|LNinvertase/pectinmethylesterase inhibitorfamily protein6931.00E−57100gi|7340686|ref|NP_195832.1|thylakoidLNmembrane one helix protein(OHP) [Arabidopsisthaliana]6940100gi|21689839|gb|AAM67563.1|putativePEGprotein transport proteinSEC12p [Arabidopsisthaliana]6951.00E−36100gi|10177015|ref|NP_199045.1|ubiquitinPEGfamily protein [Arabidopsisthaliana]6961.00E−106100gi|20466087|ref|NP_179457.1|zinc fingerPEG(C3HC4-type RING finger)family protein6970100gi|7267203|ref|NP_192355.1|aspartylPEGprotease family protein[Arabidopsis thaliana]6980100gi|46931342|ref|NP_850347.1|F-boxPEGfamily protein [Arabidopsisthaliana]699060gi|46931354|gb|AAT06481.1|At3g23540PEG[Arabidopsis thaliana]7001.00E−161100gi|21594556|gb|AAM66021.1|plasmaPEGmembrane intrinsic proteinSIMIP [Arabidopsisthaliana]7010100gi|902923|dbj|BAA07547.1|phosphoinositidePEGspecificphospholipase C[Arabidopsis thaliana]7020100gi|26449713|ref|NP_195154.2|transducinPEGfamily protein/WD-40repeat family protein703094gi|15232616|ref|NP_188176.1|phototropic-PEGresponsive NPH3 familyprotein [Arabidopsisthaliana]7040100gi|23296692|ref|NP_189177.1|forminPEGhomology 2 domain-containing protein/FH2domain-containing protein[Arabidopsis thaliana]7050100gi|51013899|ref|NP_015238.1|Ydc1pPEG[Saccharomyces cerevisiae]gb|AAB68212.1|Lpg21pgb|AAG22594.1|alkalineceramidase Ydc1p7060100gi|23297093|ref|NP_565508.1|fructose-PEGbisphosphate aldolase,putative [Arabidopsisthaliana]707099gi|6322722|ref|NP_012795.1|Pgm1pPEG[Saccharomyces cerevisiae]emb|CAA50895.1|phosphoglucomutase708099gi|10175338|ref|NP_243583.1|PEGglutaminase [Bacillushalodurans C-125]709094gi|7268277|ref|NP_193265.1|PEGcytochrome P450 familyprotein [Arabidopsisthaliana]7100100gi|6322296|sp|P38970|HAL5_YEASTPEGSerine/threonine-proteinkinase HAL57110100gi|17104779|ref|NP_564866.1|U-boxPEGdomain-containing protein712099gi|16080934|ref|NP_391762.1|aldehydePEGdehydrogenase [Bacillussubtilis subsp. subtilis str.168]713087gi|37524894|ref|NP_928238.1|glutamate-PEG1-semialdehyde 2,1-aminomutase [Photorhabdusluminescens subsp.laumondii TTO1]7140100gi|16077848|ref|NP_388662.1|trehalose-PEG6-phosphate hydrolase[Bacillus subtilis subsp.subtilis str. 168]7151.00E−81100gi|1922244|ref|NP_171654.1|latePEGembryogenesis abundantprotein, putative/LEAprotein, putative7161.00E−112100gi|7268505|ref|NP_193486.1|Ras-PEGrelated GTP-binding protein,putative [Arabidopsisthaliana]7173.00E−44100gi|21592597|ref|NP_563987.1|expressedPEGprotein [Arabidopsisthaliana] gb|AAF18498.1|Identical to gb|Y10291GAG1 protein7183.00E−9744gi|52353666|gb|AAU44232.1|hypotheticalPEGprotein [Oryza sativa(japonica cultivar-group)]7191.00E−17563gi|53850529|gb|AAC31235.1|PEGhypothetical protein[Arabidopsis thaliana]gb|AAT71954.1|7201.00E−13199gi|16330417|ref|NP_441145.1|hypotheticalPEGprotein sll1162[Synechocystis sp. PCC6803]7210100gi|17129862|ref|NP_484561.1|PEGfructokinase [Nostoc sp.PCC 7120]7220100gi|15293245|ref|NP_567109.1|COP9PEGsignalosome complexsubunit 1/CSN complexsubunit 1 (CSN1)/COP11protein (COP11)/FUSCAprotein (FUS6) [Arabidopsisthaliana]7230100gi|20259583|ref|NP_180431.1|beta-PEGketoacyl-CoA synthasefamily protein [Arabidopsisthaliana]7240100gi|20465963|ref|NP_189034.1|beta-PEGamylase, putative/1,4-alpha-D-glucanmaltohydrolase, putative[Arabidopsis thaliana]7254.00E−72100gi|32815911|ref|NP_201100.1|expressedPEGprotein [Arabidopsisthaliana]7261.00E−179100gi|8953402|ref|NP_196701.1|proteinPEGkinase-related [Arabidopsisthaliana]7270100gi|6522600|emb|CAB61965.1|1-PEGaminocyclopropane-1-carboxylic acid oxidase-likeprotein7281.00E−14585gi|57899179|dbj|BAD87231.1|membranePEGprotein-like [Oryza sativa(japonica cultivar-group)]729087gi|50909427|ref|XP_466202.1|putativePEGDNA J domain protein[Oryza sativa (japonicacultivar-group)]7300100gi|53828613|ref|NP_179479.1|proteinPEGkinase family protein[Arabidopsis thaliana]7311.00E−93100gi|23306374|ref|NP_200428.1|expressedPEGprotein [Arabidopsisthaliana]7321.00E−172100gi|56121884|ref|NP_178191.1|majorPEGintrinsic family protein/MIP family protein733098gi|24762199|gb|AAN64166.1|unknownPEGprotein [Arabidopsisthaliana]7340100gi|21230100|ref|NP_636017.1|phosphomannosePEGisomerase/GDP-mannose pyrophosphorylase7350100gi|10177277|dbj|BAB10630.1|glucose-6-PPphosphate isomerase,cytosolic [Arabidopsisthaliana]7363.00E−61100gi|21537079|ref|NP_564835.1|expressedPPHSprotein [Arabidopsisthaliana]7373.00E−91100gi|26452507|ref|NP_197149.1|PPPEGdimethylmenaquinonemethyltransferase familyprotein7380100gi|5734730|ref|NP_172592.1|glucosePPtransporter (STP1)[Arabidopsis thaliana]7391.00E−113100gi|5734748|ref|NP_564017.1|integralPPSSPEGmembrane family protein[Arabidopsis thaliana]7401.00E−153100gi|30693623|ref|NP_198871.2|expressedPPprotein [Arabidopsisthaliana]7410100gi|56381993|ref|NP_177188.1|PPSSPEGspermidine synthase 2(SPDSYN2)/putrescineaminopropyltransferase 27420100gi|4678359|emb|CAB41169.1|cytochromePPP450-like protein[Arabidopsis thaliana]7430100gi|15220055|ref|NP_173165.1|translationPPinitiation factor IF-2,chloroplast, putative[Arabidopsis thaliana]7441.00E−168100gi|21593654|ref|NP_181996.1|caseinPPLLLNPEGkinase II beta chain, putative[Arabidopsis thaliana]7450100gi|9758987|dbj|BAB09497.1|chloroplastPPnucleoid DNA-bindingprotein-like [Arabidopsisthaliana] ref|NP_199325.1|aspartyl protease familyprotein [Arabidopsisthaliana]7460100gi|12642918|ref|NP_180133.1|proteinPPSSphosphatase 2C, putative/PP2C, putative7470100gi|7270935|ref|NP_195661.1|PPcytochrome P450 familyprotein [Arabidopsisthaliana]7481.00E−138100gi|21554052|gb|AAM63133.1|deltaPPtonoplast integral proteindelta-TIP [Arabidopsisthaliana]749085gi|28872439|ref|NP_795058.1|phosphoglyceratePPmutase, 2,3-bisphosphoglycerate-independent7500100gi|25284116|pir||A95262probable formatePPPEGdehydrogenase (EC 1.2.1.2)alpha chain FdoG [imported] -SinoRhizobium meliloti(strain 1021) magaplasmidpSymA7511.00E−55100gi|7629997|ref|NP_190945.1|latePPSSPEGembryogenesis abundantprotein-related/LEAprotein-related7520100gi|23126243|ref|ZP_00108145.1|COG1523:PPSSType II secretorypathway, pullulanase PulAand related glycosidases[Nostoc punctiforme PCC73102]7530100gi|42563087|ref|NP_564975.2|CBSPPdomain-containing protein[Arabidopsis thaliana]7540100gi|17104597|ref|NP_566716.1|proteinPPSSLLkinase, putative [Arabidopsisthaliana]7550100gi|15810541|ref|NP_566876.3|proteinPPSSkinase family protein[Arabidopsis thaliana]7563.00E−8499gi|16331120|ref|NP_441848.1|hypotheticalPPprotein sll0359[Synechocystis sp. PCC6803]757099gi|4835235|emb|CAB42913.1|putativePPprotein [Arabidopsisthaliana] pir||T08405hypothetical proteinF18B3.120 - Arabidopsisthaliana7580100gi|18394553|ref|NP_564042.1|expressedPPprotein [Arabidopsisthaliana]759099gi|49176911|ref|NP_858035.1|mercuricPPreductase [unculturedbacterium]7600100gi|53794956|ref|ZP_00356067.1|COG4992:PPSSOrnithine/acetylornithineaminotransferase[Chloroflexus aurantiacus]7610100gi|6714476|ref|NP_186761.1|PPLLcystathionine gamma-synthase, chloroplast/O-succinylhomoserine (Thiol)-lyase (CGS)7620100gi|9755789|ref|NP_197266.1|expressedPPSSLLLNPEGprotein [Arabidopsisthaliana]7630100gi|15810649|pir||T48630 high affinityPPnitrate transporter-likeprotein - Arabidopsisthaliana7641.00E−129100gi|26451800|ref|NP_850901.1|expressedPPprotein [Arabidopsisthaliana]7654.00E−5891gi|50948565|ref|XP_483810.1|putativePPPEGBet1/Sft1-related SNARE(AtBS14a) [Oryza sativa7660100gi|7630064|emb|CAB88286.1|serine/threonine-PPspecific proteinkinase-like protein767092gi|50932239|ref|XP_475647.1|putativePPHSPEGglutaryl-CoA dehydrogenase7681.00E−10450gi|30684716|ref|NP_196906.2|expressedPPSSprotein [Arabidopsisthaliana]7692.00E−8789gi|50912633|ref|XP_467724.1|unknownPPPEGprotein [Oryza sativa(japonica cultivar-group)]7709.00E−1394gi|6682226|gb|AAF23278.1|unknownPPHSSSPEGprotein [Arabidopsisthaliana]7712.00E−9049gi|31430516|ref|NP_920131.1|putativePPMagnaporthe griseapathogenicity protein7720100gi|23297753|ref|NP_192381.1|calcium-SPHSPEGdependent protein kinase,putative/CDPK, putative7733.00E−54100gi|4417292|ref|NP_179791.1|expressedSPCKPPPEGprotein [Arabidopsisthaliana]7740100gi|20148423|ref|NP_177550.1|SPsulfotransferase familyprotein [Arabidopsisthaliana]7754.00E−88100gi|21554027|ref|NP_192830.1|SPSStranscriptional coactivatorp15 (PC4) family protein(KELP)7761.00E−100100gi|22136806|ref|NP_176726.1|floweringSPSSPEGlocus T protein (FT)[Arabidopsis thaliana]7776.00E−93100gi|21592464|ref|NP_565844.1|zinc fingerSPLN(AN1-like) family protein[Arabidopsis thaliana]7781.00E−101100gi|8778645|gb|AAF79653.1|F5O11.17SPLN[Arabidopsis thaliana]7792.00E−80100gi|28973123|ref|NP_196373.1|glycine-SPCKrich protein (GRP20)[Arabidopsis thaliana]7800100gi|11994377|ref|NP_188020.1|SPPPPEGexopolygalacturonase/galacturan 1,4-alpha-galacturonidase/pectinase7810100gi|20465689|ref|NP_200440.1|SPCKperoxisomal targeting signaltype 1 receptor (PEX5)7828.00E−51100gi|21536923|ref|NP_194774.1|glycine-SPCKrich protein [Arabidopsisthaliana]7831.00E−124100gi|30023686|ref|NP_197608.1|leucine-SPDSPEGrich repeat protein, putative[Arabidopsis thaliana]7843.00E−76100gi|26453292|ref|NP_563653.2|SPSSLNNPR1/NIM1-interactingprotein 1 (NIMIN-1)[Arabidopsis thaliana]7851.00E−90100gi|15220022|ref|NP_173727.1|C2SPCSPEGdomain-containing protein[Arabidopsis thaliana]7862.00E−97100gi|56479608|ref|NP_706078.2|hypoxanthineSPSSPEGphosphoribosyltransferase[Shigella flexneri 2a str.301] EDL933]7871.00E−13099gi|16331548|ref|NP_442276.1|hypotheticalSPSSprotein slr0630[Synechocystis sp. PCC6803]7880100gi|20259167|ref|NP_568786.1|proteinSPphosphatase 2C, putative/PP2C, putative7890100gi|6466946|ref|NP_974249.1|SPPEGprephenate dehydratasefamily protein [Arabidopsisthaliana]7901.00E−130100gi|20465721|ref|NP_174536.1|plastidSPCSPEGdevelopmental protein DAG,putative7911.00E−6384gi|10177620|dbj|BAB10767.1|unnamedSPSSprotein product [Arabidopsisthaliana]7921.00E−44100gi|58299|emb|CAA48415.1|unnamedSPCKPEGprotein product [syntheticconstruct]7931.00E−10854gi|3150411|ref|NP_180570.1|GCN5-SPCSrelated N-acetyltransferase(GNAT) family protein7940100gi|16331766|ref|NP_442494.1|aldehydeSPSSPEGdehydrogenase[Synechocystis sp. PCC6803]7951.00E−30100gi|7340711|emb|CAB82954.1|cytochromeSPCKCSPEGc oxidase subunit 5c-likeprotein [Arabidopsisthaliana]7961.00E−137100gi|5430750|ref|NP_175374.2|expressedSPCSHSPEGprotein [Arabidopsisthaliana]7971.00E−93100gi|19310667|ref|NP_188286.1|SPPPSStranslationally controlledtumor family protein[Arabidopsis thaliana]7980100gi|15293235|ref|NP_194825.1|CBL-SPLNinteracting protein kinase 6(CIPK6) [Arabidopsisthaliana]7991.00E−16562gi|4220476|gb|AAD12699.1|putativeSPribophorin I [Arabidopsisthaliana]8000100gi|20908080|ref|NP_200169.1|SSRabGAP/TBC domain-containing protein[Arabidopsis thaliana]8011.00E−6899gi|51449987|ref|NP_189210.2|SSPEGmyrcene/ocimene synthase,putative [Arabidopsisthaliana]8022.00E−94100gi|21592517|ref|NP_564266.1|expressedSSprotein [Arabidopsisthaliana]8030100gi|42572237|ref|NP_974213.1|ankyrinSSPEGrepeat family protein/regulator of chromosomecondensation (RCC1) familyprotein [Arabidopsisthaliana]8041.00E−115100gi|21593862|ref|NP_190632.1|kip-SSrelated protein 2 (KRP2)/cyclin-dependent kinaseinhibitor 2 (ICK2)/cdc2a-interacting protein[Arabidopsis thaliana]8050100gi|8843792|ref|NP_200680.1|PRLI-SSLNinteracting factor, putative[Arabidopsis thaliana]806090gi|21586474|gb|AAM55306.1|auxinSSinflux carrier protein[Medicago truncatula]807099gi|20799715|gb|Aref|NP_190468.1|SSAMP-dependent synthetaseand ligase family protein8080100gi|15293255|ref|NP_564424.1|PHDSSfinger family protein[Arabidopsis thaliana]8090100gi|24030409|ref|NP_194207.1|SSLNexpressed protein[Arabidopsis thaliana]8100100gi|6728973|ref|NP_186938.1|leucine-SSPEGrich repeat transmembraneprotein kinase, putative8111.00E−133100gi|27754697|gb|AAO22792.1|putativeSSPEGcytochrome coxidoreductase [Arabidopsisthaliana]812091gi|4539445|ref|NP_192805.1|SStranscription elongationfactor-related [Arabidopsisthaliana]8130100gi|23296600|ref|NP_568107.1|pseudo-SSresponse regulator 7(APRR7) [Arabidopsisthaliana]sp|Q93WK5|APRR7_ARATHTwo-component responseregulator-like APRR7(Pseudo-response regulator7)814099gi|16331392|ref|NP_442120.1|ribuloseSSbisphosphate carboxylaselarge subunit815099gi|16329673|ref|NP_440401.1|mercuricSSreductase [Synechocystis sp.PCC 6803]8160100gi|23506207|gb|AAN31115.1|At2g26250/SST1D16.11 [Arabidopsisthaliana] gb|AAG60062.1|putative beta-ketoacyl-CoAsynthase FIDDLEHEAD817099gi|6016708|ref|NP_186798.1|proteinSSkinase, putative [Arabidopsisthaliana]8180100gi|21593811|gb|AAM65778.1|putativeSSNADPH quinoneoxidoreductase [Arabidopsisthaliana]8190100gi|23125809|ref|ZP_00107727.1|COG0166:SSGlucose-6-phosphateisomerase [Nostocpunctiforme PCC 73102]8200100gi|16077353|ref|NP_388166.1|hypotheticalSSPEGprotein BSU02840[Bacillus subtilis subsp.subtilis str. 168]8211.00E−117100gi|7269793|ref|NP_194624.1|Rac-likeSSGTP-binding protein(ARAC7) [Arabidopsisthaliana]8221.00E−12896gi|27808548|gb|AAO24554.1|At1g61150SSPEG[Arabidopsis thaliana]823097gi|30684071|ref|NP_850128.1|proteinSSkinase family protein[Arabidopsis thaliana]8242.00E−7799gi|16330866|ref|NP_441594.1|hypotheticalSSLLprotein slr1273[Synechocystis sp. PCC6803]8251.00E−177100gi|23297050|ref|NP_194073.2|short-SSchaindehydrogenase/reductase(SDR) family protein8261.00E−109100gi|12083284|ref|NP_173437.1|Rac-likeSSGTP-binding protein(ARAC4)/Rho-like GTP-binding protein (ROP2)8270100gi|20465939|ref|NP_850393.1|sugarSStransporter family protein[Arabidopsis thaliana]828098gi|2661128|gb|AAC04613.1|arginaseSS[Glycine max] pir||T06222probable arginase (EC3.5.3.1) - soybeansp|O49046|ARGI_SOYBNArginase829064gi|40539114|gb|AAR87370.1|expressedSSprotein [Oryza sativa(japonica cultivar-group)]830091gi|52137615|emb|CAH40838.1|protein-SSO-fucosyltransferase 1[Saccharum officinarum]8310100gi|5733881|ref|NP_175261.1|G proteinSScoupled receptor-related[Arabidopsis thaliana]8328.00E−88100gi|15217930|ref|NP_176128.1|hypotheticalSSprotein [Arabidopsisthaliana]8331.00E−2975gi|20465895|ref|NP_974937.1|RNASSLNrecognition motif (RRM)-containing protein8345.00E−79100gi|3927837|ref|NP_180456.1|SSmitochondrial import innermembrane translocasesubunitTim17/Tim22/Tim23 familyprotein [Arabidopsisthaliana]8351.00E−115100gi|21554015|ref|NP_565766.1|glycolipidSStransfer protein-related[Arabidopsis thaliana]8361.00E−131100gi|26452816|ref|NP_193484.1|ubiquitinSScarboxyl-terminal hydrolase8371.00E−10299gi|10177705|ref|NP_199436.1|inwardSSLNrectifying potassium channel(KAT1)8381.00E−144100gi|7669950|ref|NP_190844.1|integralSSPEGmembrane Yip1 familyprotein [Arabidopsisthaliana]8390100gi|10177595|ref|NP_201509.1|proteinSSLNkinase family protein[Arabidopsis thaliana]8404.00E−27100gi|38566508|ref|NP_200137.1|SSarabinogalactan-protein,putative (AGP22)[Arabidopsis thaliana]8410100gi|18391117|ref|NP_563862.1|expressedSSPEGprotein [Arabidopsisthaliana]842092gi|50251322|dbj|BAD28194.1|putativeSSMFAP1 protein [Oryzasativa (japonica cultivar-group)]8431.00E−11295gi|3201969|gb|AAC19375.1|submergenceSSinduced protein 2A [Oryzasativa]8441.00E−7488gi|29371519|gb|AAO72703.1|unknownSS[Oryza sativa (japonicacultivar-group)]8450100gi|28416475|ref|NP_187911.1|SSPEGtransporter-related[Arabidopsis thaliana]8460100gi|21323473|ref|NP_599939.1|detergentSSLNsensitivity rescuer dtsR2[Corynebacteriumglutamicum ATCC 13032]847082gi|50928705|gb|AAQ14479.1|putativeSSaminotransferase [Oryzasativa]848099gi|3695005|gb|AAC63962.1|pyruvateSSdehydrogenase kinaseisoform 2; PDK2 [Zea mays]8490100gi|10175838|ref|NP_244081.1|SSPEGhypothetical protein BH3215[Bacillus halodurans C-125]8500100gi|33589668|ref|NP_564285.1|SSPEGcalmodulin-binding protein[Arabidopsis thaliana]


Trait Improvement Screens


DS-Improvement of drought tolerance identified by a soil drought stress tolerance screen: Drought or water deficit conditions impose mainly osmotic stress on plants. Plants are particularly vulnerable to drought during the flowering stage. The drought condition in the screening process disclosed in Example 1B started from the flowering time and was sustained to the end of harvesting. The present invention provides recombinant DNA that can improve the plant survival rate under such sustained drought condition. Exemplary recombinant DNA for conferring such drought tolerance are identified as such in Table 3. Such recombinant DNA may find particular use in generating transgenic plants that are tolerant to the drought condition imposed during flowering time and in other stages of the plant life cycle. As demonstrated from the model plant screen, in some embodiments of transgenic plants with trait-improving recombinant DNA grown under such sustained drought condition can also have increased total seed weight per plant in addition to the increased survival rate within a transgenic population, providing a higher yield potential as compared to control plants.


PEG-Improvement of drought tolerance identified by PEG induced osmotic stress tolerance screen: Various drought levels can be artificially induced by using various concentrations of polyethylene glycol (PEG) to produce different osmotic potentials (Pilon-Smits e.g., (1995) Plant Physiol. 107:125-130). Several physiological characteristics have been reported as being reliable indications for selection of plants possessing drought tolerance. These characteristics include the rate of seed germination and seedling growth. The traits can be assayed relatively easily by measuring the growth rate of seedling in PEG solution. Thus, a PEG-induced osmotic stress tolerance screen is a useful surrogate for drought tolerance screen. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in the PEG-induced osmotic stress tolerance screen can survive better drought conditions providing a higher yield potential as compared to control plants.


SS-Improvement of drought tolerance identified by high salinity stress tolerance screen: Three different factors are responsible for salt damages: (1) osmotic effects, (2) disturbances in the mineralization process, (3) toxic effects caused by the salt ions, e.g., inactivation of enzymes. While the first factor of salt stress results in the wilting of the plants that is similar to drought effect, the ionic aspect of salt stress is clearly distinct from drought. The present invention provides genes that help plants to maintain biomass, root growth, and/or plant development in high salinity conditions, which are identified as such in Table 3. Since osmotic effect is one of the major components of salt stress, which is common to the drought stress, trait-improving recombinant DNA identified in a high salinity stress tolerance screen can also provide transgenic crops with improved drought tolerance. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in a high salinity stress tolerance screen can survive better drought conditions and/or high salinity conditions providing a higher yield potential as compared to control plants.


HS-Improvement of drought tolerance identified by heat stress tolerance screen: Heat and drought stress often occur simultaneously, limiting plant growth. Heat stress can cause the reduction in photosynthesis rate, inhibition of leaf growth and osmotic potential in plants. Thus, genes identified by the present invention as heat stress tolerance conferring genes may also impart improved drought tolerance to plants. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in a heat stress tolerance screen can survive better heat stress conditions and/or drought conditions providing a higher yield potential as compared to control plants.


CK and CS-Improvement of tolerance to cold stress: Low temperature may immediately result in mechanical constraints, changes in activities of macromolecules, and reduced osmotic potential. In the present invention, two screening conditions, i.e., cold shock tolerance screen (CK) and cold germination tolerance screen (CS), were set up to look for transgenic plants that display visual growth advantage at lower temperature. In cold germination tolerance screen, the transgenic Arabidopsis plants were exposed to a constant temperature of 8° C. from planting until day 28 post plating. The trait-improving recombinant DNA identified by such screen are particular useful for the production of transgenic plant that can germinate more robustly in a cold temperature as compared to the wild type plants. In cold shock tolerance screen, the transgenic plants were first grown under the normal growth temperature of 22° C. until day 8 post plating, and subsequently were placed under 8° C. until day 28 post plating. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in a cold shock stress tolerance screen and/or a cold germination stress tolerance screen can survive better cold conditions providing a higher yield potential as compared to control plants.


Improvement of tolerance to multiple stresses: Different kinds of stresses often lead to identical or similar reaction in the plants. Genes that are activated or inactivated as a reaction to stress can either act directly in a way the genetic product reduces a specific stress, or they can act indirectly by activating other specific stress genes. By manipulating the activity of such regulatory genes, i.e., multiple stress tolerance genes, the plant can be enabled to react to different kinds of stresses. For examples, PEP SEQ ID NO: 459 can be used to improve both salt stress tolerance and cold stress tolerance in plants. Of particular interest, plants transformed with PEP SEQ ID NO: 440 can resist heat stress, salt stress and cold stress. In addition to these multiple stress tolerance genes, the stress tolerance conferring genes provided by the present invention may be used in combinations to generate transgenic plants that can resist multiple stress conditions.


PP-Improvement of early plant growth and development: It has been known in the art that to minimize the impact of disease on crop profitability, it is important to start the season with healthy vigorous plants. This means avoiding seed and seedling diseases, leading to increased nutrient uptake and increased yield potential. Traditionally early planting and applying fertilizer are the methods used for promoting early seedling vigor. In early development stage, plant embryos establish only the basic root-shoot axis, a cotyledon storage organ(s), and stem cell populations, called the root and shoot apical meristems, that continuously generate new organs throughout post-embryonic development. “Early growth and development” used herein encompasses the stages of seed imbibition through the early vegetative phase. The present invention provides genes that are useful to produce transgenic plants that have advantages in one or more processes including, but not limited to, germination, seedling vigor, root growth and root morphology under non-stressed conditions. The transgenic plants starting from a more robust seedling are less susceptible to the fungal and bacterial pathogens that attach germinating seeds and seedling. Furthermore, seedlings with advantage in root growth are more resistant to drought stress due to extensive and deeper root architecture. Therefore, it can be recognized by those skilled in the art that genes conferring the growth advantage in early stages to plants may also be used to generate transgenic plants that are more resistant to various stress conditions due to improved early plant development. The present invention provides such exemplary recombinant DNA that confer both the stress tolerance and growth advantages to plants, identified as such in Table 3, e.g., PEP SEQ ID NO: 529 which can improve the plant early growth and development, and impart salt and cold tolerance to plants. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in the early plant development screen can grow better under non-stress conditions and/or stress conditions providing a higher yield potential as compared to control plants.


SP-Improvement of late plant growth and development: “Late growth and development” used herein encompasses the stages of leaf development, flower production, and seed maturity. In certain embodiments, transgenic plants produced using genes that confer growth advantages to plants provided by the present invention, identified as such in Table 3, exhibit at least one phenotypic characteristics including, but not limited to, increased rosette radius, increased rosette dry weight, seed dry weight, silique dry weight, and silique length. On one hand, the rosette radius and rosette dry weight are used as the indexes of photosynthesis capacity, and thereby plant source strength and yield potential of a plant. On the other hand, the seed dry weight, silique dry weight and silique length are used as the indexes for plant sink strength, which are considered as the direct determinants of yield. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in the late development screen can grow better and/or have improved development during leaf development and seed maturation providing a higher yield potential as compared to control plants.


LL-Improvement of tolerance to shade stress identified in a low light screen: The effects of light on plant development are especially prominent at the seedling stage. Under normal light conditions with unobstructed direct light, a plant seeding develops according to a characteristic photomorphogenic pattern, in which plants have open and expanded cotyledons and short hypocotyls. Then the plant's energy is devoted to cotyledon and leaf development while longitudinal extension growth is minimized. Under low light condition where light quality and intensity are reduced by shading, obstruction or high population density, a seedling displays a shade-avoidance pattern, in which the seedling displays a reduced cotyledon expansion, and hypocotyls extension is greatly increased. As the result, a plant under low light condition increases significantly its stem length at the expanse of leaf, seed or fruit and storage organ development, thereby adversely affecting of yield. The present invention provides recombinant DNA that enable plants to have an attenuated shade avoidance response so that the source of plant can be contributed to reproductive growth efficiently, resulting higher yield as compared to the wild type plants. As demonstrated from the model plant screen, embodiments of transgenic plants with trait-improving recombinant DNA identified in a shade stress tolerance screen can have attenuated shade response under shade conditions providing a higher yield potential as compared to control plants. The transgenic plants generated by the present invention may be suitable for a higher density planting, thereby resulting increased yield per unit area.


LN-Improvement of Tolerance to Low Nitrogen Availability Stress


Nitrogen is a key factor in plant growth and crop yield. The metabolism, growth and development of plants are profoundly affected by their nitrogen supply. Restricted nitrogen supply alters shoot to root ratio, root development, activity of enzymes of primary metabolism and the rate of senescence (death) of older leaves. All field crops have a fundamental dependence on inorganic nitrogenous fertilizer. Since fertilizer is rapidly depleted from most soil types, it must be supplied to growing crops two or three times during the growing season. Enhanced nitrogen use efficiency by plants should enable crops cultivated under low nitrogen availability stress condition resulted from low fertilizer input or poor soil quality.


According to the present invention, transgenic plants generated using the recombinant nucleotides, which confer enhanced nitrogen use efficiency, identified as such in Table 3, exhibit one or more desirable traits including, but not limited to, increased seedling weight, greener leaves, increased number of rosette leaves, increased or decreased root length. One skilled in the art may recognize that the transgenic plants provided by the present invention with enhanced nitrogen use efficiency may also have altered amino acid or protein compositions, increased yield and/or better seed quality.


The transgenic plants of the present invention may be productively cultivated under low nitrogen growth conditions, i.e., nitrogen-poor soils and low nitrogen fertilizer inputs, which would cause the growth of wild type plants to cease or to be so diminished as to make the wild type plants practically useless. The transgenic plants also may be advantageously used to achieve earlier maturing, faster growing, and/or higher yielding crops and/or produce more nutritious foods and animal feedstocks when cultivated using nitrogen non-limiting growth conditions.


Stacked Traits: The present invention also encompasses transgenic plants with stacked engineered traits, e.g., a crop having an improved phenotype resulting from expression of a trait-improving recombinant DNA, in combination with herbicide and/or pest resistance traits. For example, genes of the current invention can be stacked with other traits of agronomic interest, such as a trait providing herbicide resistance, for example a RoundUp Ready® trait, or insect resistance, such as using a gene from Bacillus thuringensis to provide resistance against lepidopteran, coliopteran, homopteran, hemiopteran, and other insects. Herbicides for which resistance is useful in a plant include glyphosate herbicides, phosphinothricin herbicides, oxynil herbicides, imidazolinone herbicides, dinitroaniline herbicides, pyridine herbicides, sulfonylurea herbicides, bialaphos herbicides, sulfonamide herbicides and gluphosinate herbicides. To illustrate that the production of transgenic plants with herbicide resistance is a capability of those of ordinary skill in the art, reference is made to U.S. patent application publications 2003/0106096A1 and 2002/0112260A1 and U.S. Pat. Nos. 5,034,322; 5,776,760, 6,107,549 and 6,376,754, all of which are incorporated herein by reference. To illustrate that the production of transgenic plants with pest resistance is a capability of those of ordinary skill in the art reference is made to U.S. Pat. Nos. 5,250,515 and 5,880,275 which disclose plants expressing an endotoxin of Bacillus thuringiensis bacteria, to U.S. Pat. No. 6,506,599 which discloses control of invertebrates which feed on transgenic plants which express dsRNA for suppressing a target gene in the invertebrate, to U.S. Pat. No. 5,986,175 which discloses the control of viral pests by transgenic plants which express viral replicase, and to U.S. Patent Application Publication 2003/0150017 A1 which discloses control of pests by a transgenic plant which express a dsRNA targeted to suppressing a gene in the pest, all of which are incorporated herein by reference.


Once one recombinant DNA has been identified as conferring an improved trait of interest in transgenic Arabidopsis plants, several methods are available for using the sequence of that recombinant DNA and knowledge about the protein it encodes to identify homologs of that sequence from the same plant or different plant species or other organisms, e.g., bacteria and yeast. Thus, in one aspect, the invention provides methods for identifying a homologous gene with a DNA sequence homologous to any of SEQ ID NO: 1 through SEQ ID NO: 425, or a homologous protein with an amino acid sequence homologous to any of SEQ ID NO: 426 through SEQ ID NO: 850. In another aspect, the present invention provides the protein sequences of identified homologs for a sequence listed as SEQ ID NO: 851 through SEQ ID NO: 33634. In yet another aspect, the present invention also includes linking or associating one or more desired traits, or gene function with a homolog sequence provided herein.


The trait-improving recombinant DNA and methods of using such trait-improving recombinant DNA for generating transgenic plants with improved traits provided by the present invention are not limited to any particular plant species. Indeed, the plants according to the present invention may be of any plant species, i.e., may be monocotyledonous or dicotyledonous. Preferably, they will be agricultural useful plants, i.e., plants cultivated by man for purposes of food production or technical, particularly industrial applications. Of particular interest in the present invention are corn and soybean plants. The recombinant DNA constructs optimized for soybean transformation and recombinant DNA constructs optimized for corn transformation are provided by the present invention. Other plants of interest in the present invention for production of transgenic plants having improved traits include, without limitation, cotton, canola, wheat, sunflower, sorghum, alfalfa, barley, millet, rice, tobacco, fruit and vegetable crops, and turfgrass.


In certain embodiments, the present invention contemplates to use an orthologous gene in generating the transgenic plants with similarly improved traits as the transgenic Arabidopsis counterpart. Improved physiological properties in transgenic plants of the present invention may be confirmed in responses to stress conditions, for example in assays using imposed stress conditions to detect improved responses to drought stress, nitrogen deficiency, cold growing conditions, or alternatively, under naturally present stress conditions, for example under field conditions. Biomass measures may be made on greenhouse or field grown plants and may include such measurements as plant height, stem diameter, root and shoot dry weights, and, for corn plants, ear length and diameter.


Trait data on morphological changes may be collected by visual observation during the process of plant regeneration as well as in regenerated plants transferred to soil. Such trait data includes characteristics such as normal plants, bushy plants, taller plants, thicker stalks, narrow leaves, striped leaves, knotted phenotype, chlorosis, albino, anthocyanin production, or altered tassels, ears or roots. Other enhanced traits may be identified by measurements taken under field conditions, such as days to pollen shed, days to silking, leaf extension rate, chlorophyll content, leaf temperature, stand, seedling vigor, internode length, plant height, leaf number, leaf area, tillering, brace roots, stay green, stalk lodging, root lodging, plant health, barreness/prolificacy, green snap, and pest resistance. In addition, trait characteristics of harvested grain may be confirmed, including number of kernels per row on the ear, number of rows of kernels on the ear, kernel abortion, kernel weight, kernel size, kernel density and physical grain quality.


To confirm hybrid yield in transgenic corn plants expressing genes of the present invention, it may be desirable to test hybrids over multiple years at multiple locations in a geographical location where maize is conventionally grown, e.g., in Iowa, Illinois or other locations in the Midwestern United States, under “normal” field conditions as well as under stress conditions, e.g., under drought or population density stress.


Transgenic plants can be used to provide plant parts according to the invention for regeneration or tissue culture of cells or tissues containing the constructs described herein. Plant parts for these purposes can include leaves, stems, roots, flowers, tissues, epicotyl, meristems, hypocotyls, cotyledons, pollen, ovaries, cells and protoplasts, or any other portion of the plant which can be used to regenerate additional transgenic plants, cells, protoplasts or tissue culture. Seeds of transgenic plants are provided by this invention can be used to propagate more plants containing the trait-improving recombinant DNA constructs of this invention. These descendants are intended to be included in the scope of this invention if they contain a trait-improving recombinant DNA construct of this invention, whether or not these plants are selfed or crossed with different varieties of plants.


The various aspects of the invention are illustrated by means of the following examples which are in no way intended to limit the full breath and scope of claims.


EXAMPLES
Example 1
Identification of Recombinant DNA that Confers Improved Trait(s) to Plants

A. Expression Constructs for Arabidopsis Plant Transformation


Each gene of interest was amplified from a genomic or cDNA library using primers specific to sequences upstream and downstream of the coding region. Transformation vectors were prepared to constitutively transcribe DNA in either sense orientation (for enhanced protein expression) or anti-sense orientation (for endogenous gene suppression) under the control of an enhanced Cauliflower Mosaic Virus 35S promoter (U.S. Pat. No. 5,359,142) directly or indirectly (Moore, e.g., PNAS 95:376-381, 1998; Guyer, e.g., Genetics 149: 633-639, 1998; International patent application NO. PCT/EP98/07577). The transformation vectors also contain a bar gene as a selectable marker for resistance to glufosinate herbicide. The transformation of Arabidopsis plants was carried out using the vacuum infiltration method known in the art (Bethtold, e.g., Methods Mol. Biol. 82:259-66, 1998). Seeds harvested from the plants, named as T1 seeds, were subsequently grown in a glufosinate-containing selective medium to select for plants which were actually transformed and which produced T2 transgenic seed.


B. Soil Drought Tolerance Screen


This example describes a soil drought tolerance screen to identify Arabidopsis plants transformed with recombinant DNA that wilt less rapidly and/or produce higher seed yield when grown in soil under drought conditions


T2 seeds were sown in flats filled with Metro/Mix® 200 (The Scotts® Company, USA). Humidity domes were added to each flat and flats were assigned locations and placed in climate-controlled growth chambers. Plants were grown under a temperature regime of 22° C. at day and 20° C. at night, with a photoperiod of 16 hours and average light intensity of 170 μmol/m2/s. After the first true leaves appeared, humidity domes were removed. The plants were sprayed with glufosinate herbicide and put back in the growth chamber for 3 additional days. Flats were watered for 1 hour the week following the herbicide treatment. Watering was continued every seven days until the flower bud primordia became apparent, at which time plants were watered for the last time.


To identify drought tolerant plants, plants were evaluated for wilting response and seed yield. Beginning ten days after the last watering, plants were examined daily until 4 plants/line had wilted. In the next six days, plants were monitored for wilting response. Five drought scores were assigned according to the visual inspection of the phenotypes: 1 for healthy, 2 for dark green, 3 for wilting, 4 severe wilting, and 5 for dead. A score of 3 or higher was considered as wilted.


At the end of this assay, seed yield measured as seed weight per plant under the drought condition was characterized for the transgenic plants and their controls and analyzed as a quantitative response according to example 1M.


Two approaches were used for statistical analysis on the wilting response. First, the risk score was analyzed for wilting phenotype and treated as a qualitative response according to the example 1L. Alternatively, the survival analysis was carried out in which the proportions of wilted and non-wilted transgenic and control plants were compared over each of the six days under scoring and an overall log rank test was performed to compare the two survival curves using S-PLUS statistical software (S-PLUS 6, Guide to statistics, Insightful, Seattle, Wash., USA). Table 4 provides a list of recombinant DNA constructs that improve drought tolerance in transgenic plants.

TABLE 4PEPDroughtSeedTime toSEQscoreyieldwiltingIDConstructNominationDeltaP-DeltaP-Risk scoreP-NOIDIDOrientationmeanvaluemeanvaluemeanvalue60112116CGPG1112ANTI-0.1190.4960.7360.014−0.0261.000SENSE60213053CGPG1284ANTI-0.1220.5320.7850.028−0.1221.000SENSE60314733CGPG1640SENSE0.4690.016−0.2250.3630.2631.00060416132CGPG2136SENSE0.1490.5290.4720.046//60518276CGPG3542SENSE0.3200.0880.4330.2640.3621.00060670851CGPG1691SENSE0.2670.0650.4270.017//60770941CGPG4067SENSE0.2540.0560.5330.006//45071814CGPG4434SENSE0.2410.1860.7190.0200.3481.00060972326CGPG27SENSE0.3000.1490.6740.0360.1521.00060872134CGPG5335SENSE0.2390.1690.6130.0770.0101.00078372824CGPG4998SENSE0.2540.1800.6080.046//61072978CGPG3441SENSE0.4470.094−0.9730.071//61173619CGPG4375SENSE0.9900.022−0.2040.2830.9001.00061273737CGPG5176SENSE0.1620.2931.1260.0000.3331.00046073025CGPG5665SENSE0.0650.7310.4890.095//61374196CGPG6637SENSE0.1720.2301.5330.001//61474546CGPG661SENSE−0.0520.7270.7770.030−0.2261.00061574558CGPG869SENSE0.3440.072−0.1380.1170.0330.95561674643CGPG6159SENSE0.3690.042−1.2320.1250.5561.00061775234CGPG5931SENSE0.1570.038−0.0950.4790.0260.88161875278CGPG6282SENSE0.1100.2001.0270.0030.0570.78958775289CGPG6295SENSE0.6270.004−0.0740.2670.1960.88361975585CGPG7671SENSE0.4740.008−0.4520.259//62076063CGPG1574SENSE0.1050.6370.9850.0620.0231.00059376318CGPG8909SENSE0.5090.0091.0930.0050.4710.95259976891CGPG9034SENSE0.1860.1150.7960.0030.1281.00062177401CGPG9294SENSE−0.0720.6430.5540.061−0.0901.000


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference (p value, of the delta of a quantitative response or of the risk score of a qualitative response, is the probability that the observed difference between the transgenic plants and the reference occur by chance) If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


C. Heat Stress Tolerance Screen


Under high temperatures, Arabidopsis seedlings become chlorotic and root growth is inhibited. This example sets forth the heat stress tolerance screen to identify Arabidopsis plants transformed with the gene of interest that are more resistant to heat stress based on primarily their seedling weight and root growth under high temperature. T2 seeds were plated on ½ X MS salts, 1% phytagel, with 10 μg/ml BASTA (7 per plate with 2 control seeds; 9 seeds total per plate). Plates were placed at 4° C. for 3 days to stratify seeds. Plates were then incubated at room temperature for 3 hours and then held vertically for 11 additional days at temperature of 34° C. at day and 20° C. at night. Photoperiod was 16 h. Average light intensity was ˜140 μmol/m2/s. After 14 days of growth, plants were scored for glufosinate resistance, root length, final growth stage, visual color, and seedling fresh weight. A photograph of the whole plate was taken on day 14.


The seedling weight and root length were analyzed as quantitative responses according to example 1M. The final grow stage at day 14 was scored as success if 50% of the plants had reached 3 rosette leaves and size of leaves are greater than 1 mm (Boyes, e.g., (2001) The Plant Cell 13, 1499-1510). The growth stage data was analyzed as a qualitative response according to example 1L. Table 5 provides a list of recombinant DNA constructs that improve heat tolerance in transgenic plants.

TABLE 5RootGrowthSeedlinglengthstageweightPEPat day 14at day 14at day 14seqConstructNominationDeltaP-Risk scoreP-DeltaP-idIDIDOrientationmeanvaluemeanvaluemeanvalue77210923CGPG414ANTI-SENSE0.4430.032////52515419CGPG1788ANTI-SENSE0.3850.040−0.070  /0.4160.07652717903CGPG1908SENSE0.1250.091////62318330CGPG3336SENSE0.2740.1741.0140.3101.2790.01262418409CGPG3613SENSE0.1810.098////73618410CGPG3614SENSE0.3980.094////43217410CGPG2446SENSE0.1990.0670.2410.3040.8280.27162217808CGPG2435SENSE0.2660.040////43819760CGPG4065SENSE0.0150.771−0.111  /1.6110.00562519813CGPG4168SENSE0.1830.0880.6430.3440.8070.03844070510CGPG2488SENSE0.6070.0560.1710.1840.8630.02144370812CGPG607SENSE0.3190.0800.8820.3450.9340.05544771446CGPG185SENSE0.4270.1100.1840.1600.4700.06845272005CGPG5253SENSE0.2090.0980.3310.5861.1780.12254572603CGPG646SENSE0.4760.029−0.170  0.1491.2720.18561173619CGPG4375SENSE0.3680.0580.8530.2410.7460.10246773662CGPG4927SENSE0.2050.053////45372026CGPG5231SENSE0.5890.012////62673580CGPG6517SENSE−0.0770.602−0.195  /1.1060.05262874124CGPG6631SENSE0.2560.0550.4600.0130.5120.11062773935CGPG993SENSE0.2840.0891.1020.2621.0980.11548274707CGPG4914SENSE0.0770.6800.0990.7450.7730.05862974725CGPG5393SENSE0.3510.038////59176220CGPG6189SENSE0.3810.039////79675910CGPG1256SENSE0.5140.081////63076403CGPG1313SENSE0.1580.075−0.033  0.8640.2520.58159576513CGPG5892SENSE0.2810.051////63177366CGPG8150SENSE0.2590.044−0.116  0.0580.3540.36976777115CGPG9135SENSE0.2690.094////77077159CGPG9202SENSE0.1700.502−0.014  0.8870.3890.048


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


D. Salt Stress Tolerance Screen


This example sets forth the high salinity stress screen to identify Arabidopsis plants transformed with the gene of interest that are tolerant to high levels of salt based on their rate of development, root growth and chlorophyll accumulation under high salt conditions.


T2 seeds were plated on glufosinate selection plates containing 90 mM NaCl and grown under standard light and temperature conditions. All seedlings used in the experiment were grown at a temperature of 22° C. at day and 20° C. at night, a 16-hour photoperiod, an average light intensity of approximately 120 umol/m2. On day 11, plants were measured for primary root length. After 3 more days of growth (day 14), plants were scored for transgenic status, primary root length, growth stage, visual color, and the seedlings were pooled for fresh weight measurement. A photograph of the whole plate was also taken on day 14.


The seedling weight and root length were analyzed as quantitative responses according to example 1M. The final growth stage at day 14 was scored as success if 50% of the plants reached 3 rosette leaves and size of leaves are greater than 1 mm (Boyes, D.C., et al., (2001), The Plant Cell 13, 1499/1510). The growth stage data was analyzed as a qualitative response according to example 1L. Table 6 provides a list of recombinant DNA constructs that improve high salinity tolerance in transgenic plants

TABLE 6RootRootGrowthSeedlingPEPlengthlengthstageweightSEQat day 11at day 14at day 14at day 14IDConstructDeltaP-DeltaP-RSP-DeltaP-NOIDOrientationmeanvaluemeanvaluemeanvaluemeanvalue80016023SENSE0.3330.0070.3080.0050.4000.3110.4690.03380116424ANTI-0.1930.0590.1840.0330.3120.4700.3030.068SENSE52717903SENSE0.0940.0340.0280.4321.1090.2830.1850.04152918284SENSE0.1830.0630.2900.046−0.0430.7390.5840.10062318330SENSE0.5830.0180.5840.0220.6720.0631.2670.03953118349SENSE0.1910.0460.2180.107−0.0130.7260.1470.62880217807SENSE0.1620.1750.2380.0941.4120.0810.6060.04377518214SENSE0.2330.0670.3060.0320.6080.1280.8590.03543318401SENSE0.2290.1170.2610.1611.7210.1380.6910.05773919624SENSE0.1500.0710.0740.2850.0710.3480.1630.08653619944SENSE0.2000.0180.2840.1060.7040.3560.5780.03877670236SENSE0.2650.0000.2620.0191.3870.2800.6280.02744070510SENSE0.0370.7630.0210.7991.3290.0200.2200.01144170605SENSE0.3410.0580.3900.0360.8020.0020.6680.00880370802SENSE0.2440.0250.2210.0300.9480.2750.5400.03544370812SENSE0.2290.1500.2590.0360.6440.1810.4830.05480470817SENSE0.3460.0080.3220.0100.4170.1470.6130.05080570819SENSE0.0180.8460.1000.108−0.2500.0090.0620.09780670908SENSE0.2840.0700.3270.0280.9550.1580.9350.00244570939SENSE0.2410.0250.1620.0011.4140.1430.3760.05660770941SENSE0.6480.0080.6560.0023.7130.0061.0610.00974171215SENSE0.2530.0230.1930.0262.1550.0370.5610.01944771446SENSE0.2880.0140.3120.0141.4900.0880.7940.01180772351SENSE0.2340.0260.2130.0001.0410.1340.2470.06254572603SENSE0.6880.0070.5700.0070.8540.0701.0290.02845972983SENSE0.1130.1980.1190.0791.0720.5760.3040.08145372026SENSE0.2220.1490.2550.0543.7060.0060.7150.05381073709SENSE0.1800.0550.1730.0550.2090.3420.4930.11455773750SENSE0.1840.0370.1680.0210.1110.5850.5730.00255873751SENSE0.0390.7650.1290.1011.9610.0910.4870.05578673578SENSE0.1940.0040.1030.0080.3300.137−0.0170.92981373851SENSE0.6440.0050.5000.0003.0970.0260.9550.00578774112SENSE0.3900.0070.2570.0010.6360.1780.3370.15681474156SENSE0.3950.0010.3280.0040.8290.0790.4540.00081574194SENSE0.0620.3520.0570.2180.9950.0820.1730.04056273929SENSE0.2110.3110.2450.1250.8220.1110.6210.07181674252SENSE0.6780.0100.6040.0181.6270.0271.3300.02856974315SENSE−0.0870.327−0.0420.4720.0880.6000.2820.09481774317SENSE0.2270.0820.2050.1223.1900.0590.4240.01147674388SENSE0.1850.1120.1400.0841.7500.1100.5330.03281974506SENSE0.2110.0320.2200.0381.3290.2030.5250.01647974522SENSE0.2620.0220.1880.0760.5620.2650.1840.49482074540SENSE0.3180.0660.2190.0940.1370.6100.2990.31675274577SENSE0.2160.1440.1830.0411.1050.2960.5180.02080873087SENSE0.1250.4660.2000.0590.0670.3980.4460.04381874328SENSE0.0690.3090.0920.0050.5440.4420.3540.00082174650SENSE0.0150.6570.0080.9191.3070.0310.3520.02661474546SENSE0.3180.0030.3070.0011.7090.0710.5240.00582474956SENSE0.1940.0050.1990.0641.5530.0630.4550.02475174556SENSE0.0060.9520.1090.2120.3340.2830.4190.08079174393SENSE0.4100.0230.3840.0570.6560.0020.6960.02082675255SENSE−0.0020.9840.0330.6890.5180.5040.4900.09382775269SENSE0.1220.0950.0950.2781.5200.2920.5190.04658775289SENSE0.2180.0010.2130.0242.9260.1120.5430.02180973223SENSE0.3400.1400.2490.1790.4200.3510.7290.07378473283SENSE0.1280.2920.3200.200−0.0060.6820.03281173804SENSE0.1740.0690.0900.3280.1400.5360.0950.60955973814SENSE0.1350.1000.1850.0820.8000.2930.3750.10481273819SENSE0.3440.0170.3210.0212.1590.1170.6720.02747173863SENSE0.3350.0300.2510.0411.9050.2200.5630.05047274059SENSE0.2650.0390.2930.022−0.1460.0350.2980.16174674080SENSE0.1070.5090.1300.1331.1010.2930.5840.07148274707SENSE0.5440.0290.6230.0130.9660.1041.1220.00782274718SENSE0.2890.1110.3820.0210.6480.4040.6460.11258074733SENSE0.1520.1380.1290.0950.1900.6850.2200.33882374745SENSE0.1660.0350.1230.2401.1020.1000.5800.01458174749SENSE0.4440.0480.4780.0100.7960.1731.0760.00858274752SENSE0.2600.3100.4500.0880.4670.2840.9470.02275474753SENSE−0.0580.2350.0820.3260.3300.00575574754SENSE0.2300.0880.2280.0770.9660.3530.4070.11848474755SENSE0.0700.574−0.0200.4700.2650.2080.4550.07349375355SENSE0.2900.0570.3050.0000.4400.4250.3600.30182875432SENSE0.2490.0200.1780.1000.4750.1960.2160.45482975468SENSE0.1460.3440.1850.0541.3730.2680.3110.14179475752SENSE0.4020.0580.3940.0490.5210.3350.6720.02776075931SENSE0.1040.2390.0740.4331.1210.2090.3210.06983276064SENSE0.3490.0110.2700.0460.1520.3600.2500.29579776101SENSE0.5270.0640.5420.1070.7240.3671.4800.02483376121SENSE−0.0260.8490.0190.8292.8410.0590.0690.82283476193SENSE0.2620.0200.2540.0210.8340.0970.4410.05483576237SENSE0.3340.0980.3650.0470.7270.0420.7860.13783676281SENSE0.1670.1540.1810.1840.5720.0960.5980.09276276293SENSE0.0420.763−0.0440.7760.7690.2700.6100.01783175911SENSE0.1790.0410.1430.0831.5080.2660.4640.00183075686SENSE0.3780.0360.3580.0391.0390.0570.6660.01483776388SENSE0.2090.0540.1630.1360.8530.3670.4850.01383876557SENSE0.1650.0420.1040.0951.9430.0270.3950.03183976567SENSE−0.0460.5510.0670.1961.6850.1080.2730.05350776575SENSE0.1010.3020.1830.0670.1210.4960.2980.30684076624SENSE0.3990.0470.3670.0410.6520.1220.6220.06959776719SENSE0.2290.0510.1400.3170.7260.3300.4870.08384176764SENSE0.4820.0360.3760.0362.2590.1630.6700.12782575076SENSE0.1860.1390.2330.0162.1970.0620.5380.03184376976SENSE0.4550.0130.3750.0443.4990.0200.7370.00784476985SENSE0.3130.0500.2990.0373.3100.0410.5870.04884276855SENSE0.4630.0260.6360.0100.1880.0201.3340.00159976891SENSE0.3250.0000.3680.0030.4450.0870.7950.00384577014SENSE0.0050.889−0.0400.5970.0240.9320.2650.09852277351SENSE0.3920.0480.4540.0142.9120.0521.3400.00663177366SENSE0.2140.0070.1590.0170.2050.0760.3820.02984677112SENSE−0.0010.9860.0200.8011.6400.0200.5830.02584777117SENSE0.2190.0580.2340.1161.4060.1720.5160.07276877128SENSE0.2390.0200.3120.0190.3240.5230.4700.02852077135SENSE0.1530.0700.1330.0590.3860.5030.3440.06484877138SENSE0.1710.0300.2710.0121.2110.2560.4820.03577077159SENSE0.1880.1150.2200.1101.1760.3810.5890.03960077208SENSE0.3210.0750.2540.1540.1830.5780.5400.05952177219SENSE0.2970.0900.4130.0182.2990.0241.0100.01484977226SENSE0.1640.1930.0980.4131.2790.1420.2640.04362177401SENSE0.3170.0680.2740.0181.1340.2580.5020.21385077418SENSE0.2040.0960.2010.1100.3440.1840.6260.005


If <0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


E. Polyethylene Glycol (PEG) Induced Osmotic Stress Tolerance Screen


There are numerous factors, which can influence seed germination and subsequent seedling growth, one being the availability of water. Genes, which can directly affect the success rate of germination and early seedling growth, are potentially useful agronomic traits for improving the germination and growth of crop plants under drought stress. In this assay, PEG was used to induce osmotic stress on germinating transgenic lines of Arabidopsis thaliana seeds in order to screen for osmotically resistant seed lines.


T2 seeds were plated on BASTA selection plates containing 3% PEG and grown under standard light and temperature conditions. Seeds were plated on each plate containing 3% PEG, ½ X MS salts, 1% phytagel, and 10 μg/ml glufosinate. Plates were placed at 4° C. for 3 days to stratify seeds. On day 11, plants were measured for primary root length. After 3 more days of growth, i.e., at day 14, plants were scored for transgenic status, primary root length, growth stage, visual color, and the seedlings were pooled for fresh weight measurement. A photograph of the whole plate was taken on day 14.


Seedling weight and root length were analyzed as quantitative responses according to example 1M. The final growth stage at day 14 was scored as success or failure based on whether the plants reached 3 rosette leaves and size of leaves are greater than 1 mm. The growth stage data was analyzed as a qualitative response according to example 1L. Table 7 provides a list of recombinant DNA constructs that improve osmotic stress tolerance in transgenic plants.

TABLE 7RootRootGrowthSeedlingPEPlengthlengthstageweightSEQat day 11at day 14at day 14at day 14IDConstructDeltaP-DeltaP-RSP-DeltaP-NOIDOrientationmeanvaluemeanvaluemeanvaluemeanvalue69410443ANTI-0.1620.2510.1270.1183.4750.0220.5470.157SENSE77210923ANTI-0.2350.0330.3320.0103.4960.0200.2840.098SENSE52414835ANTI-0.1950.1340.1260.1731.4770.3620.3000.050SENSE77314836SENSE0.2100.1590.0370.7072.8670.0400.2050.33280116424ANTI-0.3090.0550.2010.2102.2610.1500.3750.012SENSE53018332SENSE0.1290.0950.1410.0282.2050.1340.0620.37973718536SENSE0.2140.0360.2280.0653.0620.0820.3230.15369618537SENSE0.2060.0300.1970.0342.6410.0760.3990.11269718831SENSE0.0020.9880.0300.8283.2430.050−0.1700.13469819044ANTI-0.3000.0820.2200.0062.4650.1130.2840.113SENSE69517814SENSE0.2150.0560.0570.5424.0000.0000.5730.00443318401SENSE0.2290.0170.1430.2521.6700.0650.5030.00453218623SENSE0.2460.0280.2050.0501.8360.0020.3560.00973919624SENSE0.0880.232−0.0110.8671.3170.0750.4580.13569919950SENSE0.2250.1190.2140.0723.3930.0300.3350.07870070221SENSE0.1480.1790.0760.1523.1830.0600.3510.08877670236SENSE0.0940.4890.1150.4142.3680.0330.1160.17944170605SENSE0.4230.0200.3550.0423.0870.0210.5880.07580370802SENSE0.1820.3450.0890.5963.2280.0530.1830.47778070804SENSE0.2440.0530.1080.2072.5140.2330.4340.01153970830SENSE0.2260.0300.0640.3870.9140.3540.2720.02844570939SENSE0.0860.396−0.0400.5083.3900.0310.3350.05870171110SENSE−0.2060.153−0.1640.2121.4100.018−0.3180.07974171215SENSE0.3370.0250.1520.0472.9790.0000.5660.03244771446SENSE0.2630.0030.1290.1552.6720.0570.5000.01170271718SENSE0.1970.0310.2570.0363.3100.0410.1890.13770371838SENSE0.2340.0390.2600.0163.1640.0630.0790.58370472087SENSE0.2420.0520.2160.0604.0000.0000.2990.17754372352SENSE0.1940.1850.2590.0531.6660.0690.2270.08054472413SENSE0.0820.2410.1580.0243.1960.058−0.0400.74854572603SENSE0.1780.1970.1800.2083.0930.0760.1270.48945572637SENSE0.2470.0020.1970.0173.2740.0460.3480.04154672676SENSE0.1410.0150.1810.0944.0000.0000.3010.01770572729SENSE0.4020.0170.2420.0012.1190.0900.5680.00578372824SENSE0.1660.2810.3490.0712.3550.1070.2780.21261072978SENSE0.1900.1120.3040.0321.3280.494−0.0470.65470673302SENSE0.4000.0010.3460.0263.4890.0210.4970.01074473666SENSE0.3610.0170.1770.2583.4880.0210.7570.00881073709SENSE0.2220.0410.1300.2582.1040.1060.4260.00970973736SENSE0.2750.0260.2100.0062.5360.1150.3780.11255173031SENSE0.0950.1090.2010.0333.1220.0190.1780.26270773484SENSE0.3650.1040.1850.3152.9800.0290.4890.01778673578SENSE0.1130.1180.1250.2092.5160.0830.1560.25170873594SENSE0.0870.6250.1830.2011.2300.0410.1400.23262773935SENSE0.3780.0000.2610.0373.4840.0210.4890.02371174266SENSE−0.0050.943−0.0500.6482.5710.0740.2870.07478974278SENSE0.1450.2990.1340.4523.6660.0080.3500.14079074359SENSE0.3650.0260.2910.1122.2200.0530.3510.03079274414SENSE0.3100.0350.2720.0542.5150.0370.3230.01547974522SENSE0.2240.0770.1690.0162.5470.0860.1780.33375074539SENSE0.3270.1020.1970.3713.3560.0120.4960.01182074540SENSE0.2900.0070.3760.0133.3500.0360.4580.09671474574SENSE0.2360.0960.1550.0532.4480.2550.1790.47571574607SENSE0.2740.0070.2850.0042.7830.0530.3410.00771674642SENSE0.3860.0220.2730.0892.3080.0090.3780.14161574558SENSE0.1840.0120.0530.5282.0520.0360.0970.25975174556SENSE0.2580.1860.1600.3622.2530.0420.4770.05948875231SENSE0.0350.5550.0610.5773.3500.0360.2530.05961775234SENSE0.0530.3100.0180.2403.2730.0460.1970.09658775289SENSE0.2260.0320.2450.0193.4580.0240.5060.05278573292SENSE−0.0030.9560.0080.7462.0880.0790.2740.12081173804SENSE0.3050.0510.2730.1702.2500.1500.3770.02171775015SENSE−0.0360.360−0.0650.1673.3620.0340.1110.22771074016SENSE0.1760.0190.2070.0072.8980.0350.2550.13356474072SENSE0.0250.5220.0270.6993.3040.0420.1090.27471274479SENSE0.0590.3620.1520.1842.2260.1300.2260.07657174490SENSE0.0020.9560.0970.4053.3320.0380.2680.13671374549SENSE0.1520.0270.0690.2543.3920.0310.2440.05648274707SENSE0.3220.0020.1480.0402.2280.1640.3920.02182274718SENSE0.2480.1760.1880.3203.1800.0610.6300.09158074733SENSE0.2070.0210.2160.0364.0000.0000.1560.06258274752SENSE−0.0200.679−0.0350.6161.1810.1820.1880.05448474755SENSE0.5170.0020.3860.0164.0000.0000.6860.00571875380SENSE0.0380.789−0.0450.7893.3360.0370.1350.40172075765SENSE0.2560.1820.1650.1852.4810.2440.4450.03579575867SENSE−0.4300.150−0.1920.3210.7080.082−0.3150.19371975560SENSE0.0890.0120.1070.2811.8050.1440.1250.41679475752SENSE0.2360.0220.2170.0032.8720.0360.3320.01772175977SENSE0.1480.0190.1690.0123.3540.0350.0690.61549975984SENSE0.3890.0490.2370.1992.5810.0980.5010.06472276052SENSE0.1330.1350.0950.2013.2910.0430.2740.26664276061SENSE0.0060.9450.0320.6011.8600.0020.0360.60672376109SENSE0.2840.0260.1620.0822.4270.2630.4810.00972476116SENSE0.2640.0660.2180.0181.9730.2200.3040.30972576158SENSE0.2300.0160.2790.0512.2170.160−0.0380.03472676232SENSE0.1700.1330.1590.0711.6260.2030.2940.06072776269SENSE0.2810.0300.3040.0343.1310.0690.2430.09649675907SENSE0.1960.0630.0630.3240.8750.4620.2590.12279675910SENSE0.1960.1350.2460.0252.5910.0690.3570.00964175975SENSE0.2850.1140.3070.2563.0320.0880.4850.06376276293SENSE0.2120.059−0.0180.6023.4560.0240.5250.05372876303SENSE−0.0230.6770.1680.0072.5560.0790.0760.17172976474SENSE0.2860.0380.2770.0463.5900.0130.4240.04763076403SENSE0.1630.0470.1100.0852.7680.1540.1850.31873076529SENSE0.2320.2130.3820.0402.9890.098−0.0840.59183876557SENSE0.0560.4610.0460.6753.3760.0330.2490.04559576513SENSE0.1200.0720.1260.0621.8670.2920.0350.46373176625SENSE0.0910.0420.1230.0113.1840.0600.2000.02450976741SENSE−0.0220.7600.2260.0182.1360.002−0.2310.02673276759SENSE−0.1050.298−0.1150.2332.6470.060−0.0370.67284176764SENSE0.0820.410−0.0220.8624.0000.0000.2300.04551476917SENSE0.3050.0730.2750.0663.2940.0430.3870.02776576804SENSE0.1230.1310.0720.4622.7660.0460.4680.02859976891SENSE0.3770.0140.3180.0584.0000.0000.6760.03684577014SENSE0.0100.6730.0990.0502.1380.175−0.0630.66873377026SENSE0.4580.0240.3290.0031.4100.1090.7520.03168577055SENSE0.6360.0260.5120.0164.0000.0000.5360.06152277351SENSE0.1370.0550.1160.0232.5090.0960.7380.00976777115SENSE0.3010.0320.2340.0624.0000.0000.8550.00573477131SENSE0.2510.0810.2110.0051.2400.3940.1690.34076977158SENSE0.2580.1490.2230.1934.0000.0000.5190.04577077159SENSE0.0540.713−0.1010.1073.0280.0890.4810.11252177219SENSE0.1870.2420.2070.1031.7280.2680.4190.02284977226SENSE0.3530.0330.3730.0164.0000.0000.5420.02262177401SENSE0.2640.0450.1740.1963.2880.0440.4040.00385077418SENSE0.2100.0890.1320.2223.0240.0900.4400.027


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference.


If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


F. Cold Shock Tolerance Screen


This example set forth a screen to identify Arabidopsis plants transformed with the genes of interest that are more tolerant to cold stress subjected during day 8 to day 28 after seed planting. During these crucial early stages, seedling growth and leaf area increase were measured to assess tolerance when Arabidopsis seedlings were exposed to low temperatures. Using this screen, genetic alterations can be found that enable plants to germinate and grow better than wild type plants under sudden exposure to low temperatures.


Eleven seedlings from T2 seeds of each transgenic line plus one control line were plated together on a plate containing ½ X Gamborg Salts with 0.8 Phytagel™, 1% Phytagel, and 0.3% Sucrose. Plates were then oriented horizontally and stratified for three days at 4° C. At day three, plates were removed from stratification and exposed to standard conditions (16 hr photoperiod, 22° C. at day and 20° C. at night) until day 8. At day eight, plates were removed from standard conditions and exposed to cold shock conditions (24 hr photoperiod, 8° C. at both day and night) until the final day of the assay, i.e., day 28. Rosette areas were measured at day 8 and day 28, which were analyzed as quantitative responses according to example 1M. Table 8 provides a list of recombinant nucleotides that improve cold shock stress tolerance in plants.

TABLE 8RosetteRosetteRosettePEPareaareaareaSEQat day 8at day 28differenceIDConstructNominationDeltaP-Risk scoreP-DeltaP-NOIDIDOrientationmeanvaluemeanvaluemeanvalue42610919CGPG699ANTI-0.4370.0360.2470.1080.2650.013SENSE42811310CGPG267ANTI-0.2240.5600.4970.2180.8320.055SENSE42911866CGPG1179ANTI-0.3540.0180.4220.0900.4280.073SENSE43011937CGPG959ANTI-−0.5100.004−0.0100.9390.3310.048SENSE77314836CGPG1072SENSE0.8620.0550.1870.3790.1470.61143217410CGPG2446SENSE0.2820.0760.3970.1700.5360.24643318401CGPG1862SENSE0.1340.5970.8010.0080.5820.00943418665CGPG3014SENSE1.2730.0261.1420.0031.2150.00442711142CGPG567SENSE0.9160.0530.2550.2340.0480.79243116218CGPG2158SENSE0.4430.2780.4750.0810.5010.07443519141CGPG1674SENSE0.9010.0020.9350.0000.9560.00043619156CGPG2680SENSE−0.4830.0020.6100.0020.5760.05243719621CGPG3577SENSE0.2310.3881.2650.0581.4860.05943819760CGPG4065SENSE0.7590.0430.8330.0290.9830.02143919857CGPG3929SENSE0.0600.9310.7660.0900.8530.09744070510CGPG2488SENSE0.5910.0100.1460.520−0.0090.97844170605CGPG3012SENSE0.4420.088−0.1670.612−0.6240.10744270607CGPG3162SENSE0.7470.2091.6880.0011.9290.00144370812CGPG607SENSE0.7670.1260.8230.0911.4420.14978170821CGPG345SENSE0.5230.0390.3800.0190.5840.07044470933CGPG4084SENSE0.5060.0711.3520.0091.6600.01244570939CGPG3917SENSE0.3100.0571.4730.0121.7870.01477970674CGPG4492SENSE0.0600.6251.6000.0131.9040.01244671319CGPG4414SENSE0.2920.2990.5160.0890.4940.09044771446CGPG185SENSE0.7400.0400.5080.0340.3040.05344871609CGPG1679SENSE0.3730.4081.1840.0761.3180.07344971649CGPG271SENSE0.6520.0370.6560.1571.0120.06045071814CGPG4434SENSE0.3110.0660.5480.0060.4980.00845171945CGPG2198SENSE−0.1400.2320.6030.0130.7550.01145272005CGPG5253SENSE0.3690.1801.1520.0121.4100.01578272505CGPG4786SENSE0.4870.0210.6320.0860.7140.07745472533CGPG4815SENSE0.0990.8170.2860.0870.1580.07745572637CGPG4859SENSE−0.2310.4610.6410.0340.6980.02545672782CGPG1589SENSE1.0870.0130.6550.0500.2740.20845772794CGPG1826SENSE0.3750.1500.6480.0760.5280.07445872919CGPG3899SENSE0.6280.039−0.1510.385−0.3460.15545972983CGPG5610SENSE0.7600.006−0.0770.605−0.4530.19546373338CGPG4862SENSE0.4730.3010.6000.0530.8330.03946473344CGPG4910SENSE0.3500.0740.3940.1700.2800.45346673644CGPG1756SENSE0.6370.0500.8930.0690.9680.00746773662CGPG4927SENSE0.4020.1331.5240.0051.7910.00246873672CGPG5106SENSE0.9430.0120.9010.0380.9910.03345372026CGPG5231SENSE0.4470.1040.6990.1240.7050.08446973733CGPG5167SENSE0.3080.3360.2950.0150.1970.10846073025CGPG5665SENSE−0.1390.2440.4410.0120.5850.00446273092CGPG5695SENSE0.1830.3961.0110.0350.9860.06946573536CGPG6541SENSE0.7670.0430.7310.0240.8260.02447374205CGPG5089SENSE0.4430.1980.8240.0780.7280.15947474321CGPG5870SENSE0.1320.2570.7150.0020.7890.00047574337CGPG5888SENSE1.1760.0100.9810.0030.9870.00147674388CGPG1461SENSE0.4890.1160.4800.0920.2970.28779274414CGPG6664SENSE1.1180.0370.5570.0210.3880.04047974522CGPG82SENSE0.9280.0131.3820.0181.3270.01948074526CGPG6761SENSE0.3150.4220.6790.0200.6040.01348174576CGPG6781SENSE0.6200.012−0.1640.143−0.4100.03246173050CGPG5697SENSE0.6460.2121.2840.0231.7510.01048575202CGPG1743SENSE0.4130.0120.4220.0070.2920.02648675220CGPG5434SENSE0.7910.0370.7780.0150.7870.02048775226CGPG5824SENSE0.7880.0070.5950.1190.6190.17348875231CGPG5879SENSE0.9760.0170.6630.0040.4960.02948975239CGPG5949SENSE1.0480.0110.5700.2420.5470.29049075258CGPG6096SENSE0.8080.0470.4350.1350.5850.02549175270CGPG6218SENSE0.5260.0110.7460.1410.8660.19349275271CGPG6226SENSE0.0920.6980.9630.0830.9820.12847073846CGPG1924SENSE0.8070.0320.1750.6230.0140.97347173863CGPG5201SENSE0.2850.4040.7400.0210.7970.01647274059CGPG1884SENSE0.3640.0140.1620.137−0.1310.42447774412CGPG6743SENSE0.1630.5220.6700.0260.6260.13147874445CGPG6722SENSE0.8050.0650.3610.1640.5840.14548274707CGPG4914SENSE0.5840.0050.4530.2480.2390.41548374743CGPG5840SENSE0.1870.2450.8430.0400.9760.04548474755CGPG5980SENSE0.9590.0830.6800.2050.7680.20649375355CGPG7518SENSE0.2680.0060.7150.0530.7990.03849475460CGPG7654SENSE0.2140.1200.6490.0530.7130.09479575867CGPG6965SENSE0.8170.0400.7070.0580.5880.12849775927CGPG8229SENSE0.3600.1630.8080.0170.9550.00749875962CGPG8224SENSE−0.3370.1670.3980.0610.7060.03049975984CGPG1927SENSE1.1410.0500.2200.0730.0760.20250076119CGPG5962SENSE0.2060.4451.0560.0111.3640.01550176209CGPG5375SENSE0.5980.0780.7370.1120.8340.07549675907CGPG8259SENSE0.7300.0020.3760.0230.4940.06549575847CGPG6875SENSE0.5680.1290.9270.0251.0620.03350276323CGPG8949SENSE0.5920.0420.7360.0490.8440.04850376346CGPG8943SENSE0.1500.6470.5880.0610.6560.04450476352CGPG8896SENSE0.4640.5511.1240.1091.3750.04950576360CGPG8960SENSE0.4420.2740.6460.0720.8300.03850776575CGPG7260SENSE0.4440.0300.8130.0231.1230.05050676512CGPG5891SENSE−0.0600.6690.4860.0050.7990.01450876733CGPG2647SENSE0.3040.3121.0280.0190.9760.03450976741CGPG6995SENSE1.0070.0400.3440.0080.3020.03351276902CGPG9083SENSE0.3430.0970.5510.0340.4960.08251376913CGPG9076SENSE0.5240.0210.2570.5300.1490.71951476917CGPG9108SENSE0.7280.0630.8040.0150.7790.03651576929CGPG9109SENSE0.3350.2200.6280.0310.8440.02751076845CGPG9046SENSE0.1230.7510.8770.0811.2320.02751176857CGPG9047SENSE1.0550.0531.1700.0231.2860.02051677009CGPG5933SENSE0.6040.0510.8530.1040.8640.12051777022CGPG6335SENSE0.3010.3560.4440.0240.5540.00152277351CGPG8087SENSE0.7490.0231.0230.0121.0310.03251877108CGPG9174SENSE0.6510.0250.7880.0200.7230.13351977125CGPG9120SENSE0.9450.0491.6410.0131.7940.01152077135CGPG9200SENSE0.6480.2531.2990.0191.4860.03452177219CGPG9263SENSE0.6360.0050.3000.3550.7630.003


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference (p value, of the delta of a quantitative response or of the risk score of a qualitative response, is the probability that the observed difference between the transgenic plants and the reference occur by chance) If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


G. Cold Germination Tolerance Screen


This example sets forth a screen to identify Arabidopsis plants transformed with the genes of interests are resistant to cold stress based on their rate of development, root growth and chlorophyll accumulation under low temperature conditions. T2 seeds were plated and all seedlings used in the experiment were grown at 8° C. Seeds were first surface disinfested using chlorine gas and then seeded on assay plates containing an aqueous solution of ½ X Gamborg's B/5 Basal Salt Mixture (Sigma/Aldrich Corp., St. Louis, Mo., USA G/5788), 1% Phytagel™ (Sigma-Aldrich, P-8169), and 10 ug/ml glufosinate with the final pH adjusted to 5.8 using KOH. Test plates were held vertically for 28 days at a constant temperature of 8° C., a photoperiod of 16 hr, and average light intensity of approximately 100 umol/m2/s. At 28 days post plating, root length was measured, growth stage was observed, the visual color was assessed, and a whole plate photograph was taken.


The root length at day 28 was analyzed as a quantitative response according to example 1M. The growth stage at day 7 was analyzed as a qualitative response according to example 1L. Table 9 provides a list of recombinant DNA constructs that improve cold stress tolerance in transgenic plants.

TABLE 9Root lengthGrowth stagePEPat day 28at day 28SEQConstructDeltaDeltaIDIDOrientationmeanP-valuemeanP-value52314810SENSE0.4290.0282.8220.04152414835ANTI-SENSE0.3070.1142.6580.05852515419ANTI-SENSE//3.3330.03852616223SENSE0.3210.0624.0000.00052717903SENSE//2.6670.05752818279SENSE0.1000.6223.3900.03152918284SENSE0.3430.1431.5330.09753018332SENSE0.2860.0322.5790.02653118349SENSE0.1530.0131.6940.02653770125SENSE0.5500.0202.9870.03443217410SENSE0.3460.0274.0000.00053218623SENSE0.2530.0482.1440.01153319551SENSE0.1090.0550.5110.47953419627SENSE0.2340.0070.9180.24043819760SENSE0.2750.0782.2490.14153519785SENSE0.1750.3013.1030.07453619944SENSE0.1070.2143.4230.02753870504SENSE−0.2040.1222.8930.03544170605SENSE−0.1350.3573.3220.03953970830SENSE0.0850.3033.3960.03044470933SENSE0.1910.0843.3710.03354070943SENSE0.1810.0112.9580.03254171711ANTI-SENSE0.0860.4923.3450.03654271962SENSE0.2510.0963.0250.09054372352SENSE0.2670.2003.3580.03554472413SENSE0.3160.0874.0000.00054572603SENSE0.7030.0153.2980.01154672676SENSE0.1640.0972.3600.01254772737SENSE0.1530.1272.9000.03554872742SENSE0.3200.0382.8680.12754972762SENSE0.9040.0152.6470.08755273091SENSE0.3570.0272.8580.03855473340SENSE0.3050.0151.6630.34645372026SENSE0.3650.0273.3590.03555773750SENSE0.3530.0324.0000.00055873751SENSE0.6680.0084.0000.00055173031SENSE0.2650.0054.0000.00055573420SENSE−0.0070.9102.1160.03155673489SENSE0.2660.0693.2970.04356774135SENSE0.0920.1341.5440.01455072993SENSE0.3050.0852.9260.03256273929SENSE0.2600.0184.0000.00056874313SENSE0.1570.0263.2800.04556974315SENSE0.1580.0124.0000.00079074359SENSE0.3410.0802.0620.10157074427SENSE0.1320.0971.9520.05157374531SENSE0.2420.0184.0000.00057474532SENSE0.2880.0793.3610.03457574538SENSE0.2500.2263.7370.00557674550SENSE0.2450.0434.0000.00057774579SENSE0.2550.0023.3460.03657874644SENSE0.1050.4922.7110.05258474896SENSE0.3810.0364.0000.00058574976SENSE0.1340.0382.3420.29358674991SENSE−0.1320.1171.7390.09048875231SENSE0.0420.0744.0000.00058775289SENSE0.1730.0534.0000.00055373224SENSE0.2450.0771.4390.01578573292SENSE0.1690.3632.7900.05555973814SENSE0.3050.0751.9760.05856073855SENSE0.6370.0343.2220.01556173856SENSE0.6500.0112.8950.05647173863SENSE0.1320.1343.3850.03257274511SENSE0.2650.0110.6670.42356374063SENSE0.2440.0054.0000.00056474072SENSE0.2450.0052.5360.22556574073SENSE0.3520.0653.0190.09156674075SENSE0.2400.0264.0000.00047874445SENSE0.2230.0732.6180.06457174490SENSE0.1430.0164.0000.00057974703SENSE0.4680.0243.1250.07058074733SENSE0.4870.0202.5040.08858174749SENSE0.5010.0263.4060.02958274752SENSE0.2760.0973.3890.03148474755SENSE0.1360.3922.8210.04558374773SENSE0.3760.0044.0000.00079575867SENSE0.4010.0614.0000.00079375530SENSE−0.0320.7602.3850.09858876003SENSE0.2390.0211.6110.31558976074SENSE0.2690.0332.6960.17459076137SENSE0.3160.0752.6240.08159176220SENSE0.0740.5162.5850.08979675910SENSE0.2250.0704.0000.00059276301SENSE0.1100.3082.6830.05559376318SENSE0.0520.6002.6920.07859476347SENSE0.0880.3623.2540.04959676566SENSE0.1440.0522.4080.11759576513SENSE0.3160.0193.3470.03659776719SENSE0.0990.0634.00059876757SENSE0.2510.0373.0420.08651476917SENSE0.1650.0551.8340.25459976891SENSE0.3820.0700.0001.00060077208SENSE0.4080.0342.8350.051


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


H. Shade Tolerance Screen


Plants undergo a characteristic morphological response in shade that includes the elongation of the petiole, a change in the leaf angle, and a reduction in chlorophyll content. While these changes may confer a competitive advantage to individuals, in a monoculture the shade avoidance response is thought to reduce the overall biomass of the population. Thus, genetic alterations that prevent the shade avoidance response may be associated with higher yields. Genes that favor growth under low light conditions may also promote yield, as inadequate light levels frequently limit yield. This protocol describes a screen to look for Arabidopsis plants that show an attenuated shade avoidance response and/or grow better than control plants under low light intensity. Of particular interest, we were looking for plants that didn't extend their petiole length, had an increase in seedling weight relative to the reference and had leaves that were more close to parallel with the plate surface.


T2 seeds were plated on glufosinate selection plates with ½ MS medium. Seeds were sown on ½ X MS salts, 1% Phytagel, 10 ug/ml BASTA. Plants were grown on vertical plates at a temperature of 22° C. at day, 20° C. at night and under low light (approximately 30 uE/m2/s, far/red ratio (655/665/725/735) ˜0.35 using PLAQ lights with GAM color filter #680). Twenty-three days after seedlings were sown, measurements were recorded including seedling status, number of rosette leaves, status of flower bud, petiole leaf angle, petiole length, and pooled fresh weights. A digital image of the whole plate was taken on the measurement day. Seedling weight and petiole length were analyzed as quantitative responses according to example 1M. The number of rosette leaves, flowering bud formation and leaf angel were analyzed as qualitative responses according to example 1L.


Table 10 provides a list of recombinant DNA constructs that improve shade tolerance in plants

TABLE 10LeafSeedlingPetioleangleweightlengthPEPat dayat dayat daySEQ232323IDConstructRSDeltaDeltaP-NOIDOrientationmeanP-valuemeanP-valuemeanvalue44170605SENSE//0.4240.0610.3520.03863272414SENSE0.0001.0000.3960.0670.2220.04746473344SENSE2.6670.0570.1790.4440.2890.05874473666SENSE//0.8380.0080.1420.44545372026SENSE0.0001.0000.4690.0420.3130.07363473723SENSE1.3330.423−0.1030.677−0.4060.08963573747SENSE2.0000.225−0.6050.002−0.5650.05063373409SENSE2.6670.0570.3650.0520.2830.01063674634SENSE0.6670.423−0.6360.109−0.8690.01761574558SENSE1.3330.1840.4860.0180.2310.10882474956SENSE1.3330.4230.1540.076−0.0520.28948675220SENSE0.0001.000−0.6440.230−0.7160.09948775226SENSE0.0001.000−0.6010.067−1.1420.08648875231SENSE0.6670.4230.4550.0390.2580.07263875232SENSE1.3330.423−1.9310.077−1.9920.03848975239SENSE0.6670.4230.1490.0750.0600.44456073855SENSE3.3330.038−0.2610.121−0.3610.21547274059SENSE0.0001.0000.4090.0660.3310.06248274707SENSE3.3330.0380.2690.0710.1360.01675474753SENSE0.0001.0000.3650.0790.2670.05763774769SENSE1.3330.184−0.1410.346−0.2210.02949375355SENSE1.3330.423−0.7350.069−0.5810.05863975429SENSE2.6670.057−0.4530.151−1.3190.03264075965SENSE0.6670.4230.0920.0860.1090.12264276061SENSE//0.3730.0750.1100.33064376155SENSE0.6670.423−0.4730.021−0.6080.00476176207SENSE2.6670.057−0.3740.060−0.3690.08864175975SENSE2.0000.2250.4050.0560.2230.04476276293SENSE2.6670.184−0.4120.052−0.7170.01864476322SENSE0.6670.423−0.5330.044−0.5590.04564576440SENSE//−0.4050.141−0.5250.04550976741SENSE0.6670.423−0.7930.027−0.8030.01851376913SENSE//0.3130.0730.3360.12964676828SENSE0.6670.423−0.1850.141−1.1890.00151777022SENSE0.0001.000−0.4950.282−0.4750.07964877303SENSE2.0000.225−0.2960.117−0.1760.03864977352SENSE1.3330.423−0.2980.127−0.3770.03663177366SENSE0.6670.4230.6110.0150.3270.04464777136SENSE1.3330.423−0.8240.076−0.5660.089


For “seeding weight” and “leaf angle”, if p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference with p<0.2.


For “petiole length”, if p<0.05 and delta<0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta<0, the transgenic plants showed a trend of trait improvement as compared to the reference.


I. Early Plant Growth and Development Screen


This example sets forth a plate based phenotypic analysis platform for the rapid detection of phenotypes that are evident during the first two weeks of growth. In this screen, we were looking for genes that confer advantages in the processes of germination, seedling vigor, root growth and root morphology under non-stressed growth conditions to plants. The transgenic plants with advantages in seedling growth and development were determined by the seedling weight and root length at day 14 after seed planting.


T2 seeds were plated on glufosinate selection plates and grown under standard conditions (˜100 uE/m2/s, 16 h photoperiod, 22° C. at day, 20° C. at night). Seeds were stratified for 3 days at 4° C. Seedlings were grown vertically (at a temperature of 22° C. at day 20° C. at night). Observations were taken on day 10 and day 14. Both seedling weight and root length at day 14 were analyzed as quantitative responses according to example 1M.


Table 11 provides a list recombinant DNA constructs that improve early plant growth and development.

TABLE 11PEPRoot lengthRoot lengthSeedling weightSEQat day 10at day 14at day 14IDConstructDeltaDeltaDeltaNOIDOrientationmeanP-valuemeanP-valuemeanP-value52314810SENSE0.1750.0270.0490.231−0.0890.33777314836SENSE0.1050.0070.0890.0290.1620.21773514944SENSE0.2050.2070.1750.0460.2590.22152515419ANTI-SENSE0.1820.0190.1250.0480.2120.15552717903SENSE0.4010.0080.1740.0110.3300.00552918284SENSE0.0310.7950.0160.7220.2230.04373718536SENSE0.1580.2220.1040.1180.1490.12273618410SENSE0.2220.2280.0780.6400.1410.08043217410SENSE0.1450.0520.0740.2460.0740.54253218623SENSE0.2470.1000.1330.0740.2280.03643418665SENSE0.2370.0250.1560.0570.2680.03373819540SENSE0.1150.3970.0850.2370.1300.03173919624SENSE0.3290.0570.1710.2690.4940.02474070353SENSE0.1360.0650.0860.3550.1400.17078070804SENSE0.4110.0660.2900.1470.4830.12944370812SENSE0.4380.0690.3510.0430.5710.03644470933SENSE0.0930.0990.0880.0390.0710.06054070943SENSE0.3440.0740.1730.0310.2630.28374171215SENSE0.2820.1070.1420.0100.1080.59444671319SENSE0.2010.0770.0510.4270.2360.02574271320SENSE0.6010.0150.4370.0410.5470.10744771446SENSE0.3080.1820.2070.0210.4650.10746373338SENSE0.3090.0350.1460.0380.4620.01761173619SENSE0.0700.1510.1030.0160.2660.09374473666SENSE0.3040.0360.1660.0140.4800.01046973733SENSE0.1560.148−0.0870.5110.1560.03661273737SENSE0.3520.0050.2440.0200.0980.09355773750SENSE0.1560.0330.1110.0470.2160.04046573536SENSE0.4610.0070.2610.0060.3300.02455072993SENSE0.2540.0340.1520.2280.2710.07374774211SENSE0.1550.0840.0680.1590.3680.03456974315SENSE0.3280.0070.1490.0200.5130.05574874510SENSE0.2770.0030.1580.0160.4330.02974974527SENSE0.4460.0250.2560.0220.4120.05275074539SENSE0.1030.0240.0630.1960.2490.01357674550SENSE0.2160.0760.2280.0350.4240.04848174576SENSE0.2700.0170.0160.8230.2160.23575274577SENSE0.3330.0320.2240.0230.2850.09474373076SENSE0.1290.4590.0630.5130.2440.05575674908SENSE0.2370.0060.1840.0140.2530.09475174556SENSE0.1470.1110.0760.0250.3770.03548975239SENSE0.1800.1450.0850.0190.2360.16275875254SENSE0.1440.2330.0310.5020.3630.05455373224SENSE0.3840.0350.2570.0220.6160.02974573852SENSE0.0290.7600.0310.6170.0310.01957274511SENSE0.3450.0030.1070.0690.4870.04356574073SENSE−0.0030.969−0.0230.6460.2250.09674674080SENSE0.2520.0310.1020.1520.2960.07475374729SENSE0.0650.082−0.0310.6420.0800.06858174749SENSE0.1600.1610.1040.1200.2430.32275474753SENSE0.1360.1270.0610.2240.4220.00275574754SENSE0.2160.0040.1330.0400.1880.05548474755SENSE0.3970.0710.2430.0840.4990.03775775107SENSE0.3520.0140.2120.0440.3420.08675975741SENSE0.1600.0980.0620.3990.1230.32076075931SENSE0.1620.0370.1340.0120.2250.06179776101SENSE0.1420.4450.1090.3610.3720.05176176207SENSE0.3970.0150.2350.1000.1510.14976276293SENSE0.1600.1720.1780.0420.4910.00049575847SENSE0.2620.0030.0050.9280.2460.15976376517SENSE0.2200.0800.1620.0210.2100.01776476559SENSE0.2830.1300.2110.1230.2660.09376576804SENSE0.2190.0610.1300.0660.4190.00376677018SENSE0.3380.0020.1370.0950.4690.05952277351SENSE0.0520.5660.0600.2900.3430.09151877108SENSE0.0000.9980.0950.0800.0670.08576777115SENSE0.3390.1020.1730.0440.7880.05676877128SENSE0.3600.0750.1740.1700.3040.12576977158SENSE0.2900.0520.0330.4840.3650.01077077159SENSE0.3030.0190.1770.1660.4080.03377177163SENSE0.4970.0070.4340.0021.0470.03860077208SENSE0.2060.0110.1030.0770.3880.009


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


J. Late Plant Growth and Development Screen


This example sets forth a soil based phenotypic platform to identify genes that confer advantages in the processes of leaf development, flowering production and seed maturity to plants.



Arabidopsis plants were grown on a commercial potting mixture (Metro Mix 360, Scotts Co., Marysville, Ohio) consisting of 30-40% medium grade horticultural vermiculite, 35-55% sphagnum peat moss, 10-20% processed bark ash, 1-15% pine bark and a starter nutrient charge. Soil was supplemented with Osmocote time-release fertilizer at a rate of 30 mg/ft3. T2 seeds were imbibed in 1% agarose solution for 3 days at 4° C. and then sown at a density of ˜5 per 2 ½″ pot. Thirty-two pots were ordered in a 4 by 8 grid in standard greenhouse flat. Plants were grown in environmentally controlled rooms under a 16 h day length with an average light intensity of ˜200 μmoles/m2/s. Day and night temperature set points were 22° C. and 20° C., respectively. Humidity was maintained at 65%. Plants were watered by sub-irrigation every two days on average until mid-flowering, at which point the plants were watered daily until flowering was complete.


Application of the herbicide glufosinate was performed to select T2 individuals containing the target transgene. A single application of glufosinate was applied when the first true leaves were visible. Each pot was thinned to leave a single glufosinate-resistant seedling ˜3 days after the selection was applied.


The rosette radius was measured at day 25. The silique length was measured at day 40. The plant parts were harvested at day 49 for dry weight measurements if flowering production was stopped. Otherwise, the dry weights of rosette and silique were carried out at day 53. The seeds were harvested at day 58. All measurements were analyzed as quantitative responses according to example 1M.


Table 12 provides a list of recombinant DNA constructs that improve late plant growth and development.

TABLE 12Rosette dryRosetteSeed net drySilique drySiliquePEPweightradiusweightweightlengthSEQat day 53at day 25at day 62at day 53at day 40IDDeltaP-DeltaP-DeltaP-DeltaP-DeltaP-NOOrientationmeanvaluemeanvaluemeanvaluemeanvaluemeanvalue772ANTI-0.0720.189//0.8460.0020.1430.381−0.0120.803SENSE773SENSE0.0940.592−0.2380.0980.6650.0590.2470.474−0.0290.184775SENSE0.1340.019//0.6750.0610.5400.0100.0930.008774SENSE−0.2620.112−0.4070.032−0.3930.1200.3570.0810.0030.922776SENSE−1.0270.044//2.2160.0040.3510.3550.1350.001780SENSE0.3100.048//−0.2450.6290.1490.1570.1490.043781SENSE0.2190.069//0.7790.001−0.1740.108−0.0150.595777SENSE0.5260.045//−0.3880.4780.1310.281−0.3510.036778SENSE0.4020.004−0.1580.126−2.1590.036−0.4100.016−0.1950.043779SENSE0.4540.006−0.1240.234−0.1120.3280.2280.0700.0650.024782SENSE0.1950.205−0.1530.0190.3520.0050.4880.031−0.0630.473783SENSE−0.3880.381//1.3390.002−1.1470.1260.6010.000786SENSE−0.1070.2800.0910.1480.5630.0580.1660.1650.0420.084787SENSE−0.2900.138−0.7520.0200.5010.044−0.1140.2370.0480.188788SENSE−0.0640.709−0.0600.3020.3460.0270.1980.0820.0620.115789SENSE−0.0600.585−0.2040.0760.9300.004−0.2430.1090.0940.016790SENSE−0.2990.0760.2080.1810.6830.0080.1400.652−0.0470.070792SENSE0.4700.039//0.1130.691−0.1180.511−0.0030.921791SENSE−0.0370.754//0.4840.084−0.1630.3660.0240.328784SENSE−0.1820.070−0.0790.2430.6490.0750.0700.659−0.1500.112785SENSE−0.0570.216−0.0860.0161.6450.0080.2780.0360.0150.364795SENSE0.6200.013//0.6980.010−0.1020.7940.0400.247793SENSE0.4920.045−0.2320.029−0.5050.0300.0250.881794SENSE−0.0510.809//0.8090.003−0.0540.649−0.0200.786797SENSE0.1210.107//0.9150.017−0.2870.0540.0540.120796SENSE0.3240.017//−0.2890.4680.3310.0120.0400.476798SENSE0.4110.038//0.8090.0180.2310.286−0.0480.279799SENSE−0.0140.720//0.3370.064−0.1120.1650.0180.581


If p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference.


K. Limited Nitrogen Tolerance Screen


Under low nitrogen conditions, Arabidopsis seedlings become chlorotic and have less biomass. This example sets forth the limited nitrogen tolerance screen to identify Arabidopsis plants transformed with the gene of interest that are altered in their ability to accumulate biomass and/or retain chlorophyll under low nitrogen condition.


T2 seeds were plated on glufosinate selection plates containing 0.5×N-Free Hoagland's T 0.1 mM NH4NO3 T 0.1% sucrose T 1% phytagel media and grown under standard light and temperature conditions. At 12 days of growth, plants were scored for seedling status (i.e., viable or non-viable) and root length. After 21 days of growth, plants were scored for BASTA resistance, visual color, seedling weight, number of green leaves, number of rosette leaves, root length and formation of flowering buds. A photograph of each plant was also taken at this time point.


The seedling weight and root length were analyzed as quantitative responses according to example 1M. The number green leaves, the number of rosette leaves and the flowerbud formation were analyzed as qualitative responses according to example 1L. The leaf color raw data were collected on each plant as the percentages of five color elements (Green, DarkGreen, LightGreen, RedPurple, YellowChlorotic) using a computer imaging system. A statistical logistic regression model was developed to predict an overall value based on five colors for each plant.


Table 13 provides a list of recombinant DNA constructs that improve low nitrogen availability tolerance in plants.

TABLE 13PEPRootLeafRosetteSEQlengthcolorWeightIDConstructDeltaP-Risk scoreP-DeltaP-NOIDOrientationmeanvaluemeanvaluemeanvalue65010913ANTI-SENSE−0.3220.0720.5590.234−0.0520.20965110925ANTI-SENSE−0.4070.0475.8280.342−0.0640.41665211357SENSE−0.4360.0160.9960.047−0.0930.23365311358SENSE−0.5210.0001.2970.012−0.0030.94565411432ANTI-SENSE−0.2150.0490.6690.0460.0090.89965511927ANTI-SENSE−0.4160.0280.5810.0210.0280.29343011937ANTI-SENSE−0.3020.0760.5640.039−0.1020.06765612050SENSE−0.1880.0031.3230.032−0.0310.24565712358ANTI-SENSE−0.1790.0500.2780.0650.0240.81865813809SENSE−0.1260.0240.7960.303−0.0990.07360314733SENSE−0.1610.1312.2670.085−0.0550.67652515419ANTI-SENSE−0.0680.1191.6070.0510.0440.45165916877SENSE−0.2070.1582.5160.0570.0120.84366018117ANTI-SENSE−0.2210.0040.9060.004−0.1460.06780570819SENSE−0.2580.0572.2750.088−0.1030.23177770639SENSE−0.2760.071−1.1840.0990.1320.05277870668SENSE−0.1260.010−0.5210.1560.0670.52066171651SENSE−0.0370.5990.4850.042−0.0200.68266272449SENSE−0.0190.645−0.7380.0430.3240.02445472533SENSE−0.0110.8661.1310.0170.0680.23766372649SENSE−0.1540.3090.1810.093−0.0460.35061072978SENSE−0.3040.1840.3670.1250.1440.00846473344SENSE−0.0680.002−0.5470.4000.0400.61274473666SENSE0.2390.137−2.1920.0890.1250.06463573747SENSE−0.0310.5680.6710.0100.0020.97466473107SENSE−0.2360.0390.1960.0570.0500.55456774135SENSE0.1730.081−2.9880.0610.1170.09063674634SENSE−1.0490.0190.8050.083−0.0840.03457874644SENSE−0.2240.0270.2470.353−0.0540.25666574875SENSE−0.2970.2011.0840.0890.2010.10666675079SENSE−0.3030.2770.5830.071−0.1220.32948575202SENSE−0.2770.0930.4630.049−0.0200.57066875228SENSE−0.8380.0361.2390.033−0.0910.50366975246SENSE−1.1920.0091.3140.039−0.1360.20580973223SENSE0.0600.764−0.2490.4420.0660.08478473283SENSE0.2000.242−2.8190.0460.1980.05163774769SENSE−0.1350.0900.6420.0240.1020.06466775180SENSE−0.2660.0310.6020.0210.0060.28167075375SENSE−0.6470.0121.6180.001−0.1560.04367175476SENSE−0.5630.0111.1010.003−0.0910.16267375857SENSE−0.2560.0672.4720.006−0.1250.04167275502SENSE0.1010.209−0.1470.4820.1560.06667575970SENSE−0.1250.1910.9630.0360.0460.39167676115SENSE−0.4990.1130.6190.0920.0180.75283376121SENSE−0.7070.0121.6950.007−0.1990.04964376155SENSE−0.0590.4540.7790.022−0.0330.74267776217SENSE−0.3120.1200.7500.001−0.0100.86467876287SENSE−0.5040.0351.3260.003−0.1470.01867976294SENSE−0.4920.0021.0960.006−0.1340.01376276293SENSE0.1020.330−2.1840.0080.3580.04967475873SENSE−0.1910.0713.0700.0930.1090.19768076305SENSE−0.6430.0012.3190.000−0.1100.01864476322SENSE−0.5680.0191.2590.091−0.2050.01983776388SENSE0.0780.155−0.0320.8620.1300.01768176391SENSE−0.2590.1141.0430.0460.0360.80679876431SENSE0.1430.134−0.3830.5540.1960.03468276438SENSE−0.1830.0290.7900.057−0.0620.29168376452SENSE−0.2840.1120.9280.017−0.0020.96968476563SENSE−0.0320.100−0.0470.203−0.0950.15383976567SENSE0.2280.057−0.4030.4180.1800.02464676828SENSE−0.1630.0510.6660.0180.1560.07868577055SENSE−0.2610.1800.9260.0790.0540.51668677059SENSE−0.3800.0051.1880.021−0.1380.01268777062SENSE−0.4940.0201.2840.0330.0100.75968877063SENSE−0.2900.0180.8140.019−0.0820.11669177314SENSE−0.5000.0460.3690.008−0.1850.02969277344SENSE−0.1630.0290.1990.0160.0670.35069377348SENSE−0.2180.0030.3130.0000.0020.96084677112SENSE−0.0250.839−0.0570.9190.1420.05968977218SENSE−0.0720.6870.6790.0080.0540.33652177219SENSE0.1830.0710.1340.2900.3190.03869077256SENSE−0.3390.0150.2160.205−0.0580.302


For leaf color and rosette weight, if p<0.05 and delta or risk score mean>0, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2 and delta or risk score mean>0, the transgenic plants showed a trend of trait improvement as compared to the reference with p<0.2. For root length, if p<0.05, the transgenic plants showed statistically significant trait improvement as compared to the reference. If p<0.2, the transgenic plants showed a trend of trait improvement as compared to the reference.


L. Statistic Analysis for Qualitative Responses


Table 14 provides a list of responses that were analyzed as qualitative responses

TABLE 14responsescreencategories (success vs. failure)wilting response RiskSoil drought tolerance screennon-wilted vs. wiltedScoregrowth stage at day 14heat stress tolerance screen50% of plants reach stage1.03 vs. notgrowth stage at day 14salt stress tolerance screen50% of plants reach stage1.03 vs. notgrowth stage at day 14PEG induced osmotic stress tolerance50% of plants reach stage1.03 vs. notscreengrowth stage at day 7cold germination tolerance screen50% of plants reach stage 0.5 vs. notnumber of rosette leavesShade tolerance screen5 leaves appeared vs. notat day 23flower bud formation atShade tolerance screenflower buds appear vs. notday 23leaf angle at day 23Shade tolerance screen>60 degree vs. <60 degreenumber of green leaves atlimited nitrogen tolerance screen6 or 7 leaves appeared vs. notday 21number of rosette leaveslimited nitrogen tolerance screen6 or 7 leaves appeared vs. notat day 21Flower bud formation atlimited nitrogen tolerance screenflower buds appear vs. notday 21


Plants were grouped into transgenic and reference groups and were scored as success or failure according to Table 14. First, the risk (R) was calculated, which is the proportion of plants that were scored as of failure plants within the group. Then the relative risk (RR) was calculated as the ratio of R (transgenic) to R (reference). Risk score (RS) was calculated as −log2RR. Subsequently the risk scores from multiple events for each transgene of interest were evaluated for statistical significance by t-test using SAS statistical software (SAS 9, SAS/STAT User's Guide, SAS Institute Inc, Cary, N.C., USA). RS with a value greater than 0 indicates that the transgenic plants perform better than the reference. RS with a value less than 0 indicates that the transgenic plants perform worse than the reference. The RS with a value equal to 0 indicates that the performance of the transgenic plants and the reference don't show any difference.


M. Statistic Analysis for Quantitative Responses


Table 15 provides a list of responses that were analyzed as quantitative responses.

TABLE 15responsescreenseed yieldSoil drought stress tolerance screenseedling weight at day 14heat stress tolerance screenroot length at day 14heat stress tolerance screenseedling weight at day 14salt stress tolerance screenroot length at day 14salt stress tolerance screenroot length at day 11salt stress tolerance screenseedling weight at day 14PEG induced osmotic stress tolerance screenroot length at day 11PEG induced osmotic stress tolerance screenroot length at day 14PEG induced osmotic stress tolerance screenrosette area at day 8cold shock tolerance screenrosette area at day28cold shock tolerance screendifference in rosette areacold shock tolerance screenfrom day 8 to day 28root length at day 28cold germination tolerance screenseedling weight at day 23Shade tolerance screenpetiole length at day 23Shade tolerance screenroot length at day 14Early plant growth and development screenSeedling weight at day 14Early plant growth and development screenRosette dry weight at dayLate plant growth and development screen53rosette radius at day 25Late plant growth and development screenseed dry weight at day 58Late plant growth and development screensilique dry weight at day 53Late plant growth and development screensilique length at day 40Late plant growth and development screenSeedling weight at day 21Limited nitrogen tolerance screenRoot length at day 21Limited nitrogen tolerance screen


The measurements (M) of each plant were transformed by log2 calculation. The Delta was calculated as log2M(transgenic)−log2M(reference). Subsequently the mean delta from multiple events of the transgene of interest was evaluated for statistical significance by t-test using SAS statistical software (SAS 9, SAS/STAT User's Guide, SAS Institute Inc, Cary, N.C., USA). The Delta with a value greater than 0 indicates that the transgenic plants perform better than the reference. The Delta with a value less than 0 indicates that the transgenic plants perform worse than the reference. The Delta with a value equal to 0 indicates that the performance of the transgenic plants and the reference don't show any difference.


Example 2
Identification of Homologs

A BLAST searchable “All Protein Database” is constructed of known protein sequences using a proprietary sequence database and the National Center for Biotechnology Information (NCBI) non-redundant amino acid database (nr.aa). For each organism from which a DNA sequence provided herein was obtained, an “Organism Protein Database” is constructed of known protein sequences of the organism; the Organism Protein Database is a subset of the All Protein Database based on the NCBI taxonomy ID for the organism.


The All Protein Database is queried using amino acid sequence of cognate protein for gene DNA used in trait-improving recombinant DNA, i.e., sequences of SEQ ID NO: 426 through SEQ ID NO: 850 using “blastp” with E-value cutoff of 1e-8. Up to 1000 top hits were kept, and separated by organism names. For each organism other than that of the query sequence, a list is kept for hits from the query organism itself with a more significant E-value than the best hit of the organism. The list contains likely duplicated genes, and is referred to as the Core List. Another list was kept for all the hits from each organism, sorted by E-value, and referred to as the Hit List.


The Organism Protein Database is queried using amino acid sequences of SEQ ID NO: 426 through SEQ ID NO: 850 using “blastp” with E-value cutoff of 1e-4. Up to 1000 top hits are kept. A BLAST searchable database is constructed based on these hits, and is referred to as “SubDB”. SubDB was queried with each sequence in the Hit List using “blastp” with E-value cutoff of 1e-8. The hit with the best E-value is compared with the Core List from the corresponding organism. The hit is deemed a likely ortholog if it belongs to the Core List, otherwise it is deemed not a likely ortholog and there is no further search of sequences in the Hit List for the same organism. Likely orthologs from a large number of distinct organisms were identified and are reported by amino acid sequences of SEQ ID NO: 851 to SEQ ID NO: 33634. These orthologs are reported in Tables 2 as homologs to the proteins cognate to genes used in trait-improving recombinant DNA.


Example 3
Consensus Sequence Build

ClustalW program is selected for multiple sequence alignments of an amino acid sequence of SEQ ID NO: 426 and its homologs, through SEQ ID NO: 850 and its homologs. Three major factors affecting the sequence alignments dramatically are (1) protein weight matrices; (2) gap open penalty; (3) gap extension penalty. Protein weight matrices available for ClustalW program include Blosum, Pam and Gonnet series. Those parameters with gap open penalty and gap extension penalty were extensively tested. On the basis of the test results, Blosum weight matrix, gap open penalty of 10 and gap extension penalty of 1 were chosen for multiple sequence alignment. The consensus sequence of SEQ ID NO: 601 and its 13 homologs was derived according to the procedure described above and is displayed in FIG. 1.


Example 4

This example illustrates the identification of amino acid domain by Pfam analysis.


The amino acid sequence of the expressed proteins that were shown to be associated with an enhanced trait were analyzed for Pfam protein family against the current Pfam collection of multiple sequence alignments and hidden Markov models using the HMMER software in the appended computer listing. The Pfam protein families for the proteins of SEQ ID NO: 425 through 850 are shown in Table 16. The Hidden Markov model databases for the identified patent families are also in the appended computer listing allowing identification of other homologous proteins and their cognate encoding DNA to enable the full breadth of the invention for a person of ordinary skill in the art. Certain proteins are identified by a single Pfam domain and others by multiple Pfam domains. For instance, the protein with amino acids of SEQ ID NO: 488 is characterized by two Pfam domains, i.e. “NAF” and “Pyr_redox2”. See also the protein with amino acids of SEQ ID NO: 441 which is characterized by three copies of the Pfam domain “zf-CCCH”. In Table 16 “score” is the gathering score for the Hidden Markov Model of the domain which exceeds the gathering cutoff reported in Table 17.

TABLE 16PEP SEQID NOGENE IDPfam domain namebeginstopscoreE-value426CGPG699Histone2710099.68.80E−27427CGPG567MIF211569.41.00E−17428CGPG267WD40468328.32.40E−05428CGPG267WD4013617336.39.60E−08430CGPG959NPH3209444429.93.20E−126431CGPG2158LSM149275.71.30E−19432CGPG2446NUDIX6321094.92.20E−25433CGPG1862Bromodomain42151097.44.00E−26435CGPG1674efhand30333123.10.00089435CGPG1674Na_Ca_ex44157558.51.90E−14436CGPG2680Linker_histone23931013.30E−27437CGPG3577Glyoxalase1313245.22.00E−10438CGPG4065LRR_215017417.30.051439CGPG3929Lung_7-TM_R134419130.25.20E−36441CGPG3012zf-CCCH30577.70.12441CGPG3012zf-CCCH61857.30.13441CGPG3012zf-CCCH11514019.30.0034442CGPG3162SBF128313239.75.60E−69443CGPG607PurA2827544.45.80E−12444CGPG4084PCl2513551021.60E−27445CGPG3917Pkinase13268341.61.20E−99445CGPG3917NAF307367124.72.30E−34446CGPG4414p45032502364.51.50E−106447CGPG185Cyclin_N62187117.73.00E−32447CGPG185Cyclin_C18931283.37.00E−22448CGPG1679Ferric_reduct183304126.76.00E−35448CGPG1679FAD_binding_8334435163.64.70E−46448CGPG1679FAD_binding_6336435−7.50.0026448CGPG1679NAD_binding_6441709348.59.90E−102449CGPG271PP2C269629561.10E−13450CGPG4434p45030490347.12.70E−101452CGPG5253SBP56204871316.40453CGPG5231Pkinase85343344.41.70E−100453CGPG5231efhand39041835.61.50E−07453CGPG5231efhand42645431.43.00E−06453CGPG5231efhand46249028.32.50E−05453CGPG5231efhand49652439.31.20E−08455CGPG4859PSI_PsaF47221443.92.00E−130456CGPG1589PLAC829239087.44.00E−23458CGPG3899Pkinase132417294.12.40E−85460CGPG5665Aminotran_327362550.22.00E−162460CGPG5665Aminotran_1_242417−49.45.40E−05461CGPG5697Aminotran_325360503.22.70E−148461CGPG5697Aminotran_1_240415−51.67.00E−05462CGPG5695Pyr_redox14824010.40.0019462CGPG5695Pyr_redox_214845071.62.20E−18463CGPG4862Glyco_hydro_1405206119.40E−181465CGPG6541PGK88479732.82.10E−217466CGPG1756NOP5NT267119.58.70E−33466CGPG1756NOSIC160212127.92.70E−35466CGPG1756Nop252400332.47.00E−97467CGPG4927WD4029433132.91.00E−06467CGPG4927WD4033637324.70.0003467CGPG4927WD4037842822.10.0018468CGPG5106C2179691.72.10E−24469CGPG5167p45043501265.69.30E−77470CGPG1924F-box216839.69.80E−09471CGPG5201Methyltransf_1111623186.86.20E−23471CGPG5201Methyltransf_1211622931.33.10E−06472CGPG1884Pkinase_Tyr88365137.72.90E−38472CGPG1884Pkinase88368142.51.00E−39473CGPG5089LRR_114216417.70.037473CGPG5089LRR_116618888.2473CGPG5089LRR_11902129.34.7473CGPG5089LRR_121423310.72.6473CGPG5089LRR_12382579.44.5473CGPG5089Pkinase402658−21.17.60E−07474CGPG5870Pkinase4731477.24.60E−20474CGPG5870Pkinase_Tyr4831467.58.20E−20475CGPG5888Pkinase12270119.67.90E−33476CGPG1461PGAM79265151.42.10E−42477CGPG6743PGK88483573.51.80E−169478CGPG6722Gln-synt_C208468370.22.90E−108479CGPG82Pkinase43329324.81.40E−94480CGPG6761Pyr_redox15524723.40.00017480CGPG6761Pyr_redox_215546891.91.80E−24481CGPG6781Aminotran_328350619.82.20E−183482CGPG4914WD40281318313.80E−06483CGPG5840Pkinase5274146.18.30E−41484CGPG5980Pkinase9278134.23.20E−37485CGPG1743mTERF88391536.82.00E−158486CGPG5434MtN3_slv99876.76.50E−20486CGPG5434MtN3_slv13220656.58.10E−14487CGPG5824Pkinase121407304.32.00E−88488CGPG5879Pkinase743283497.00E−102488CGPG5879NAF38344195.51.50E−25489CGPG5949Pyr_redox_26302154.82.00E−43489CGPG5949Pyr_redox164259975.30E−26490CGPG6096GSHPx79187229.18.70E−66491CGPG6218MFS_12744995.12.00E−25491CGPG6218Sugar_tr30488528.18.40E−156492CGPG6226Sugar_tr101556315.49.20E−92492CGPG6226MFS_110551581.22.90E−21494CGPG7654HABP4_PAI-RBP1159272149.96.00E−42495CGPG6875zf-C3HC420223925.30.00019496CGPG8259TIM4245454.31.40E−133498CGPG8224NIR_SIR_ferr6613349.78.80E−12498CGPG8224NIR_SIR166347199.38.30E−57498CGPG8224NIR_SIR_ferr36243469.78.70E−18498CGPG8224NIR_SIR4435910.40.0014499CGPG1927F-box388538.81.70E−08499CGPG1927Arm418458275.90E−05499CGPG1927Arm45949934.82.70E−07499CGPG1927Arm50054339.21.30E−08499CGPG1927Arm54458537.44.50E−08499CGPG1927Arm58963045.71.40E−10499CGPG1927Arm63167428.52.20E−05499CGPG1927Arm67571545.81.30E−10499CGPG1927Arm716757250.00025500CGPG5962Glyco_hydro_1495523269.85.00E−78501CGPG5375IMPDH18493696.22.10E−206503CGPG8943MGS123238189.95.50E−54503CGPG8943AICARFT_IMPCHas243568635.44.30E−188504CGPG8896Ferric_reduct122279184.52.40E−52505CGPG8960DnaJ7713986.95.60E−23505CGPG8960Fer416218512.60.0035506CGPG5891Pkinase56307−19.66.30E−07507CGPG7260ATP-grasp_25204−43.85.10E−08508CGPG2647Fibrillarin79310599.42.90E−177509CGPG6995F-box237130.26.70E−06510CGPG9046Pro_isomerase3619557.44.40E−14511CGPG9047Pec_lyase_C108274124.52.80E−34513CGPG9076Auxin_inducible710626.35.50E−08515CGPG9109CS199391.22.80E−24516CGPG5933Nodulin-like16263497.51.40E−146517CGPG6335Sterol_desat35246238.81.10E−68518CGPG9174YjeF_N28205146.47.10E−41518CGPG9174Carb_kinase268524267.52.50E−77519CGPG9120Glyoxal_oxid_N114355530.22.10E−156522CGPG8087DUF167743159228.81.10E−65523CGPG385RRM_1137671.42.70E−18523CGPG385zf-CCHC9911634.14.40E−07523CGPG385zf-CCHC121138281.90E−05524CGPG1857FHA3210745.81.30E−10525CGPG1788PB110019288.22.30E−23526CGPG1966PHD6611457.63.70E−14526CGPG1966SET238373991.30E−26527CGPG1908F-box1259313.80E−06527CGPG1908LRR_217119724.50.00035529CGPG3557DnaJ467139.67.80E−39529CGPG3557DnaJ_C21433653.75.70E−13530CGPG3340Dehydrin21186188.81.20E−53531CGPG3431DUF9143330829.12.20E−246532CGPG3530DUF164429181359.35.80E−105533CGPG3119DUF231262434243.63.80E−70534CGPG3594PRA-CH81155109.11.20E−29534CGPG3594PRA-PH176269761.10E−19535CGPG4031Pkinase_Tyr73355134.82.20E−37535CGPG4031Pkinase73355173.84.10E−49536CGPG4036Aa_trans31467546.72.20E−161537CGPG2384Epimerase326618.31.80E−07537CGPG23843Beta_HSD4292−95.65.50E−06538CGPG1598Hist_deacetyl149461399.35.10E−117539CGPG560zf-CCCH436933.85.50E−07539CGPG560zf-CCCH88114421.80E−09539CGPG560zf-CCCH13416043.18.60E−10539CGPG560zf-CCCH24427044.72.90E−10539CGPG560zf-CCCH29031646.29.80E−11540CGPG4002Aldo_ket_red15319258.81.00E−74541CGPG1422ADH_N33148131.61.90E−36541CGPG1422ADH_zinc_N1793141069.80E−29542CGPG4429p450425104433.60E−130544CGPG2251Abhydrolase_121450729.88.80E−06545CGPG6462-Hacid_dh3135764.14.10E−16545CGPG6462-Hacid_dh_C128322225.61.00E−64546CGPG2569zf-CCCH14617127.54.30E−05546CGPG2569WD4017821525.90.00013546CGPG2569WD4030233831.92.00E−06546CGPG2569WD4034337826.49.40E−05547CGPG5507zf-MYND7411147.93.10E−11547CGPG5507UCH539844192.11.20E−54548CGPG5547Hrf166313533.22.60E−157548CGPG5547Yip11082861.70.00014549CGPG5517HEAT17821414.20.44549CGPG5517HEAT49753317.80.036550CGPG5792AA_permease90561483.81.90E−142551CGPG5766PHD28232954.33.70E−13552CGPG5775SEP237311150.24.90E−42552CGPG5775UBX343422105.71.20E−28554CGPG4872WD4031034731.92.00E−06555CGPG6420ADH_N27155128.41.90E−35555CGPG6420ADH_zinc_N186327138.22.10E−38556CGPG6402PALP44349−9.22.50E−07557CGPG5073MatE49209124.52.70E−34557CGPG5073MatE270433104.43.20E−28558CGPG5091Lectin_C6918915.72.10E−05558CGPG5091Pkinase25754264.33.60E−16558CGPG5091Pkinase_Tyr28354269.16.30E−20559CGPG3570TMEM144107204.81.90E−58560CGPG4342Mito_carr114210119.31.00E−32560CGPG4342Mito_carr214301100.54.60E−27560CGPG4342Mito_carr304392100.74.10E−27563CGPG4351ADH_N5914641.23.30E−09563CGPG4351ADH_zinc_N17732366.67.40E−17564CGPG4753FKBP_C43137190.92.80E−54565CGPG4757DPBB_165152140.44.40E−39565CGPG4757Pollen_allerg_1163240133.55.20E−37567CGPG6624YGGT92174111.52.20E−30568CGPG3161ABC_tran92280155.81.00E−43568CGPG3161ABC2_membrane384590180.63.60E−51569CGPG3770ARID2112936.33.70E−08569CGPG3770ELM2372427339.50E−07569CGPG3770Myb_DNA-binding47251819.40.011571CGPG6702Aldedh103564794.55.60E−236572CGPG21MIP30265447.22.00E−131573CGPG6801PEPCK_ATP184921215.70574CGPG154Aa_trans29428516.52.80E−152575CGPG6762Aldedh28511665.15.10E−197576CGPG6763PK1343804.65.00E−239576CGPG6763PK_C355470176.36.80E−50576CGPG6763PEP-utilizers486574116.76.00E−32577CGPG1467Auxin_inducible231121536.90E−43578CGPG6160Miro1112666.48.20E−17578CGPG6160Ras121733181.50E−92579CGPG15p45028483375.38.60E−110580CGPG5825Pkinase138425297.91.80E−86581CGPG5936MES_148421141.91.60E−39582CGPG5974FMO-like10457−195.55.40E−15582CGPG5974DAO12288−14.47.70E−05582CGPG5974Pyr_redox_212308−16.40.0018583CGPG1366Pkinase87363126.56.90E−35583CGPG1366Pkinase_Tyr87364123.26.70E−34584CGPG7390DapB_N5118068.32.20E−17584CGPG7390DapB_C183315100.64.40E−27585CGPG7421Ribul_P_3_epim91291411.79.30E−121585CGPG7421OMPdecase94300−460.0036586CGPG7446TPR_112315626.49.10E−05586CGPG7446TPR_212315624.40.00038586CGPG7446TPR_116019337.15.70E−08586CGPG7446TPR_216019332.99.90E−07586CGPG7446TPR_119424114.40.093587CGPG6295Pkinase67345−14.93.40E−07588CGPG1476RRM_1269763.94.80E−16588CGPG1476RRM_111418498.12.40E−26588CGPG1476RRM_120327388.22.30E−23588CGPG1476RRM_130637694.62.80E−25588CGPG1476PABP50458169.41.10E−17589CGPG1821F-box105737.44.50E−08589CGPG1821LRR_216619016.60.077589CGPG1821LRR_23764016.61.9590CGPG6975Trp_syntA17274441.79.00E−130591CGPG6189GDPD43321186.46.40E−53592CGPG8868Pkinase49318114.42.90E−31593CGPG8909Brix29346270.92.30E−78594CGPG8951Lipase_31022228.99.40E−05595CGPG5892Pkinase50288−29.22.20E−06597CGPG6142FAD_binding_4123258118.81.40E−32599CGPG9034Nicastrin249458357.91.50E−104600CGPG9270Pkinase241513133.94.00E−37600CGPG9270Pkinase_Tyr241513126.75.90E−35602CGPG1284NAPRTase172439196.46.20E−56603CGPG1640DUF163911719080.83.80E−21604CGPG2136HMA1373521.80E−12605CGPG3542Oxidored_FMN11346315.77.60E−92606CGPG1691Mlo55131236.30607CGPG4067Glyco_transf_81694992635.50E−76608CGPG5335U-box25632993.17.90E−25608CGPG5335Arm38342348.91.60E−11608CGPG5335Arm42446422.10.0018608CGPG5335Arm46550540.93.90E−09608CGPG5335Arm50654618.70.019608CGPG5335Arm54758734.14.30E−07609CGPG27Ammonium_transp43467685.53.50E−203611CGPG4375Anti-silence1155392.94.40E−115612CGPG5176Pkinase334602135.41.40E−37612CGPG5176Pkinase_Tyr334606124.92.00E−34614CGPG661Flavodoxin_187230173.83.80E−49614CGPG661FAD_binding_1285509380.42.50E−111614CGPG661NAD_binding_15456571121.60E−30615CGPG869ABC_tran110312147.53.20E−41616CGPG6159Miro1412980.25.80E−21616CGPG6159Ras15176332.95.10E−97618CGPG6282Pkinase_Tyr86365140.54.10E−39618CGPG6282Pkinase86365167.82.60E−47619CGPG7671GATase_22259109.87.20E−30619CGPG7671SIS3564901237.60E−34619CGPG7671SIS52766683.56.10E−22620CGPG1574SPX1293370.22.90E−108620CGPG1574EXS550718320.52.80E−93621CGPG9294Pkinase22281169.67.20E−48622CGPG2435Abhydrolase_3923072504.50E−72623CGPG3336GRP1109144.72.30E−40625CGPG4168zf-A20103429.11.40E−05625CGPG4168zf-AN199139613.50E−15626CGPG6517Aldedh19478827.94.90E−246627CGPG993p4503050086.47.90E−23629CGPG5393PALP20309439.25.10E−129630CGPG1313Di1910218482.64.40E−142631CGPG8150DnaJ9316223.91.90E−05631CGPG8150HSCB_C176250102.11.50E−27632CGPG1661FAE_3-kCoA_syn114340776.21.80E−230633CGPG6427ADH_N27155129.86.70E−36633CGPG6427ADH_zinc_N186332129.49.40E−36634CGPG4906WD40468339.21.30E−08635CGPG5092Lectin_legB25262393.33.30E−115635CGPG5092Pkinase35361145.81.30E−10635CGPG5092Pkinase_Tyr35361165.91.10E−19636CGPG6146DUF24137272374.91.20E−109637CGPG6022RRM_14611328.52.20E−05637CGPG6022RRM_116323344.63.10E−10638CGPG5883Pkinase129411125.51.40E−34638CGPG5883Pkinase_Tyr12941186.86.00E−23640CGPG8248NAD_binding_211632036.40E−58640CGPG82486PGD167301−145.63.60E−09641CGPG8233Sedlin_N377177.82.50E−50645CGPG6337Sterol_desat38246218.11.80E−62646CGPG9005CH1411554.82.60E−13646CGPG9005EB120024776.67.30E−20647CGPG9208CoA_trans5242104.72.50E−28648CGPG4348Xan_ur_permease94532−11.61.00E−07650CGPG618ADK38224317.81.80E−92650CGPG618ADK_lid16019583.46.40E−22651CGPG251p45037459223.15.80E−64652CGPG636Ion_trans_28116375.31.80E−19652CGPG636Ion_trans_220227759.59.80E−15654CGPG287B56714841074.30655CGPG893DUF231210368324.81.30E−94657CGPG657PI-PLC-X10624873.46.70E−19657CGPG657C240549685.11.90E−22658CGPG1489Enolase_N3139232.77.20E−67658CGPG1489Enolase_C147441719.32.40E−213660CGPG1683DUF568892273029.90E−88661CGPG4685MFS_140446143.16.60E−40662CGPG4729Auxin_inducible37146217.23.40E−62663CGPG4880Trehalose_PPase108346314.12.30E−91664CGPG5680Aminotran_1_230384204.71.90E−58665CGPG7317ADH_N271551237.60E−34665CGPG7317ADH_zinc_N186328146.56.30E−41666CGPG4902DUF26015116240.14.30E−69667CGPG6720Aldedh116581794.94.30E−236668CGPG5850Pkinase75349170.92.80E−48668CGPG5850Pkinase_Tyr75349128.32.00E−35669CGPG6010RRM_1130200607.30E−15669CGPG6010RRM_122929962.61.20E−15670CGPG7488OPT28651682.33.40E−202672CGPG7672PTPA86387526.82.10E−155674CGPG6988Sulfotransfer_179340296.64.10E−86675CGPG1357DUF267713185.61.40E−22675CGPG1357DUF2619024181.62.20E−21675CGPG1357Pkinase_Tyr326598144.72.30E−40675CGPG1357Pkinase326598158.31.80E−44676CGPG5918ADH_N35150102.71.00E−27676CGPG5918ADH_zinc_N18131581.32.80E−21677CGPG6020La107166135.11.70E−37677CGPG6020RRM_119527839.79.50E−09678CGPG7032Pkinase18302169.86.20E−48678CGPG7032Pkinase_Tyr18302174.32.80E−49679CGPG7069Na_Ca_ex115249118.51.80E−32679CGPG7069Na_Ca_ex415558118.91.30E−32680CGPG8900K_trans130264.82.30E−23681CGPG8923Pkinase134418303.14.80E−88682CGPG6296Pkinase113376169.57.50E−48682CGPG6296Pkinase_Tyr113376150.73.50E−42685CGPG6951RNA_pol_Rpb89145314.22.10E−91686CGPG6994Ion_trans_215523756.58.00E−14686CGPG6994Ion_trans_2279354444.80E−10687CGPG7019Aldo_ket_red10325−73.85.00E−05689CGPG9255Pkinase97417246.93.90E−71690CGPG9274CS7315066.29.80E−17690CGPG9274SGS15421258.12.70E−14691CGPG6130Miro107222145.61.20E−40691CGPG6130Ras108274−24.61.80E−07692CGPG8074PMEI32193190.92.70E−54694CGPG172WD4018722223.40.00074696CGPG3222zf-C3HC411015130.94.10E−06697CGPG3259Asp86507507.11.90E−149697CGPG3259SapB_231935356.58.10E−14697CGPG3259SapB_1379417575.70E−14698CGPG1900F-box14839.51.00E−08700CGPG20MIP30259448.11.10E−131701CGPG201PI-PLC-X1132571374.80E−38701CGPG201PI-PLC-Y29941797.24.40E−26701CGPG201C243953180.25.70E−21702CGPG3420WD4057361226.96.50E−05703CGPG4308NPH3303092542.90E−73704CGPG5244FH2589985601.38.30E−178705CGPG5538aPHC13313625.34.60E−185706CGPG3738Glycolytic55399842.51.90E−250707CGPG6454PGM_PMM_I103249157.33.70E−44707CGPG6454PGM_PMM_II280394163.54.90E−46707CGPG6454PGM_PMM_III396518150.63.70E−42707CGPG6454PGM_PMM_IV53964796.57.50E−26708CGPG6530Glutaminase121412609.33.10E−180709CGPG5175p4503646755.81.30E−13710CGPG5756Pkinase504838165.31.40E−46711CGPG5361U-box3210687.53.80E−23712CGPG6709Aldedh103562699.52.10E−207713CGPG6755Aminotran_3123449313.43.80E−91714CGPG6765Alpha-amylase16418541.67.40E−160715CGPG6039LEA_22151358.68.90E−105716CGPG6158Miro912478.51.90E−20716CGPG6158Ras10171357.22.40E−104718CGPG7528Agenet368344.80E−07718CGPG7528Agenet14120693.65.50E−25719CGPG7756BRAP251161137.82.70E−38719CGPG7756zf-C3HC416820740.17.20E−09719CGPG7756zf-UBP219290119.49.30E−33721CGPG8249PfkB33132143.20E−61722CGPG2232PCI293397103.94.20E−28723CGPG5494FAE1_CUT1_RppA223135863.30E−173723CGPG5494Chal_sti_synt_C28742600.0016723CGPG5494ACP_syn_III_C33342431.37.50E−09724CGPG5947Glyco_hydro_14109534645.44.40E−191726CGPG6262Pkinase68304−46.32.00E−05727CGPG69092OG-Fell_Oxy205302109.49.80E−30729CGPG8965DnaJ668116.28.30E−32730CGPG5861Pkinase6833798.71.60E−26730CGPG5861Pkinase_Tyr6833778.11.40E−20732CGPG7246MIP75285170.53.80E−48733CGPG5378WWE7714888.42.00E−23734CGPG9168NTP_transferase5286481.11.20E−141734CGPG9168MannoseP_isomer297462449.54.10E−132734CGPG9168Cupin_2377447521.80E−12735CGPG1505PGI525481050.20737CGPG3220Methyltransf_62159181.81.60E−51738CGPG3062Sugar_tr26489672.82.30E−199738CGPG3062MFS_13245089.68.70E−24739CGPG3580DUF58827169202.11.20E−57740CGPG3725DUF13258269487.81.20E−143741CGPG592Spermine_synth50295493.42.40E−145742CGPG4417p45034510291.71.30E−84743CGPG2276IF2_N41947046.96.20E−11743CGPG2276GTP_EFTU4996701574.40E−44743CGPG2276Ras513668−68.70.00036743CGPG2276GTP_EFTU_D2693756591.40E−14744CGPG4975CK_II_beta96270447.91.20E−131745CGPG2790Asp82476−116.59.10E−07746CGPG4972PP2C883492812.00E−81747CGPG5140p4502750287.53.70E−23748CGPG19MIP11232391.41.20E−114749CGPG6769iPGM_N12367828.14.30E−246749CGPG6769Metalloenzyme377493184.13.00E−52750CGPG6770Molybdop_Fe4S45311470.45.30E−18750CGPG6770Molybdopterin11767082.98.90E−22750CGPG6770Molydop_binding902102163.27.60E−16752CGPG6789Isoamylase_N1298134.23.20E−37752CGPG6789Alpha-amylase13857449.21.10E−12753CGPG5374CBS54268344.80E−07753CGPG5374CBS29142680.35.60E−21754CGPG5978Pkinase_Tyr74349229.95.00E−66754CGPG5978Pkinase74349185.31.30E−52755CGPG5979Pkinase108369219.66.40E−63757CGPG948NPH3165407469.92.90E−138758CGPG6052DUF17236111092.89.50E−25759CGPG7661HMA9115450.45.40E−12759CGPG7661DAO188525−20.60.0002759CGPG7661Pyr_redox_2188497237.82.10E−68759CGPG7661Pyr_redox360450891.30E−23759CGPG7661Pyr_redox_dim525634160.93.00E−45760CGPG8261Aminotran_327350484.51.20E−142761CGPG5447Cys_Met_Meta_Pp176561752.13.20E−223761CGPG5447Beta_elim_lyase215461−106.20.0013762CGPG7068DUF1005249432467.91.10E−137763CGPG5938MFS_14542078.22.30E−20764CGPG7100BCNT149228153.64.90E−43765CGPG9003SNARE379966.11.00E−16766CGPG6299Pkinase_Tyr733491369.50E−38766CGPG6299Pkinase73349139.11.10E−38767CGPG9135Acyl-CoA_dh_N5116283.65.80E−22767CGPG9135Acyl-CoA_dh_M16621885.21.80E−22767CGPG9135Acyl-CoA_dh_127241895.21.80E−25767CGPG9135Acyl-CoA_dh_2284407−9.90.0016771CGPG9139TB2_DP1_HVA2229838.66.60E−09772CGPG414Pkinase80338353.14.00E−103774CGPG1986Sulfotransfer_171332273.53.80E−79775CGPG2916PC468158190.24.70E−54776CGPG111PBP20165256.93.80E−74777CGPG4317zf-AN110214270.16.60E−18778CGPG4387zf-A20315540.64.90E−09778CGPG4387zf-AN112416464.72.80E−16779CGPG4492Oleosin2714587.24.70E−23780CGPG597Glyco_hydro_2898437482.93.60E−142781CGPG345TPR_2491524250.00024781CGPG345TPR_149152418.80.018781CGPG345TPR_259062325.40.00018781CGPG345TPR_159062332.31.60E−06781CGPG345TPR_1624657240.00048781CGPG345TPR_265869132.61.30E−06781CGPG345TPR_165869136.68.00E−08783CGPG4998LRRNT_2276757.25.00E−14783CGPG4998LRR_171939.93.6783CGPG4998LRR_19511717.80.035783CGPG4998LRR_111914117.20.056783CGPG4998LRR_114316512.81785CGPG5104C288794.82.30E−25786CGPG6502Pribosyltran95241137.33.90E−38787CGPG6630Lactamase_B11327842.41.40E−09788CGPG5484PP2C100340253.93.10E−73789CGPG5394PDT101278281.41.60E−81789CGPG5394ACT28837126.68.30E−05791CGPG2090DUF7865107220.14.60E−63792CGPG6664FeThRed_A91157166.56.10E−47793CGPG7706Acetyltransf_15814971.23.10E−18794CGPG7884Aldedh72516216.45.70E−62795CGPG6965COX5C263162.41.10E−45797CGPG2297TCTP1165320.92.00E−93798CGPG6246Pkinase24278334.91.30E−97798CGPG6246NAF311373119.58.90E−33799CGPG9037Ribophorin_I404726586.90E−195800CGPG1941TBC271503−52.50.00069801CGPG2055Terpene_synth1105−377.70E−05803CGPG386Ank578946.86.50E−11803CGPG386Ank9112322.60.0013803CGPG386RCC1259307291.50E−05803CGPG386RCC131136125.70.00015804CGPG393CDI15520892.11.50E−24805CGPG609Smr42850281.82.00E−21806CGPG4022Aa_trans41435323.92.60E−94807CGPG942AMP-binding324414059.70E−119808CGPG1800PHD60565348.32.30E−11810CGPG3257LRRNT_2266339.97.80E−09810CGPG3257LRR_19111318.90.016810CGPG3257LRR_111513712.21.4810CGPG3257LRR_11631839.64.1810CGPG3257Pkinase_Tyr34461268.37.20E−20810CPG3257Pkinase34561263.94.90E−16811CGPG1696MtN3_slv693133.55.40E−37811CGPG1696MtN3_slv12821496.28.80E−26813CGPG3780Response_reg7819479.97.10E−21813CGPG3780CCT67571371.52.40E−18814CGPG6602RuBisCO_large_N104229293.34.30E−85814CGPG6602RuBisCO_large237545816.51.30E−242815CGPG6621GIDA128453−2130.0004815CGPG6621Pyr_redox_2128437209.47.80E−60815CGPG6621Pyr_redox296391126.76.00E−35815CGPG6621Pyr_redox_dim470579158.41.70E−44816CGPG5493FAE1_CUT1_RppA113402727.77.00E−216816CGPG5493Chal_sti_synt_C384528−2.90.0028816CGPG5493ACP_syn_III_C44252623.25.20E−08817CGPG5811Pkinase107381198.81.20E−56817CGPG5811Pkinase_Tyr107381253.24.80E−73818CGPG5902ADH_N2710866.29.80E−17818CGPG5902ADH_zinc_N139281165.31.50E−46819CGPG6791PGI25480184.82.00E−52820CGPG6778Alpha-amylase13420442.84.00E−130821CGPG6166Miro712152.91.00E−12821CGPG6166Ras8177266.54.90E−77822CGPG3735LisH477343.85.40E−10823CGPG5854Pkinase227597.24.60E−26825CGPG5024adh_short3021215.23.60E−07826CGPG6054Miro612063.46.90E−16826CGPG6054Ras7178255.31.20E−73827CGPG6207Sugar_tr34479468.38.90E−138827CGPG6207MFS_138438121.32.50E−33828CGPG7620Arginase69349375.57.30E−110831CGPG73Dicty_CAR10314−6.32.80E−06832CGPG2100PLAC817116142.79.00E−40833CGPG6026RRM_186243.47.20E−10834CGPG7269Tim1718146146.85.20E−41836CGPG6993Peptidase_C1212219381.61.10E−111838CGPG6926Yip193240161.12.70E−45839CGPG7172Pkinase3269192.59.30E−55840CGPG7129DUF1070963115.51.40E−31842CGPG9031MFAP1_C1414235111.20E−150843CGPG9105ARD14168329.74.50E−96843CGPG9105Cupin_28816228.32.50E−05844CGPG9082DUF6627169236.36.00E−68845CGPG6212Sugar_tr2049052.41.40E−12845CGPG6212MFS_12945198.91.40E−26846CGPG9206Carboxyl_trans345351022.51.30E−304847CGPG9151Aminotran_3824212698.90E−78848CGPG9129HATPase_c22835683.75.20E−22850CGPG9358TPR_155158410.50.28850CGPG9358TPR_158561815.50.068850CGPG9358TPR_165568816.10.059












TABLE 17








Pfam domain
accession
gathering



name
number
cutoff
domain description


















zf-MYND
PF01753.8
11
MYND finger


UCH
PF00443.18
−8.6
Ubiquitin carboxyl-terminal hydrolase


MIF
PF01187.7
−17.6
Macrophage migration inhibitory factor (MIF)


PurA
PF04845.3
25
PurA ssDNA and RNA-binding protein


Gln-synt_C
PF00120.14
−124
Glutamine synthetase, catalytic domain


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


NAF
PF03822.4
4.5
NAF domain


Sterol_desat
PF01598.7
−13
Sterol desaturase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Asp
PF00026.13
−186.1
Eukaryotic aspartyl protease


Glyco_hydro_1
PF00232.9
−301.8
Glycosyl hydrolase family 1


Oleosin
PF01277.7
−27
Oleosin


Sugar_tr
PF00083.13
−85
Sugar (and other) transporter


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


ATP-grasp_2
PF08442.1
−118.8
ATP-grasp domain


PLAC8
PF04749.6
−1.1
PLAC8 family


Ferric_reduct
PF01794.8
−7
Ferric reductase like transmembrane





component


FAD_binding_8
PF08022.1
−10.4
FAD-binding domain


FAD_binding_6
PF00970.13
−11.4
Oxidoreductase FAD-binding domain


NAD_binding_6
PF08030.1
−23.6
Ferric reductase NAD binding domain


LSM
PF01423.12
13.7
LSM domain


Fibrillarin
PF01269.7
−86.6
Fibrillarin


WD40
PF00400.20
21.5
WD domain, G-beta repeat


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Linker_histone
PF00538.8
−8
linker histone H1 and H5 family


PP2C
PF00481.11
−44
Protein phosphatase 2C


Sugar_tr
PF00083.13
−85
Sugar (and other) transporter


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


SBF
PF01758.6
−27.8
Sodium Bile acid symporter family


Glyoxalase
PF00903.14
12.1
Glyoxalase/Bleomycin resistance





protein/Dioxygenase superfamily


Pkinase
PF00069.14
−70.8
Protein kinase domain


NAF
PF03822.4
4.5
NAF domain


LRR_2
PF07723.2
6
Leucine Rich Repeat


p450
PF00067.11
−105
Cytochrome P450


WD40
PF00400.20
21.5
WD domain, G-beta repeat


WD40
PF00400.20
21.5
WD domain, G-beta repeat


WD40
PF00400.20
21.5
WD domain, G-beta repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


Pkinase
PF00069.14
−70.8
Protein kinase domain


C2
PF00168.18
3.7
C2 domain


SBP56
PF05694.1
25
56 kDa selenium binding protein (SBP56)


IMPDH
PF00478.13
−190.6
IMP dehydrogenase/GMP reductase domain


MtN3_slv
PF03083.5
−0.8
MtN3/saliva family


MtN3_slv
PF03083.5
−0.8
MtN3/saliva family


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


Aminotran_1_2
PF00155.10
−57.5
Aminotransferase class I and II


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


Aminotran_1_2
PF00155.10
−57.5
Aminotransferase class I and II


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


NAF
PF03822.4
4.5
NAF domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Nodulin-like
PF06813.3
−57.8
Nodulin-like


Glyco_hydro_14
PF01373.7
−231.4
Glycosyl hydrolase family 14


GSHPx
PF00255.10
−16
Glutathione peroxidase


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


Sugar_tr
PF00083.13
−85
Sugar (and other) transporter


PGK
PE00162.9
−39.9
Phosphoglycerate kinase


PGK
PF00162.9
−39.9
Phosphoglycerate kinase


Pyr_redox
PF00070.17
5
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disuiphide oxidoreductase


zf-C3HC4
PF00097.13
16.9
Zinc finger, C3HC4 type (RING finger)


MIP
PF00230.9
−62
Major intrinsic protein


Tim17
PF02466.8
2.7
Tim17/Tim22/Tim23 family


HABP4_PAI-RBP1
PF04774.4
17.1
Hyaluronan/mRNA binding family


DUF1677
PF07911.4
25
Protein of unknown function (DUF1677)


NIR_SIR_ferr
PF03460.6
2.4
Nitrite/Sulfite reductase ferredoxin-like half





domain


NIR_SIR
PF01077.11
−25
Nitrite and sulphite reductase 4Fe—4S domain


TIM
PF00121.8
−97
Triosephosphate isomerase


DnaJ
PF00226.19
−8
DnaJ domain


Fer4
PF00037.15
9.3
4Fe—4S binding domain


Pro_isomerase
PF00160.10
−37
Cyclophilin type peptidyl-prolyl cis-trans





isomerase/CLD


Pec_lyase_C
PF00544.8
−45
Pectate lyase


CS
PF04969.5
8.6
CS domain


Glyoxal_oxid_N
PF07250.1
25
Glyoxal oxidase N-terminus


YjeF_N
PF03853.4
25
YjeF-related protein N-terminus


Carb_kinase
PF01256.7
−66.3
Carbohydrate kinase


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


DUF1644
PF07800.2
25
Protein of unknown function (DUF1644)


TMEM14
PF03647.3
−1
Transmembrane proteins 14C


Aldo_ket_red
PF00248.10
−97
Aldo/keto reductase family


Pkinase
PF00069.14
−70.8
Protein kinase domain


BCNT
PF07572.2
25
Bucentaur or craniofacial development


Mito_carr
PF00153.15
0
Mitochondrial carrier protein


Mito_carr
PF00153.15
0
Mitochondrial carrier protein


Mito_carr
PFPF00153.15
0
Mitochondrial carrier protein


NUDIX
PF00293.17
0
NUDIX domain


PRA-CH
PF01502.9
25
Phosphoribosyl-AMP cyclohydrolase


PRA-PH
PF01503.8
6
Phosphoribosyl-ATP pyrophosphohydrolase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


zf-CCCH
PF00642.14
0
Zinc finger C-xB-C-x5-C-x3-H type (and





similar)


WD40
PF00400.20
21.5
WD domain, G-beta repeat


WD40
PF00400.20
21.5
WD domain, G-beta repeat


WD40
PF00400.20
21.5
WD domain, G-beta repeat


CK_II_beta
PF01214.9
−106
Casein kinase II regulatory subunit


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


Yip1
PF04893.6
−6.4
Yip1 domain


ARID
PF01388.11
−8
ARID/BRIGHT DNA binding domain


ELM2
PE01448.12
12
ELM2 domain


Myb_DNA-binding
PF00249.19
2.8
Myb-like DNA-binding domain


PP2C
PF00481.11
−44
Protein phosphatase 2C


MatE
PF01554.8
59.6
MatE


MatE
PF01554.8
59.6
MatE


Methyltransf_11
PF08241.1
17.1
Methyltransferase domain


Methyltransf_12
PF08242.1
21.4
Methyltransferase domain


FMO-like
PF00743.9
−381.6
Flavin-binding monooxygenase-like


DAO
PF01266.12
−35.9
FAD dependent oxidoreductase


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disulphide oxidoreductase


Auxin_inducible
PF02519.4
−15
Auxin responsive protein


Response_reg
PF00072.12
4
Response regulator receiver domain


CCT
PF06203.4
25
CCT motif


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


PABP
PF00658.8
25
Poly-adenylate binding protein, unique





domain


p450
PF00067.11
−105
Cytochrome P450


Aa_trans
PF01490.7
−128.4
Transmembrane amino acid transporter





protein


F-box
PF00646.21
13.6
F-box domain


LRR_2
PF07723.2
6
Leucine Rich Repeat


LRR_2
PF07723.2
6
Leucine Rich Repeat


FHA
PF00498.14
25
FHA domain


Bromodomain
PF00439.14
8.9
Bromodomain


PHD
PF00628.17
25.9
PHD-finger


SET
PF00856.17
23.5
SET domain


Abhydrolase_1
PF00561.10
10.3
alpha/beta hydrolase fold


Epimerase
PF01370.11
−46.3
NAD dependent epimerase/dehydratase





family


3Beta_HSD
PF01073.8
−135.9
3-beta hydroxysteroid





dehydrogenase/isomerase family


DUF231
PF03005.5
−58

Arabidopsis proteins of unknown function



ABC_tran
PF00005.15
9.5
ABC transporter


ABC2_membrane
PF01061.12
−17.9
ABC-2 type transporter


Dehydrin
PF00257.9
−4.4
Dehydrin


DnaJ
PF00226.19
−8
DnaJ domain


DnaJ_C
PF01556.9
−24
DnaJ C terminal region


Ank
PF00023.18
0
Ankyrin repeat


Ank
PF00023.18
0
Ankyrin repeat


RCC1
PF00415.8
19
Regulator of chromosome condensation





(RCC1)


RCC1
PF00415.8
19
Regulator of chromosome condensation





(RCC1)


Pkinase
PF00069.14
−70.8
Protein kinase domain


Lung_7-TM_R
PF06814.3
25
Lung seven transmembrane receptor


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Aa_trans
PF01490.7
−128.4
Transmembrane amino acid transporter





protein


PCl
PF01399.15
25
PCl domain


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


p450
PF00067.11
−105
Cytochrome P450


FKBP_C
PF00254.17
−7.6
FKBP-type peptidyl-prolyl cis-trans isomerase


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Lectin_C
PF00059.10
−10.4
Lectin C-type domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


HEAT
PF02985.10
9.9
HEAT repeat


HEAT
PF02985.10
9.9
HEAT repeat


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


PHD
PF00628.17
25.9
PHD-finger


SEP
PF08059.2
25
SEP domain


UBX
PF00789.10
10
UBX domain


AA_permease
PF00324.10
−120.8
Amino acid permease


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


FAD_binding_4
PF01565.12
−8.1
FAD binding domain


PALP
PF00291.14
−70
Pyridoxal-phosphate dependent enzyme


GIDA
PF01134.11
−226.7
Glucose inhibited division protein A


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox
PF00070.17
5
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox_dim
PF02852.12
−13
Pyridine nucleotide-disulphide





oxidoreductase, dimerisation domain


YGGT
PF02325.7
0
YGGT family


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


PEPCK_ATP
PF01293.10
−327
Phosphoenolpyruvate carboxykinase


Trp_syntA
PF00290.11
−149.8
Tryptophan synthase alpha chain


DapB_N
PF01113.10
−20.7
Dihydrodipicolinate reductase, N-terminus


DapB_C
PF05173.3
0.5
Dihydrodipicolinate reductase, C-terminus


Ribul_P_3_epim
PF00834.8
−97.3
Ribulose-phosphate 3 epimerase family


OMPdecase
PF00215.13
−47.3
Orotidine 5′-phosphate decarboxylase/





HUMPS family


Arginase
PF00491.11
−120
Arginase family


Pkinase
PF00069.14
−70.8
Protein kinase domain


Ferric_reduct
PF01794.8
−7
Ferric reductase like transmembrane





component


Lipase_3
PF01764.14
−8
Lipase (class 3)


Auxin_inducible
PF02519.4
−15
Auxin responsive protein


TPR_2
PF07719.5
20.1
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_2
PF07719.5
20.1
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_2
PF07719.5
20.1
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


Anti-silence
PF04729.4
25
Anti-silencing protein, ASF1-like


p450
PF00067.11
−105
Cytochrome P450


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


TCTP
PF00838.7
−70.7
Translationally controlled tumour protein


NAPRTase
PF04095.5
−88.5
Nicotinate phosphoribosyltransferase





(NAPRTase) family


SPX
PF03105.9
−20
SPX domain


EXS
PF03124.4
20
EXS family


DUF1639
PF07797.3
25
Protein of unknown function (DUF1639)


Mlo
PF03094.5
−263
Mlo family


Sulfotransfer_1
PF00685.16
−53.1
Sulfotransferase domain


PI-PLC-X
PF00388.8
18.8
Phosphatidylinositol-specific phospholipase





C, X domain


PI-PLC-Y
PF00387.8
−11
Phosphatidylinositol-specific phospholipase





C, Y domain


C2
PF00168.18
3.7
C2 domain


HMA
PF00403.14
17.4
Heavy-metal-associated domain


Ammonium_transp
PF00909.10
−144
Ammonium Transporter Family


Asp
PF00026.13
−186.1
Eukaryotic aspartyl protease


SapB_2
PF03489.6
10.7
Saposin-like type B, region 2


SapB_1
PF05184.4
3
Saposin-like type B, region 1


Oxidored_FMN
PF00724.9
−147.7
NADH: flavin oxidoreductase/NADH oxidase





family


Pyr_redox
PF00070.17
5
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disulphide oxidoreductase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Sterol_desat
PF01598.7
−13
Sterol desaturase


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


Sulfotransfer_1
PF00685.16
−53.1
Sulfotransferase domain


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_2
PF07719.5
20.1
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


TPR_2
PF07719.5
20.1
Tetratricopeptide repeat


TPR_1
PF00515.16
7.7
Tetratricopeptide repeat


GATase_2
PF00310.10
−106.2
Glutamine amidotransferases class-II


SIS
PF01380.11
0
SIS domain


SIS
PF01380.11
0
SIS domain


Acetyltransf_1
PF00583.13
18.6
Acetyltransferase (GNAT) family


DUF1005
PF06219.2
25
Protein of unknown function (DUF1005)


DUF231
PF03005.5
−58

Arabidopsis proteins of unknown function



TB2_DP1_HVA22
PF03134.9
−25.1
TB2/DP1, HVA22 family


Di19
PF05605.2
25
Drought induced 19 protein (Di19)


MtN3_slv
PF03083.5
−0.8
MtN3/saliva family


MtN3_slv
PF03083.5
−0.8
MtN3/saliva family


Pkinase
PF00069.14
−70.8
Protein kinase domain


p450
PF00067.11
−105
Cytochrome P450


PGI
PF00342.8
−168.9
Phosphoglucose isomerase


Glyco_hydro_28
PF00295.7
−97
Glycosyl hydrolases family 28


Pkinase
PF00069.14
−70.8
Protein kinase domain


Isoamylase_N
PF02922.7
−6.5
Isoamylase N-terminal domain


Alpha-amylase
PF00128.12
−93
Alpha amylase, catalytic domain


PBP
PF01161.9
−20.6
Phosphatidylethanolamine-binding protein


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


zf-CCCH
PF00642.14
0
Zinc finger C-x8-C-x5-C-x3-H type (and





similar)


Aa_trans
PF01490.7
−128.4
Transmembrane amino acid transporter





protein


Smr
PF01713.11
10
Smr domain


2-Hacid_dh
PF00389.19
11.2
D-isomer specific 2-hydroxyacid





dehydrogenase, catalytic domain


2-Hacid_dh_C
PF02826.7
−82.2
D-isomer specific 2-hydroxyacid





dehydrogenase, NAD binding domain


DUF1070
PF06376.2
25
Protein of unknown function (DUF1070)


AMP-binding
PF00501.16
0
AMP-binding enzyme


Enolase_N
PF03952.6
−3.3
Enolase, N-terminal domain


Enolase_C
PF00113.12
−34
Enolase, C-terminal TIM barrel domain


FAE_3-kCoA_syn1
PF07168.2
25
Fatty acid elongase 3-ketoacyl-CoA synthase 1


efhand
PF00036.20
17.5
EF hand


Na_Ca_ex
PF01699.12
25
Sodium/calcium exchanger protein


NOP5NT
PF08156.2
25
NOP5NT (NUC127) domain


NOSIC
PF08060.2
25
NOSIC (NUC001) domain


Nop
PF01798.6
25
Putative snoRNA binding domain


Terpene_synth
PF01397.10
−86
Terpene synthase, N-terminal domain


PCI
PF01399.15
25
PCI domain


Abhydrolase_3
PF07859.2
25.8
alpha/beta hydrolase fold


GRP
PF07172.1
16.8
Glycine rich protein family


DUF914
PF06027.2
−193
Eukaryotic protein of unknown function





(DUF914)


DUF1325
PF07039.1
25
Protein of unknown function (DUF1325)


zf-A20
PE01754.6
25
A20-like zinc finger


zf-AN1
PF01428.6
0
AN1-like Zinc finger


PALP
PF00291.14
−70
Pyridoxal-phosphate dependent enzyme


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


DUF1723
PF08330.1
17
Protein of unknown function (DUF1723)


GDPD
PF03009.7
−18
Glycerophosphoryl diester phosphodiesterase





family


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


Pribosyltran
PF00156.15
2
Phosphoribosyl transferase domain


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


BRAP2
PF07576.1
25
BRCA1-associated protein 2


zf-C3HC4
PF00097.13
16.9
Zinc finger, C3HC4 type (RING finger)


zf-UBP
PF02148.8
25
Zn-finger in ubiquitin-hydrolases and other





protein


DnaJ
PF00226.19
−8
DnaJ domain


HSCB_C
PF07743.4
−7
HSCB C-terminal oligomerisation domain


ABC_tran
PF00005.15
9.5
ABC transporter


Acyl-CoA_dh_N
PF02771.7
10.9
Acyl-CoA dehydrogenase, N-terminal domain


Acyl-CoA_dh_M
PF02770.9
25
Acyl-CoA dehydrogenase, middle domain


Acyl-CoA_dh_1
PF00441.13
−15.6
Acyl-CoA dehydrogenase, C-terminal domain


Acyl-CoA_dh_2
PF08028.1
−16.9
Acyl-CoA dehydrogenase, C-terminal domain


NTP_transferase
PF00483.12
−90.5
Nucleotidyl transferase


MannoseP_isomer
PF01050.8
−70
Mannose-6-phosphate isomerase


Cupin_2
PF07883.1
16.6
Cupin domain


Lectin_legB
PF00139.10
−110.1
Legume lectin domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


PGAM
PF00300.12
−3
Phosphoglycerate mutase family


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Alpha-amylase
PF00128.12
−93
Alpha amylase, catalytic domain


PTPA
PF03095.4
−106
Phosphotyrosyl phosphate activator (PTPA)





protein


Sedlin_N
PF04628.2
25
Sedlin, N-terminal conserved region


NAD_binding_2
PF03446.4
−63.5
NAD binding domain of 6-phosphogluconate





dehydrogenase


6PGD
PF00393.8
−232.3
6-phosphogluconate dehydrogenase, C-





terminal domain


ADK
PF00406.11
24.2
Adenylate kinase


ADK_lid
PF05191.3
25
Adenylate kinase, active site lid


DUF26
PF01657.7
0
Domain of unknown function DUF26


DUF26
PF01657.7
0
Domain of unknown function DUF26


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


mTERF
PF02536.4
−60
mTERF


B56
PF01603.9
−210
Protein phosphatase 2A regulatory B subunit





(B56 family)


zf-AN1
PE01428.6
0
AN1-like Zinc finger


Xan_ur_permease
PF00860.10
−151.2
Permease family


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


Auxin_inducible
PF02519.4
−15
Auxin responsive protein


Trehalose_PPase
PF02358.6
−49.4
Trehalose-phosphatase


DUF260
PF03195.4
0.8
Protein of unknown function DUF260


Aminotran_1_2
PF00155.10
−57.5
Aminotransferase class I and II


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


La
PF05383.5
25
La domain


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


LEA_2
PF03168.3
25
Late embryogenesis abundant protein


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


DUF241
PF03087.4
−53.6

Arabidopsis protein of unknown function



Sugar_tr
PF00083.13
−85
Sugar (and other) transporter


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


Ion_trans_2
PF07885.4
24.9
Ion channel


Ion_trans_2
PF07885.4
24.9
Ion channel


PI-PLC-X
PF00388.8
18.8
Phosphatidylinositol-specific phospholipase





C, X domain


C2
PF00168.18
3.7
C2 domain


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


RNA_pol_Rpb8
PF03870.5
−31.2
RNA polymerase Rpb8


COX5C
PF05799.1
25
Cytochrome c oxidase subunit Vc (COX5C)


Ion_trans_2
PF07885.4
24.9
Ion channel


Ion_trans_2
PF07885.4
24.9
Ion channel


Aldo_ket_red
PF00248.10
−97
Aldo/keto reductase family


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Na_Ca_ex
PF01699.12
25
Sodium/calcium exchanger protein


Na_Ca_ex
PF01699.12
25
Sodium/calcium exchanger protein


Pkinase
PF00069.14
−70.8
Protein kinase domain


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


OPT
PF03169.6
−238.6
OPT oligopeptide transporter protein


PMEI
PF04043.5
25
Plant invertase/pectin methylesterase





inhibitor


K_trans
PF02705.6
−482
K+ potassium transporter


Pkinase
PF00069.14
−70.8
Protein kinase domain


CH
PF00307.19
22.5
Calponin homology (CH) domain


EB1
PF03271.6
25
EB1-like C-terminal motif


Carboxyl_trans
PF01039.11
−262.3
Carboxyl transferase domain


CoA_trans
PF01144.12
25
Coenzyme A transferase


Pkinase
PF00069.14
−70.8
Protein kinase domain


CS
PF04969.5
8.6
CS domain


SGS
PF05002.5
5.2
SGS domain


NPH3
PF03000.4
25
NPH3 family


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Spermine_synth
PF01564.6
−93.8
Spermine/spermidine synthase


FeThRed_A
PF02941.5
25
Ferredoxin thioredoxin reductase variable





alpha chain


Alpha-amylase
PF00128.12
−93
Alpha amylase, catalytic domain


Cyclin_N
PF00134.13
−14.7
Cyclin, N-terminal domain


Cyclin_C
PF02984.8
−13
Cyclin, C-terminal domain


F-box
PF00646.21
13.6
F-box domain


F-box
PF00646.21
13.6
F-box domain


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


MIP
PF00230.9
−62
Major intrinsic protein


LRRNT_2
PF08263.2
18.6
Leucine rich repeat N-terminal domain


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


WD40
PF00400.20
21.5
WD domain, G-beta repeat


Glycolytic
PF00274.9
−174.5
Fructose-bisphosphate aldolase class-I


PSI_PsaF
PF02507.5
25
Photosystem I reaction centre subunit III


LRRNT_2
PF08263.2
18.6
Leucine rich repeat N-terminal domain


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


LRR_1
PF00560.21
7.7
Leucine Rich Repeat


C2
PF00168.18
3.7
C2 domain


p450
PF00067.11
−105
Cytochrome P450


FH2
PF02181.13
−98.3
Formin Homology 2 Domain


U-box
PF04564.5
10.5
U-box domain


FAE1_CUT1_RppA
PF08392.1
−192.7
FAE1/Type III polyketide synthase-like protein


Chal_sti_synt_C
PF02797.5
−6.1
Chalcone and stilbene synthases, C-terminal





domain


ACP_syn_III_C
PF08541.1
−24.4
3-Oxoacyl-[acyl-carrier-protein (ACP)]





synthase III C terminal


aPHC
PF05875.2
25
Alkaline phytoceramidase (aPHC)


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Glyco_hydro_14
PF01373.7
−231.4
Glycosyl hydrolase family 14


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


PGM_PMM_I
PF02878.5
−37.5
Phosphoglucomutase/phosphomannomutase,





alpha/beta/alpha domain I


PGM_PMM_II
PF02879.5
−20
Phosphoglucomutase/phosphomannomutase,





alpha/beta/alpha domain II


PGM_PMM_III
PF02880.5
−7.8
Phosphoglucomutase/phosphomannomutase,





alpha/beta/alpha domain III


PGM_PMM_IV
PF00408.9
−6
Phosphoglucomutase/phosphomannomutase,





C-terminal domain


Glutaminase
PF04960.5
−143.6
Glutaminase


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


2OG-Fell_Oxy
PF03171.9
11.5
2OG-Fe(II) oxygenase superfamily


Histone
PF00125.13
17.4
Core histone H2A/H2B/H3/H4


F-box
PF00646.21
13.6
F-box domain


Agenet
PF05641.2
6.6
Agenet domain


Agenet
PF05641.2
6.6
Agenet domain


DnaJ
PF00226.19
−8
DnaJ domain


SNARE
PF05739.8
20.8
SNARE domain


adh_short
PF00106.14
−17
short chain dehydrogenase


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disulhide oxidoreductase


Pyr_redox
PF00070.17
5
Pyridine nucleotide-disulhide oxidoreductase


NPH3
PF03000.4
25
NPH3 family


PB1
PF00564.13
12.3
PB1 domain


MIP
PF00230.9
−62
Major intrinsic protein


F-box
PF00646.21
13.6
F-box domain


LRR_2
PF07723.2
6
Leucine Rich Repeat


F-box
PF00646.21
13.6
F-box domain


MIP
PF00230.9
−62
Major intrinsic protein


Methyltransf_6
PF03737.5
25
Demethylmenaquinone methyltransferase


zf-C3HC4
PF00097.13
16.9
Zinc finger, C3HC4 type (RING finger)


RRM_1
PF00076.11
20.7
RNA recognition motif. (a.k.a. RRM, RBD, or





RNP domain)


zf-CCHC
PF00098.12
17.9
Zinc knuckle


zf-CCHC
PF00098.12
17.9
Zinc knuckle


NPH3
PF03000.4
25
NPH3 family


p450
PF00067.11
−105
Cytochrome P450


p450
PF00067.11
−105
Cytochrome P450


DPBB_1
PF03330.7
5.3
Rare lipoprotein A (RlpA)-like double-psi





beta-barrel


Pollen_allerg_1
PF01357.10
17.2
Pollen allergen


p450
PF00067.11
−105
Cytochrome P450


p450
PF00067.11
−105
Cytochrome P450


CBS
PF00571.16
17.5
CBS domain pair


CBS
PF00571.16
17.5
CBS domain pair


Cys_Met_Meta_PP
PF01053.9
−278.4
Cys/Met metabolism PLP-dependent enzyme


Beta_elim_lyase
PF01212.10
−114.4
Beta-eliminating lyase


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


PK
PF00224.10
−244
Pyruvate kinase, barrel domain


PK_C
PF02887.5
−44
Pyruvate kinase, alpha/beta domain


PEP-utilizers
PF00391.12
10
PEP-utilising enzyme, mobile domain


iPGM_N
PF06415.3
−263.4
BPG-independent PGAM N-terminus





(iPGM_N)


Metalloenzyme
PF01676.7
−14.4
Metalloenzyme superfamily


Molybdop_Fe4S4
PF04879.5
13.6
Molybdopterin oxidoreductase Fe4S4 domain


Molybdopterin
PF00384.11
−50
Molybdopterin oxidoreductase


Molydop_binding
PF01568.10
1.1
Molydopterin dinucleotide binding domain


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


HMA
PF00403.14
17.4
Heavy-metal-associated domain


DAO
PF01266.12
−35.9
FAD dependent oxidoreductase


Pyr_redox_2
PF07992.3
−20
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox
PF00070.17
5
Pyridine nucleotide-disulphide oxidoreductase


Pyr_redox_dim
PF02852.12
−13
Pyridine nucleotide-disulphide





oxidoreductase, dimerisation domain


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


Hist_deacetyl
PF00850.9
−71
Histone deacetylase domain


zf-A20
PF01754.6
25
A20-like zinc finger


zf-AN1
PF01428.6
0
AN1-like Zinc finger


U-box
PF04564.5
10.5
U-box domain


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


Arm
PF00514.11
17
Armadillo/beta-catenin-like repeat


PDT
PF00800.8
25
Prephenate dehydratase


ACT
PF01842.13
0
ACT domain


PP2C
PF00481.11
−44
Protein phosphatase 2C


Aminotran_3
PF00202.10
−207.6
Aminotransferase class-III


DUF568
PF04526.3
25
Protein of unknown function (DUF568)


PHD
PF00628.17
25.9
PHD-finger


TBC
PF00566.8
−58
TBC domain


DUF786
PF05646.3
−31.3
Protein of unknown function (DUF786)


PLAC8
PF04749.6
−1.1
PLAC8 family


IF2_N
PF04760.6
25
Translation initiation factor IF-2, N-terminal





region


GTP_EFTU
PF00009.15
8
Elongation factor Tu GTP binding domain


Ras
PF00071.11
−69.9
Ras family


GTP_EFTU_D2
PF03144.14
25
Elongation factor Tu domain 2


PC4
PF02229.5
4
Transcriptional Coactivator p15 (PC4)


DUF588
PF04535.2
25
Domain of unknown function (DUF588)


LisH
PF08513.1
20.7
LisH


CDI
PF02234.8
17
Cyclin-dependent kinase inhibitor


Glyco_transf_8
PF01501.9
−43.2
Glycosyl transferase family 8


Pkinase
PF00069.14
−70.8
Protein kinase domain


efhand
PF00036.20
17.5
EF hand


efhand
PF00036.20
17.5
EF hand


efhand
PF00036.20
17.5
EF hand


efhand
PF00036.20
17.5
EF hand


WWE
PF02825.9
25
WWE domain


FAE1_CUT1_RppA
PF08392.1
−192.7
FAE1/Type III polyketide synthase-like protein


Chal_sti_synt_C
PF02797.5
−6.1
Chalcone and stilbene synthases, C-terminal





domain


ACP_syn_III_C
PF08541.1
−24.4
3-Oxoacyl-[acyl-carrier-protein (ACP)]





synthase III C terminal


Hrf1
PF03878.5
−81.2
Hrf1 family


Yip1
PF04893.6
−6.4
Yip1 domain


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain


ADH_N
PF08240.2
−14.5
Alcohol dehydrogenase GroES-like domain


ADH_zinc_N
PF00107.16
23.8
Zinc-binding dehydrogenase


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


Miro
PF08477.1
28
Miro-like protein


Ras
PF00071.11
−69.9
Ras family


Sugar_tr
PF00083.13
−85
Sugar (and other) transporter


MFS_1
PF07690.5
23.5
Major Facilitator Superfamily


RuBisCO_large_N
PF02788.5
25
Ribulose bisphosphate carboxylase large





chain, N-terminal domain


RuBisCO_large
PF00016.9
−76
Ribulose bisphosphate carboxylase large





chain, catalytic domain


Flavodoxin_1
PF00258.14
6.3
Flavodoxin


FAD_binding_1
PF00667.9
−79
FAD binding domain


NAD_binding_1
PF00175.10
−3.9
Oxidoreductase NAD-binding domain


Lactamase_B
PF00753.16
24.6
Metallo-beta-lactamase superfamily


PGI
PF00342.8
−168.9
Phosphoglucose isomerase


Peptidase_C12
PF01088.10
−91.4
Ubiquitin carboxyl-terminal hydrolase, family 1


Dicty_CAR
PF05462.2
−39.7
Slime mold cyclic AMP receptor


Aldedh
PF00171.11
−209.3
Aldehyde dehydrogenase family


PfkB
PF00294.13
−67.8
pfkB family carbohydrate kinase


Brix
PF04427.7
11.4
Brix domain


MGS
PF02142.11
3
MGS-like domain


AICARFT_IMPCHas
PF01808.8
−98
AICARFT/IMPCHase bienzyme


MFAP1_C
PF06991.1
25
Micro-fibrillar-associated protein 1 C-terminus


Nicastrin
PF05450.4
−85.8
Nicastrin


Ribophorin_I
PF04597.4
−217
Ribophorin I


DUF662
PF04949.2
25
Family of unknown function (DUF662)


ARD
PF03079.4
25
ARD/ARD' family


Cupin_2
PF07883.1
16.6
Cupin domain


HATPase_c
PF02518.14
22.4
Histidine kinase-, DNA gyrase B-, and





HSP90-like ATPase


Pkinase
PF00069.14
−70.8
Protein kinase domain


Pkinase_Tyr
PF07714.5
65
Protein tyrosine kinase


Pkinase
PF00069.14
−70.8
Protein kinase domain









Example 5A

This example illustrates the construction of plasmids for transferring recombinant DNA into the nucleus of a plant cell which can be regenerated into a transgenic crop plant of this invention. Primers for PCR amplification of protein coding nucleotides of recombinant DNA are designed at or near the start and stop codons of the coding sequence, in order to eliminate most of the 5′ and 3′ untranslated regions. DNA of interest, i.e. each DNA identified in Table 1 and the DNA for the identified homologous genes, are cloned and amplified by PCR prior to insertion into the insertion site the base vector.


Elements of an exemplary common expression vector, pMON82060 are illustrated in Table 18. The exemplary base vector which is especially useful for corn transformation is illustrated in FIG. 2 and assembled using technology known in the art. The DNA of interest are inserted in a expression vector at the insertion site between the intron 1 of rice act 1 gene and the termination sequence of PinII gene.

TABLE 18pMON82060Coordinatesof SEQ IDfunctionnameannotationNO: 33636AgroB-AGRtu.right borderAgro right border sequence, essential for5235-5591transformationtransfer of T-DNA.Gene ofP-Os.Act1Promoter from the rice actin gene act1.5609-7009interest plantL-Os.Act1Leader (first exon) from the rice actin 1expressiongene.cassetteI-Os.Act1First intron and flanking UTR exonsequences from the rice actin 1 geneinsertion siteT-St.Pis4The 3′ non-translated region of the7084-8026potato proteinase inhibitor II gene whichfunctions to direct polyadenylation of themRNAPlantP-CaMV.35SCaMV 35S promoter8075-8398selectableL-CaMV.35S5′ UTR from the 35S RNA of CaMVmarkerCR-Ec.nptII-Tn5nptII selectable marker that confers8432-9226expressionresistance to neomycin and kanamycincassetteT-AGRtu.nosA 3′ non-translated region of the9255-9507nopaline synthase gene ofAgrobacterium tumefaciens Ti plasmidwhich functions to directpolyadenylation of the mRNA.AgroB-AGRtu.left borderAgro left border sequence, essential for 39-480transformationtransfer of T-DNA.MaintenanceOR-Ec.oriV-RK2The vegetative origin of replication from567-963in E. coliplasmid RK2.CR-Ec.ropCoding region for repressor of primer2472-2663from the ColE1 plasmid. Expression ofthis gene product interferes with primerbinding at the origin of replication,keeping plasmid copy number low.OR-Ec.ori-ColE1The minimal origin of replication from3091-3679the E. coli plasmid ColE1.P-Ec.aadA-SPC/STRpromoter for Tn7 adenylyltransferase4210-4251(AAD(3″))CR-Ec.aadA-Coding region for Tn74252-5040SPC/STRadenylyltransferase (AAD(3″))conferring spectinomycin andstreptomycin resistance.T-Ec.aadA-SPC/STR3′ UTR from the Tn7 adenylyltransferase5041-5098(AAD(3″)) gene of E. coli.


Plasmids for use in transformation of soybean are also prepared. Elements of an exemplary common expression vector plasmid pMON82053 are shown in Table 19 below. This exemplary soybean transformation base vector illustrated in FIG. 3 was assembled using the technology known in the art. DNA of interest, i.e. each DNA identified in Table 1 and the DNA for the identified homologous genes, are cloned and amplified by PCR prior to insertion into the insertion site the base vector at the insertion site between the enhanced 35S CaMV promoter and the termination sequence of cotton E6 gene.

TABLE 19pMON82053Coordinates of SEQ IDfunctionnameannotationNO: 33637AgroB-AGRtu.left borderAgro left border6144-6585transforamtionsequence, essential fortransfer of T-DNA.PlantP-At.Act7Promoter from the6624-7861selectablearabidopsis actin 7 genemarkerL-At.Act75′UTR of ArabidopsisexpressionAct7 genecassetteI-At.Act7Intron from theArabidopsis actin7 geneTS.At.ShkG-CTP2Transit peptide region of7864-8091Arabidopsis EPSPSCR-AGRtu.aroA-Synthetic CP4 coding8092-9459CP4.nno_Atregion with dicotpreferred codon usage.T-AGRtu.nosA 3′ non-translated region9466-9718of the nopaline synthasegene of Agrobacteriumtumefaciens Ti plasmidwhich functions to directpolyadenylation of themRNA.Gene ofP-CaMV.35S-enhPromoter for 35S RNA 1-613interestfrom CaMV containing aexpressionduplication of the −90 to −350cassetteregion.insertion siteT-Gb.E6-3b3′ untranslated region 688-1002from the fiber protein E6gene of sea-island cotton;AgroB-AGRtu.right borderAgro right border1033-1389transformationsequence, essential fortransfer of T-DNA.MaintenanceOR-Ec.oriV-RK2The vegetative origin of5661-6057in E. colireplication from plasmidRK2.CR-Ec.ropCoding region for3961-4152repressor of primer fromthe ColE1 plasmid.Expression of this geneproduct interferes withprimer binding at theorigin of replication,keeping plasmid copynumber low.OR-Ec.ori-ColE1The minimal origin of2945-3533replication from the E. coliplasmid ColE1.P-Ec.aadA-SPC/STRromoter for Tn72373-2414adenylyltransferase(AAD(3″))CR-Ec.aadA-Coding region for Tn71584-2372SPC/STRadenylyltransferase(AAD(3″)) conferringspectinomycin andstreptomycin resistance.T-Ec.aadA-SPC/STR3′ UTR from the Tn71526-1583adenylyltransferase(AAD(3″)) gene of E. coli.


Example 5B

This example illustrates monocot plant transformation to produce nuclei of this invention in cells of a transgenic plant by transformation of corn callus. Corn plants of a readily transformable line are grown in the greenhouse and ears harvested when the embryos are 1.5 to 2.0 mm in length. Ears are surface sterilized by spraying or soaking the ears in 80% ethanol, followed by air drying. Immature embryos are isolated from individual kernels on surface sterilized ears. Prior to inoculation of maize cells, Agrobacterium cells are grown overnight at room temperature. Immature maize embryos are inoculated with Agrobacterium shortly after excision, and incubated at room temperature with Agrobacterium for 5-20 minutes. Immature embryos are then co-cultured with Agrobacterium for 1 to 3 days at 23° C. in the dark. Co-cultured embryos are transferred to selection media and cultured for approximately two weeks to allow embryogenic callus to develop. Embryogenic callus is transferred to culture medium containing 100 mg/L paromomycin and subcultured at about two week intervals. Transformants are recovered 6 to 8 weeks after initiation of selection.


Plasmid vectors are prepared essentially as described in Example 5 for transforming into corn callus each of DNA identified in Table 1 and the corresponding DNA for the identified homologous genes identified in Table 2, by Agrobacterium-mediated transformation.


For Agrobacterium-mediated transformation of corn callus, immature embryos are cultured for approximately 8-21 days after excision to allow callus to develop. Callus is then incubated for about 30 minutes at room temperature with the Agrobacterium suspension, followed by removal of the liquid by aspiration. The callus and Agrobacterium are co-cultured without selection for 3-6 days followed by selection on paromomycin for approximately 6 weeks, with biweekly transfers to fresh media, and paromomycin resistant callus identified as containing the recombinant DNA in an expression cassette.


Transgenic corn plants are regenerated from transgenic callus resulting from transformation on media to initiate shoot development in plantlets which are transferred to potting soil for initial growth in a growth chamber at 26 degrees C. followed by a mist bench before transplanting to 5 inch pots where plants are grown to maturity. The plants are self fertilized and seed is harvested for screening as seed, seedlings or progeny R2 plants or hybrids, e.g. for yield trials in the screens indicated above. Populations of transgenic plants and seeds produced form transgenic plant cells from each transgenic event are screened as described in Example 7 below to identify the members of the population having the enhanced trait.


Example 6

This example illustrates dicot plant transformation to produce nuclei of this transgenic in cells of transgenic plants by transformation of soybean tissue. For Agrobacterium-mediated transformation, soybean seeds are germinated overnight and the meristem explants excised. The meristems and the explants are placed in a wounding vessel. Soybean explants and induced Agrobacterium cells from a strain containing plasmid DNA with the gene of interest cassette and a plant selectable marker cassette are mixed no later than 14 hours from the time of initiation of seed germination and wounded using sonication. Following wounding, explants are placed in co-culture for 2-5 days at which point they are transferred to selection media for 6-8 weeks to allow selection and growth of transgenic shoots. Trait positive shoots are harvested approximately 6-8 weeks post bombardment and placed into selective rooting media for 2-3 weeks. Shoots producing roots are transferred to the greenhouse and potted in soil. Shoots that remain healthy on selection, but do not produce roots are transferred to non-selective rooting media for an additional two weeks. Roots from any shoots that produce roots off selection are tested for expression of the plant selectable marker before they are transferred to the greenhouse and potted in soil. Populations of transgenic plants and seeds produced form transgenic plant cells from each transgenic event are screened as described in Example 7 below to identify the members of the population having the enhanced trait.


Example 7

This example illustrates identification of nuclei of the invention by screening derived plants and seeds for an enhanced trait identified below.


Many transgenic events which survive to fertile transgenic plants that produce seeds and progeny plants will not exhibit an enhanced agronomic trait. Populations of transgenic seed and plants prepared in Examples 5 and 6 are screened to identify those transgenic events providing transgenic plant cells with a nucleus having recombinant DNA imparting an enhanced trait. Each population is screened for enhanced nitrogen use efficiency, increased yield, enhanced water use efficiency, enhanced tolerance to cold and heat, increased level of oil and protein in seed using assays described below. Plant cell nuclei having recombinant DNA with each of the genes identified in Table 1 and the identified homologs are identified in plants and seeds with at least one of the enhanced traits.


Selection for Enhanced Nitrogen Use Efficiency


Transgenic corn plants with nuclei of the invention are planted in fields with three levels of nitrogen (N) fertilizer being applied, i.e. low level (0 pounds per acre N), medium level (80 pounds per acre N) and high level (180 pounds per acre N). Liquid 28% or 32% UAN (Urea, Ammonium Nitrogen) are used as the N source and apply by broadcast boom and incorporate with a field cultivator with rear rolling basket in the same direction as intended crop rows. Although there is no N applied in the low level treatment, the soil should still be disturbed in the same fashion as the treated area. Transgenic plants and control plants can be grouped by genotype and construct with controls arranged randomly within genotype blocks. For improved statistical analysis each type of transgenic plant can be tested by 3 replications and across 4 locations. Nitrogen levels in the fields are analyzed before planting by collecting sample soil cores from 0-24″ and 24 to 48″ soil layer. Soil samples are analyzed for nitrate-nitrogen, phosphorus (P), potassium (K), organic matter and pH to provide baseline values. P, K and micronutrients are applied based upon soil test recommendations.


Transgenic corn plants prepared in Example 5 and which exhibit a 2 to 5% yield increase as compared to control plants when grown in the high nitrogen field are selected as having nuclei of the invention. Transgenic corn plants which have at least the same or higher yield as compared to control plants when grown in the medium nitrogen field are selected as having nuclei of the invention. Transgenic corn plants having a nucleus with DNA identified in Table 3 as imparting nitrogen use efficiency (LN) and homologous DNA are selected from a nitrogen use efficiency screen as having a nucleus of this invention.


Selection for Increased Yield


Many transgenic plants of this invention exhibit improved yield as compared to a control plant. Improved yield can result from enhanced seed sink potential, i.e. the number and size of endosperm cells or kernels and/or enhanced sink strength, i.e. the rate of starch biosynthesis. Sink potential can be established very early during kernel development, as endosperm cell number and size are determined within the first few days after pollination.


Much of the increase in corn yield of the past several decades has resulted from an increase in planting density. During that period, corn yield has been increasing at a rate of 2.1 bushels/acre/year, but the planting density has increased at a rate of 250 plants/acre/year. A characteristic of modern hybrid corn is the ability of these varieties to be planted at high density. Many studies have shown that a higher than current planting density should result in more biomass production, but current germplasm does not perform. well at these higher densities. One approach to increasing yield is to increase harvest index (HI), the proportion of biomass that is allocated to the kernel compared to total biomass, in high density plantings.


Effective yield selection of enhanced yielding transgenic corn events uses hybrid progeny of the transgenic event over multiple locations with plants grown under optimal production management practices, and maximum pest control. A useful target for improved yield is a 5% to 10% increase in yield as compared to yield produced by plants grown from seed for a control plant. Selection methods may be applied in multiple and diverse geographic locations, for example up to 16 or more locations, over one or more planting seasons, for example at least two planting seasons to statistically distinguish yield improvement from natural environmental effects. It is to plant multiple transgenic plants, positive and negative control plants, and pollinator plants in standard plots, for example 2 row plots, 20 feet long by 5 feet wide with 30 inches distance between rows and a 3 foot alley between ranges. Transgenic events can be grouped by recombinant DNA constructs with groups randomly placed in the field. A pollinator plot of a high quality corn line is planted for every two plots to allow open pollination when using male sterile transgenic events. A useful planting density is about 30,000 plants/acre. High planting density is greater than 30,000 plants/acre, preferably about 40,000 plants/acre, more preferably about 42,000 plants/acre, most preferably about 45,000 plants/acre.


Each of the transgenic corn plants and soybean plants with a nucleus of the invention prepared in Examples 5 and 6 are screened for yield enhancement. At least one event from each of the corn and soybean plants is selected as having at least between 3 and 5% increase in yield as compared to a control plant as having a nucleus of this invention.


Selection for Enhanced Water Use Efficiency (WUE)


The following is a high-throughput method for screening for water use efficiency in a greenhouse to identify the transgenic corn plants with a nucleus of this invention. This selection process imposes 3 drought/re-water cycles on plants over a total period of 15 days after an initial stress free growth period of 11 days. Each cycle consists of 5 days, with no water being applied for the first four days and a water quenching on the 5th day of the cycle. The primary phenotypes analyzed by the selection method are the changes in plant growth rate as determined by height and biomass during a vegetative drought treatment. The hydration status of the shoot tissues following the drought is also measured. The plant height are measured at three time points. The first is taken just prior to the onset drought when the plant is 11 days old, which is the shoot initial height (SIH). The plant height is also measured halfway throughout the drought/re-water regimen, on day 18 after planting, to give rise to the shoot mid-drought height (SMH). Upon the completion of the final drought cycle on day 26 after planting, the shoot portion of the plant is harvested and measured for a final height, which is the shoot wilt height (SWH) and also measured for shoot wilted biomass (SWM). The shoot is placed in water at 40 degree Celsius in the dark. Three days later, the shoot is weighted to give rise to the shoot turgid weight (STM). After drying in an oven for four days, the shoots are weighted for shoot dry biomass (SDM). The shoot average height (SAH) is the mean plant height across the 3 height measurements. The procedure described above may be adjusted for +/−˜one day for each step given the situation.


To correct for slight differences between plants, a size corrected growth value is derived from SIH and SWH. This is the Relative Growth Rate (RGR). Relative Growth Rate (RGR) is calculated for each shoot using the formula [RGR %=(SWH−SIH)/((SWH+SIH)/2)×100]. Relative water content (RWC) is a measurement of how much (%) of the plant was water at harvest. Water Content (RWC) is calculated for each shoot using the formula [RWC %=(SWM−SDM)/(STM−SDM)×100]. Fully watered corn plants of this age run around 98% RWC.


Transgenic corn plants and soybean plants prepared in Examples 5 and 6 are screened for water use efficiency. Transgenic plants having at least a 1% increase in GRG and RWC as compared to control plants are identified as having enhanced water used efficiency and are selected as having a nucleus of this invention. Transgenic corn and soybean plants having in their nucleus DNA identified in Table 3 as imparting drought tolerance improvement (DS) and homologous DNA are identified as showing increased water use efficiency as compared to control plants and are selected as having a nucleus of this invention.


Selection for Growth Under Cold Stress


Cold germination assay-Three sets of seeds are used for the assay. The first set consists of positive transgenic events (F1 hybrid) where the genes of the present invention are expressed in the seed. The second seed set is nontransgenic, wild-type negative control made from the same genotype as the transgenic events. The third set consisted of two cold tolerant and one cold sensitive commercial check lines of corn. All seeds are treated with a fungicide “Captan” (MAESTRO® 80DF Fungicide, Arvesta Corporation, San Francisco, Calif., USA). 0.43 mL Captan is applied per 45 g of corn seeds by mixing it well and drying the fungicide prior to the experiment.


Corn kernels are placed embryo side down on blotter paper within an individual cell (8.9×8.9 cm) of a germination tray (54×36 cm). Ten seeds from an event are placed into one cell of the germination tray. Each tray can hold 21 transgenic events and 3 replicates of wildtype (LH244SDms+LH59), which is randomized in a complete block design. For every event there are five replications (five trays). The trays are placed at 9.7C for 24 days (no light) in a Convrion® growth chamber (Conviron Model PGV36, Controlled Environments, Winnipeg, Canada). Two hundred and fifty milliliters of deionized water are added to each germination tray. Germination counts are taken 10th, 11th, 12th, 13th, 14th, 17th, 19th, 21st, and 24th day after start date of the experiment.


Seeds are considered germinated if the emerged radical size is 1 cm. From the germination counts germination index is calculated.


The germination index is calculated as per:

Germination index=(Σ([T+1−ni]*[Pi−Pi−1]))/T

where T is the total number of days for which the germination assay is performed. The number of days after planting is defined by n. “i” indicated the number of times the germination had been counted, including the current day. P is the percentage of seeds germinated during any given rating. Statistical differences are calculated between transgenic events and wild type control. After statistical analysis, the events that show a statistical significance at the p level of less than 0.1 relative to wild-type controls will advance to a secondary cold selection. The secondary cold screen is conducted in the same manner of the primary selection only increasing the number of repetitions to ten. Statistical analysis of the data from the secondary selection is conducted to identify the events that show a statistical significance at the p level of less than 0.05 relative to wild-type controls.


Transgenic corn plants and soybean plants prepared in Examples 5 and 6 are screened for water use efficiency. Transgenic plants having at least a 5% increase in germination index as compared to control plants are identified as having enhanced cold stress tolerance and are selected as having a nucleus of this invention. Transgenic corn and soybean plants having in their nucleus DNA identified in Table 3 as imparting cold tolerance improvement (CK or CS) and homologous DNA are identified as showing increased cold stress tolerance as compared to control plants and are selected as having a nucleus of this invention.


Screens for Transgenic Plant Seeds with Increased Protein and/or Oil Levels


The following is a high-throughput selection method for identifying plant seeds with improvement in seed composition using the Infratec® 1200 series Grain Analyzer, which is a near-infrared transmittance spectrometer used to determine the composition of a bulk seed sample. Near infrared analysis is a non-destructive, high-throughput method that can analyze multiple traits in a single sample scan. An NIR calibration for the analytes of interest is used to predict the values of an unknown sample. The NIR spectrum is obtained for the sample and compared to the calibration using a complex chemometric software package that provides a predicted values as well as information on how well the sample fits in the calibration.


Infratec® Model 1221, 1225, or 1227 analyzer with transport module by Foss North America is used with cuvette, item #1000-4033, Foss North America or for small samples with small cell cuvette, Foss standard cuvette modified by Leon Girard Co. Corn and soy check samples of varying composition maintained in check cell cuvettes are supplied by Leon Girard Co. NIT collection software is provided by Maximum Consulting Inc. Calculations are performed automatically by the software. Seed samples are received in packets or containers with barcode labels from the customer. The seed is poured into the cuvettes and analyzed as received.

TABLE 21Typical sample(s):Whole grain corn and soybean seedsAnalytical time to runLess than 0.75 min per samplemethod:Total elapsed time per run:1.5 minute per sampleTypical and minimumCorn typical: 50 cc; minimum 30 ccsample size:Soybean typical: 50 cc; minimum 5 ccTypical analytical range:Determined in part by the specificcalibration.Corn - moisture 5-15%, oil 5-20%,protein 5-30%, starch 50-75%, and density1.0-1.3%.Soybean - moisture 5-15%, oil 15-25%,and protein 35-50%.


Transgenic corn plants and soybean plants prepared in Examples 5 and 6 are screened for increased protein and oil in seed. Transgenic inbred corn and soybean plants having an increase of at least 1 percentage point in the total percent seed protein or at least 0.3 percentage point in total seed oil and transgenic hybrid corn plants having an increase of at least 0.4 percentage point in the total percent seed protein as compared to control plants are identified as having enhanced seed protein or enhanced seed oil and are selected as having a nucleus of this invention.


Example 8

This example illustrates monocot and dicot plant transformation to produce nuclei of this invention in cells of a transgenic plant by transformation where the recombinant DNA suppresses the expression of an endogenous protein identified by Pfam, Histone, WD40, NPH3, FHA, PB1, ADH_zinc_N, NAPRTase, ADK_lid, p450, B56, DUF231, C2, DUF568, WD40, F-box, Pkinase, or Terpene-synth. Corn callus and soybean tissue are transformed as describe in Examples 5 and 6 using recombinant DNA in the nucleus with DNA that transcribes to RNA that forms double-stranded RNA targeted to an endogenous gene with DNA encoding the protein. The genes for which the double-stranded RNAs are targeted are the native gene in corn and soybean that are homolog of the genes encoding the protein with an amino acid sequence of SEQ ID NO: 426, 428, 429, 430, 524, 525, 541, 601, 602, 650, 651, 654, 655, 657, 660, 694, 698, 772, 801.


Populations of transgenic corn plants and soybean plants prepared in Examples 5 and 6 with DNA for suppressing a gene identified in Table 3 as providing an enhanced trait by gene suppression are screened to identify an event from those plants with a nucleus of the invention by selecting the trait identified in this specification.

Claims
  • 1. A plant cell nucleus with stably-integrated, recombinant DNA comprising a promoter that is functional in plant cells and that is operably linked to DNA from a plant, bacteria or yeast that encodes a protein having at least one domain of amino acids in a sequence that exceeds the Pfam gathering cutoff of 16.9 for amino acid sequence alignment with a protein domain family identified by Pfam name Cyclin_C; wherein said plant cell nucleus is selected from by screening a population of transgenic plants that have said recombinant DNA in its nuclei and express said protein for an enhanced trait as compared to control plants that do not have said recombinant DNA in their nuclei; and wherein said enhanced trait is selected from group of enhanced traits consisting of enhanced water use efficiency, enhanced cold tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
  • 2. A plant cell nucleus with stably integrated, recombinant DNA comprising a promoter that is functional in plant cells and that is operably linked to DNA from a plant, bacteria or yeast that encodes a protein having at least one domain of amino acids in a sequence that exceeds the Pfam gathering cutoff for amino acid sequence alignment with a protein domain family identified by a Pfam name selected from the group of Pfam names consisting of Mito_carr, 6PGD, UBX, iPGM_N, WD40, Fer4, Enolase_C, DUF1639, PBP, PLAC8, Acyl-CoA_dh—1, Isoamylase_N, Acyl-CoA_dh—2, PC4, Sugar_tr, UCH, Enolase_N, HATPase_c, PRA-PH, Pkinase, SBP56, PEP-utilizers, SIS, PCI, DUF1644, Terpene_synth, Acyl-CoA_dh_M, Acyl-CoA_dh_N, Glutaminase, Lectin_legB, Dehydrin, MatE, Ank, 2-Hacid_dh_C, Chal_sti_synt_C, DUF1070, ATP-grasp—2, Arginase, HABP4_PAI-RBP1, ABC2_membrane, DUF1723, Glyco_hydro—1, MFS—1, ARD, PDT, HMA, Pro_isomerase, Ferric_reduct, PRA-CH, Aa_trans, ACT, LisH, PGM_PMM_II, Spermine_synth, zf-MYND, LRRNT—2, Ribul_P—3_epim, PGM_PMM_IV, NPH3, DapB_C, TPR—1, TPR—2, FAE1_CUT1_RppA, K_trans, F-box, Cyclin_C, ADK, NUDIX, NIR_SIR, PEPCK_ATP, La, DapB_N, MtN3_slv, FMO-like, TIM, FKBP_C, PMEI, Peptidase_C12, Cyclin_N, DUF568, Methyltransf—11, Methyltransf—12, DUF1677, DnaJ_C, BRAP2, IF2_N, Carboxyl_trans, mTERF, Glyoxalase, TMEM14, Mlo, Beta_elim_lyase, Pyr_redox_dim, Glyco_transf—8, Nicastrin, Flavodoxin—1, Epimerase, PTPA, Lipase—3, Pyr_redox—2, GSHPx, ELM2, PGI, Aminotran—1—2, ABC_tran, GRP, PGK, Oleosin, Sulfotransfer—1, EXS, DUF1325, AMP-binding, Arm, NTP_transferase, LSM, Metalloenzyme, Molybdop_Fe4S4, MFAP1_C, Aminotran—3, PHD, B56, DUF588, PSI_PsaF, zf-CCCH, HEAT, PALP, FH2, SapB—1, Ammonium_transp, MannoseP_isomer, NOP5NT, SapB—2, Pyr_redox, Pollen_allerg—1, Asp, DUF662, FHA, YjeF_N, COX5C, GTP_EFTU_D2, Ion_trans—2, PK, DUF231, FAD_binding—1, Hrf1, FAD_binding—4, FAD_binding—6, FAD_binding—8, CBS, Smr, aPHC, DUF241, Brix, Ras, Acetyltransf—1, NAF, SPX, Na_Ca_ex, C2, p450, PP2C, Histone, 2-Hacid_dh, SBF, CCT, BCNT, PK_C, Miro, CH, PfkB, ACP_syn_III_C, Sterol_desat, ADH_zinc_N, CS, Cys_Met_Meta_PP, Lactamase_B, Bromodomain, CDI, Linker_histone, DAO, Dicty_CAR, Aldo_ket_red, zf-AN1, Methyltransf—6, DUF1005, LEA—2, NIR_SIR_ferr, DUF260, Oxidored_FMN, DUF26, Lectin_C, Pec_lyase_C, Nop, TB2_DP1_HVA22, ADH_N, YGGT, NAPRTase, NAD_binding—1, DUF914, PGM_PMM_I, NAD_binding—2, AICARFT_IMPCHas, Auxin_inducible, NAD_binding—6, Anti-silence, RuBisCO_large, Response_reg, FeThRed_A, Di19, SNARE, PGM_PMM_III, Molydop_binding, efhand, zf-CCHC, GTP_EFTU, ARID, adh_short, Fibrillarin, RuBisCO_large_N, WWE, AA_permease, PABP, OMPdecase, RRM—1, U-box, OPT, TBC, MGS, DUF786, 3Beta_HSD, zf-UBP, zf-A20, DPBB—1, GDPD, PI-PLC-X, SEP, PI-PLC-Y, NOSIC, Glycolytic, SET, ADK_lid, Alpha-amylase, EB1, PGAM, Abhydrolase—1, Glyco_hydro—14, Lung—7-TM_R, Abhydrolase—3, TCTP, GATase—2, Gln-synt_C, 20G-FeII_Oxy, Pribosyltran, MIF, CoA_trans, RCC1, Pkinase_Tyr, MIP, DnaJ, HSCB_C, Trehalose_PPase, LRR—1, Cupin—2, LRR—2, Glyco_hydro—28, Yip1, Trp_syntA, Sedlin_N, SGS, Aldedh, CK_II_beta, zf-C3HC4, GIDA, PB 1, IMPDH, Carb_kinase, PurA, Molybdopterin, Nodulin-like, Tim17, Xan_ur_permease, Hist_deacetyl, RNA_pol_Rpb8, Agenet, Myb_DNA-binding, Glyoxal_oxid_N, Ribophorin_I and FAE—3-kCoA_syn1 wherein said Pfam gathering cutoff for said protein domain families are stated in Table 16; wherein said plant cell nucleus is selected by screening a population of transgenic plants that have said recombinant DNA and express said protein for an enhanced trait as compared to control plants that do not have said recombinant DNA in their nuclei; and wherein said enhanced trait is selected from group of enhanced traits consisting of enhanced water use efficiency, enhanced cold tolerance, enhanced heat tolerance, enhanced resistance to salt exposure, enhanced shade tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
  • 3. The plant cell nucleus of claim 2 wherein said protein has an amino acid sequence with at least 90% identity to a consensus amino acid sequence in the group of consensus amino acid sequences consisting of the consensus amino acid sequence constructed for SEQ ID NO: 205 and homologs thereof listed in Table 2 through the consensus amino acid sequence constructed for SEQ ID NO:408 and homologs thereof listed in Table 2.
  • 4. The plant cell nucleus of claim 2 wherein said protein is selected from the group of proteins identified in Table 1.
  • 5. A plant cell nucleus with stably integrated, recombinant DNA to suppress the level of an endogenous protein having at least one domain of amino acids in a sequence that exceeds the Pfam gathering cutoff for amino acid sequence alignment with a protein domain family identified by Pfam name in the group of Pfam names consisting of Histone, WD40, NPH3, FHA, PB 1, ADH_zinc_N, NAPRTase, ADK_lid, p450, B56, DUF231, C2, DUF568, WD40, F-box, Pkinase, and Terpene-synth wherein the Pfam gathering cutoff for said protein domain families is stated in Table 16; wherein said plant cell nucleus is selected by screening a population of transgenic plants with said recombinant DNA and have the level of said endogenous protein suppressed for an enhanced trait as compared to control plants that do not have said recombinant DNA; and wherein said enhanced trait is selected from the group of enhanced traits consisting of enhanced water use efficiency, enhanced cold tolerance, enhanced heat tolerance, enhanced resistance to salt exposure, enhanced shade tolerance, increased yield, enhanced nitrogen use efficiency, enhanced seed protein and enhanced seed oil.
  • 6. The plant cell nucleus of claim 2 or 5 further comprising DNA expressing a protein that provides tolerance from exposure to an herbicide applied at levels that are lethal to a wild type of said plant cell.
  • 7. The plant cell nucleus of claim 6 wherein the agent of said herbicide is a glyphosate, dicamba, or glufosinate compound.
  • 8. A transgenic plant cell or plant comprising a plurality of plant cells with a plant cell nucleus of claim 2 or 5.
  • 9. The transgenic plant cell or plant of claim 8 which is homozygous for said recombinant DNA.
  • 10. A transgenic seed comprising a plurality of plant cells with a plant cell nucleus of claim 2 or 5.
  • 11. The transgenic seed of claim 10 from a corn, soybean, cotton, canola, alfalfa, wheat or rice plant.
  • 12. The transgenic corn seed of claim 11 wherein said seed can produce corn plants that are resistant to disease from the Mal de Rio Cuarto virus or the Puccina sorghi fungus or both.
  • 13. A transgenic pollen grain comprising a haploid derivative of a plant cell nucleus of claim 2 or 5.
  • 14. A method for manufacturing non-natural, transgenic seed that can be used to produce a crop of transgenic plants with an enhanced trait resulting from expression of stably-integrated, recombinant DNA in a nucleus of claim 2 or 5, said method for manufacturing said transgenic seed comprising: (a) screening a population of plants for said enhanced trait and said recombinant DNA, wherein individual plants in said population can exhibit said trait at a level less than, essentially the same as or greater than the level that said trait is exhibited in control plants which do not express the recombinant DNA, (b) selecting from said population one or more plants that exhibit said trait at a level greater than the level that said trait is exhibited in control plants, (c) verifying that said recombinant DNA is stably integrated in said selected plants, (d) analyzing tissue of said selected plant to determine the production or suppression of a protein having the function of a protein encoded by nucleotides having a sequence selected from the group consisting of one of SEQ ID NO:205-408; and (e) collecting seed from said selected plant.
  • 15. The method of claim 14 wherein plants in said population further comprise DNA expressing a protein that provides tolerance to exposure to an herbicide applied at levels that are lethal to wild type plant cells, and wherein said selecting is effected by treating said population with said herbicide.
  • 16. The method of claim 15 wherein said herbicide comprises a glyphosate, dicamba, or glufosinate compound.
  • 17. The method of claim 14 wherein said selecting is effected by identifying plants with said enhanced trait.
  • 18. The method of claim 15 wherein said seed is corn, soybean, cotton, alfalfa, wheat or rice seed.
  • 19. A method of producing hybrid corn seed comprising: (a) acquiring hybrid corn seed from a herbicide tolerant corn plant which also has stably-integrated, recombinant DNA in a nucleus of claim 2 or 5; (b) producing corn plants from said hybrid corn seed, wherein a fraction of the plants produced from said hybrid corn seed is homozygous for said recombinant DNA, a fraction of the plants produced from said hybrid corn seed is hemizygous for said recombinant DNA, and a fraction of the plants produced from said hybrid corn seed has none of said recombinant DNA; (c) selecting corn plants which are homozygous and hemizygous for said recombinant DNA by treating with an herbicide; (d) collecting seed from herbicide-treated-surviving corn plants and planting said seed to produce further progeny corn plants; (e) repeating steps (c) and (d) at least once to produce an inbred corn line; (f) crossing said inbred corn line with a second corn line to produce hybrid seed.
  • 20. A method of selecting a plant comprising cells with a plant cell nucleus of claim 2 or 5 wherein an immunoreactive antibody is used to detect the presence of said protein in seed or plant tissue.
  • 21. Anti-counterfeit milled seed having, as an indication of origin, a plant cell nucleus of claim 2 or 5.
  • 22. A method of growing a corn, cotton, soybean, canola, alfalfa or wheat, crop without irrigation water comprising planting seed having the plant cells with a plant cell nucleus of claim 2 or 5 which are selected for enhanced water use efficiency.
  • 23. The method of claim 22 comprising providing up to 300 millimeters of ground water during the production of said crop.
  • 24. A method of growing a corn, cotton, soybean, canola, alfalfa or wheat crop without added nitrogen fertilizer comprising planting seed having plant cells of claim 2 or 5 which are selected for enhanced nitrogen use efficiency.
CROSS REFERENCE TO RATED APPLICATIONS

This application claims benefit under 35USC § 119(e) of U.S. provisional application Ser. No. 60/679,917, filed May 10, 2005, and U.S. provisional application Ser. No. 60/723,596, filed Oct. 4, 2005, both of which are incorporated herein by reference in their entirety.

Provisional Applications (2)
Number Date Country
60679917 May 2005 US
60723596 Oct 2005 US