TRUNCATED FORMS OF ATYPICAL CYS HIS RICH THIOREDOXIN 4 (ACHT4) CAPABLE OF INHIBITING ACHT4-MEDIATED OXIDATION OF THE SMALL SUBUNIT OF ADP-GLUCOSE PYROPHOSPHORYLASE (APS1)

Information

  • Patent Application
  • 20220170039
  • Publication Number
    20220170039
  • Date Filed
    February 21, 2022
    2 years ago
  • Date Published
    June 02, 2022
    a year ago
Abstract
The present invention provides compositions for attenuating the function of atypical CYS HIS rich thioredoxin 4 (ACHT4), a light-regulated protein expressed in plants and algae that controls starch storage in chloroplast, and methods for increasing plant and algae growth and yield.
Description
SEQUENCE LISTING STATEMENT

The ASCII file, entitled 91421SequenceListing.txt, created on Feb. 21, 2022, comprising 159,405 bytes, submitted concurrently with the filing of this application is incorporated herein by reference.


FIELD OF THE INVENTION

The present invention provides compositions for attenuating the function of atypical CYS HIS rich thioredoxin 4 (ACHT4), a light-regulated protein expressed in plants and algae that controls starch storage in chloroplast, and methods for increasing plant and algae growth and yield.


BACKGROUND OF THE INVENTION

Genetically modified plants with improved agronomic traits such as yield, pest resistance, herbicide tolerance, improved seed compositions, and the like are desired by both farmers and consumers. Although considerable efforts in plant breeding have provided significant gains in desired phenotypes, the ability to introduce specific DNA into plant genomes provides further opportunities for generation of plants with improved and/or unique phenotypes. The ability to develop genetically modified plants with improved traits depends in part on the identification of genes that are useful in recombinant DNA constructs for production of transformed plants with improved properties.


One genetic modification that would be economically desirable would be to increase the growth and yield production of the plant. There is a need to develop a method for increasing growth in plants, regardless of the locale or the environmental conditions.


The Arabidopsis thaliana atypical cysteine histidine-rich Trxs (ACHTs) constitute a small family of plant-specific and chloroplast-localized Trxs. They are light-regulated and are good catalysts of 2-Cys Prx reduction.


The expression profile of the ACHT family members suggests that they have distinct roles. The role of ACHT4, a recently identified paralog of ACHT1 in Arabidopsis, was previously unknown.


SUMMARY OF THE INVENTION

In one embodiment, the present invention provides compositions for attenuating the function of atypical CYS HIS rich thioredoxin 4 (ACHT4), a light-regulated protein expressed in plants and algae that controls starch storage in chloroplast, and methods for increasing plant and algae growth and yield.


In one embodiment, the present invention provides a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a composition comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a composition comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a cell comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a composition comprising a cell comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a seed comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


In another embodiment, the present invention provides a plant, or plant part, comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


In another embodiment, the present invention provides an algae comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


In another embodiment, the present invention provides a polypeptide comprising an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides a composition comprising a polypeptide comprising a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4).


In another embodiment, the present invention provides a method of increasing the yield of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the yield of said plant or algae.


In another embodiment, the present invention provides a method of increasing the productivity of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the productivity of said plant or algae.


In another embodiment, the present invention provides a method of increasing the size of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the size of said plant or algae.


In another embodiment, the present invention provides a method of increasing the biomass of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the biomass of said plant or algae.


In another embodiment, the present invention provides a method of stimulating the growth of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby stimulating the growth of said plant or algae.


In another embodiment, the present invention provides a method of producing a plant or algae having an enhanced phenotype, wherein said method comprises delivering a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein to plant or algae cells, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation, regenerating plants or algae from said cells, and screening said plants or algae to identify a plant having an enhanced phenotype.





BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S)


FIG. 1A: Immunoblot assay showing the oxidized state of ACHT4 active-site Cys residues captured in plants expressing ACHT4 fused with HA-tag at the end of the night (oxidized) and at 1 min (m), 5 min, 30 min, 1 h, and 2 h after beginning of illumination. Analysis of the purified proteins under reducing conditions (reduced), indicated that the changes in the oxidized level of ACHT4 were not the result of altered protein content. Equal loading was verified by ribulose-1,5-bis-phosphate carboxylase/oxygenase (RBCL) levels.



FIG. 1B: Immunoblot assay showing the ACHT4 intermolecular disulfide complexes, 2-Cys Prx heterotrimeric (Prx-t) and heterodimeric (Prx-d), and a unique additional complex (marked with an asterisk) extracted under nonreducing conditions (NR) from plants expressing either ACHT4MT or ACHT1MT. The conversion of the complexes to the monomer (mono) by chemical reduction (R) indicated the disulfide nature of the complexes.



FIGS. 1C and 1D: Reciprocal immunoprecipitation identified 2-Cys Prx (C) and APS1 (D) as the intermolecular disulfide partners of ACHT4. Immunoblot assay of proteins immunoprecipitated with anti-HA (ACHT4 IP), anti-2-Cys Prx (Prx IP), and anti-APS1 (APS IP) affinity matrixes or with a nonspecific matrix (Control IP) from plants expressing ACHT4MT. Purified proteins were run under reducing conditions and blotted with antibodies specific to the HA-tagged ACHT4 (aHA), 2-Cys Prx (αPrx) or APS1 (αAPS1).



FIG. 2A: Immunoblot assay with HA Ab showing the ACHT4 intermolecular disulfide complexes during the transition from night to day.



FIG. 2B: Immunoblot assay showing APS1 redox state during the transition from night to day. The panels of APS1 dimer and the monomer were taken from the same immunoblot exposure.



FIG. 2C: Immunoblot assay with APS1 Ab of gel slice of 50 kDa monomer extracted from plants in the dark (D) or in the light (L) before (−DTT) and after (+DTT) chemical reduction with DTT.



FIG. 2D: Immunoblot assay with APS1 Ab showing the reduced 50 kDa APS1 monomer (red). Equal loading was verified as in FIG. 1. The results shown are representative of three independent experiments.



FIG. 2E: Immunoblot assay with HA Ab showing the ACHT4 intermolecular disulfide complexes during the transition from day to night.



FIG. 2F: Immunoblot assay showing APS1 redox state during the transition from day to night. The panels of APS1 dimer and the monomer were taken from the same immunoblot exposure.



FIG. 3A: Immunoblot assay showing the ACHT4 intermolecular disulfide complexes in plants treated for 2-hrs with 50 μE*m−2*s−1 light intensity (50 μE) and after abrupt decreased (10 μE) followed by abrupt increased light intensity (50 μE). Equal protein loading was verified as in FIG. 1. The results shown are representative of three independent experiments.



FIG. 3B: Immunoblot assay showing the APS1 redox state in plants treated for 2-hrs with 50 μE*m−2*s−1 light intensity (50 μE) and after abrupt decreased (10 μE) followed by abrupt increased light intensity (50 μE). Equal protein loading was verified as in FIG. 1. The results shown are representative of three independent experiments.



FIG. 4A: Schematic representation comparing ACHT1 and ACHT4 protein sequences. ACHT4 is comprised of a chloroplast targeting transit peptide (TP) followed by a conserved thioredoxin (Trx) domain and long C-terminus domain.



FIG. 4B: Immunoblot assay of intermolecular disulfide complexes of ACHT4ΔC as in FIG. 1B.



FIG. 4C: Immunoblot assay with HA Ab showing the ACHT4ΔC intermolecular disulfide complexes during the transition from night to day.



FIG. 4D: Immunoblot assay of thylakoid membranes (T), enriched grana (G), grana margin (GM) and stroma lamellae (SL) and decorated with anti-HA Ab (ACHT1, ACHT4, ACHT4ΔC), or with Ab against the PS II protein PsbA (PsbA) or the PS I protein PsaD Ab (PsaD).



FIG. 5A: Immunoblot assay with APS1 Ab showing APS1 redox state in WT plants (WT), plants expressing increased level of ACHT4ΔC (ACHT4ΔC), or plants expressing increased level of ACHT4 (ACHT4). Equal protein loading was verified as in FIG. 1.



FIG. 5B: Leaves starch content of plants expressing increased level of ACHT4ΔC, or plants expressing increased level of ACHT4 relative to that of WT plants. The results shown are representative of three independent experiments.



FIGS. 6A and 6B: Effect of overexpression of AtACHT4ΔC on biomass accumulation in Arabidopsis plants. Overexpressing (OE) AtACHT4ΔC in Arabidopsis plants stimulated growth (fresh weight, FW, FIG. 6A; dry weight, DW, FIG. 6B) in comparison to wild type plants, indicating that OE of AtACHT4ΔC stimulates the export of photosynthates from the chloroplast which are then directed toward growth. The OE of AtACHT4 decreased growth, confirming that the C-terminus of ACHT4 attenuates growth.



FIG. 7: A phylogenetic tree showing that Arabidopsis has one paralog of ACHT4 where other crop plants, including potato, maize, rice, barley, wheat, sorghum, castor, bean, rapeseed, cotton, soybean, beat, banana, chili, chickpea, tomato, African oilpalm, Foxtail millet, cassava and the algae Chlamydomonas and Chlorella have one to five paralogs. The blastP analysis for Arabidopsis ACHT4 (AtACHT4) was performed against several crop plants, biofuel plants and algae genomes database. Pairwise distance analysis of proteins was performed using MEGA 7 program and the proteins closely related to AtACHT4 were aligned and then phylogenetic tree was constructed. The amino acid sequences of proteins were downloaded from respective databases and have accession numbers or protein ID as follow: Arabidopsis (Arabidopsis thaliana: NP_172333.1; SEQ ID NO: 1); Potato-1 (Solanum tuberosum: XP_006348023.1; SEQ ID NO: 2); Potato-2 (Solanum tuberosum: XP_006351368.1; SEQ ID NO: 3); Maize-1 (Zea mays: NP_001266702.1; SEQ ID NO: 4); Maize-2 (Zea mays: ACR34655.1; SEQ ID NO: 5); Maize-3 (Zea mays: ACN36361.1; SEQ ID NO: 6); Rice-1 (Oryza sativa: XP_015632287.1; SEQ ID NO: 7); Rice-2 (Oryza sativa: XP_015646723.1; SEQ ID NO: 8); Barley-1 (Hordeum vulgare: BAK03063.1; SEQ ID NO: 9); Barley-2 (Hordeum vulgare: BAK07858.1; SEQ ID NO: 10); Wheat (Triticum aestivum: Traslated ORF in 1st frame from mRNA AK335384.1; SEQ ID NO: 11); Cassava-1 (Manihot esculenta: OAY44415.1; SEQ ID NO: 12); Cassava-2 (Manihot esculenta: OAY41970.1; SEQ ID NO: 13); Sorghum-1 (Sorghum bicolor: KXG39469.1; SEQ ID NO: 14); Sorghum-2 (Sorghum bicolor: XP_002465837.1; SEQ ID NO: 15); Sorghum-3 (Sorghum bicolor: KXG36972.1; SEQ ID NO: 16); Rapeseed-1 (Brassica napus: CDY06319.1; SEQ ID NO: 17); Rapeseed-2 (Brassica napus: XP_013711973.1; SEQ ID NO: 18); Rapeseed-3 (Brassica napus: XP_013672630.1; SEQ ID NO: 19); Rapeseed-4 (Brassica napus: XP_013716476.1; SEQ ID NO: 20); Rapeseed-5 (Brassica napus: XP_013641071.1; SEQ ID NO: 21); Castor (Ricinus communis: XP_002525461.1; SEQ ID NO: 22); Bean-1 (Phaseolus vulgaris: XP_007161960.1; SEQ ID NO: 23); Bean-2 (Phaseolus vulgaris: XP_007161924.1; SEQ ID NO: 24); Cotton-1 (Gossypium histrum: NP_001313760.1; SEQ ID NO: 25); Cotton-2 (Gossypium histrum: XP_016753539.1; SEQ ID NO: 26); Cotton-3 (Gossypium histrum: XP_016672835.1; SEQ ID NO: 27); Soybean-1 (Glycine max: XP_003548763.1; SEQ ID NO: 28); Soybean-2 (Glycine max: NP_001276128.1; SEQ ID NO: 29); Beet (Beta vulgaris: XP_010672407.1; SEQ ID NO: 30); Banana-1 (Musa acuminata: XP_009416338.1; SEQ ID NO: 31); Banana-2 (Musa acuminata: XP_009406843.1; SEQ ID NO: 32); Chili (Capsicum annuum: XP_016552829.1; SEQ ID NO: 33); Chick pea (Cicer arietinum: XP_004493141.1; SEQ ID NO: 34); Tomato-1 (Solanum lycopersicum: XP_004252003.1; SEQ ID NO: 35); Tomato-2 (Solanum lycopersicum: XP_004249307.1; SEQ ID NO: 36); African oilpalm-1 (Elaeis guineensis: XP_010938119.1; SEQ ID NO: 37); African oilpalm-2 (Elaeis guineensis: XP_010921294.1; SEQ ID NO: 38); Foxtail millet-1 (Setaria italica: XP_004984516.2; SEQ ID NO: 39); Foxtail millet-2 (Setaria italica: XP_004985651.1; SEQ ID NO: 40); Foxtail millet-3 (Setaria italica: XP_004958724.1; SEQ ID NO: 41); Chlamydomonas (Chlamydomonas reinhardtii: XP_001697443.1; SEQ ID NO: 42); Chlorella (Chlorella variabilis: XP_005851922.1; SEQ ID NO: 43).



FIG. 8: Effect of OE of StACHT4-2ΔC in potato plants on tubers yield. OE StACHT4-2ΔC in potato plants nearly doubled tubers yield (g per plant) in comparison to WT plants (cultivated Desiree), indicating that StACHT4-2ΔC stimulates the export of photosynthates from the chloroplast which are then directed toward growth. OE of StACHT4-2 did not change growth, confirming that the deletion of StACHT4-2 C-terminus relieves growth attenuation and tubers yield in a similar fashion to Arabidopsis AtACHT4ΔC.



FIG. 9: Photographs of potato plants after 60-days growth in the greenhouse showing higher shoots growth and higher tuber yield in StACHT4-2ΔC-OE plants over WT (cultivated Desiree) plants. Scale bar in each panel represents 10 cm.



FIG. 10: Percent change of transitory starch level of leaves of 6-weeks old StACHT4-1ΔC OE and StACHT4-1 OE potato plants. Overexpressing (OE) StACHT4-1ΔC in potato plants stimulated transitory starch accumulation in leaves relative to wild type plants. OE of StACHT4-1 decreased transitory starch accumulation, confirming that the C-terminus of StACHT4-1 attenuates starch synthesis in potato leaves.



FIG. 11: Schematic representation of plant expression vector construct for the over expression of the four StACHT4 construct. Only the construct of StACHT4-1ΔC is shown.





DESCRIPTION OF SPECIFIC EMBODIMENTS OF THE INVENTION

In one embodiment, the present invention provides compositions for attenuating the function and/or expression of atypical CYS HIS rich thioredoxin 4 (ACHT4), a light-regulated protein expressed in plants and algae that controls starch storage in chloroplast, and methods for increasing plant and algae growth and yield.


ACHT4


The Arabidopsis thaliana atypical cysteine histidine-rich Trxs (ACHTs) constitute a small family of plant-specific and chloroplast-localized Trxs. They are light-regulated and are good catalysts of 2-Cys Prx reduction.


A recently discovered paralog, AtACHT4, was found by the inventors to attenuate starch synthesis in Arabidopsis thaliana by oxidizing a regulatory disulfide of the AGPase (ADP-glucose pyrophosphorylase, which catalyzes the first committed step in the starch synthesis pathway; Examples 1-2). The oxidizing reaction of AtACHT4 with AGPase requires the C-terminus of AtACHT4 (Example 2).


Thus, in one embodiment, the present invention provides compositions comprising ACHT4 proteins and nucleic acids and uses thereof. ACHT4 sequences may be from any species comprising such sequences. Table 1 hereinbelow discloses the amino acid sequences of some ACHT4 paralogs in various species, while Table 2 hereinbelow discloses the nucleic acid sequences of some ACHT4 paralogs in various species.









TABLE 1







ACHT4 amino acid sequences













Database






Accession No.






(Genbank

SEQ




unless otherwise

ID


Organism
Paralogs
specified)
Amino Acid Sequence
NO:















Arabidopsis

AtACHT
NP_172333.1
MTEVISKTSLFLGACGNHHRVDDFSFS
1



thaliana 4



PVSFGGFGLKKSFSCLKLKSQKPLRSV






FYGKQIVFGDSQDESFRRSSAITAQTTL






RIGTAQKWWEKGLKDNMREISSAQEL






VDSLTNAGDKLVVVDFFSPGCGGCKA






LHPKICQFAEMNPDVQFLQVNYEEHK






SMCYSLGVHVLPFFRFYRGSQGRVCSF






SCTNATIKKFRDALAKHGPDRCSLGPT






KGLEEKELVALAANKELNFTYTPKPVP






VEKEAATPDSNPSLPVPLPSMSSNDEK






TLVSAGR







Solanum

StACHT4-
XP_006348023.
MMKLMSKGFMFPSSSDCGEIYHHRPL
2



tuberosum

1
1;
NLPGICSFPNKSVNLSCLPSLNLSSSCLP



(Potato)

PGSC0003DMP
RTDFYGRRLVINEGVSKFNRRNSQVV





400032814
DITAQMSIGIRKAQKWWEKGVQPNMK






EVNSAQELVDSLLSAGDKLVVVDFFSP






GCGGCKALHPKLCQLAEMNPDVHFLQ






VNYEEHKSMCYSLNVHVLPFFRFYRG






AEGRVCSFSCTNATIKKFKDALAKYGT






DRCTLGPPKGLEEKELLALAANKDLSF






NYTPKTEEAPVLVTSQKEVQDTTPPNI






ESPLPLPLPLPIASTSSQTAKRDTEKEA






YATSGR







Solanum

StACHT4-
XP_006351368.
MKFNRRNHKSAAATAQMSIGIRKAPK
3



tuberosum

2
1;
WWEKGLQPNMKEVMGAQDLADTLL



(Potato)

PGSC0003DMP
NAGDKLVVVDFLSPGCGGCKALHPKI





400040925
CQLAEMNPDVQFLHVNYEEHKSMCYS






LNVHVLPFFRFYRGAEGRLCSFSCTNA






TIKKFKDALTKYGADCCSLEPVKGLEE






KELLALAANKDLSFAYTPKTEEPMPV






ALQDAKVIKTSRTSSSCPNTFSLLPLPL






PLPLASTSHKAKQDSKSEVF







Zeamays

ZmACHT4-
NP_001266702.
MAAAQAISKGSVVSPCGNRAAPGLLA
4


(Maize)
1
1
RRRGAVAARVAPSAARIGGFWRKNAF






PGGRLTLRTRRSRAASPAQMNMNLAL






GKSMRWWEKGLQPNMREIESAQDLV






DALTNAGDRLVVVDFFSPGCGGCRAF






HPKICQFAEQNPDVLFLQVNYEEHKS






MCHSLHVHVLPLFRFYRGAQGRLCSFS






CTNTTIKKFRDALAKHKPDRCSLGPTR






GLEESELLALAANKDLQFTYAKEEPEL






IPRGDAPGEVVAPEPAKLPAAPKPLVR






LGSEERSLVSSGR







Zeamays

ZmACHT4-
ACR34655.1
MADALCNGVVASPCGRDVAGRARGA
5


(Maize)
2

ARAALAESLQVAGHASKTSFSAGRMS






VKDSKPRPLSRSLEAAAPGQMNLSFPK






AMRWWKKGLHPNMREVESAQDLADS






LLSAGDKLVVVDFFSPGCGGCRALHP






KIAQFAEKNPGVQFLQVNYETHKSMC






YSLRVHVLPFFRFYRGAEGRVSSFSCT






NATINKFKDALAKHGAERCSLGPARG






LDESELMALAENRDLHFTYDKPGGLV






PLAEAIAKEAAAPGGPWLPLPASLLGQ






GSDNSLLPSGR







Zeamays

ZmACHT4-
ACN36361.1
MAAAQVVAKGSVVSPCGNRAVPGLL
6


(Maize)
3

GRRRDAVAAQMTPSAVRIGGSWRKN






AFPGVRLALGTRRSRPASRSFSASPVQ






MNMNLAIGKSMRWWEKGLQPNMREI






ESAQDLVDSLTNAGERLVVVDFFSPGC






GGCRALHPKICQFAERNPDVLFLQVNY






EEHKSMCYSLRVHVLPFFRFYRGAQG






RLCSFSCTNATVRSCPCFFCSYDYWYV






LNNMQHIQNDLY







Oryza

OsACHT4-
XP_015632287.
MAATAAQAVAVKGSVAVPPCGSRGR
7



sativa

1
1
RRGAVASVRMAAAAATSALRIGRRSP



(Rice)


FLGRRLAVGPRRSRPVPRNLVAPVQM






NLAFAKATKWWEKGLQPNMREVESA






QDLVDSLTNAGDNLVIVDFFSPGCGGC






RALHPKICQIAEQNPDVLFLQVNYEEH






KSMCYSLHVHVLPFFRFYRGAQGRLC






SFSCTNATIKKFRDALAKHKPDRCSLG






PTRGLEESELLALAANKDLQFNYTKKP






ELVPSGDAAAAQELDRGSTKLSPPAKP






LVKQGSEERSLVSSGR







Oryza

OsACHT4-
XP_015646723.
MAEALCSGSVASPCGEVGVGFAAGLV
8



sativa

2
1
RGAAAAAALAESVPIGGYSSKSTFPSG



(Rice)


RVALTERKARPLPRNLEAAHGQMNLT






IGKAMRWWEKCLQPNMREIESAQDLA






DSLLNAGDKLVVVDFFSPGCGGCRAL






HPKIAQLAEKNPEVLFLQVNYEKHKS






MCYSLHVHVLPFFRFYRGAQGRVSSFS






CTNATIKKFKDALAKHGPDRCGLGPA






KGLEESELMALAINRDLNFTYTPNQDL






VPIADALLKEAAAPGGPWLPLPATATQ






LFIQGSENSLLSSGR







Hordeum

HvACHT4-
BAK03063.1
MATAQAVAKGTVVSPCGTRAAGFGA
9



vulgare

1

RRRGAVAARMSPCAPAAVRIGRKSPFL



(Barley)


GARLTVGPRRSKLVPRNLVSSPVQMN






LAFAKSTKWWEKGLKPNMREIESAQD






LVDSLANAGDRLVVVDFFSPGCGGCR






ALHPKICQFGEQNPDVLFLQVNYEEHK






SMCYSLHVHVLPFFRFYRGAQGRLCSF






SCTNATIKKFRDALAKHNPDRCSIGPT






RGLEESELLALAANKDLQFTYTKQPEP






VPSGDSEFIAPGSPRLPPPAKPLVRQGS






GERTLVSSGR







Hordeum

HvACHT4-
BAK07858.1
MANALYGGGVAAPCGDLGAAAALAE
10



vulgare

2

SLPMGGGYRARSSFPAGRVALAERPLP



(Barley)


RSLQVAAAAGQMNGNLTIGKAMRW






WEKGTQPNMREVESAQDLADSLLNA






GDKLVVVDFFSPGCGGCRALHPKIAQF






AERNPDVLFLQVNYEKHKSMCYSLHV






HVLPFFRFYRGAQGRVSSFSCTNATIK






KFKDALAKHSPDRCSLGPARGLEKAE






LLALAENRDLEFTYSEKPTLVPIAEAIR






MEAASIGGPWLPLPPAATQPFPLGSEN






GSLIPSGR







Triticum

TaACHT4
Traslated ORF in
MASALCGGGSGSVAAPCGDLGAAAA
11



aestivum


1st frame from
LAESLPMGAGYRAKSSFPAGRVALAD



(Wheat)

mRNA
RPLRRGLQVAAAAGQMNGNLTIGKA





AK335384.1
MRWWEKVTHPNMREVESAQDLADSL






LNAGDKLVVVDFFSPGCGGCRALHPKI






AQFAERNPDVLFLQVNYEKHKSMCYS






LHVHVLPFFRFYRGAQGRVSSFSCTNA






TIKKFKDALAKHSPDRCSLGPARGLEE






AELLALAANRDLEFTYNEKPTLVPIAE






AIQMEAASIGGPWMPLPAAATQPLTLG






SENGSLIPSGR







Manihot

MeACHT4-
OAY44415.1
MADVLSNTNLVSSSFSSSFTGHRNEQK
12



esculenta

1

NSSCRLKGFPRKVNRQTLRLKATSLGS



(Cassava)


DFHGKRVVLQDNQGKPKRGIYLQMSI






KAQHTGLRLKSAPKWWEKGLQPNMR






EVTSAQDFVDSLLNAGDKLVIVDFFSP






GCGGCKALHPKICQFAEMNPDVLFLH






VNYEEHKSMCYSLNIHVLPFFRFYRGA






QGRLCSFSCTNATIKKFRDALAKHSPD






RCSLGPTKGLEEKELIALASNKDLNFK






YAQKPDLPTPIPAKEERVPVVSPSHPNP






ALPLPLPLPTASPKSGQGSEEKTLVGSG






R







Manihot

MeACHT4-
OAY41970.1
MAAVSSNTNLVSSSCSSSFSSSQNRPEY
13



esculenta

2

RSSRLRVFPQELNHQALRLQTTSLGSD



(Cassava)


FHGKRVVLQEKPKCKQGISVQSSIKAQ






TGLRLKNAKNWWEEELQPNMREVISA






QDLVDSLLNAGDKLVIVYFFSPGCGGC






RALHPKICQLAKNNADVQFLKVNYEE






HKSMCYSLNVHVLPFFRFYRGAQGRV






CSFSCTNATIKKFKNALAKHTPDRSSL






EPTKGLEEKELIALAANKDLNLTYAPK






SDKPIPAPTKEEIVPEIPQSLSLALRRSM






ELAQGSAEKTLVASGR







Sorghum

SbACHT4-
KXG39469.1
MAAAQAVAKGSVVAPCGNRAAPGLL
14



bicolor

1

GRRRGAVAARMAPSAVRIGASWRKT






AFTGGRLALGLGTRRSRPASRSSFASP






AQMNMNLAIGKSMRWWEKGLQPNM






REIESAQDLVDSLTNAGDKLVIVDFFSP






GCGGCRALHPKICQFAEQNPDVLFLQV






NYEEHKSMCYSLHVHVLPFFRFYRGA






QGRLCSFSCTNATIKKFKDALAKHKPD






RCSLGPTRGLEESEFLALAANKDLQFT






YTKEPELIPRGDAPGEVIAPEPAKLPAA






TKPLVRLGSEERSLVSSGR







Sorghum

SbACHT4-
XP_002465837.
MAAAQAMAKGSVGQGSLGRRRGAEA
15



bicolor

2
1
ARVGGSWRKSAFLGGRLAVGPRRPRP






VSRILVTSPAVQQTNLSFAKAMKWWQ






KGLQPNMRAIQTAQDLADSLTNAGDG






LVVVDFFSPGCAGCHALHPKICQFAER






NPDVQFLQVNYEEHKSMCHSLHVHVF






PFFRFYRGAQGRLCSFSCTNATIKKFR






DALAKHRADRCSLGPTRGLEESELLAL






AANKDLQFTYTKEAELAPSMEDVAEV






MTADRPGLPTSTMPLARQGSEDRALV






SSGR







Sorghum

SbACHT4-
KXG36972.1
MAEALCNGVVASPYGGGDVGVAGRA
16



bicolor

3

RGAAKAALAESLPVGGYATKSSFSAG






RMSVSDRKPRPLSRNLEAAAAPGQMN






LSFPKAMRWWEKGLHPNMREIESAQD






LADSLLNAGDKLVVVDFFSPGCGGCR






ALHPKIAQFAEKNPDVLFLQVNYETHK






SMCYSLHVHVLPFFRFYRGAEGRVSSF






SCTNATVRIDHLSNFKNQQMNE







Brassica

BnACHT4-
CDY06319.1
MAEAAISRTNLIFRGACVNQHKHVDD
17



napus

1

YSVSSPVSFGLRKSFPSLKVKPFNQFQS



(Rapeseed)


SRSSSSITAQTTLRIGTPQKWWEKGLK






ENMREISSAQELVDSLTNAGDKLVVV






DFFSPGCGGCKALHPKICQLAEQNPDV






QFLQVNYEEHKSMCYSLGVHVLPFFR






FYRGAHGRVCSFSCTNATIKKFRDALA






KHSPDRCSLGPTKGLEEKELVALAAN






KELNFSYTPRAVPVEEEEAPVPASNPG






LPVAHPSMKANDGKTLVSSGR







Brassica

BnACHT4-
XP_013711973.
MAEVISKTSLFFRGACVNHHHHADDF
18



napus

2
1
SVSPVSFGLKKSFSSLKQKPLRSDFSGK



(Rapeseed)


QILQTFNRSFRSSSVTAQSTLRIGTAQK






WWEKGLQENMREISSAQELVDSLADA






GDKLVVVDFFSPGCGGCKALHPKMCQ






LAEQSADVQFLQVNYEEHKSMCYSLG






VHVLPFFRFYRGAQGRVCSFSCTNATI






KKFRDALAKHSPDRCSLGPTKGLEEKE






LVALAANKELNFSYTPKVVPVEKEAAI






PTSNPALPVPHPSMSGSEEKTLVSAGR







Brassica

BnACHT4-
XP_013672630.
MAEAAISRTNLIFRGACVTHHHHADD
19



napus

3
1
YSVSSSPVSFGLRKSFSSLKLKPPRQID



(Rapeseed)


TQFQTFTRSSRASSITAQTTLRIGTPQK






WWEKGLKENMREISSAQELVDSLTNA






GDKLVVVDFFSPGCGGCKALHPKICQL






AEQNPDVQFLQVNYEEHKSMCYSLGV






HVLPFFRFYRGAHGRVCSFSCTNATIK






KFRDALAKHSPDRCSLGPTKGLEEKEL






VALAANKELNFSYTPRAVPVEEEEAPV






PASKPGLAVPHPSMSANDEKTLVSAG






R







Brassica

BnACHT4-
XP_013716476.
MAEAAISRTNLIFRGACVNQHKHVDD
20



napus

4
1
YSVSSPVSFGLRKSFPSLKVKPFNQFQS



(Rapeseed)


SRSSSSITAQTALRIGTPQRWWEKGLK






ENMREISSAQELVDSLTNAGDKLVVV






DFFSPGCGGCKALHPKICQLAEQNPDV






QFLQVNYEEHKSMCYSLGVHVLPFFR






FYRGAHGRVCSFSCTNATIKKFRDALA






KHTPDRCSLGPTKGLEEKELVALAAN






KELNFSYTPKDVPVEEEAAPVPVSNPG






LPVAHPSMKANDGKTLVSSGR







Brassica

BnACHT4-
XP_013641071.
MAEVISKTSLFFGGGACVNHHHHHVD
21



napus

5
1
DLSVSPVSFGFKKSFSSSLKQKPLRSDF



(Rapeseed)


SGKQILETFNRSFRSSSVTAQSTLRIGT






AHKWWEKGSQENMREISSAQDLVDSL






ADAGDKLVVVDFFSPGCGGCKALHPK






MCQLAEQSPDVQFLQVNYEEHKSMCY






SLGVHVLPFFRFYRGAQGRVCSFSCTN






ATIKKFRDALAKHSPDRCSLGPTKGLE






EKELVALAANKELKFSYTPKVVPVEK






EVAIPTSNPGLPVPHPSTMSGSEEKTLV






SAGR







Ricinus

RcACHT4
XP_002525461.
MADVLSKTNLVPSSCCNGYKNQKKD
22



communis


1
GAFVLKRSCSLKVSSRKFNPQAFGSQK



(Castor)


ISLISDFYGKRVIVQEKQLKRGNFHQFS






IKAQTGLRLKNAPKWWEKGLQPNMK






EITSAQDLVDSLMNAGDKLVIVDFFSP






GCGGCKALHPKICQFAEMNPDVQFLQ






VNYEEHKSMCYSLNVHVLPFFRFYRG






AQGRVCSFSCTNATIKKFKDALAKHTP






DRCSLGPTKGLEEKELIALASNKDLNF






TCTPKPVQPTAPAQEEIIPAALTPAHVN






QTLPLPIPLSTTSLMSAQDLGEKTLVTS






GR







Phaseolus

PvACHT4-
XP_007161960.
MAEVFTKASFVSSLLGSSQRHHRRVST
23



vulgaris

1
1
VPDTCTFVSGVGGSPSLKLKSPILRSWS



(Bean)


PSSEFQGKQLLFRVNRGKPNRVSSRLR






ASTAAQMTLRIGKVQKWWEKGLQPN






MKEVTSAQDLVESLLNAGDKLVVVDF






FSPGCGGCKALHPKICQLAEMNPDVQF






LQVNYEEHKSMCYSLNVHVLPFFRFY






RGAHGRLCSFSCTNATIKKFRDALAKH






SPDRCSLGPTKGLEEKELLALAANKDL






SFTLPKPLQPEHANEGLATAPAPVPSSE






SLPLPSLTLNSEVSQERTLTTAGR







Phaseolus

PvACHT4-
XP_007161924.
MAEVLTEASLVSSWHGTTQRHHRRVS
24



vulgaris

2
1
TVPNSSSFVSGVGRFPSLKLKSQILRSL



(Bean)


SSSSEFQGKKLLFHVNRGLANRISSRLG






ASTAAQMTLRIGKGQKWWEKGLQPN






MNEVTSAQDLVESLLNAGDKLVVVDF






FSPGCGGCKALHPKICQLAEMNPDVQF






LQVNYEEHKSMCYSLNVHVLPFFRFY






RGAHGRLCSFSCTNATIKKFKDALAKH






SPDRCSLGPTKGLEEKELLALAANKDL






SFIYAPNPLQPEHENEELATAPAPVPSS






ESLPLCHLISEVSKEKTLITAGR







Gossypium

GhACHT4-
NP_001313760.
MAEVLGKGNLFTTCNYSQTKNLEGGT
25



histrum

1
1
CLVPKKISGFSLERNGFSSLKVKSQALR



(Cotton)


SDFNGQRMVFLEKKSMNRRRFCQVPI






KAQMQSGLIGRIQKWWEKGLQPNMK






EVASAQDLVDSLLNAGDKLVVVDFFS






PGCGGCKALHPKICQFAEMNPDVQFL






QVNYEEHKSMCYSLNVHVLPFFRFYR






GAQGRVCSFSCTNATIKKFRDALAKHT






PDRCSLSTTKGLEEKELLALSANKDLS






FNYTPIPTHGEILIWKQVPSDSTRKLPL






SVPTTSAKQRDSEEKTLVGVGR







Gossypium

GhACHT4-
XP_016753539.
MAEVLGKSNLFTACNYSQKKHQEGG
26



histrum

2
1
VPLFSRRISVFCLRKNSFPSLRLEPQAL



(Cotton)


RSGFNGQRVVFLEKRSLNERRFCRVPI






KAQMQTGLIGKTQKWWEKGNQPNM






KEVTSAQDLVDSLLNAGDKLVIVDFFS






PGCGGCKALHPKICQLAEMNPDVQFL






KVNYEEHKSMCYSLNVHVLPFFRFYR






GAQGRLCSFSCTNATIKKFKDALAKHS






PDRCSLGPTKGLEEKELLALAANKDLS






FNYTPKPVHPAPEEIPVLKEVPSGSSFK






LKESEEKTLIGVGR







Gossypium

GhACHT4-
XP_016672835.
MAEVLGKSNLFTACNCSQKKNQEGGV
27



histrum

3
1
PLFSRRISAFCLRKNSFPSLKLEPQALRS



(Cotton)


GFNGQRVVVLEKRSLNERRFCRVPIKA






QMQTGLIGKTQKWWEKGNQPNMKEV






TSAQDLVDSLLNAGDKLVIVDFFSPGC






GGCKALHPKICQLAEMNPDVQFLKLN






YEEHKSMCYSLNVHVLPFFRFYRGAQ






GRLCSFSCTNATIKKFKDALAKHSPDR






CSLGPTKGLEEKELLALAANKDLSFNY






TPKPVHPAPEEMPVLEEVPSGSSFRPKE






SEEKTLVGVGR







Glycine

GmACHT4-
XP_003548763.
MAEVLTKASLVSSSWHGVSQRHHHRR
28



max

1
1
VSTVLSNNTCSFRSGVGKFSSLKMNSQ



(Soybean)


VLRSWSSSSEFQGKKLVFHVNRGLPNR






VNSRLRASTGTQMNLRLGKVQKWWE






KGLQPNMKEVTSAQDFVDSLLNAGDK






LVVVDFFSPGCGGCKALHPKICQFAEM






NPDVQFLQVNYEEHKSMCYSLNVHVL






PFFRFYRGAHGRLCSFSCTNATIKKFK






DALAKHTPDRCSLGPTIGLEEKELEAL






AANKDLSFTYSPKPLQPSHENEELATE






TASAPALGSGSLPSPSMTLNAVASNER






TLTTSGR







Glycine

GmACHT4-
NP_001276128.
MKSQVLRSWSSSSEFQGIKLVFHVNRG
29



max

2
1
LPNRVNSRLRASTGAQMSFRLGKVQK



(Soybean)


WWEKGLQPNMKEVTSAQDFVDSLLS






AGDKLVVVDFFSPGCGGCKALHPKIC






QFAEMNPDVQFLQVNYEEHKSMCYSL






NVHVLPFFRFYRGAHGRLCSFSCTNAT






IKKFKDALAKHTPDRCSLGPTKGLEEK






ELLALAANKDLSFTNSPEPLQPAHADE






ELGTEPAPAPGSKSLPSPSMILNSEVSK






KRTLTTSGR







Beta

BvACHT4
XP_010672407.
MADVLTKSSVFSPTISHHHSGSKNFPIK
30



vulgaris


1
CSVAVSNRGRLVGISSLRSSFGGVRIAI



(Beet)


DKNTSFGSKRRNYQSIDAKMGLSIGKA






QKWWEKGLQPNMREITSAEDLVDSLL






TAGDTLVVVDFFSPGCGGCRALHPKL






CQLAEMNPDVQFLQINYEEHKSMCYS






LNVHVLPFFRFYRGAEGRVSSFSCTNA






TIKKFKDALAKHNPARCSLGPTKGLEE






KELLALAANKDLSFTYTPKPVEAEPVP






APALEEVSVKADEQVLAQESLPSFNRK






PLSSQPSTVSEEKTLATAAR







Musa

MaACHT4-
XP_009416338.
MAETLAQRTLLLPGGHLSLPPFCGMRS
31



acuminate

1
1
RPSLAAFTLFSRTKVEPLRSSSCDSKFH



(Banana)


GRRLVVGARRGRPSRARLGSGSEQMV






LSFKKAIKWWQKGLQPNMVEIESAEH






LVDSLLNAGDKLVIVDFFSPGCGGCRA






LHPKICQFAESNQNVLFLQINYEQHKS






MCYSLGVHVLPFFRFYRGAHGRLCSFS






CTNATIKKFKDALAKHITDRCSLGPAR






GLEESELLALAANKDLSFNYTSKPVPV






PEEIPERIPTSPKLPLHAVRRPAQESEDK






ALAAAGR







Musa

MaACHT4-
XP_009406843.
MADALAQMTLLSPHGHRSLSRSSDRR
32



acuminate

2
1
NRLVCASKDDLLRSSSSCNSQFHGRRL



(Banana)


VIGAQRERPLRGNRGSSSVQMTLSFKK






ASKWWEKGLHPNMKDIKSAEDLVDSL






SNAGDKLVIVDFFSPGCAGCRALHPKI






CQFAELNPDVQFLQLNHEEHKSMCYS






LNVHVLPFFRFYRGAHGRLCSFSCTNA






TIKKFKDALAKHITERCSLGPAKGLEE






TELLALAANKDLSFTYTRTPVPVPDEL






AEKAPFNPNLPVHAAARLTLESEDKAF






AAAGR







Capsicum

CaACHT4
XP_016552829.
MAKLMNKGFVFPSSSDCGHHRPHGISS
33



annuum


1
FPNKSVNLSCLPSTCLLRSYFYGRRLVI



(Sweet


NEALPKRNAHVAITVQMSMGIRKVQK



and Chili


WWEKGVQPNMKEVNSAQGLVDSLLS



Peppers)


AGDKLVVVDFFSPGCGGCKALHPKLC






QLAEMNPDVQFLQVNYEEHKSMCYSL






NVHLLPFFRFYRGAEGRVCSFSCTNAT






IKKFKDALAKYGTDRCTFGPPKGLEEK






ELLALAANKELSFNYIPKTEEEPVLVAS






QEEVEDRTPNKESPLPLPLPLPISSTSSL






KPKQDTEKEAYATSGR







Cicer

CaACHT4
XP_004493141.
MAEILTKTSLVSSWHGNRKQQHRRLS
34



arietinum


1
MVPNKTCSFNTCVGSFPSLKLKSQFLR



(Chick


SSSFSSEFYGKNTIFRVNRSIPNRINSQF



pea)


SVSAAPKMTLRIGKIQKWWEKGLQPN






MREVTSAQDLVDSLLNAGDKLVIVDF






FSPGCGGCRALHPKICQMAEMNPDVE






FLQVNYEEHKSMCYSLNVHVLPFFRF






YRGAHGRLCSFSCTNATIKKFKDALAK






HTPDRCSLEPTKGLEEKELIALSENKDL






NFTYTPKPLQPVHTPANEELATTKASP






VCSEPLPLPSLTSNSDEVLKERTLTRAG






R







Solanum

SlACHT4-
XP_004252003.
MTKLMSKGFIFPSSSSDCGEIYDRLRLN
35



lycopersicum

1
1;
LHGICSFPNKSVNLSCLPSLKLSSSCLP



(Tomato)

Solycl2g019740.
RTDFYGRRLVINEGLSNFNRRVADITA





1
QMSVGIKKAQKWWEKGVQPNMKEV






NSAQELVDSLLSAGDKLVVVDFFSPGC






GGCKALHPKLCQLAEMNPDVQFLQVN






YEEHKSMCYSLNVHVLPFFRFYRGAE






GRVCSFSCTNATIKKFRDALAKYGTDR






CTIGSPKGLEEKELLALAANKDLSFNY






TPKTEEEPILVTSQKEVRDRTTPNIESPL






PLPLPLPITSTSSQTAKRDTEKEAYATS






GR







Solanum

SlACHT4-
XP_004249307.
MEKLLNKAVFLPSILNSSGIYHSNQHAI
36


lycopersicum
2
1;
CVFPVKFNRRYHKSAVATAQMSIGIKR



(Tomato)

SolyclOg080730.
APKWWEKGLQPNMKEVTGAQDLVDT





1
LLNGGDKLVVVDFLSPGCGGCKALHP






KICQLAEMNPDVQFLHVNYEEHKSMC






YSLNVHVLPFFRFYRGAEGRLCSFSCT






NATIKKFKDALTKYGADCCSLGPVKG






LEEKELLALAANKDLSFAYTPKTEEPV






PLALEEVKVIKTSRQSSSHPNTFSPLPLP






LPLASTLHTAKQDSKS







Elaeis

EgACHT4-
XP_010938119.
MMEVLSQSGVMSPCGHRWVVRSCKE
37



guineensis

1
1
RSPSFVGFPRSSSRTIESLMSSSRNSGFH



(African


GRRLSIGAWRVNAVKGNFSSTPVQMS



oilpalm)


LCVGKALKWWEKELQPNMKEIESAQ






DLVDSLLNAGDKLVIVDFFSPGCGGCK






ALHPKICQFAKLNPDVLFLQVNYEKH






KSMCYSLNVHVLPFFRFYRGAHGRLC






SFSCTNATIKKFKDALAKHTTDRCSLG






PTKGLEESELMALAANKDLSFSYTRKP






VPVPSPDEAAEEVVLSPKLPVSSTPRVI






QDSEEKALVAAGR







Elaeis

EgACHT4-
XP_010921294.
MAEVLGRSGVFSLRGHRSVAPSCQKR
38



guineensis

2
1
SPSFLGFPLSSSRPIGPPRSSSRRFVIGTR



(African


RGRSIKGNSSSSRVQMSLGVGKSLKW



oilpalm)


WEKGVQPNMKEIGSAQDLVDSLLNEG






DKLVIVDFFSPGCGGCKALHPKICRIAE






MNPHVLFLQINYEKHKSMCYSLHVHV






LPFFRFYRGAHGRLCSFSCTNATIKKFK






DALAKHTTDRCSLGPTKGLEESELVAL






AANKDLSFNYTRKPVPVLTPDEAAEK






VPLSPKLPVSSAPRVIKDSEDKALVAA






GT







Setaria

SiACHT4-
XP_004984516.
MAAAQAVAKGSVVSPCGSRAAPGLLS
39



italic

1
2
RRRGAVATRMAPSAVRIGGSWRKTAF



(Foxtail


LGGRLAVGPRRSRSASRTLVASPVQM



millet)


NMNLAIGKSMRWWEKGLQPNMREIES






AQDLVDSLTNAGDRLVIVDFFSPGCGG






CRALHPKICQFAEQNPDVLFLQVNHEE






HKSMCYSLHVHVLPFFRFYRGAQGRL






CSFSCTNATIKKFKDALAKHKPDRCSI






GPTRGLEESELLALAANKDLQFTYTKK






PELIPSGDAAAEVIAPEPTKLPAATKPS






VKIGSEERSXWSHQEDEMNDL







Setaria

SiACHT4-
XP_004985651.
MAAAQAMAKMSVGSPACNRAAGSLC
40



italic

2
1
RWRGAVAVRLGGSWSWRKSPFLGGR



(Foxtail


MAVGPRRSRPVSRNPVASPVQMNLSF



millet)


GKTMKWWEKGLQPNMRAIHTAQELV






DSLINAGDGLVIVDFFSPGCAGCHALH






PKICQFAERNPDVQFLQVNFEEHKSMC






HSLHVHVFPFFRFYRGAQGRLCSFSCT






NATIKKFKDALAKHKPDRCSLGPIKGL






EESELLALAANRDLQFTYTKEQDLAPS






MEDGAEVITHDHPRLPAAAKPLVRQG






SEDRAVVSSGR







Setaria

SiACHT4-
XP_004958724.
MAEALCNGVVPSPCGGDVGVAGRVS
41



italic

3
1
GAAAALAESVPIGGYRTKSSFSAGRM



(Foxtail


AMTDRKMRPLPRSIEAAPGQMNLSFP



millet)


KAMRWWEKGLQPNMREIESAQDLAD






SLLNAGDKLVVVDFFSPGCGGCRALH






AKIAQFAEKNPDVMFLQVNYETHKSM






CYSLHVHVLPFFRFYRGAEGRVSSFSC






TNATIKKFKDALAKHGPDRCSLGPAR






GLEESELMALAANKDLQFTYEKPGLV






PLAEAIAKEAAAPGGPWFPLPASATQF






LTQGSENSLLSSGR






Chlamydomonas
CrACHT4
XP_001697443.
MASILNRAGSRSLVFETKQSLRSIPGSL
42



reinhardtii


1
LSLRSVALKPFRTTICAAGALLTARRST



(Single-


SGLGRANGVVCQAGRSTGEWWKKDN



cell green


PPNMRDINSIQELVDALSDAGDRLVIV



alga)


EFYAQWCNACRALFPKICKIMAENPD






VLFLKVNFDDNRDACRTLSVKVLPYF






HFYRGAEGRVAAFSATISKLQLFKDAV






ETYSAAFCSLEPAPGLAEFPDLIAHPEL






HPEEAAEAARRARLASTESEEELHPLA






DTPTVVG







Chlorella

CvACHT4
XP_005851922.
WWTKSAPPNVVHIKSVQHLVDEMVR
43


(Single-

1, partial
AERLAGAGERLVIMDVFAPWCAACKA



cell green


LYPKLMKLMEERPDVLLLTVNFDENK



alga)


TVVKAMGVKVLPYFMFYRGKEGKLQ






EFSASNKRFHLIQEAIERHSTDRCFLDS






TDEEPVLAEFPTVVPAKGISGSLDEPAG






RAAGKAVGQPQPVA









In another embodiment, the amino acid sequence of ACHT4 is a homolog of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 is a paralog of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 is a fragment of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 is a variant of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 comprises any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 consists essentially of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 consists of any one of the sequences listed in Table 1. In another embodiment, the amino acid sequence of ACHT4 corresponds to any one of the sequences listed in Table 1.


In another embodiment, the amino acid sequence of ACHT4 is a homolog of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 is a paralog of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 is a fragment of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 is a variant of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 comprises any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 consists essentially of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 consists of any one of SEQ ID NOs: 1-43. In another embodiment, the amino acid sequence of ACHT4 corresponds to any one of SEQ ID NOs: 1-43.


In one embodiment, there is one paralog of ACHT4 in a species. In another embodiment, there are two paralogs of ACHT4 in a species. In another embodiment, there are three paralogs of ACHT4 in a species. In another embodiment, there are four paralogs of ACHT4 in a species. In another embodiment, there are five paralogs of ACHT4 in a species. In another embodiment, there are six paralogs of ACHT4 in a species. In another embodiment, there are seven or more paralogs of ACHT4 in a species.


In one embodiment, a “corresponding sequence” is an amino acid (or nucleic acid) sequence from a first species for which there is a similar or equivalent sequence in a second species, which may be inferred by sequence alignment, as is well known in the art.









TABLE 2







ACHT4 nucleic acid sequences













Database

SEQ



Para-
Accession

ID


Organism
logs
No.
Nucleic Acid Sequence
NO:






Arabidopsis

AtACHT4
NM_100730
ATGACGGAAGTGATTAGCAAAACGAGTTTGTT
44



thaliana



CTTAGGAGCTTGTGGTAATCATCACCGTGTTGA






TGATTTCTCTTTCTCTCCGGTGAGTTTTGGTGG






GTTTGGTTTGAAAAAGAGTTTCTCTTGTCTGAA






GCTTAAGAGTCAGAAGCCTCTTAGAAGTGTAT






TTTACGGAAAACAGATCGTTTTCGGAGATTCTC






AAGACGAGAGCTTCAGAAGATCATCAGCTATC






ACAGCTCAGACAACTTTGAGGATTGGGACAGC






TCAGAAGTGGTGGGAGAAAGGTCTGAAAGAT






AACATGAGAGAGATCTCTTCAGCTCAAGAGCT






CGTTGATTCTCTTACTAACGCTGGTGATAAGCT






TGTTGTTGTTGATTTCTTCTCACCTGGCTGTGG






TGGCTGCAAGGCTCTCCATCCTAAGATATGTC






AGTTTGCAGAGATGAACCCGGATGTGCAGTTT






CTTCAGGTGAATTACGAGGAGCATAAGTCCAT






GTGTTATAGTCTTGGTGTCCATGTTCTCCCTTTT






TTCCGATTCTACCGTGGCTCTCAGGGTCGTGTT






TGCAGCTTTAGCTGTACCAATGCCACGATCAA






GAAATTCAGAGATGCCTTGGCAAAGCATGGTC






CAGATAGGTGCAGCCTCGGACCGACCAAAGGC






CTTGAAGAGAAAGAGCTTGTGGCACTTGCAGC






CAACAAAGAACTCAACTTTACTTACACACCAA






AGCCTGTACCAGTTGAGAAAGAAGCAGCCACT






CCTGATTCAAACCCAAGTCTCCCTGTTCCTCTT






CCTTCGATGAGCTCCAATGACGAAAAAACATT






GGTCTCCGCAGGGAGATGA







Solanum

StACHT4-
XM_006347961.2;
ATGATGAAATTGATGAGCAAAGGTTTTATGTT
45



tuberosum

1
PGSC0003DMT400048452
TCCTTCGTCTTCTGATTGTGGTGAAATTTATCA



(Potato)


TCATCGTCCTCTTAATCTACCTGGGATCTGTTC






TTTTCCCAATAAATCGGTCAATCTTTCTTGTCT






TCCTTCGTTGAACCTTTCATCTTCTTGTTTGCCA






AGAACCGATTTTTATGGTCGTAGATTGGTTATA






AATGAAGGCGTATCCAAGTTCAACCGAAGAAA






TTCCCAAGTTGTTGATATCACTGCTCAGATGAG






TATTGGAATCAGGAAAGCACAGAAATGGTGGG






AGAAAGGGGTTCAACCTAACATGAAAGAGGT






GAACAGTGCACAAGAACTTGTTGACTCTCTTTT






GAGTGCAGGGGACAAATTAGTTGTTGTTGATT






TCTTTTCCCCTGGCTGTGGAGGTTGTAAAGCTC






TTCACCCCAAGTTGTGTCAGCTGGCAGAGATG






AATCCAGATGTGCATTTTTTACAGGTGAACTAT






GAGGAACACAAGTCGATGTGTTACTCTCTTAA






TGTACATGTTCTCCCATTTTTCCGTTTCTATAG






AGGAGCTGAAGGCCGTGTTTGCAGCTTTAGCT






GTACCAATGCCACGATCAAAAAATTCAAAGAT






GCACTGGCGAAGTATGGTACAGATCGTTGCAC






CCTTGGGCCGCCAAAAGGGCTGGAGGAGAAA






GAGCTACTTGCACTGGCAGCTAACAAGGATCT






CTCCTTTAATTACACTCCAAAAACAGAAGAAG






CACCCGTCCTTGTTACCTCACAAAAGGAAGTT






CAGGATACAACTCCTCCAAATATAGAGTCCCC






TCTACCACTTCCTCTTCCTCTCCCCATTGCGTC






AACTAGCTCACAGACGGCCAAACGGGATACAG






AGAAAGAAGCATATGCTACTTCTGGTAGATGA







Solanum

StACHT4-
XM_006351306.2;
ATGAAATTCAATAGAAGAAATCACAAATCA
46



tuberosum

2
PGSC0003DMT400060823
GCAGCTGCAACTGCTCAGATGAGCATAGGT



(Potato)


ATCAGGAAAGCTCCTAAATGGTGGGAGAAA






GGACTTCAACCGAATATGAAAGAGGTGATG






GGTGCTCAAGACCTCGCTGACACCCTTCTA






AACGCTGGGGATAAACTAGTCGTTGTCGAT






TTCCTTTCCCCTGGCTGTGGAGGCTGCAAA






GCCCTTCATCCAAAGATATGTCAGTTAGCA






GAGATGAATCCGGATGTGCAGTTTTTACAT






GTGAACTATGAGGAACACAAGTCAATGTGT






TACTCGCTGAACGTACATGTTCTCCCATTT






TTTCGTTTCTATAGAGGTGCTGAAGGTCGT






CTTTGTAGCTTTAGTTGCACCAATGCCACG






ATAAAAAAATTCAAAGATGCATTGACAAAG






TATGGTGCAGATTGTTGCAGCCTCGAACCA






GTTAAAGGGCTCGAGGAGAAAGAGCTACTT






GCCCTAGCAGCTAATAAGGACCTCTCTTTT






GCTTACACACCAAAAACAGAAGAACCAATG






CCTGTTGCCTTACAAGATGCTAAGGTGATA






AAAACAAGCAGAACATCTTCATCTTGTCCA






AATACATTCTCCCTGTTACCACTTCCCCTT






CCTCTTCCTCTAGCATCAACTTCACATAAG






GCCAAACAGGACTCGAAGAGTGAAGTTTTT






TAA







Zeamays

ZmACHT4-
NM_001279773.1
ATGGCGGCAGCGCAGGCGATCTCGAAGGGGA
47


(Maize)
1

GCGTGGTGTCTCCGTGCGGCAATCGAGCGGCG






CCGGGCCTCCTTGCCAGGCGGAGGGGTGCCGT






GGCGGCGCGGGTGGCGCCGTCAGCGGCGCGG






ATCGGGGGCTTCTGGAGGAAGAACGCGTTTCC






TGGCGGGAGGCTAACCCTGAGGACGAGGAGA






TCCAGGGCCGCGTCACCGGCGCAGATGAACAT






GAACCTTGCGCTTGGGAAATCGATGAGGTGGT






GGGAGAAGGGGTTGCAGCCCAACATGCGTGA






GATCGAGTCCGCCCAAGACCTTGTCGATGCTTT






GACCAACGCCGGCGACAGGCTCGTCGTCGTCG






ACTTCTTCTCTCCTGGCTGCGGCGGCTGCCGTG






CTTTTCACCCCAAGATTTGTCAATTTGCGGAGC






AGAATCCAGATGTGCTGTTCTTGCAAGTGAAC






TACGAGGAGCACAAGTCTATGTGCCACAGCCT






TCATGTCCATGTCCTACCCTTGTTCAGATTCTA






CAGGGGAGCACAGGGACGACTCTGTAGCTTCA






GTTGTACAAACACAACTATTAAGAAGTTCAGG






GATGCACTCGCGAAGCACAAGCCAGATAGATG






TAGCCTTGGCCCAACCAGGGGGCTAGAGGAAT






CTGAGTTATTAGCCTTGGCGGCAAACAAGGAC






CTGCAGTTCACCTACGCGAAGGAGGAACCAGA






ACTGATCCCCAGGGGAGATGCTCCTGGGGAGG






TCGTTGCTCCTGAGCCTGCAAAGCTTCCTGCGG






CTCCAAAGCCTTTGGTCAGGCTGGGGTCCGAG






GAGAGGTCACTGGTCTCGTCAGGAAGATGA







Zeamays

ZmACHT4-
BT084302.
ATGGCTGACGCGTTGTGCAACGGCGTCGTGGC
48


(Maize)
2
1
GTCCCCGTGCGGCCGGGACGTCGCCGGCCGGG






CCAGGGGCGCCGCCAGGGCCGCGCTCGCGGAG






TCCCTGCAGGTCGCCGGGCACGCCAGCAAGAC






CTCCTTCTCCGCCGGGAGGATGTCGGTCAAGG






ACAGCAAGCCGAGGCCCCTGTCGCGTAGCCTC






GAGGCCGCCGCGCCAGGACAGATGAACCTGTC






GTTCCCCAAAGCCATGCGGTGGTGGAAGAAGG






GGCTGCACCCCAACATGCGCGAGGTCGAGTCC






GCGCAGGACCTGGCCGACTCGCTGCTCAGCGC






CGGCGACAAGCTCGTGGTCGTCGACTTCTTCTC






CCCAGGCTGCGGCGGCTGCCGCGCCCTCCACC






CCAAGATCGCCCAGTTCGCCGAGAAGAACCCG






GGCGTGCAGTTCTTGCAGGTGAACTACGAGAC






GCACAAGTCCATGTGCTACAGCCTCCGCGTCC






ACGTCCTCCCTTTCTTCAGGTTCTACCGGGGAG






CCGAGGGCCGGGTCAGCAGCTTCAGCTGCACC






AACGCAACGATCAACAAGTTCAAGGACGCGCT






CGCCAAGCACGGGGCTGAGAGGTGTAGCCTCG






GGCCTGCGCGGGGGCTGGACGAGTCGGAGCTC






ATGGCCTTGGCTGAGAACAGGGACCTGCACTT






CACCTACGACAAGCCGGGCGGCCTCGTCCCCC






TCGCCGAAGCTATTGCCAAGGAGGCTGCCGCA






CCGGGAGGCCCGTGGCTTCCTCTGCCTGCGTCC






CTGCTCGGCCAGGGATCCGACAACTCATTGCT






GCCCTCTGGAAGATAG







Zeamays

ZmACHT4-
BT069464.
ATGGCGGCGGCGCAGGTGGTCGCGAAGGGGAGCGTGG
49


(Maize)
3
1
TGTCGCCGTGCGGCAATCGAGCGGTGCCGGGCCTCTT






GGGCAGGCGGAGGGATGCCGTGGCGGCGCAGATGACG






CCGTCGGCGGTGCGGATCGGGGGCTCCTGGAGGAAGA






ACGCGTTTCCTGGCGTGAGGCTAGCCTTGGGGACGAG






GAGATCCAGGCCCGCGTCCCGGAGTTTCTCCGCCTCG






CCGGTGCAGATGAACATGAACCTTGCGATTGGGAAAT






CAATGAGGTGGTGGGAGAAGGGGTTGCAGCCCAACAT






GCGTGAGATCGAGTCCGCCCAAGACCTTGTAGATTCC






TTAACCAACGCCGGCGAGAGGCTCGTCGTCGTCGACT






TCTTCTCCCCTGGCTGCGGCGGCTGCCGTGCTCTTCA






CCCGAAGATTTGCCAATTTGCGGAGCGGAACCCTGAT






GTGCTGTTCTTGCAAGTGAACTACGAGGAGCACAAGT






CTATGTGCTACAGCCTTCGTGTCCATGTGCTACCCTT






CTTCAGATTCTACAGAGGAGCACAGGGACGACTCTGC






AGCTTCAGCTGTACAAACGCAACTGTAAGATCATGTC






CATGTTTCTTCTGTTCGTATGATTATTGGTATGTCCT






CAATAACATGCAACATATCCAAAATGACCTTTATTGA







Oryza

OsACHT4-
XM_015776801.1
ATGGCGGCGACGGCGGCGCAGGCGGTGGCGG
50



sativa

1

TGAAGGGGAGCGTGGCGGTGCCGCCGTGCGGG



(Rice)


AGCCGCGGCCGGCGGAGGGGCGCCGTGGCGTC






GGTGCGCATGGCGGCGGCGGCGGCGACGTCGG






CGTTGCGGATCGGCAGGAGGAGCCCCTTCCTC






GGCCGGAGGCTGGCGGTTGGGCCGAGGAGATC






CAGGCCCGTGCCCCGGAATCTCGTCGCGCCGG






TGCAGATGAATCTCGCGTTTGCGAAAGCCACG






AAGTGGTGGGAGAAGGGATTGCAGCCCAACAT






GCGGGAGGTCGAGTCCGCGCAAGACCTCGTCG






ACTCCTTGACCAACGCCGGCGACAATCTCGTC






ATCGTCGACTTCTTCTCCCCTGGCTGCGGCGGC






TGCCGTGCCCTCCACCCCAAGATTTGCCAGATT






GCAGAGCAGAATCCGGACGTGCTGTTCTTGCA






GGTGAACTATGAGGAGCACAAGTCTATGTGCT






ACAGCCTCCATGTTCATGTTCTTCCTTTCTTCA






GGTTCTACAGGGGAGCTCAGGGCCGGCTCTGC






AGCTTCAGCTGTACTAACGCAACTATTAAGAA






GTTCAGGGATGCGCTTGCTAAGCATAAACCAG






ATAGATGCAGCCTTGGCCCAACTAGGGGGCTC






GAGGAGTCGGAGCTATTGGCGCTGGCTGCGAA






CAAGGATCTGCAGTTCAACTACACCAAGAAAC






CAGAACTGGTTCCTAGCGGAGATGCCGCAGCT






GCCCAGGAATTGGATCGTGGAAGCACAAAGCT






TTCTCCACCCGCAAAACCATTGGTCAAGCAGG






GCTCTGAAGAGAGGTCCTTGGTCTCATCAGGC






AGATGA







Oryza

OsACHT4-
XM_015791237.1
ATGGCTGAGGCACTGTGCAGCGGCAGCGTCGC
51



sativa

2

GTCCCCGTGCGGGGAGGTGGGTGTGGGGTTCG



(Rice)


CCGCCGGCCTTGTGAGGGGCGCCGCGGCGGCG






GCGGCGCTCGCGGAGTCTGTGCCGATTGGTGG






GTACAGCAGCAAGAGCACGTTCCCGAGTGGGA






GGGTGGCGCTCACGGAGAGGAAGGCGAGGCC






CCTGCCACGGAATCTCGAAGCGGCGCATGGGC






AGATGAACCTGACGATTGGGAAGGCCATGAGG






TGGTGGGAGAAGTGCCTGCAGCCCAACATGAG






GGAGATCGAGTCGGCGCAAGACCTCGCCGACT






CCCTCCTCAACGCCGGCGACAAGCTCGTCGTC






GTCGACTTCTTCTCCCCGGGCTGCGGTGGCTGC






CGCGCCCTACACCCCAAGATTGCTCAACTAGC






CGAGAAGAACCCGGAGGTGCTGTTCTTGCAAG






TGAACTACGAGAAGCACAAGTCAATGTGCTAC






AGCCTCCATGTTCATGTTCTGCCATTCTTCAGG






TTCTACAGGGGAGCTCAGGGCCGTGTCAGCAG






CTTCAGCTGCACAAACGCAACTATCAAGAAGT






TCAAGGATGCACTTGCCAAGCATGGTCCGGAC






AGGTGTGGCCTCGGCCCGGCGAAGGGGCTCGA






GGAGTCGGAGCTCATGGCGTTGGCCATAAACA






GGGACCTGAACTTCACCTACACACCAAACCAA






GACCTTGTCCCAATTGCAGACGCCCTCCTGAA






GGAAGCTGCTGCACCTGGAGGTCCATGGCTGC






CATTGCCCGCAACGGCGACGCAGCTGTTCATT






CAGGGATCTGAGAATTCGCTGTTGTCATCTGG






AAGATAG







Hordeum

HvACHT4-
AK371865.1
ATGGCGACGGCGCAGGCGGTGGCCAAGGGGA
52



vulgare

1

CCGTGGTCTCTCCGTGCGGCACCCGGGCCGCA



(Barley)


GGATTTGGAGCCCGGCGGCGGGGCGCCGTGGC






GGCCCGCATGTCGCCCTGCGCGCCGGCGGCGG






TGCGGATCGGCAGGAAAAGCCCGTTTCTTGGC






GCTAGGCTCACGGTCGGTCCCAGGAGATCCAA






GCTCGTTCCCCGGAATCTTGTCTCCTCACCGGT






GCAGATGAACCTTGCGTTTGCGAAATCCACCA






AGTGGTGGGAAAAGGGTCTGAAGCCCAACATG






AGGGAGATCGAGTCCGCCCAGGACCTCGTCGA






CTCGTTGGCTAACGCCGGCGACAGGCTCGTCG






TTGTTGACTTCTTCTCCCCTGGCTGCGGCGGCT






GCCGTGCCCTCCACCCAAAGATTTGCCAGTTTG






GGGAGCAGAACCCAGATGTGCTGTTCTTGCAA






GTGAACTACGAGGAACACAAGTCCATGTGCTA






CAGCCTCCATGTCCATGTGCTGCCCTTCTTCAG






GTTCTACAGGGGAGCCCAGGGCCGCCTCTGCA






GCTTCAGCTGTACTAACGCAACCATAAAGAAG






TTCAGGGATGCGCTTGCCAAGCATAATCCTGA






TAGGTGTAGCATTGGTCCAACCAGGGGCCTCG






AGGAGTCTGAGCTGCTGGCTTTGGCTGCGAAC






AAGGACCTGCAGTTCACATACACGAAGCAGCC






AGAACCAGTTCCGAGTGGTGATTCCGAGTTCA






TTGCTCCTGGGAGCCCAAGGCTTCCTCCACCTG






CAAAACCATTGGTTCGGCAGGGTTCCGGAGAG






AGGACCTTGGTCTCATCAGGAAGATGA







Hordeum

HvACHT4-
AK376663.1
ATGGCCAACGCGCTTTACGGCGGCGGCGTGGC
53



vulgare

2

GGCGCCGTGCGGTGACTTGGGCGCCGCGGCCG



(Barley)


CGCTCGCGGAGTCTTTGCCGATGGGCGGCGGG






TACCGCGCGAGGAGCTCCTTCCCCGCCGGGAG






GGTGGCGCTGGCGGAGAGGCCCCTGCCCCGGA






GCCTCCAGGTGGCGGCCGCTGCTGGACAGATG






AACGGGAACCTGACGATTGGCAAGGCCATGAG






GTGGTGGGAGAAGGGGACGCAGCCCAACATG






AGGGAGGTCGAGTCCGCGCAAGACCTCGCCGA






CTCCCTGCTCAACGCCGGCGACAAGCTCGTCG






TCGTCGACTTCTTCTCCCCCGGCTGCGGTGGCT






GCCGCGCGCTCCACCCCAAGATTGCGCAGTTC






GCCGAGCGTAATCCGGACGTGCTGTTCCTGCA






AGTCAACTACGAGAAGCACAAGTCCATGTGCT






ACAGCCTCCATGTCCATGTCCTCCCTTTCTTCA






GGTTCTACAGGGGAGCTCAGGGCAGGGTCAGC






AGCTTCAGCTGCACCAACGCAACCATAAAGAA






GTTCAAGGACGCCCTCGCAAAGCACTCGCCGG






ACAGGTGCAGCCTCGGCCCGGCGCGGGGGCTT






GAGAAGGCGGAGCTCTTGGCTCTGGCTGAGAA






CAGGGACCTGGAATTCACCTACAGCGAGAAGC






CGACACTTGTGCCGATCGCAGAGGCCATCAGG






ATGGAAGCTGCCTCAATCGGAGGCCCATGGCT






GCCATTGCCTCCGGCCGCGACGCAGCCGTTTC






CTCTGGGATCCGAGAATGGCTCGCTCATCCCCT






CTGGAAGATAG







Triticum

TaACHT4
AK335384.1
ATGGCCAGCGCGCTATGCGGCGGCGGCAGCGG
54



aestivum



CAGCGTGGCGGCGCCGTGCGGGGACTTGGGCG



(Wheat)


CCGCGGCGGCGCTCGCGGAGTCTTTGCCGATG






GGCGCCGGGTACCGCGCCAAGAGCTCCTTCCC






CGCCGGGAGGGTGGCGCTGGCGGACAGGCCCC






TGCGCCGGGGCCTCCAAGTGGCGGCGGCTGCT






GGACAGATGAACGGGAACCTGACGATTGGCA






AGGCCATGAGGTGGTGGGAGAAGGTGACGCA






CCCCAATATGAGGGAGGTCGAGTCCGCGCAAG






ACCTCGCCGACTCCCTGCTCAACGCCGGCGAC






AAGCTCGTCGTCGTCGACTTCTTCTCCCCCGGC






TGCGGTGGCTGCCGCGCTCTCCACCCCAAGAT






TGCGCAGTTCGCTGAGCGGAATCCGGACGTGC






TGTTCCTGCAAGTCAACTACGAGAAGCACAAG






TCCATGTGCTACAGCCTCCATGTCCATGTCCTC






CCTTTCTTCAGGTTCTACAGGGGAGCCCAGGG






CAGGGTCAGCAGCTTCAGCTGCACAAATGCAA






CCATCAAGAAGTTCAAGGACGCCCTCGCAAAG






CACTCGCCGGACAGGTGCAGCCTCGGCCCGGC






GCGGGGGCTCGAGGAGGCGGAGCTCTTGGCTC






TGGCGGCAAACAGGGACCTGGAATTCACCTAC






AACGAGAAGCCGACGCTGGTGCCGATCGCCGA






GGCTATCCAGATGGAAGCTGCCTCCATTGGCG






GCCCATGGATGCCATTGCCCGCGGCCGCGACG






CAGCCGCTCACTCTGGGATCTGAGAATGGCTC






GCTGATCCCCTCCGGAAGATAG







Manihot

MeACHT4-
Manes.
ATGGCTGATGTTTTGAGCAATACCAATCTGGTT
55



esculenta

1
08G148200.
CTTCTTCCTTCTCTTCATCTTTTACTGGTCACC



(Cassava)

1
GAAACGAGCAGAAAAATAGCTCTTGCAGGCTA





Downloaded
AAAGGGTTCCCCCGAAAAGTGAATCGTCAGAC





From
TTTGAGATTGAAAGCGACATCGCTTGGCAGTG





Phytozome
ATTTTCATGGAAAGAGGGTTGTTCTTCAAGAC






AATCAAGGCAAACCCAAGAGAGGGATTTATCT






TCAAATGTCAATTAAGGCTCAGCATACTGGCC






TTAGACTCAAGAGTGCTCCAAAATGGTGGGAA






AAAGGATTGCAACCCAACATGAGGGAGGTGA






CCTCTGCTCAAGACTTTGTGGACTCCCTCTTGA






ACGCTGGAGATAAACTTGTCATTGTTGATTTCT






TCTCCCCTGGTTGTGGTGGCTGCAAGGCTCTCC






ATCCCAAGATATGTCAGTTTGCAGAGATGAAC






CCAGATGTGCTGTTCCTTCATGTGAATTATGAG






GAACATAAATCCATGTGTTATAGCCTCAATAT






CCATGTGCTTCCCTTCTTCAGGTTTTATCGAGG






GGCGCAAGGCCGGTTATGCAGCTTTAGCTGCA






CTAATGCTACGATAAAAAAATTCAGAGATGCA






CTGGCCAAGCACTCTCCAGACCGGTGCAGTCT






CGGGCCAACAAAAGGGCTGGAGGAGAAAGAG






CTTATTGCATTGGCTTCCAACAAAGATCTCAAC






TTCAAATATGCACAGAAACCAGATCTGCCAAC






GCCAATTCCTGCCAAGGAAGAGAGAGTGCCAG






TAGTATCCCCATCTCATCCAAATCCAGCTCTAC






CTCTACCTCTTCCTCTTCCCACAGCAAGTCCAA






AATCTGGACAAGGCTCAGAGGAGAAAACGTTG






GTCGGATCAGGGAGATGA







Manihot

MeACHT4-
Manes.
ATGGCCGCTGTTTCTAGCAACACCAATCTTGTT
56



esculenta

2
09G143500.
TCTTCTTCTTGTTCTTCATCCTTTAGTTCTTCGC



(Cassava)

1
AAAACCGCCCCGAATACCGCTCTTCCAGGCTC





Downloaded
AGAGTGTTCCCTCAGGAATTGAATCATCAGGC





From
TTTGAGATTACAAACTACGTCGCTTGGCAGTG





Phytozome
ATTTTCATGGAAAGAGGGTTGTTCTTCAAGAA






AAACCAAAATGCAAACAAGGGATTTCCGTTCA






AAGCTCAATTAAGGCTCAGCAGACTGGCCTTA






GACTCAAGAATGCTAAAAATTGGTGGGAGGAG






GAGTTGCAACCCAACATGAGGGAGGTGATCTC






TGCTCAAGATCTTGTGGACTCCCTCCTTAATGC






TGGCGATAAGCTTGTCATTGTTTATTTCTTCTC






CCCTGGCTGTGGTGGCTGTAGGGCTCTCCATCC






CAAGATATGTCAATTGGCAAAGAACAATGCAG






ATGTGCAGTTTCTTAAAGTGAACTATGAGGAG






CACAAATCCATGTGTTATAGCCTCAATGTTCAT






GTCCTTCCATTCTTCAGGTTTTACAGAGGGGCT






CAAGGCCGAGTCTGCAGCTTTAGCTGCACCAA






CGCCACGATCAAGAAATTTAAAAATGCATTGG






CCAAGCACACCCCAGACAGATCCAGCCTCGAG






CCAACAAAAGGGCTGGAGGAGAAAGAGCTCA






TTGCATTGGCTGCCAATAAAGATCTCAACTTA






ACATATGCACCAAAATCAGATAAGCCAATCCC






AGCTCCAACTAAGGAAGAGATAGTACCCGAAA






TTCCCCAATCTCTTTCTCTTGCTCTTCGTAGGA






GTATGGAGCTTGCTCAAGGCTCAGCCGAAAAG






ACCTTGGTCGCTTCAGGGAGATGA







Sorghum

SbACHT4-
gnl|SbGDB|
ATGGCGGCAGCGCAGGCGGTCGCGAAGGGGA
57



bicolor

1
Sb01g036620.1
GCGTGGTTGCGCCGTGCGGCAATCGAGCGGCG





Downloaded
CCGGGCCTCCTTGGCAGGCGGAGGGGTGCCGT





From
GGCGGCGCGGATGGCGCCGTCGGCGGTGCGGA





Phytozome
TCGGGGCCTCATGGAGGAAGACCGCGTTTACA






GGCGGGAGGCTAGCCTTGGGGTTGGGGACGAG






GAGATCCAGGCCCGCGTCCCGGAGTTCTTTCG






CGTCGCCGGCGCAGATGAACATGAACCTTGCG






ATTGGGAAATCGATGAGGTGGTGGGAGAAGG






GGTTGCAGCCCAACATGCGTGAGATCGAGTCC






GCCCAAGACCTTGTCGATTCCTTGACCAACGC






CGGCGACAAGCTCGTCATCGTCGACTTCTTCTC






CCCTGGCTGCGGCGGCTGCCGTGCTCTTCACCC






GAAGATTTGTCAATTTGCGGAGCAGAACCCAG






ATGTGCTGTTCTTGCAAGTGAACTACGAGGAG






CACAAGTCTATGTGCTACAGTCTTCATGTCCAT






GTCCTACCCTTCTTCAGATTCTACAGGGGAGCA






CAGGGACGGCTCTGCAGCTTCAGTTGTACAAA






CGCAACCATTAAGAAGTTCAAGGATGCACTTG






CGAAGCACAAGCCAGATAGATGTAGCCTTGGC






CCAACCAGGGGGCTAGAGGAATCGGAGTTTTT






AGCCTTGGCAGCAAACAAGGACCTGCAGTTCA






CCTACACCAAGGAGCCAGAACTGATTCCCAGG






GGAGATGCTCCTGGGGAGGTCATTGCTCCCGA






GCCTGCAAAGCTTCCTGCGGCCACAAAGCCTT






TGGTCAGGCTGGGGTCCGAAGAAAGGTCCTTG






GTCTCATCAGGAAGATGA







Sorghum

SbACHT4-
XM_002465792.1
ATGGCGGCGGCGCAGGCGATGGCGAAAGGGA
58



bicolor

2

GCGTGGGGCAGGGGTCTCTTGGTCGGCGGAGG






GGCGCCGAGGCGGCGCGGGTCGGAGGATCAT






GGAGGAAGAGCGCGTTCCTCGGCGGGAGGCTG






GCGGTTGGGCCCAGGAGACCGAGACCCGTGTC






CCGGATTCTAGTTACGTCGCCGGCGGTGCAGC






AGACGAACCTTTCATTTGCGAAAGCCATGAAG






TGGTGGCAGAAGGGATTGCAGCCCAACATGCG






GGCGATCCAGACCGCCCAAGACCTCGCCGATT






CCTTGACCAACGCCGGCGACGGGCTCGTCGTC






GTCGACTTCTTCTCACCCGGCTGCGCTGGCTGC






CATGCTCTCCACCCCAAGATTTGTCAGTTCGCG






GAGAGGAACCCGGATGTGCAGTTCCTGCAGGT






GAACTATGAGGAGCACAAGTCTATGTGCCACA






GCCTTCACGTTCATGTGTTCCCTTTCTTCAGGT






TCTACAGGGGAGCTCAGGGTCGGCTCTGCAGC






TTCAGCTGTACCAATGCAACTATTAAGAAGTT






CAGGGATGCACTTGCAAAGCACAGAGCTGATA






GATGCAGCCTTGGCCCTACTCGGGGACTAGAA






GAATCAGAATTGTTGGCCCTGGCTGCAAACAA






GGACCTGCAGTTCACCTACACCAAGGAGGCAG






AACTGGCTCCAAGCATGGAAGATGTCGCAGAG






GTTATGACTGCTGACCGTCCAGGGCTTCCGAC






ATCAACAATGCCATTGGCAAGGCAGGGATCTG






AGGACAGGGCCTTGGTCTCGTCAGGAAGATGA







Sorghum

SbACHT4-
gnl|SbGDB|
ATGGCTGAGGCGTTGTGCAACGGCGTCGTGGC
59



bicolor

3
Sb02g043280.2
GTCGCCGTACGGCGGCGGGGACGTGGGCGTCG





Downloaded
CCGGCCGGGCCAGGGGCGCCGCCAAGGCCGC





From
GCTCGCGGAGTCCCTGCCGGTCGGCGGGTACG





Phytozome
CCACCAAGAGCTCCTTCTCCGCCGGGAGGATG






TCGGTGTCGGACAGGAAGCCGAGGCCCCTGTC






TCGGAACCTCGAGGCCGCCGCCGCGCCTGGAC






AGATGAACCTGTCGTTTCCCAAGGCCATGCGG






TGGTGGGAGAAGGGGCTGCACCCCAACATGCG






GGAGATCGAGTCCGCGCAGGACCTCGCCGACT






CCCTCCTCAACGCCGGCGACAAGCTCGTCGTC






GTCGATTTTTTCTCCCCAGGCTGCGGCGGCTGC






CGCGCTCTCCACCCCAAGATTGCCCAGTTCGCC






GAGAAGAACCCGGACGTGCTGTTCCTGCAAGT






GAACTACGAGACGCACAAGTCCATGTGCTACA






GCCTCCACGTCCATGTCCTCCCGTTCTTCAGGT






TCTACAGGGGAGCCGAGGGACGGGTCAGCAG






CTTCAGCTGCACCAATGCAACGGTAAGAATCG






ACCACCTCTCCAACTTCAAGAACCAGCAGATG






AATGAATGA







Brassica

BnACHT4-
BnaA09g48840D
ATGGCGGAAGCAGCAATCAGCAGAACGAAT
60



napus

1

CTGATCTTCCGAGGAGCTTGCGTGAATCAAC



(Rapeseed)


ACAAGCATGTAGATGATTACTCTGTCTCATC






ACCTGTGAGTTTCGGTTTGAGAAAGAGCTTC






CCTTCTCTGAAGGTGAAGCCTTTTAATCAATT






CCAGAGCTCCCGATCATCATCATCCATCACA






GCTCAGACAACGTTGAGGATTGGGACGCCTC






AGAAATGGTGGGAGAAGGGTCTGAAAGAGA






ACATGAGAGAGATCTCTTCAGCTCAGGAGCT






TGTTGACTCTTTAACCAACGCTGGTGATAAG






CTCGTTGTGGTTGACTTCTTCTCTCCTGGCTG






TGGTGGATGCAAGGCTCTTCATCCTAAGATA






TGTCAGTTGGCAGAGCAGAACCCTGATGTGC






AGTTTCTTCAGGTGAACTACGAGGAGCACAA






GTCCATGTGTTACAGTCTCGGTGTCCACGTCC






TCCCGTTTTTCAGATTCTACCGTGGCGCTCAT






GGTCGTGTCTGCAGCTTCAGCTGCACCAATG






CTACGATCAAGAAGTTCAGAGATGCATTGGC






GAAGCATAGTCCGGATAGGTGCAGCCTTGGA






CCGACCAAAGGGCTTGAAGAGAAGGAGCTT






GTGGCACTTGCAGCCAACAAAGAACTCAACT






TTAGTTACACACCGAGGGCTGTACCAGTTGA






GGAAGAAGAAGCTCCCGTCCCCGCTTCAAAC






CCTGGTCTCCCTGTTGCTCATCCATCGATGAA






GGCCAATGATGGAAAGACATTGGTCTCCTCA






GGGAGATGA







Brassica

BnACHT4-
XM_013856519.1;
ATGGCGGAGGTAATCAGCAAAACGAGTTTGTT
61



napus

2
LOC106415750
CTTCCGAGGAGCTTGCGTGAATCACCACCACC



(Rapeseed)


ACGCAGATGACTTCTCCGTCTCGCCGGTGAGTT






TCGGTCTCAAAAAGAGTTTCTCTTCTCTCAAGC






AGAAGCCTCTTAGAAGCGACTTCTCTGGAAAA






CAGATCCTACAGACCTTCAACAGGAGCTTCCG






ATCATCATCCGTTACCGCTCAGTCAACGCTGA






GGATTGGGACAGCTCAGAAGTGGTGGGAGAA






AGGTCTGCAAGAGAACATGAGAGAGATCTCTT






CGGCGCAAGAGCTCGTCGACTCTCTCGCCGAC






GCTGGCGATAAGCTCGTCGTGGTTGACTTCTTC






TCTCCTGGCTGCGGCGGATGCAAGGCTCTGCA






TCCTAAGATGTGCCAGCTGGCGGAGCAGAGCG






CTGATGTGCAGTTTCTTCAGGTGAACTACGAG






GAGCACAAGTCCATGTGTTATAGCCTCGGTGT






CCACGTCCTCCCGTTTTTTCGGTTCTACCGTGG






CGCTCAGGGTCGCGTCTGTAGCTTTAGCTGTAC






TAATGCTACGATAAAGAAATTTAGAGACGCGT






TGGCGAAGCATAGTCCGGATAGGTGCAGCCTT






GGACCAACCAAGGGGCTTGAAGAGAAAGAGC






TTGTGGCACTTGCAGCCAATAAAGAACTCAAC






TTTAGTTACACGCCGAAGGTTGTACCTGTTGAG






AAAGAAGCAGCTATTCCCACTTCCAACCCGGC






ACTCCCTGTTCCTCATCCATCGATGAGTGGCAG






TGAGGAGAAGACATTGGTCTCTGCAGGGAGGT






GA







Brassica

BnACHT4-
XM_013817176.1;
ATGGCGGAAGCAGCAATTAGCAGAACGAATCT
62



napus

3
LOC106377028
GATCTTCAGAGGAGCTTGCGTGACTCACCACC



(Rapeseed)


ACCATGCAGATGATTACTCTGTCTCATCATCAC






CTGTGAGTTTCGGTCTGAGAAAGAGCTTCTCTT






CTCTCAAGCTGAAGCCTCCGAGACAGATCGAT






ACTCAATTCCAGACCTTCACAAGGAGCTCCCG






AGCATCATCCATCACAGCTCAGACGACGCTGA






GGATCGGGACGCCTCAGAAATGGTGGGAGAA






GGGTCTGAAAGAGAACATGAGAGAGATCTCTT






CAGCTCAGGAGCTTGTTGACTCTCTAACCAAC






GCTGGTGATAAGCTCGTTGTGGTTGACTTCTTC






TCTCCTGGCTGCGGTGGATGCAAGGCTCTTCAT






CCTAAGATATGTCAGTTGGCAGAGCAGAACCC






TGATGTGCAGTTTCTTCAGGTGAACTACGAGG






AGCACAAGTCCATGTGTTACAGTCTCGGTGTC






CACGTCCTCCCTTTCTTTCGATTCTACCGTGGC






GCTCACGGTCGTGTCTGCAGCTTCAGCTGCAC






AAATGCTACGATCAAGAAGTTCAGAGATGCAT






TGGCGAAGCATAGTCCAGATAGGTGCAGCCTC






GGACCGACCAAAGGGCTTGAAGAGAAGGAGC






TTGTGGCGCTTGCGGCCAACAAAGAACTCAAC






TTTAGTTACACACCGAGGGCTGTACCAGTTGA






GGAAGAAGAAGCTCCCGTCCCCGCTTCAAAAC






CAGGTCTTGCTGTTCCTCATCCATCGATGAGCG






CCAATGATGAGAAGACATTGGTCTCCGCAGGG






AGATGA







Brassica

BnACHT4-
XM_013861022.1;
ATGGCGGAAGCAGCAATCAGCAGAACGAATCT
63



napus

4
LOC106420177
GATCTTCCGAGGAGCTTGCGTGAATCAACACA



(Rapeseed)


AGCATGTAGATGATTACTCTGTCTCATCACCTG






TGAGTTTCGGTCTGAGAAAGAGCTTCCCTTCTC






TGAAGGTGAAGCCTTTTAATCAATTCCAGAGC






TCCCGATCATCATCATCCATCACAGCTCAGAC






AGCGTTGAGGATTGGGACGCCTCAGAGATGGT






GGGAGAAGGGTTTGAAAGAGAACATGAGAGA






GATCTCTTCAGCTCAGGAGCTCGTTGACTCTCT






AACCAACGCTGGTGATAAGCTCGTTGTGGTTG






ACTTCTTTTCTCCTGGCTGTGGTGGATGCAAGG






CTCTTCATCCTAAGATATGTCAGTTGGCAGAGC






AGAACCCTGATGTGCAGTTTCTTCAGGTGAAC






TACGAGGAGCACAAGTCCATGTGTTACAGTCT






CGGTGTCCACGTCCTTCCGTTTTTCAGATTCTA






CCGTGGCGCTCATGGTCGTGTCTGCAGCTTCAG






CTGCACCAATGCTACGATAAAGAAGTTCAGAG






ATGCATTGGCGAAGCATACTCCGGATAGGTGC






AGCCTTGGACCGACCAAAGGGCTTGAAGAGAA






GGAGCTTGTGGCACTTGCAGCCAACAAAGAAC






TCAACTTTAGTTACACACCGAAGGATGTACCA






GTTGAGGAAGAGGCAGCTCCCGTCCCCGTTTC






AAACCCTGGTCTCCCTGTTGCTCATCCATCGAT






GAAGGCCAATGATGGAAAGACATTGGTCTCCT






CAGGGAGATGA







Brassica

BnACHT4-
XM_013785617.1;
ATGGCGGAGGTAATCAGCAAAACGAGTTTGTT
64



napus

5
LOC106346322
CTTCGGAGGAGGAGCTTGCGTGAATCACCACC



(Rapeseed)


ACCACCACGTAGATGACTTGTCTGTCTCACCG






GTGAGTTTCGGTTTCAAAAAGAGTTTCTCTTCT






TCTCTCAAGCAGAAGCCTCTTAGAAGCGACTT






CTCTGGAAAACAGATCCTAGAGACCTTCAACA






GGAGCTTCCGATCATCATCCGTCACCGCTCAGT






CGACGCTGAGGATTGGGACAGCTCACAAGTGG






TGGGAGAAAGGCTCTCAAGAGAACATGAGAG






AGATCTCTTCGGCGCAAGACCTCGTCGACTCTC






TCGCCGACGCTGGCGATAAGCTCGTCGTGGTT






GACTTCTTCTCCCCTGGCTGCGGGGGATGCAA






GGCTCTGCATCCTAAGATGTGCCAGCTGGCGG






AGCAGAGCCCTGATGTGCAGTTTCTTCAGGTG






AATTACGAGGAGCACAAGTCCATGTGTTACAG






TCTCGGTGTCCATGTCCTTCCCTTTTTTCGATTT






TATCGAGGCGCTCAGGGTCGTGTCTGTAGCTTT






AGCTGTACCAATGCTACGATAAAGAAATTTAG






AGACGCGTTGGCGAAGCATAGTCCGGATAGGT






GCAGCCTTGGACCAACCAAGGGGCTTGAAGAG






AAAGAGCTTGTGGCGCTTGCAGCTAATAAAGA






ACTAAAGTTTAGTTACACGCCGAAGGTTGTAC






CTGTTGAGAAAGAGGTTGCCATCCCCACTTCA






AACCCTGGTCTCCCTGTTCCTCATCCATCGACG






ATGAGCGGCAGTGAGGAGAAGACGTTGGTCTC






TGCAGGGAGGTGA







Ricinus

RcACHT4
XM_002525415.2;
ATGGCTGATGTTTTGAGCAAGACCAATCTTGTT
65



communis


LOC8276541
CCTTCGTCTTGTTGTAATGGTTACAAGAACCAG



(Castor)


AAGAAAGATGGTGCCTTCGTTCTAAAAAGAAG






TTGCAGTCTTAAGGTGTCATCTAGGAAATTCA






ATCCTCAGGCTTTCGGATCACAGAAGATATCA






CTTATTTCTGATTTTTATGGCAAGAGGGTTATT






GTTCAAGAAAAACAACTCAAGAGAGGGAATTT






TCATCAATTTTCAATTAAGGCTCAGACTGGACT






GAGACTCAAGAATGCTCCAAAATGGTGGGAAA






AGGGGTTGCAACCAAACATGAAGGAGATCACC






TCTGCACAAGACCTTGTGGACTCCCTTATGAAT






GCTGGGGACAAACTTGTAATTGTTGATTTCTTC






TCCCCTGGCTGTGGTGGCTGCAAAGCTCTCCAT






CCAAAGATATGTCAATTTGCGGAGATGAACCC






TGATGTCCAGTTTCTTCAGGTGAATTATGAGGA






ACATAAATCCATGTGTTATAGCCTCAATGTAC






ACGTACTGCCATTCTTTAGATTTTACCGAGGGG






CTCAAGGCCGAGTATGCAGCTTTAGCTGTACT






AATGCCACGATTAAGAAATTTAAAGATGCATT






AGCCAAGCACACCCCAGACCGATGCAGCCTCG






GGCCAACCAAAGGGCTGGAGGAGAAAGAGCT






TATTGCGTTGGCTTCTAACAAAGATCTCAACTT






TACATGCACACCAAAACCAGTTCAACCAACTG






CTCCTGCTCAGGAAGAGATAATACCAGCAGCA






CTCACCCCAGCTCATGTGAATCAAACCCTACCT






CTTCCTATTCCTCTCTCTACAACAAGCCTGATG






TCTGCCCAAGACTTGGGGGAGAAAACCTTGGT






TACTTCTGGGAGATAG







Phaseolus

PvACHT4-
XM_007161898.1;
ATGGCTGAAGTTTTTACCAAGGCGAGTTTCGTT
66



vulgaris

1
PHAVU_
TCTTCTTTGCTTGGTAGTAGTCAACGCCACCAT



(Bean)

001G112200g
CGAAGGGTGTCGACGGTTCCTGATACTTGTAC






CTTTGTTTCTGGCGTCGGAGGGTCTCCTTCTCT






CAAGTTAAAGTCTCCGATTCTCAGATCTTGGTC






CCCTTCTTCTGAGTTTCAGGGTAAACAGCTTCT






CTTTCGTGTAAATAGAGGAAAGCCCAACAGGG






TCAGTTCGCGGTTGAGAGCGTCAACTGCTGCT






CAGATGACCCTTAGAATAGGGAAAGTTCAAAA






ATGGTGGGAAAAGGGGCTTCAACCCAACATGA






AAGAGGTGACTTCGGCCCAAGACCTTGTGGAA






TCACTGTTAAACGCAGGGGACAAGTTGGTGGT






GGTTGATTTCTTCTCTCCTGGTTGTGGTGGCTG






CAAAGCCCTTCACCCTAAGATATGTCAACTGG






CAGAGATGAATCCTGATGTTCAATTCCTTCAG






GTGAACTATGAGGAGCATAAGTCCATGTGTTA






TAGCCTCAATGTCCATGTTCTACCCTTCTTCCG






CTTCTATAGAGGTGCTCATGGTCGATTATGTAG






CTTTAGCTGCACCAATGCCACGATCAAGAAGT






TTAGAGACGCATTGGCCAAACACTCCCCAGAT






AGATGCAGCTTGGGCCCAACCAAAGGGTTAGA






GGAGAAAGAGCTCCTAGCTCTTGCTGCCAACA






AAGATCTTTCCTTTACCTTGCCAAAACCTTTAC






AACCTGAACACGCAAATGAAGGGTTGGCAACT






GCTCCTGCTCCTGTTCCTAGTTCAGAATCTCTT






CCTTTACCTTCACTGACCCTCAATTCTGAGGTC






TCCCAAGAGAGAACCTTGACCACTGCTGGGAG






ATGA







Phaseolus

PvACH
XM_007161862.1;
ATGGCTGAGGTTTTGACCGAGGCAAGTTTGGT
67



vulgaris

T4-2
PHAVU_
TTCTTCGTGGCATGGTACTACTCAACGCCACCA



(Bean)

001G109200g
TCGAAGAGTATCGACAGTTCCCAATTCTTCTAG






CTTCGTTTCTGGCGTTGGAAGGTTCCCTTCTCT






CAAGTTAAAGTCTCAGATTCTCAGATCCCTCTC






CTCTTCTTCTGAGTTTCAGGGTAAAAAGCTTCT






CTTTCATGTAAATAGAGGACTAGCCAACAGAA






TCAGTTCGCGGTTGGGAGCTTCAACTGCAGCG






CAGATGACCCTTAGAATAGGGAAAGGTCAGAA






ATGGTGGGAAAAGGGGCTTCAACCCAACATGA






ATGAGGTGACTTCCGCCCAAGATCTTGTAGAA






TCACTGTTAAACGCAGGGGACAAGTTAGTGGT






GGTTGATTTCTTCTCTCCTGGTTGTGGTGGCTG






CAAAGCCCTTCACCCTAAGATATGTCAACTGG






CAGAGATGAATCCTGATGTTCAATTCCTTCAG






GTGAACTATGAGGAACATAAGTCCATGTGTTA






TAGCCTCAATGTCCATGTTCTTCCCTTCTTCCG






CTTCTATAGAGGTGCTCATGGTCGATTATGTAG






CTTTAGCTGCACCAATGCCACGATCAAGAAGT






TTAAAGATGCATTGGCCAAACACTCCCCAGAT






AGATGCAGCTTGGGCCCAACCAAAGGGTTAGA






GGAAAAAGAGCTCCTAGCTCTTGCTGCCAACA






AAGATCTTTCGTTCATCTACGCACCAAATCCCT






TACAACCTGAACATGAAAATGAAGAGTTGGCT






ACTGCTCCCGCTCCTGTTCCTAGTTCAGAGTCT






CTTCCTTTGTGTCACCTCATTTCTGAGGTCTCC






AAAGAGAAAACCTTGATCACTGCTGGGAGATG






A







Gossypium

GhACHT4-
NM_001326831.1;
ATGGCTGAAGTTTTGGGGAAGGGAAATCTGTT
68



histrum

1
LOC107894997
TACGACTTGTAACTATAGTCAGACGAAGAATC



(Cotton)


TAGAAGGTGGAACTTGTTTGGTTCCTAAGAAA






ATTTCTGGGTTTTCTTTAGAAAGGAACGGTTTT






TCTTCTTTAAAGGTTAAATCTCAGGCTTTAAGA






AGTGATTTTAATGGGCAAAGAATGGTTTTTTTG






GAGAAGAAAAGTATGAACAGGCGAAGGTTTT






GTCAAGTTCCCATCAAAGCACAGATGCAAAGT






GGTCTTATTGGTCGAATTCAGAAATGGTGGGA






GAAAGGGCTTCAACCAAATATGAAAGAAGTTG






CATCTGCACAAGACCTAGTAGACTCTCTTCTGA






ATGCTGGTGATAAGCTTGTTGTGGTAGATTTCT






TCTCCCCTGGTTGTGGTGGTTGCAAGGCTCTTC






ATCCCAAGATTTGCCAATTTGCAGAGATGAAT






CCAGATGTGCAGTTTCTTCAGGTTAATTACGAG






GAGCACAAGTCAATGTGCTATAGCCTTAATGT






CCATGTGCTGCCTTTCTTCCGGTTCTATCGAGG






TGCGCAGGGGCGTGTATGCAGCTTTAGTTGTA






CCAATGCCACGATCAAAAAATTCAGAGATGCA






TTAGCCAAACACACACCTGATCGGTGTAGCCT






CAGCACGACAAAAGGGCTCGAGGAGAAGGAG






CTTTTGGCATTATCTGCGAACAAAGACCTTTCC






TTCAACTACACACCAATTCCCACACATGGAGA






GATTCTTATATGGAAACAAGTTCCATCTGATTC






AACGAGAAAGCTCCCGCTTTCAGTCCCGACAA






CATCCGCAAAACAAAGGGACAGTGAGGAGAA






AACCTTGGTTGGTGTCGGAAGATGA







Gossypium

GhACHT4-
XM_016898050.1;
ATGGCTGAAGTTTTGGGGAAGTCAAATCTGTT
69



histrum

2
LOC107961887
TACAGCTTGTAACTATAGTCAGAAGAAGCATC



(Cotton)


AAGAAGGTGGCGTTCCTTTGTTTTCCAGGAGA






ATCTCTGTGTTTTGTTTGAGAAAGAATAGTTTT






CCTTCTTTGAGGTTGGAACCTCAAGCTTTGAGG






AGTGGTTTTAATGGTCAAAGAGTGGTTTTTTTA






GAGAAAAGAAGTCTAAATGAGAGAAGGTTCT






GTCGAGTTCCCATTAAAGCACAGATGCAAACT






GGGCTTATTGGTAAAACTCAAAAGTGGTGGGA






GAAGGGGAATCAACCAAATATGAAAGAAGTG






ACATCTGCACAAGACCTGGTGGACTCACTTTT






GAATGCTGGGGATAAACTTGTTATAGTGGATT






TCTTCTCTCCTGGTTGTGGTGGCTGCAAGGCTC






TTCATCCCAAGATTTGCCAATTGGCAGAGATG






AATCCGGATGTGCAGTTCCTTAAGGTGAACTA






TGAGGAGCATAAATCCATGTGTTATAGCCTTA






ATGTACATGTGTTGCCTTTCTTTAGGTTCTATA






GAGGAGCTCAGGGTCGTCTATGCAGCTTTAGC






TGCACCAATGCCACGATCAAAAAATTCAAAGA






TGCATTGGCCAAGCACTCACCAGACCGATGCA






GCCTTGGGCCGACAAAAGGTCTCGAGGAGAAG






GAGCTTTTGGCATTAGCTGCCAACAAAGACCT






TTCCTTCAACTACACACCGAAACCAGTTCATCC






TGCACCGGAAGAAATTCCGGTGCTGAAAGAAG






TTCCATCCGGTTCATCCTTCAAGCTAAAAGAA






AGCGAGGAGAAGACCTTGATTGGTGTGGGGAG






ATGA







Gossypium

GhACHT4-
XM_016817346.1;
ATGGCTGAAGTTTTGGGGAAGTCAAATCTGTT
70



histrum

3
LOC107892305
TACAGCTTGTAACTGTAGTCAGAAGAAGAATC



(Cotton)


AAGAAGGTGGCGTTCCTTTGTTTTCTAGGAGA






ATCTCTGCGTTTTGTTTGAGAAAGAATAGTTTT






CCTTCTTTGAAGTTGGAACCTCAAGCTTTGAGG






AGTGGTTTTAATGGTCAAAGAGTGGTTGTTTTA






GAGAAAAGAAGTCTAAATGAGAGAAGGTTCT






GTCGAGTTCCCATTAAAGCACAGATGCAAACA






GGGCTTATTGGTAAAACCCAAAAGTGGTGGGA






GAAGGGGAATCAACCAAATATGAAAGAAGTG






ACATCTGCACAAGACCTGGTGGACTCACTTTT






GAATGCTGGGGATAAACTTGTTATAGTGGATT






TTTTCTCTCCTGGTTGTGGTGGCTGCAAGGCTC






TTCATCCCAAGATTTGCCAATTGGCAGAGATG






AATCCGGATGTGCAGTTCCTTAAGCTGAACTA






TGAGGAGCATAAATCCATGTGTTATAGCCTTA






ATGTACATGTGTTGCCTTTCTTTAGGTTCTATA






GAGGAGCTCAGGGTCGTTTATGCAGCTTTAGC






TGCACCAATGCCACGATCAAAAAATTCAAAGA






TGCATTGGCCAAGCACTCACCAGACCGATGCA






GCCTTGGGCCGACAAAAGGTCTCGAGGAGAAG






GAGCTTTTGGCATTAGCTGCCAACAAAGACCT






TTCCTTCAACTACACACCGAAACCAGTTCATCC






TGCACCAGAAGAAATGCCGGTGCTGGAAGAA






GTTCCATCCGGTTCATCCTTCAGGCCAAAAGA






AAGCGAGGAGAAGACCTTGGTTGGTGTGGGGA






GATGA







Glycine

GmACHT4-
XM_003548715.3;
ATGGCGGAGGTTTTAACCAAGGCGAGTTTGGT
71



max

1
LOC100816892
TTCATCTTCTTGGCATGGGGTTAGTCAACGGCA



(Soybean)


TCATCATCGAAGGGTTTCAACGGTTCTTTCAAA






TAATACATGTAGCTTCCGTTCCGGCGTGGGAA






AGTTCTCTTCTTTGAAGATGAATTCTCAGGTTC






TCAGATCTTGGTCCTCTTCTTCTGAGTTTCAGG






GTAAAAAGCTTGTCTTTCATGTAAATAGAGGA






TTACCCAATAGGGTCAATTCGCGGTTGAGAGC






TTCTACTGGGACTCAGATGAACCTTAGACTAG






GGAAAGTTCAGAAATGGTGGGAAAAGGGGCT






TCAACCCAACATGAAAGAGGTGACTTCAGCAC






AAGACTTTGTGGATTCACTGTTAAACGCAGGG






GACAAGTTGGTGGTGGTTGATTTCTTCTCTCCT






GGTTGTGGTGGCTGCAAAGCCCTTCATCCTAA






GATATGCCAATTTGCAGAGATGAATCCTGATG






TTCAGTTCCTTCAGGTGAACTATGAGGAGCAT






AAGTCCATGTGTTATAGCCTTAATGTCCATGTT






CTTCCCTTCTTCCGATTCTATAGAGGCGCTCAC






GGTCGATTATGTAGCTTTAGCTGCACCAATGCC






ACGATCAAGAAGTTCAAAGATGCATTAGCCAA






ACACACCCCAGACAGATGCAGCTTAGGCCCAA






CCATAGGGTTAGAGGAGAAAGAACTCGAAGCT






CTTGCTGCCAACAAAGATCTTTCCTTCACCTAC






TCACCAAAACCATTACAACCTTCACATGAAAA






CGAAGAGTTGGCAACCGAAACTGCTTCTGCTC






CGGCTCTTGGTTCAGGATCTCTTCCTTCACCTT






CAATGACCCTCAATGCTGTGGCCTCTAATGAG






AGAACCTTGACCACTTCTGGGAGATGA







Glycine

GmACHT4-
NM_001289199.1;
ATGAAGTCTCAGGTTCTCAGATCTTGGTCCTCT
72



max

2
LOC100784901
TCTTCTGAGTTTCAGGGTATAAAGCTTGTCTTT



(Soybean)


CATGTAAATAGAGGATTACCCAATAGGGTCAA






TTCGCGCTTGAGAGCTTCAACTGGGGCTCAGA






TGAGCTTTAGACTAGGGAAAGTTCAGAAATGG






TGGGAAAAGGGGCTTCAACCCAACATGAAGG






AGGTGACTTCGGCACAAGACTTTGTGGATTCA






CTGTTAAGCGCAGGGGACAAGTTGGTGGTGGT






TGATTTCTTCTCTCCCGGTTGTGGTGGCTGCAA






AGCCCTTCATCCTAAGATATGTCAATTTGCAGA






GATGAATCCTGATGTTCAGTTCCTTCAGGTGAA






CTATGAGGAGCATAAGTCCATGTGTTATAGCC






TTAATGTCCATGTTCTTCCCTTCTTCCGATTCTA






TAGAGGTGCTCATGGTCGATTATGTAGCTTTAG






CTGCACCAATGCCACGATCAAGAAGTTTAAAG






ATGCATTGGCCAAACACACCCCAGATAGATGC






AGCTTGGGCCCAACCAAAGGGTTAGAAGAGA






AAGAGCTTCTAGCTCTTGCTGCCAACAAAGAT






CTTTCCTTCACCAACTCACCAGAACCTTTACAA






CCTGCACATGCAGATGAAGAGTTGGGAACCGA






ACCTGCTCCTGCTCCTGGTTCAAAATCTCTTCC






TTCACCTTCAATGATTCTCAATTCTGAGGTCTC






TAAAAAGAGAACCTTAACCACTTCAGGGAGAT






GA







Beta

BvACHT4
XM_010674105.1;
ATGGCGGATGTTCTTACCAAATCCAGTGTTTTT
73



vulgaris


LOC104888985
TCTCCAACAATTTCTCATCATCATAGTGGAAGT



(Beet)


AAAAATTTTCCAATTAAATGTTCAGTTGCAGTG






AGTAATCGAGGGAGATTAGTTGGAATTTCTTC






GTTGAGGAGTAGTTTTGGTGGTGTAAGAATTG






CGATCGATAAAAATACCAGTTTTGGGTCAAAA






AGGAGGAATTACCAATCAATTGATGCTAAGAT






GGGTCTGAGCATCGGCAAAGCACAGAAATGGT






GGGAGAAAGGACTCCAGCCAAATATGAGAGA






GATAACTTCTGCGGAAGACCTAGTCGATTCTTT






ACTAACAGCAGGAGATACATTAGTTGTCGTTG






ATTTTTTCTCTCCTGGATGTGGAGGCTGCAGAG






CTCTTCATCCTAAGTTGTGTCAATTGGCAGAGA






TGAACCCTGATGTCCAGTTTCTTCAGATTAACT






ACGAAGAACATAAATCAATGTGTTACAGTCTT






AATGTTCATGTTCTTCCCTTCTTTCGGTTTTACA






GAGGGGCTGAAGGCCGGGTTTCCAGCTTCAGC






TGTACAAATGCAACGATTAAGAAATTCAAGGA






TGCTTTGGCGAAGCATAACCCAGCAAGGTGTA






GCCTTGGGCCAACAAAGGGCCTAGAAGAGAA






GGAGCTTCTTGCTCTTGCTGCCAACAAAGACCT






TTCATTTACCTATACACCAAAGCCTGTGGAAG






CGGAACCCGTACCCGCACCTGCACTTGAAGAA






GTCTCTGTTAAGGCTGACGAACAAGTCTTAGC






ACAAGAATCTCTCCCTTCTTTCAACAGGAAGC






CTCTTAGCTCACAACCATCAACCGTGAGTGAA






GAGAAAACTCTAGCTACTGCTGCGAGATGA







Musa

MaACHT4-
XM_009418063.1;
ATGGCGGAAACTTTGGCTCAGAGGACCCTCCT
74



acuminate

1
LOC103996979
TTTGCCTGGCGGGCATCTTTCTTTGCCGCCGTT



(Banana)


TTGCGGGATGCGGAGCCGCCCTTCTCTTGCGG






CGTTCACTCTCTTTTCACGTACCAAGGTTGAGC






CCTTGAGGTCTTCTTCTTGTGATAGCAAGTTCC






ATGGGAGGAGACTGGTCGTTGGGGCGCGGAG






AGGGAGGCCCTCGAGGGCACGCCTCGGTTCTG






GCTCTGAACAGATGGTTCTGTCGTTCAAGAAG






GCTATAAAATGGTGGCAGAAGGGGCTTCAACC






CAATATGGTGGAGATCGAGTCGGCTGAGCATC






TCGTCGACTCCTTATTGAACGCCGGCGACAAG






CTTGTTATTGTGGATTTCTTCTCCCCAGGGTGT






GGAGGCTGCAGAGCGCTTCATCCAAAGATTTG






CCAGTTCGCCGAATCGAATCAAAATGTTTTGTT






TCTCCAAATAAATTATGAGCAACATAAGTCGA






TGTGCTACAGCTTGGGTGTCCATGTTCTCCCCT






TCTTTAGGTTCTATCGCGGAGCACACGGGCGC






CTGTGCAGCTTCAGCTGCACCAATGCAACTATT






AAGAAATTTAAAGATGCTTTGGCCAAGCACAT






CACTGACAGATGCAGCCTTGGGCCAGCTAGGG






GGCTGGAGGAGTCAGAGCTCTTGGCTTTGGCC






GCAAACAAAGATCTCTCATTTAACTACACAAG






CAAGCCAGTTCCTGTGCCTGAAGAGATTCCAG






AGAGAATTCCAACAAGCCCGAAACTCCCTCTT






CATGCTGTCCGTAGACCTGCCCAGGAATCCGA






GGACAAGGCCCTCGCCGCAGCTGGGAGATGA







Musa

MaACHT4-
XM_009408568.1;
ATGGCGGATGCTTTGGCTCAAATGACGCTCCTT
75



acuminate

2
LOC103989652
TCGCCCCATGGCCACCGTTCTTTGTCGCGCTCT



(Banana)


TCCGACCGGAGAAACCGCCTTGTTTGTGCGTC






AAAGGATGATCTCTTGAGGTCTTCGTCTTCTTG






TAATAGCCAGTTCCATGGGAGAAGGCTGGTTA






TTGGCGCACAGAGAGAGAGGCCGTTGAGAGG






CAACCGAGGTTCTAGCTCTGTGCAGATGACTC






TGTCCTTTAAGAAGGCTTCGAAATGGTGGGAG






AAGGGGCTTCATCCCAATATGAAGGACATCAA






GTCGGCTGAGGATCTCGTCGACTCCTTGTCGA






ACGCGGGCGACAAGCTCGTCATCGTGGATTTC






TTCTCCCCAGGATGTGCAGGCTGCAGAGCCCT






CCACCCAAAGATCTGCCAATTCGCAGAGTTGA






ATCCAGACGTTCAATTTCTCCAACTAAACCAC






GAGGAACACAAGTCCATGTGCTACAGCTTGAA






TGTCCATGTTCTCCCCTTCTTTAGGTTCTATCGC






GGAGCGCACGGTCGCCTGTGCAGCTTCAGCTG






CACCAATGCAACCATCAAGAAATTTAAGGATG






CTTTGGCGAAGCACATCACCGAAAGATGCAGT






CTTGGGCCAGCCAAGGGGCTGGAGGAGACGG






AGCTCCTTGCCTTGGCTGCAAACAAGGATCTCT






CCTTCACCTACACAAGAACGCCTGTTCCCGTAC






CTGATGAGCTTGCAGAGAAAGCTCCATTTAAC






CCAAACCTACCTGTGCATGCTGCTGCTAGACTC






ACCCTGGAATCTGAGGACAAGGCTTTTGCCGC






AGCCGGTAGATGA







Capsicum

CaACHT4
XM_016697343.1;
ATGGCAAAATTGATGAACAAAGGTTTTGTGTT
76



annum


LOC107852277
TCCTTCATCTTCTGATTGTGGTCATCATCGCCC



(Sweet


TCATGGGATTTCTTCTTTCCCCAATAAATCGGT



and Chili


CAATCTTTCTTGTCTTCCATCTACTTGTCTGCTA



Peppers)


AGAAGCTATTTTTATGGTCGTAGATTGGTCATA






AATGAAGCCCTACCCAAAAGAAATGCCCACGT






TGCAATCACTGTCCAGATGAGTATGGGAATCA






GGAAAGTACAGAAATGGTGGGAGAAAGGGGT






TCAACCTAACATGAAAGAAGTGAACAGTGCTC






AAGGCCTTGTTGACTCTCTTTTGAGTGCAGGAG






ATAAATTAGTAGTTGTTGATTTCTTTTCCCCTG






GCTGCGGTGGCTGCAAAGCCCTTCACCCTAAG






TTGTGTCAGCTGGCAGAGATGAATCCAGATGT






GCAGTTTTTACAGGTGAACTATGAGGAACACA






AGTCCATGTGTTACTCTCTTAACGTGCACCTTC






TCCCATTTTTCCGTTTCTATAGAGGAGCTGAAG






GTCGTGTTTGCAGCTTTAGCTGTACCAATGCCA






CGATAAAAAAATTTAAAGATGCATTGGCAAAG






TATGGTACAGATCGTTGCACCTTTGGACCACC






GAAAGGGCTTGAGGAGAAAGAGCTACTTGCAT






TGGCAGCTAACAAGGAACTCTCGTTTAATTAC






ATTCCAAAAACAGAAGAAGAACCTGTCCTTGT






TGCCTCACAAGAGGAAGTTGAGGACAGAACTC






CAAATAAAGAGTCCCCTCTACCACTTCCTCTTC






CTCTACCCATTAGCTCAACTAGCTCACTGAAGC






CCAAACAGGATACAGAGAAAGAAGCGTATGC






TACTTCTGGTAGATAG







Cicer

CaACHT4
XM_004493084.2;
ATGGCTGAAATTTTGACCAAGACAAGTTTGGT
77



arietinum


LOC101501672
TTCATCTTGGCATGGGAACAGAAAACAGCAAC



(Chick


ATCGAAGGTTGTCCATGGTTCCCAATAAGACT



pea)


TGTAGCTTCAACACTTGCGTGGGAAGTTTCCCA






TCTTTGAAGCTAAAATCTCAGTTTCTTAGATCT






TCCTCTTTTTCATCTGAGTTTTATGGGAAAAAT






ACTATCTTTCGTGTAAATAGATCAATACCCAAC






AGGATTAATTCACAATTTTCAGTTTCAGCTGCG






CCTAAGATGACACTTAGAATAGGAAAAATTCA






GAAATGGTGGGAAAAGGGGCTTCAACCTAACA






TGAGAGAAGTGACTTCAGCTCAAGATCTTGTA






GATTCACTTTTAAACGCAGGGGACAAACTTGT






CATTGTTGACTTTTTCTCTCCTGGTTGTGGTGG






CTGCAGAGCCCTTCACCCTAAGATATGTCAAA






TGGCAGAGATGAATCCTGATGTTGAGTTCCTTC






AAGTGAACTATGAAGAGCATAAATCCATGTGT






TATAGCCTTAATGTTCATGTCCTTCCTTTCTTCC






GCTTCTATAGAGGCGCTCATGGTCGCTTATGCA






GCTTTAGCTGCACCAATGCCACGATCAAGAAG






TTTAAAGATGCATTGGCCAAACACACTCCAGA






TAGATGCAGCTTGGAACCAACCAAAGGGTTAG






AGGAGAAAGAGCTCATAGCTTTATCTGAAAAC






AAAGATCTTAACTTCACATACACACCAAAACC






TCTTCAACCTGTGCATACACCTGCAAATGAAG






AGTTGGCGACAACCAAAGCCTCTCCTGTTTGTT






CGGAGCCTCTTCCTTTACCTTCATTGACCTCGA






ATTCTGATGAAGTCTTGAAGGAGAGAACCCTG






ACAAGGGCTGGAAGATGA







Solanum

SlACHT4-
XM_004251955.2;
ATGACGAAATTGATGAGCAAAGGTTTTATCTT
78



lycopersicum

1
LOC101244843
TCCTTCTTCTTCTTCTGATTGTGGTGAAATTTAT



(Tomato)


GATCGTCTTCGTCTTAATCTACATGGGATCTGT






TCTTTTCCCAATAAATCGGTCAATCTTTCTTGT






CTTCCTTCGTTGAAGCTTTCTTCTTCTTGTTTGC






CAAGAACCGATTTTTATGGTCGTAGATTGGTTA






TAAATGAAGGCTTATCCAATTTCAACCGAAGA






GTTGCTGATATCACTGCTCAGATGAGTGTTGG






AATCAAGAAAGCACAGAAATGGTGGGAGAAA






GGGGTTCAACCTAACATGAAAGAAGTGAACAG






TGCACAAGAACTTGTTGACTCTCTATTGAGTGC






AGGGGATAAATTAGTTGTTGTTGATTTCTTTTC






TCCTGGCTGTGGAGGTTGTAAAGCTCTTCACCC






CAAGTTGTGTCAGCTGGCAGAGATGAATCCAG






ATGTGCAGTTTTTACAGGTGAACTATGAGGAA






CACAAGTCGATGTGTTACTCTCTTAATGTACAT






GTTCTCCCGTTTTTCCGTTTCTATAGAGGAGCT






GAAGGCCGTGTTTGCAGCTTTAGCTGTACCAA






TGCCACGATCAAAAAATTCAGAGATGCATTGG






CGAAGTATGGTACAGATCGTTGCACCATTGGG






TCACCCAAAGGGCTTGAGGAGAAAGAGCTACT






TGCATTGGCAGCTAACAAGGATCTTTCCTTTAA






TTACACTCCAAAAACAGAAGAAGAACCCATCC






TCGTTACCTCACAAAAGGAAGTTCGGGATAGA






ACTACTCCAAATATAGAGTCCCCTCTACCACTT






CCTCTTCCTCTCCCCATTACGTCAACTAGCTCA






CAGACGGCCAAACGGGATACAGAGAAAGAAG






CATATGCTACTTCTGGTAGATGA







Solanum

SlACHT4-
XM_004249259.2;
ATGGAGAAATTGTTGAATAAGGCAGTATTTCT
79



lycopersicum

2
LOC101244047
TCCATCAATTTTGAATTCTAGTGGTATTTATCA



(Tomato)


TTCTAATCAACATGCGATTTGTGTTTTTCCAGT






GAAATTCAATAGAAGATATCACAAATCAGCAG






TTGCTACTGCTCAGATGAGCATAGGTATCAAG






AGAGCTCCTAAATGGTGGGAGAAAGGACTTCA






ACCGAATATGAAAGAGGTGACGGGTGCTCAAG






ACCTCGTTGACACCCTTCTAAACGGTGGGGAT






AAACTAGTCGTTGTTGATTTCCTTTCCCCTGGC






TGTGGAGGCTGCAAAGCCCTTCATCCAAAGAT






ATGTCAGTTAGCAGAGATGAATCCGGATGTGC






AGTTTTTGCATGTGAACTATGAGGAACACAAG






TCAATGTGTTACTCGCTGAACGTACATGTTCTC






CCATTTTTCCGTTTCTATAGAGGTGCTGAAGGT






CGTCTTTGTAGCTTTAGTTGCACCAATGCCACG






ATAAAAAAATTCAAAGATGCATTGACAAAGTA






TGGTGCAGATTGTTGCAGCCTCGGACCAGTTA






AAGGGCTCGAGGAGAAAGAGCTACTTGCCCTA






GCGGCTAATAAGGACCTATCTTTTGCTTACACA






CCAAAAACAGAAGAACCAGTGCCTCTTGCCTT






AGAAGAAGTTAAGGTGATAAAAACAAGTAGA






CAATCTTCATCTCATCCCAATACATTCTCCCCA






TTACCACTTCCTCTTCCTCTAGCATCAACTTTG






CATACGGCCAAACAGGACTCAAAGAGTTAA







Elaeis

EgACHT4-
XM_010939817.1;
ATGATGGAGGTTTTGAGTCAGAGCGGTGTTAT
80



guineensis

1
LOC105057263
GTCGCCGTGCGGGCATCGTTGGGTGGTCCGTT



(African


CTTGCAAGGAGAGGAGCCCTTCTTTTGTTGGGT



oilpalm)


TTCCTCGCTCTTCCTCTCGGACGATCGAGTCTC






TGATGTCTTCTTCTCGGAATAGCGGTTTCCATG






GGAGGAGATTGAGCATTGGGGCTTGGAGAGTG






AATGCCGTGAAGGGGAATTTTAGTTCTACCCC






CGTACAGATGAGCCTCTGCGTTGGAAAGGCTT






TGAAATGGTGGGAGAAGGAGCTCCAGCCCAAC






ATGAAGGAGATCGAGTCGGCCCAGGATCTCGT






CGACTCTTTATTGAACGCAGGAGACAAGCTTG






TCATAGTAGATTTCTTCTCCCCTGGTTGTGGAG






GCTGCAAAGCCCTCCATCCAAAGATTTGCCAG






TTTGCAAAGCTGAACCCAGATGTTCTCTTCCTC






CAAGTAAACTATGAAAAGCACAAATCCATGTG






TTATAGCTTAAATGTCCATGTTCTTCCCTTTTTT






AGGTTTTACAGGGGAGCACACGGTCGTCTTTG






TAGCTTCAGCTGCACCAATGCAACTATTAAGA






AATTTAAAGATGCTTTGGCCAAGCACACCACA






GACAGATGCAGCCTGGGCCCAACAAAGGGGCT






GGAGGAATCAGAGCTCATGGCTCTGGCTGCAA






ACAAGGATCTCTCTTTCAGTTACACAAGAAAG






CCAGTCCCTGTTCCCTCGCCAGATGAGGCTGC






AGAGGAAGTTGTGCTCAGCCCAAAACTTCCGG






TTTCTTCAACTCCAAGAGTCATCCAAGATTCGG






AGGAGAAGGCTCTGGTGGCAGCTGGGAGATG






A







Elaeis

EgACHT4-
XM_010922992.1;
ATGGCGGAGGTTTTGGGCAGGAGCGGCGTGTT
81



guineensis

2
LOC105044909
CTCGCTGCGCGGGCACCGTTCCGTGGCCCCTTC



(African


TTGCCAGAAGAGGAGCCCTTCTTTTCTTGGGTT



oilpalm)


TCCTCTCTCATCCTCTCGGCCGATCGGGCCTCC






TAGGTCGTCTTCTCGGAGATTTGTTATCGGGAC






TCGGAGAGGGAGGTCCATCAAGGGAAATTCTA






GCTCTTCCCGTGTACAGATGAGCCTCGGCGTTG






GAAAGTCATTGAAGTGGTGGGAGAAGGGTGTG






CAGCCCAACATGAAGGAGATTGGATCGGCCCA






GGATCTTGTTGACTCCTTATTGAATGAAGGAG






ACAAGCTTGTTATCGTAGATTTCTTCTCCCCTG






GTTGTGGAGGCTGCAAAGCCCTCCATCCAAAG






ATTTGCCGGATTGCGGAGATGAACCCACATGT






TCTCTTCCTCCAAATAAACTATGAGAAGCACA






AGTCCATGTGTTATAGCTTGCATGTTCACGTTC






TCCCCTTTTTTAGGTTTTACCGGGGAGCTCACG






GTCGCCTTTGTAGCTTCAGCTGCACCAATGCAA






CTATTAAGAAATTTAAAGATGCATTGGCCAAA






CACACCACAGACAGATGCAGCCTTGGGCCAAC






AAAGGGGCTGGAAGAATCAGAGCTTGTGGCTC






TGGCTGCAAACAAGGATCTCTCCTTCAATTAC






ACAAGAAAACCGGTTCCTGTTCTCACACCAGA






CGAGGCTGCAGAGAAAGTTCCTCTTAGCCCAA






AACTTCCAGTGTCTTCAGCCCCAAGAGTCATC






AAAGATTCTGAGGACAAGGCCCTCGTTGCAGC






TGGGACATGA







Setaria

SiACHT4-
XM_004984459.3;
ATGGCGGCGGCGCAGGCGGTCGCGAAGGGCA
82



italic

1
LOC101779469
GCGTGGTGTCGCCGTGCGGCAGCAGGGCCGCG



(Foxtail


CCGGGGCTCCTGAGTCGGCGGAGGGGCGCCGT



millet)


GGCGACGCGGATGGCGCCGTCGGCGGTGCGGA






TCGGGGGCTCCTGGAGGAAGACCGCGTTCCTC






GGCGGTAGGCTGGCGGTCGGGCCGAGGAGATC






CAGGTCCGCGTCCCGGACCCTCGTCGCGTCGC






CGGTGCAGATGAACATGAACCTTGCGATTGGG






AAATCCATGAGGTGGTGGGAGAAGGGGCTGC






AGCCCAACATGCGGGAGATCGAGTCCGCCCAG






GATCTCGTCGATTCCTTGACCAACGCCGGCGA






CAGACTCGTCATCGTGGACTTCTTCTCCCCCGG






CTGCGGCGGTTGCCGTGCTCTTCACCCCAAGAT






TTGCCAGTTTGCGGAGCAGAACCCGGATGTGC






TGTTCTTGCAAGTGAACCATGAGGAGCACAAG






TCTATGTGCTACAGCCTCCATGTCCACGTCCTC






CCATTCTTCAGGTTCTACAGGGGAGCTCAGGG






ACGGCTCTGCAGCTTCAGTTGTACCAACGCAA






CTATCAAGAAGTTCAAGGATGCACTTGCAAAG






CACAAACCGGATAGATGTAGCATTGGCCCAAC






TAGAGGGCTGGAGGAATCAGAGTTATTAGCAT






TGGCTGCAAACAAGGACTTGCAGTTCACCTAC






ACCAAGAAGCCAGAACTGATCCCCAGCGGAG






ATGCTGCTGCTGAGGTCATTGCTCCCGAGCCTA






CAAAGCTTCCTGCGGCAACAAAGCCGTCGGTC






AAGATAGGGTCCGAGGAGAGGTCCNNTTGGTC






TCATCAGGAAGATGAGATGAATGACCTCTAG







Setaria

SiACHT4-
XM_004985594.1;
ATGGCGGCAGCGCAGGCGATGGCGAAGATGA
83



italic

2
LOC101775678
GCGTGGGGTCGCCGGCCTGCAATCGGGCTGCG



(Foxtail


GGATCCCTCTGCCGGTGGAGGGGAGCCGTGGC



millet)


GGTGCGGCTCGGAGGATCCTGGTCCTGGAGGA






AGAGCCCGTTCCTCGGCGGGAGGATGGCGGTT






GGGCCCAGGAGATCGAGGCCCGTGTCCCGGAA






TCCTGTTGCGTCGCCGGTGCAGATGAACCTTTC






ATTTGGGAAAACCATGAAGTGGTGGGAGAAG






GGATTGCAGCCCAACATGCGGGCGATCCACAC






CGCCCAAGAACTCGTCGATTCCTTGATCAACG






CCGGCGACGGGCTCGTCATAGTCGACTTCTTCT






CACCTGGCTGCGCCGGCTGCCATGCCCTCCATC






CCAAGATTTGCCAGTTTGCGGAGCGGAACCCA






GATGTGCAGTTCCTGCAAGTGAACTTTGAGGA






GCACAAGTCTATGTGCCACAGCCTTCATGTTCA






TGTGTTCCCTTTCTTCAGATTCTACAGGGGAGC






TCAGGGCCGGCTCTGCAGCTTCAGCTGTACCA






ATGCAACTATCAAGAAGTTCAAGGATGCGCTT






GCAAAGCACAAACCAGATAGATGTAGCCTTGG






CCCAATTAAGGGGCTAGAGGAATCAGAGCTAC






TGGCTTTGGCTGCAAACAGGGACCTGCAGTTC






ACCTACACCAAGGAGCAAGATCTGGCTCCGAG






CATGGAAGATGGCGCAGAGGTCATCACTCATG






ACCATCCAAGGCTTCCTGCAGCAGCAAAGCCG






CTGGTCAGGCAGGGGTCTGAGGACAGGGCTGT






GGTCTCATCGGGAAGATAA







Setaria

SiACHT4-
XM_004958667.1;
ATGGCTGAGGCTTTGTGCAACGGCGTCGTGCC
84



italic

3
LOC101759010
GTCGCCGTGCGGCGGGGACGTGGGCGTGGCCG



(Foxtail


GCCGGGTCAGTGGCGCCGCGGCGGCGCTAGCG



millet)


GAGTCCGTGCCGATCGGCGGCTACCGCACCAA






GAGCTCCTTCTCCGCAGGGAGGATGGCCATGA






CCGACAGGAAGATGAGGCCCCTGCCTCGGAGC






ATCGAGGCCGCGCCTGGACAGATGAACCTGTC






GTTTCCTAAGGCCATGCGGTGGTGGGAGAAGG






GGCTGCAGCCCAACATGCGGGAGATCGAGTCC






GCGCAAGACCTCGCCGACTCCCTGCTCAACGC






CGGCGACAAGCTCGTCGTCGTCGACTTCTTCTC






CCCTGGCTGCGGCGGCTGCCGCGCCCTGCATG






CCAAGATTGCCCAGTTTGCCGAGAAGAACCCA






GATGTGATGTTCCTGCAAGTGAACTATGAGAC






GCACAAGTCCATGTGCTACAGCCTCCATGTCC






ATGTCCTCCCTTTCTTCAGGTTCTACAGGGGAG






CCGAGGGACGGGTCAGCAGCTTCAGCTGCACA






AATGCAACTATCAAGAAGTTCAAGGACGCGCT






CGCAAAGCACGGACCTGACAGGTGCAGCCTCG






GCCCTGCACGGGGGCTGGAGGAGTCGGAGCTC






ATGGCCTTGGCTGCAAACAAGGACCTGCAATT






CACCTACGAGAAGCCGGGCCTTGTCCCACTTG






CAGAAGCCATTGCCAAGGAGGCTGCTGCACCA






GGAGGCCCGTGGTTCCCTTTGCCTGCGTCCGCG






ACGCAGTTCCTCACTCAGGGATCAGAGAATTC






ATTGCTGTCATCCGGAAGATAG







Chlamydonionas

CrACHT4
XM_001697391.1
ATGGCCAGCATACTAAATCGTGCCGGTTCAAG
85



reinhardtii



GTCGTTAGTTTTTGAGACTAAGCAGTCATTGCG



(Single-


TTCTATTCCTGGCAGCCTTTTATCGCTGCGGTC



cell


AGTGGCGCTGAAGCCATTCCGGACAACCATCT



green


GCGCGGCGGGAGCGCTGCTGACTGCACGGCGC



alga)


TCGACATCAGGCCTCGGGCGCGCCAACGGGGT






CGTTTGCCAAGCAGGGCGTAGCACTGGGGAAT






GGTGGAAGAAGGACAACCCCCCAAACATGCG






GGACATCAACTCAATTCAGGAGCTGGTTGACG






CTCTGTCGGATGCCGGAGACCGCCTCGTCATT






GTGGAGTTCTACGCCCAGTGGTGCAACGCCTG






CCGCGCGCTATTCCCCAAGATCTGCAAAATCA






TGGCTGAGAACCCGGACGTGCTCTTCCTCAAA






GTGAACTTTGACGACAACCGTGACGCGTGCCG






CACCCTGAGCGTCAAGGTGCTGCCGTACTTCC






ACTTCTACCGCGGTGCGGAGGGCCGTGTGGCG






GCCTTCAGCGCCACCATCAGCAAGTTGCAGCT






GTTCAAGGATGCCGTGGAGACCTACAGCGCCG






CCTTCTGCAGCCTGGAGCCCGCGCCGGGGCTG






GCGGAGTTCCCCGACCTCATCGCGCACCCGGA






GCTGCACCCGGAGGAGGCCGCAGAGGCGGCG






CGGCGCGCGCGGCTGGCGTCCACCGAGTCGGA






GGAGGAGTTGCATCCGCTGGCCGACACGCCGA






CTGTGGTGGGATAG







Chlorella

CvACHT4
XM_005851860.1
TGGTGGACCAAGTCTGCGCCGCCCAATGTAGT
86


(Single-

partial cds
GCACATCAAGTCTGTGCAGCACTTGGTGGACG



cell


AAATGGTGAGGGCTGAGAGGCTGGCGGGCGCT



green


GGCGAGCGGCTGGTGATCATGGATGTGTTTGC



alga)


GCCCTGGTGCGCCGCCTGCAAGGCGCTGTACC






CCAAGCTGATGAAGCTGATGGAGGAGCGCCCC






GATGTGCTGCTGCTGACGGTAAACTTTGATGA






GAACAAGACGGTGGTGAAGGCCATGGGGGTC






AAGGTCCTGCCGTACTTCATGTTCTATCGCGGC






AAGGAGGGCAAGCTGCAGGAGTTCTCGGCCAG






CAACAAGCGATTCCACCTCATCCAGGAAGCCA






TTGAGCGGCACAGCACCGATCGCTGCTTCCTG






GATAGCACCGACGAGGAGCCTGTGCTTGCAGA






GTTCCCCACTGTCGTCCCCGCCAAGGGCATCA






GCGGCAGCTTGGATGAGCCGGCCGGCCGTGCG






GCCGGCAAGGCGGTGGGCCAGCCGCAGCCCGT






GGCCTGA









In another embodiment, the nucleic acid sequence of ACHT4 is a homolog of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 is a paralog of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 is a fragment of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 is a variant of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 comprises any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 consists essentially of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 consists of any one of the sequences listed in Table 2. In another embodiment, the nucleic acid sequence of ACHT4 corresponds to any one of the sequences listed in Table 2.


In another embodiment, the nucleic acid sequence of ACHT4 is a homolog of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 is a paralog of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 is a fragment of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 is a variant of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 comprises any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 consists essentially of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 consists of any one of SEQ ID NOs: 44-86. In another embodiment, the nucleic acid sequence of ACHT4 corresponds to any one of SEQ ID NOs: 44-86.


ACHT4ΔC


In one embodiment, the present invention provides a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


The inventors have demonstrated herein that over-expression (OE) of a C-terminal truncated form of AtACHT4 (AtACHT4ΔC, in which there is a 47 amino acid deletion of the C-terminus) has a dominant negative effect in Arabidopsis plants. AtACHT4ΔC overexpression relieves the oxidation of AGPase and boosts starch synthesis. In contrast, the OE of the full length ACHT4 does not, confirming that the C-terminus of AtACHT4 is indeed required for the attenuation of starch synthesis and the authenticity of the stimulating effect of AtACHT4-C-Del.


In addition, as demonstrated in Example 2, the stimulation of starch synthesis by AtACHT4-C-Del results in a significant increase in transitory starch content in Arabidopsis leaves and stimulates growth, indicating that AtACHT4ΔC stimulates the export of photosynthates from the chloroplast which are then directed toward plant metabolism and growth.


There are homologs of the ACHT4 found in Arabidopsis (AtACHT4), and in other species including potato, cassava, maize, rice, barley, wheat, sorghum, rapeseed, castor, bean, cotton, soybean, beet, banana, peppers, chickpea, tomato, oil palm, millet, several species of algae, and other plants and algae (Tables 1-2). As described in Example 3, ACHT4 in potato (paralogs StACHT4-2 and StACHT4-1) is similarly involved in attenuating starch synthesis and growth, and overexpression of the C-terminally deleted forms of StACHT4-1 and StACHT4-2, respectively, disinhibits (i.e. promotes) starch synthesis and nearly doubles tuber yield and plant shoot growth, respectively.


Since ACHT4 is expressed in all major crop and biofuel species, including rice and corn, attenuation of ACHT4 through dominant negative C-terminal deletions or other means, represents a promising new target for increasing plant growth and yield in all major crop and biofuel species.


In one embodiment, the inactivating mutation in the ACHT4 gene as described in the methods and compositions of the present invention is a deletion mutation. In another embodiment, the inactivating mutation is an insertion mutation. In another embodiment, the inactivating mutation is a substitution mutation. In another embodiment, the inactivating mutation is a null mutation. In another embodiment, the inactivating mutation is another type of mutation known in the art. In one embodiment, the insertion, deletion or substitution mutation comprises an insertion, deletion or substitution of a single nucleic acid, while in another embodiment, it comprises an insertion, deletion or substitution of 1-5 nucleic acids, 1-10 nucleic acids, 5-20 nucleic acids, 10-50 nucleic acids, 25-100 nucleic acids, 100-500 nucleic acids, 300-400 nucleic acids, 200-500 nucleic acids, or 500 or more nucleic acids.


In one embodiment, the mutation is a dominant negative mutation. In one embodiment, a dominant negative mutation (also called an antimorphic mutation) comprises an altered gene product that acts antagonistically to, attenuates, or inhibits the function(s) of the wild-type allele.


In one embodiment, the inactivating mutation in the C-terminal portion of ACHT4 is a deletion of the entire C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of a portion of the C-terminal portion of ACHT4.


In one embodiment, a “corresponding sequence” is a nucleic acid (or amino acid) sequence from a first species for which there is a similar or equivalent sequence in a second species, which may be inferred by sequence alignment, as is well known in the art.


In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 141-207 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 150-225 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 150-300 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 75-150 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 3-75 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 3-30 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 75-225 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 3-60 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 60-120 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 120-180 nucleic acids of the ACHT4 gene. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 150-225 nucleic acids of the ACHT4 gene.


In one embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 687-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 651-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 600-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 801-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 699-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 750-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 774-825 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 687-750 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 699-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 600-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 651-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 750-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 801-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 849-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 876-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 825-903 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 699-801 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 699-849 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 486-690 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 501-690 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 549-690 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 600-690 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 651-690 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 486-651 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 486-600 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 486-549 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of nucleic acids 486-501 of any one of the sequences listed in Table 2 or a corresponding nucleic acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises insertion of a non-native sequence into a portion of the C-terminal of ACHT4 encoding the C-terminal of ACHT4, wherein said the C-terminal of ACHT4 is inactivated as a result.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises substitution of nucleic acid residues, as is known to one of skill in the art. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 consists essentially of any of the mutations listed hereinabove. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 consists of any of the mutations listed hereinabove.


In another embodiment, the present invention provides a composition comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In another embodiment, the present invention provides an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In one embodiment, “expression” as used herein refers to transcription of DNA to produce RNA. The resulting RNA may be without limitation mRNA encoding a protein, antisense RNA that is complementary to an mRNA encoding a protein, or an RNA transcript comprising a combination of sense and antisense gene regions, such as for use in RNAi technology. Expression as used herein may also refer to production of encoded protein from mRNA.


In another embodiment, the present invention provides a composition comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


Recombinant Polynucleotides


In one embodiment, “recombinant polynucleotide” refers to a polynucleotide having a genetically engineered modification introduced through manipulation via mutagenesis, restriction enzymes, and the like. Recombinant polynucleotides may comprise DNA segments obtained from different sources, or DNA segments obtained from the same source, but which have been manipulated to join DNA segments which do not naturally exist in the joined form. A recombinant polynucleotide may exist outside of the cell, for example as a PCR fragment, or integrated into a genome, such as a plant genome.


The present invention contemplates the use of polynucleotides effective for imparting an enhanced phenotype to genetically modified plants or algae expressing said polynucleotides. Exemplary polynucleotides for use in the present invention are provided herein in Table 2 (SEQ ID NO: 44 through SEQ ID NO: 86). A subset of the nucleic molecules of this invention includes fragments of the disclosed polynucleotides consisting of oligonucleotides of at least 15, preferably at least 16 or 17, more preferably at least 18 or 19, and even more preferably at least 20 or more, consecutive nucleotides. Such oligonucleotides are fragments of the larger molecules having a sequence provided herein in Table 2 (SEQ ID NO: 44 through SEQ ID NO: 86), and find use, for example as probes and primers for detection of the polynucleotides of the present invention.


Also of interest in the present invention are variants of the polynucleotides provided herein. Such variants may be naturally occurring, including homologous polynucleotides from the same or a different species, or may be non-natural variants, for example polynucleotides synthesized using chemical synthesis methods, or generated using recombinant DNA techniques. Degeneracy of the genetic code provides the possibility to substitute at least one base of the protein encoding sequence of a gene with a different base without causing the amino acid sequence of the polypeptide produced from the gene to be changed. Hence, a polynucleotide useful in the present invention may have any base sequence that has been changed from the sequences in Table 2 (SEQ ID NO: 44 to SEQ ID NO: 86) by substitution in accordance with degeneracy of the genetic code.


Homologs of the polynucleotides provided herein will generally demonstrate significant identity with the polynucleotides provided herein. A polynucleotide of the present invention is substantially identical to a reference polynucleotide if, when the sequences of the polynucleotides are optimally aligned there is about 60% nucleotide equivalence; more preferably 70%; more preferably 80% equivalence; more preferably 85% equivalence; more preferably 90%; more preferably 95%; and/or more preferably 98% or 99% equivalence over a comparison window. A comparison window is preferably at least 50-100 nucleotides, and more preferably is the entire length of the polynucleotide provided herein. Optimal alignment of sequences for aligning a comparison window may be conducted by algorithms; preferably by computerized implementations of these algorithms (such as the Wisconsin Genetics Software Package). The reference polynucleotide may be a full-length molecule or a portion of a longer molecule. Preferentially, the window of comparison for determining polynucleotide identity of protein encoding sequences is the entire coding region.


Promoters


In one embodiment, a polynucleotide of the present invention is operatively linked in a recombinant polynucleotide to a promoter functional in a plant or alga to provide for expression of the polynucleotide in the sense orientation such that a desired polypeptide is produced. Also considered are embodiments wherein a polynucleotide is operatively linked to a promoter functional in a plant to provide for expression of the polynucleotide in the antisense orientation such that a complementary copy of at least a portion of an mRNA native to the target plant host is produced. Such a transcript may contain both sense and antisense regions of a polynucleotide, for example where RNAi methods are used for gene suppression.


In one embodiment, the promoter of the expression vector of the present invention is operably linked to the polynucleotide. In one embodiment, the promoter is a constitutive promoter. In another embodiment, the promoter is an inducible promoter. In another embodiment, the promoter is a tissue-specific promoter.


In one embodiment, a promoter used in the compositions and methods of the present invention is cisgenic, i.e. is a promoter that is native to the plant.


Recombinant polynucleotides of the present invention are assembled in recombinant DNA constructs using methods known to those of ordinary skill in the art. Thus, DNA constructs used for transforming plant cells will comprise a polynucleotide one desires to introduce into a target plant. Such constructs will also typically comprise a promoter operatively linked to said polynucleotide to provide for expression in the target plant. Other construct components may include additional regulatory elements, such as 5′ or 3′ untranslated regions (such as polyadenylation sites), intron regions, and transit or signal peptides.


Numerous promoters that are active in plant cells have been described in the literature. These include promoters present in plant genomes as well as promoters from other sources, including nopaline synthase (NOS) promoter and octopine synthase (OCS) promoters carried on tumor-inducing plasmids of Agrobacterium tumefaciens, caulimovirus promoters such as the cauliflower mosaic virus or figwort mosaic virus promoters.


These and numerous other promoters that function in plant cells are known to those skilled in the art and available for use in recombinant polynucleotides of the present invention to provide for expression of desired genes in genetically modified plant cells.


Furthermore, the promoters may be altered to contain multiple “enhancer sequences” to assist in elevating gene expression. Such enhancers are known in the art. By including an enhancer sequence with such constructs, the expression of the selected protein may be enhanced. These enhancers often are found 5′ to the start of transcription in a promoter that functions in eukaryotic cells, but can often be inserted in the forward or reverse orientation 5′ or 3′ to the coding sequence. In some instances, these 5′ enhancing elements are introns. Deemed to be particularly useful as enhancers are the 5′ introns of the rice actin 1 and rice actin 2 genes. Examples of other enhancers that can be used in accordance with the invention include elements from the CaMV 35S promoter, octopine synthase genes, the maize alcohol dehydrogenase gene, the maize shrunken 1 gene and promoters from non-plant eukaryotes.


Organ-Specific Promoters


In other aspects of the invention, expression in plant seed tissues is desired to effect improvements in seed composition. In one embodiment, promoters for use for seed composition modification include promoters from seed genes such as napin, globulin 1, glutelin 1, and peroxiredoxin antioxidant.


In still other aspects of the invention, preferential expression in plant green tissues is desired. In one embodiment, promoters for expression in plant green tissues include those from genes such as SSU, aldolase and pyruvate orthophosphate dikinase (PPDK).


Recombinant constructs prepared in accordance with the invention will also in one embodiment, include a 3′ untranslated DNA region that typically contains a polyadenylation sequence following the polynucleotide coding region. Examples of useful 3′ UTRs include those from the nopaline synthase gene of Agrobacterium tumefaciens (nos), a gene encoding the small subunit of a ribulose-1,5-bisphosphate carboxylase-oxygenase (rbcS), and the T7 transcript of Agrobacterium tumefaciens.


Constructs and vectors may also include a transit peptide for targeting of a gene target to a plant organelle, particularly to a chloroplast, leucoplast or other plastid organelle.


Host Cells


In another embodiment, the present invention provides a cell comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


In one embodiment, the cell is from an Arabidopsis. In one embodiment, the cell is from Arabidopsis thaliana. In one embodiment, the cell is from a crop plant. In one embodiment, the crop plant is Solanum tuberosum (Potato). In another embodiment, the crop plant is Zea mays (Maize). In another embodiment, the crop plant is Oryza sativa (Rice). In another embodiment, the crop plant is Manihot esculenta (Cassava). In another embodiment, the crop plant is Hordeum vulgare (Barley). In another embodiment, the crop plant is Triticum aestivum (Wheat). In another embodiment, the crop plant is Sorghum bicolor. In another embodiment, the crop plant is Brassica napus (Rapeseed). In another embodiment, the crop plant is Ricinus communis (Castor). In another embodiment, the crop plant is Phaseolus vulgaris (Bean). In another embodiment, the crop plant is Gossypium histrum (Cotton). In another embodiment, the crop plant is Glycine max (Soybean). In another embodiment, the crop plant is Beta vulgaris (Beet). In another embodiment, the crop plant is Musa acuminate (Banana). In another embodiment, the crop plant is Capsicum annuum (Sweet and Chili Peppers). In another embodiment, the crop plant is Cicer arietinum (Chick pea). In another embodiment, the crop plant is Solanum lycopersicum (Tomato). In another embodiment, the crop plant is Elaeis guineensis (African oilpalm). In another embodiment, the crop plant is Setaria italic (Foxtail millet).


In another embodiment, the crop plant is a food crop. In another embodiment, the crop plant is a nutritionally enhanced food crop.


In another embodiment, the cell is a moss cell. In one embodiment, the moss is Physcomitrella patens. In another embodiment, the cell is an algae cell. In one embodiment, the algae cell is Chlamydomonas reinhardtii. In another embodiment, the algae cell is Ostreococcus tauri. In another embodiment, the algae cell is a Chlorella.


Provided herein are host cells that contain a vector, e.g., a DNA plasmid and support the replication and/or expression of the vector. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, plant, insect, amphibian, or mammalian cells. In some embodiments, host cells are monocotyledonous or dicotyledonous plant cells. In other embodiments monocotyledonous host cell is a maize host cell. In certain embodiments, the host cell utilized in the methods of the present invention is transiently transfected with the nucleic acid molecules of the invention.


In some embodiments, the host cell utilized in the methods of the present invention is a plant protoplast. Plant protoplasts are plant cells that had their entire plant cell wall enzymatically removed prior to the introduction of the molecule of interest. The complete removal of the cell wall disrupts the connection between cells producing a homogenous suspension of individualized cells which allows more uniform and large scale transfection experiments. This comprises, but is not restricted to protoplast fusion, electroporation, liposome-mediated transfection, and polyethylene glycol-mediated transfection. Protoplast preparation is therefore a very reliable and inexpensive method to produce millions of cells.


In particular embodiments, the plant protoplast is derived from one of the following genuses: Acorus, Aegilops, Allium, Amborella, Antirrhinum, Apium, Arabidopsis, Arachis, Beta, Betula, Brassica, Capsicum, Ceratopteris, Citrus, Cryptomeria, Cycas, Descurainia, Eschscholzia, Eucalyptus, Glycine, Gossypium, Hedyotis, Helianthus, Hordeum, Ipomoea, Lactuca, Linum, Liriodendron, Lotus, Lupinus, Lycopersicon, Medicago, Mesembryanthemum, Nicotiana, Nuphar, Pennisetum, Persea, Phaseolus, Physcomitrella, Picea, Pinus, Poncirus, Populus, Prunus, Robinia, Rosa, Saccharum, Schedonorus, Secale, Sesamum, Solanum, Sorghum, Stevia, Thellungiella, Theobroma, Triphysaria, Triticum, Vitis, Zea, or Zinnia. In some embodiments, the host cell is derived from a genus that is different from the genus from which the ACTH4 gene is derived.


Also provided herein are plant cells having the nucleotide sequence constructs of the invention. A further aspect of the present invention provides a method of making such a plant cell involving introduction of a vector including the construct into a plant cell. For integration of the construct into the plant genome, such introduction will be followed by recombination between the vector and the plant cell genome to introduce the sequence of nucleotides into the genome. RNA encoded by the introduced nucleic acid construct may then be transcribed in the cell and descendants thereof, including cells in plants regenerated from transformed material. A gene stably incorporated into the genome of a plant is passed from generation to generation to descendants of the plant, so such descendants should show the desired phenotype.


Optionally, germ line cells may be used in the methods described herein rather than, or in addition to, somatic cells. The term “germ line cells” refers to cells in the plant organism which can trace their eventual cell lineage to either the male or female reproductive cell of the plant. Other cells, referred to as “somatic cells” are cells which give rise to leaves, roots and vascular elements which, although important to the plant, do not directly give rise to gamete cells. Somatic cells, however, also may be used. With regard to callus and suspension cells which have somatic embryogenesis, many or most of the cells in the culture have the potential capacity to give rise to an adult plant. If the plant originates from single cells or a small number of cells from the embryogenic callus or suspension culture, the cells in the callus and suspension can therefore be referred to as germ cells. In the case of immature embryos which are prepared for treatment by the methods described herein, certain cells in the apical meristem region of the plant have been shown to produce a cell lineage which eventually gives rise to the female and male reproductive organs. With many or most species, the apical meristem is generally regarded as giving rise to the lineage that eventually will give rise to the gamete cells. An example of a non-gamete cell in an embryo would be the first leaf primordia in corn which is destined to give rise only to the first leaf and none of the reproductive structures.


In another embodiment, the present invention provides a composition comprising a cell comprising an expression vector comprising a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.


Seeds


In another embodiment, the present invention provides a seed comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


The present invention is directed to seed from a genetically modified plant, wherein the genome of said seed comprises an exogenous polynucleotide comprising a functional portion of an encoding region for a polypeptide provided herein, and wherein plants grown from said seed exhibit an enhanced phenotype as compared to the phenotype of a control plant. In one embodiment, the enhanced phenotype is increased yield. Exogenous polynucleotides of the present invention include recombinant polynucleotides providing for expression of mRNA encoding a polypeptide.


Plants and Plant Parts


In another embodiment, the present invention provides a plant, or plant part, comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


The present invention provides a plant comprising a C-terminal deleted form of an ACHT4 gene. Transformed seeds and plant parts are also encompassed. In one embodiment, the plant part is a seed. In another embodiment, the plant part is a leaf. In another embodiment, the plant part is a stem. In another embodiment, the plant part is a root. In another embodiment, the plant part is a flower. In another embodiment, the plant part is a tuber. In another embodiment, the plant part is a fruit.


In one embodiment, a plant of the present invention is any plant that comprises an ACHT4 gene or homolog.


In one embodiment, a plant of the present invention is a crop plant. In one embodiment, the crop plant is Solanum tuberosum (Potato). In another embodiment, the crop plant is Zea mays (Maize). In another embodiment, the crop plant is Oryza sativa (Rice). In another embodiment, the crop plant is Manihot esculenta (Cassava). In another embodiment, the crop plant is Hordeum vulgare (Barley). In another embodiment, the crop plant is Triticum aestivum (Wheat). In another embodiment, the crop plant is Sorghum bicolor. In another embodiment, the crop plant is Brassica napus (Rapeseed). In another embodiment, the crop plant is Ricinus communis (Castor). In another embodiment, the crop plant is Phaseolus vulgaris (Bean). In another embodiment, the crop plant is Gossypium histrum (Cotton). In another embodiment, the crop plant is Glycine max (Soybean). In another embodiment, the crop plant is Beta vulgaris (Beet). In another embodiment, the crop plant is Musa acuminate (Banana). In another embodiment, the crop plant is Capsicum annuum (Sweet and Chili Peppers). In another embodiment, the crop plant is Cicer arietinum (Chick pea). In another embodiment, the crop plant is Solanum lycopersicum (Tomato). In another embodiment, the crop plant is Elaeis guineensis (African oilpalm). In another embodiment, the crop plant is Setaria italic (Foxtail millet).


In another embodiment, the crop plant is a food crop. In another embodiment, the crop plant is a nutritionally enhanced food crop.


In another embodiment, a plant of the present invention is an Arabidopsis. In one embodiment, the Arabidopsis plant is Arabidopsis thaliana. In another embodiment, the Arabidopsis plant is Arabidopsis arenicola, Arabidopsis arenosa, Arabidopsis cebennensis, Arabidopsis croatica, Arabidopsis halleri, Arabidopsis lyrata, Arabidopsis neglecta, Arabidopsis pedemontana, or Arabidopsis suecica.


In another embodiment, a plant of the present invention is a moss. In one embodiment, the moss is a Sphagnum. In one embodiment, the Sphagnum species is cristatum or subnitens. In one embodiment, the moss is used for peat. In one embodiment, peat is used for fuel, as a horticultural soil additive, and in smoking malt in the production of Scotch whisky. In another embodiment, the moss is used for decorative purposes, such as in gardens and in the florist trade. In another embodiment, the moss is used as insulation. In another embodiment, the moss is used as an absorber of liquids. In another embodiment, moss is used for first-aid dressings, for diapers or napkins. In another embodiment, the moss is a Physcomitrella patens. In another embodiment, the moss is a Fontinalis antipyretica which, in one embodiment, is used to subdue fires.


In one embodiment, the plant is an ornamental plant.


Plants included in the invention are any plants amenable to transformation techniques, including gymnosperms and angiosperms, both monocotyledons and dicotyledons.


In addition to a plant, the present invention provides any clone of such a plant, seed, selfed or hybrid progeny and descendants, and any part of any of these, such as cuttings, seed. The invention provides any plant propagule, that is any part which may be used in reproduction or propagation, sexual or asexual, including cuttings, seed and so on. Also encompassed by the invention is a plant which is a sexually or asexually propagated off-spring, clone or descendant of such a plant, or any part or propagule of said plant, off-spring, clone or descendant. Plant extracts and derivatives are also provided.


Algae


In another embodiment, the present invention provides an alga comprising a C-terminal deleted form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.


In one embodiment, the alga is a microalga. In one embodiment, the species of the alga is selected from the following species: Ankistrodesmus, Botryococcus braunii, Chlorella, Chlorella protothecoides (autotrophic/heterothrophic), Crypthecodinium cohnii, Cyclotella, Dunaliella tertiolecta, Gracilaria, Hantzschia, Nannochloris, Nannochloropsis, Neochloris oleoabundans, Nitzschia, Phaeodactylum tricornutum, Pleurochrysis carterae (also called CCMP647), Sargassum, Scenedesmus, Schizochytrium, Stichococcus, Tetraselmis suecica, and Thalassiosira pseudonana. In another embodiment, the alga is a Chlamydomonas reinhardtii. In another embodiment, the alga is a Ostreococcus tauri.


In another embodiment, the present invention provides a polypeptide comprising a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4).


Proteins and Polypeptides


Polypeptides considered in the present invention are entire proteins or at least a sufficient portion of the entire protein to impart the relevant biological activity of the protein, e.g. enhanced plant phenotype. The term “protein” also includes molecules consisting of one or more polypeptide chains. Thus, a polypeptide useful in the present invention may constitute an entire protein having the desired biological activity, or may constitute a portion of an oligomeric protein having multiple polypeptide chains. Polypeptides useful for generation of genetically modified plants having enhanced properties include the polypeptide provided herein as SEQ ID NOs: 1-43, as well as homologs of such polypeptides.


In one embodiment, the inactivating mutation in the C-terminal portion of ACHT4 is a deletion of the entire C-terminal domain of ACHT4. In one embodiment, the C-terminal domain of ACHT4 is the sequence that is downstream of the conserved thioredoxin (Trx) domain (as depicted in FIG. 4A). In another embodiment, the inactivating mutation in the C-terminal is a deletion of a portion of the C-terminal domain of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the 47 amino acid C-terminal portion of ACHT4.


In one embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises:









(SEQ ID NO: 87)









KELNFTYTPKPVPVEKEAATPDSNPSLPVPLPSMSSNDEKTLVSAGR.






In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a homolog of SEQ ID NO: 87. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a paralog of SEQ ID NO: 87. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a fragment of SEQ ID NO: 87. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a variant of SEQ ID NO: 87. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises SEQ ID NO: 87. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 corresponds to SEQ ID NO: 87.


In another embodiment, the inactivating mutation in the C-terminal is a deletion of the 68-69 amino acid C-terminal portion of ACHT4.


In one embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises:









(SEQ ID NO: 88)


AANKDLSFNYTPKTEEAPVLVTSQKEVQDTTPPNIESPLPLPLPLPIAST





SSQTAKRDTEKEAYATSGR.






In another embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises:









(SEQ ID NO: 89)


AANKDLSFAYTPKTEEPMPVALQDAKVIKTSRTSSSCPNTFSLLPLPLPL





PLASTSHKAKQDSKSEVF.






In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a homolog any one of SEQ ID NOs: 88-89. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a paralog any one of SEQ ID NOs: 88-89. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a fragment any one of SEQ ID NOs: 88-89. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a variant any one of SEQ ID NOs: 88-89. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises any one of SEQ ID NOs: 88-89. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 corresponds to any one of SEQ ID NOs: 88-89.


In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 47-69 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 50-75 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 50-100 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 25-50 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 1-25 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 1-10 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 25-75 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 1-20 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 20-40 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 40-60 amino acids of the C-terminal portion of ACHT4. In another embodiment, the inactivating mutation in the C-terminal is a deletion of the final 50-70 amino acids of the C-terminal portion of ACHT4.


In one embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 229-275 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 250-275 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 260-275 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 240-275 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 229-240 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 229-250 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 229-260 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 229-270 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 233-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 240-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 250-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 260-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 275-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 290-301 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 233-250 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 233-275 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 233-290 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 162-230 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 175-230 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 200-230 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 220-230 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 162-175 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 162-200 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises a deletion of amino acids 162-220 of any one of the sequences listed in Table 1 or a corresponding amino acid sequence from another species.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises insertion of a non-native sequence into a portion of the C-terminal of ACHT4 encoding the C-terminal of ACHT4, wherein said the C-terminal of ACHT4 is inactivated as a result.


In another embodiment, an inactivating mutation in the C-terminal of ACHT4 comprises substitution of amino acid residues, such as a substitution of polar for non-polar residues, non-polar for polar residues, charged for uncharged residues, positively charged for negatively charged residues, or vice versa, or a combination thereof, as is known to one of skill in the art. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 consists essentially of any of the mutations listed hereinabove. In another embodiment, an inactivating mutation in the C-terminal of ACHT4 consists of any of the mutations listed hereinabove.


In one embodiment, an ACHT4 protein having an inactivating mutation in the C-terminal portion is a truncated ACHT4. In another embodiment, an ACHT4 protein having an inactivating mutation in the C-terminal portion is represented as ACHT4ΔC. In one embodiment, the truncated form of ACHT4 comprises:









(SEQ ID NO: 90)


MTEVISKTSLFLGACGNHHRVDDFSFSPVSFGGFGLKKSFSCLKLKSQKP





LRSVFYGKQIVFGDSQDESFRRSSAITAQTTLRIGTAQKWWEKGLKDNMR





EISSAQELVDSLTNAGDKLVVVDFFSPGCGGCKALHPKICQFAEMNPDVQ





FLQVNYEEHKSMCYSLGVHVLPFFRFYRGSQGRVCSFSCTNATIKKFRDA





LAKHGPDRCSLGPTKGLEEKELVALAAN.






In another embodiment, the truncated form of ACHT4 comprises:









(SEQ ID NO: 91)


MMKLMSKGFMFPSSSDCGEIYHHRPLNLPGICSFPNKSVNLSCLPSLNLS





SSCLPRTDFYGRRLVINEGVSKFNRRNSQVVDITAQMSIGIRKAQKWWEK





GVQPNMKEVNSAQELVDSLLSAGDKLVVVDFFSPGCGGCKALHPKLCQLA





EMNPDVHFLQVNYEEHKSMCYSLNVHVLPFFRFYRGAEGRVCSFSCTNAT





IKKFKDALAKYGTDRCTLGPPKGLEEKELLAL.






In another embodiment, the truncated form of ACHT4 comprises:









(SEQ ID NO: 92)


MKFNRRNHKSAAATAQMSIGIRKAPKWWEKGLQPNMKEVMGAQDLADTLL





NAGDKLVVVDFLSPGCGGCKALHPKICQLAEMNPDVQFLHVNYEEHKSMC





YSLNVHVLPFFRFYRGAEGRLCSFSCTNATIKKFKDALTKYGADCCSLEP





VKGLEEKELLAL.






In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a homolog any one of SEQ ID NOs: 90-92. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a paralog any one of SEQ ID NOs: 90-92. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a fragment any one of SEQ ID NOs: 90-92. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 is a variant any one of SEQ ID NOs: 90-92. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 comprises any one of SEQ ID NOs: 90-92. In another embodiment, the amino acid sequence of the C-terminal of ACHT4 corresponds to any one of SEQ ID NOs: 90-92.


Homologs of the polypeptides of the present invention may be identified by comparison of the amino acid sequence of the polypeptide to amino acid sequences of polypeptides from the same or different plant sources, e.g. manually or by using known homology-based search algorithms such as those commonly known and referred to as BLAST, FASTA, and Smith-Waterman. As used herein, a homolog is a peptide from the same or a different organism that performs the same biological function as the polypeptide to which it is compared. An orthologous relation between two organisms is not necessarily manifest as a one-to-one correspondence between two genes, because a gene can be duplicated or deleted after organism phylogenetic separation, such as speciation. For a given polypeptide, there may be no ortholog or more than one ortholog. Other complicating factors include alternatively spliced transcripts from the same gene, limited gene identification, redundant copies of the same gene with different sequence lengths or corrected sequence. A local sequence alignment program, e.g. BLAST, can be used to search a database of sequences to find similar sequences, and the summary Expectation value (E-value) used to measure the sequence base similarity.


A further aspect of the invention comprises functional homolog proteins which differ in one or more amino acids from those of a polypeptide provided herein as the result of one or more of the well-known conservative amino acid substitutions, e.g. valine is a conservative substitute for alanine and threonine is a conservative substitute for serine. Conservative substitutions for an amino acid within the native polypeptide sequence can be selected from other members of a class to which the naturally occurring amino acid belongs. Representative amino acids within these various classes include, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar amino acids such as glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; and (4) neutral nonpolar (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. Conserved substitutes for an amino acid within a native amino acid sequence can be selected from other members of the group to which the naturally occurring amino acid belongs. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Naturally conservative amino acids substitution groups are: valine-leucine, valine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, aspartic acid-glutamic acid, and asparagine-glutamine. A further aspect of the invention comprises polypeptides that differ in one or more amino acids from those of a described protein sequence as the result of deletion or insertion of one or more amino acids in a native sequence.


Homologs of the polypeptides provided herein will generally demonstrate significant identity with the polypeptides provided herein. In one embodiment, the present invention provides polypeptides with at least about 50% sequence identity. In one embodiment, the present invention provides polypeptides with at least about 70% sequence identity. In one embodiment, the present invention provides polypeptides with at least about 80% sequence identity with an amino acid sequence of any of the amino acid sequences listed in Table 1. In one embodiment, the present invention provides polypeptides with higher identity to such a polypeptide sequence, e.g. 90% to 99% identity. Identity of protein homologs is determined by optimally aligning the amino acid sequence of a putative protein homolog with a defined amino acid sequence and by calculating the percentage of identical and conservatively substituted amino acids over the window of comparison. Preferentially, the window of comparison for determining identity is the entire polypeptide sequence disclosed herein, e.g. the full sequence of any one of the sequences listed in Table 1.


In another embodiment, the present invention provides a composition comprising a polypeptide comprising a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4).


Methods of Use for Plants Expressing ACHT4ΔC


In another embodiment, the present invention provides a method of increasing the yield of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the yield of said plant or algae.


In another embodiment, the present invention provides a method of increasing the productivity of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the productivity of said plant or algae.


In another embodiment, the present invention provides a method of increasing the size of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the size of said plant or algae.


In another embodiment, the present invention provides a method of increasing the biomass of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby increasing the biomass of said plant or algae.


In another embodiment, the present invention provides a method of stimulating the growth of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby stimulating the growth of said plant or algae.


In another embodiment, the present invention provides a method of enhancing the starch content of a plant or algae comprising contacting a cell from said plant or algae with a polynucleotide encoding a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4), thereby enhancing the starch content of said plant or algae.


In one embodiment, the plant or algae has enhanced starch in transitory starch stores. In another embodiment, the plant or algae has enhanced starch in non-transitory starch stores. In one embodiment, the plant has enhanced starch content in one or more leaves. In another embodiment, the plant has enhanced starch content in the roots. In another embodiment, the plant has enhanced starch content in the stem. In another embodiment, the plant has enhanced starch content in one or more seeds. In another embodiment, the plant has enhanced starch content in its tubers. In another embodiment, the plant has enhanced starch content in its fruit. In another embodiment, the plant has enhanced starch content in one or more of its flowers.


In one embodiment, the present invention provides methods comprising the step of “contacting” a cell with a polynucleotide or expression vector as described herein. In one embodiment, plants are genetically modified using a microbial vector comprising ACHT4ΔC. In one embodiment, the microbial vector is Agrobacterium tumefaciens. In another embodiment, plants are genetically modified using microprojectile bombardment. In one embodiment, corn, rice, and other cereal grains are genetically modified using microprojectile bombardment. In another embodiment, plants are genetically modified using electroporation. In another embodiment, plants are genetically modified using microinjection, which in one embodiment, is direct microinjection of genetically modified DNA into anchored cells. In another embodiment, plants are genetically modified using transposons or transposable elements.


In one embodiment, the step of contacting is performed in vitro. In another embodiment, the step of contacting is performed in vivo.


In one embodiment, the ACHT4ΔC is integrated into the plant or algae chromosome. In another embodiment, the ACHT4ΔC is expressed from a vector.


Methods of Producing Genetically Modified Plants


In another embodiment, the present invention provides a method of producing a plant having an enhanced phenotype, wherein said method comprises transforming plant cells with a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation, regenerating plants from said cells, and screening said plants to identify a plant having an enhanced phenotype.


In another embodiment, the present invention provides a method of producing an algae having an enhanced phenotype, wherein said method comprises delivering a recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein to algae cells, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation, growing algae from said cells, and screening said algae to identify a plant having an enhanced phenotype.


Genetically modified plant seed provided by this invention may be grown to generate genetically modified plants having an enhanced phenotype as compared to an appropriate control line. Such seed is obtained by screening transformed plants for enhanced phenotypes resulting from the introduction of a recombinant polynucleotide into the genomic DNA of tissue from a parental line. The recombinant polynucleotide is introduced into the genome to produce genetically modified cells that can be cultured into genetically modified plants having an enhanced phenotype as compared to the parental line or other appropriate control. Such genetically modified cells are cultured into genetically modified plants that produce progeny genetically modified seed. Preferably, multiple genetically modified plants (events) comprising the recombinant polynucleotides are evaluated, e.g. from 2 to 20 or more genetically modified events, to identify a desired enhanced phenotype. Although the design of a recombinant polynucleotide is based on a rational expectation of a phenotypic modification, the present invention also contemplates that unexpected, yet desired enhanced phenotypes may be obtained.


Genetically modified plants grown from genetically modified seed as described herein will have improved phenotypes that contribute to increased yield or other increased plant value, including, for example, improved seed quality. Of particular interest are plants having altered cell division, enhanced plant growth and development, stress tolerance, including tolerance to abiotic and biotic stress, altered seed or flower development, improved light response, and enhanced carbon and/or nitrogen metabolism, transport or utilization properties.


Genetic Modification


In one embodiment, the present invention provides a cisgenic plant. In one embodiment, a cisgenic plant of the present invention is genetically modified, in one embodiment, comprises a second copy of a gene in which a portion of said gene is deleted, but contains no foreign or heterologous genes. In one embodiment, the promoters used in the expression of ACHT4ΔC are cisgenic. In one embodiment, food crops of the present invention are cisgenic.


In another embodiment, the present invention provides a transgenic plant. In one embodiment, a transgenic plant of the present invention is genetically modified with foreign or heterologous genes. In one embodiment, transgenic plants of the present invention are used for biofuel. In another embodiment, transgenic plants of the present invention are food crop plants.


Any method or delivery system may be used for the delivery and/or transformation (plant cells)/transfection (algae cells) of the nucleic acid vectors encoding ACHT4 and homologs, paralogs, etc. in the host cell, e.g., plant protoplast. The vectors may be delivered to the host cell either alone, or in combination with other agents. Transient expression systems may also be used. Homologous recombination may also be used.


Transformation may be accomplished by a wide variety of means, as is known to those of ordinary skill in the art. Such methods include, but are not limited to, Agrobacterium-mediated transformation (e.g., Komari et al., 1998, Curr. Opin. Plant Biol., 1:161), including floral dip transformation, particle bombardment mediated transformation (e.g., Finer et al., 1999, Curr. Top. Microbiol. Immunol., 240:59), protoplast electroporation (e.g., Bates, 1999, Methods Mol. Biol., 111:359), viral infection (e.g., Porta and Lomonossoff, 1996, Mol. Biotechnol. 5:209), microinjection, and liposome injection. Other exemplary delivery systems that can be used to facilitate uptake by a cell of the nucleic acid include calcium phosphate and other chemical mediators of intracellular transport, microinjection compositions, and homologous recombination compositions (e.g., for integrating a gene into a preselected location within the chromosome of the cell). Alternative methods may involve, for example, the use of liposomes, electroporation, or chemicals that increase free (or “naked”) DNA uptake, transformation using viruses or pollen and the use of microprojection. Standard molecular biology techniques are common in the art (e.g., Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York).


For the Agrobacterium tumefaciens-based plant transformation system, additional elements present on transformation constructs will, in one embodiment, include T-DNA left and right border sequences to facilitate incorporation of the recombinant polynucleotide into the plant genome.


In one embodiment, DNA is inserted randomly, i.e. at a non-specific location, in the genome of a target plant line. In another embodiment, DNA insertion is targeted in order to achieve site-specific integration, e.g. to replace an existing gene in the genome, to use an existing promoter in the plant genome, or to insert a recombinant polynucleotide at a predetermined site known to be active for gene expression. Several site specific recombination systems exist which are known to function in plants including cre-lox and FLP-FRT.


Transformation methods of this invention are preferably practiced in tissue culture on media and in a controlled environment. “Media” refers to the numerous nutrient mixtures that are used to grow cells in vitro, that is, outside of the intact living organism. Recipient cell targets include, but are not limited to, meristem cells, callus, immature embryos and gametic cells such as microspores, pollen, sperm and egg cells. It is contemplated that any cell from which a fertile plant may be regenerated is useful as a recipient cell. Callus may be initiated from tissue sources including, but not limited to, immature embryos, seedling apical meristems, microspores and the like. Cells capable of proliferating as callus are also recipient cells for genetic transformation. Practical transformation methods and materials for making genetically modified plants of this invention, e.g. various media and recipient target cells, transformation of immature embryos and subsequent regeneration of fertile genetically modified plants are known in the art.


In one embodiment, the method of transformation of algae comprises any of the methods as described hereinabove. In one embodiment, transformation of algae is accomplished using glass bead-assisted transformation, particle gun-mediated (biolistic) transformation, treatment with cellulolytic enzymes to weaken their cell walls, or homologous recombination.


Markers of Genetic Transformation


In one embodiment, DNA is introduced into only a small percentage of target cells in any one experiment. Marker genes are used to provide an efficient system for identification of those cells that are stably transformed by receiving and integrating a genetically modified DNA construct into their genomes. Preferred marker genes provide selective markers that confer resistance to a selective agent, such as an antibiotic or herbicide. Potentially transformed cells are exposed to the selective agent. In the population of surviving cells will be those cells where, generally, the resistance-conferring gene has been integrated and expressed at sufficient levels to permit cell survival. Cells may be tested further to confirm stable integration of the exogenous DNA. Useful selective marker genes include those conferring resistance to antibiotics such as kanamycin (nptII), hygromycin B (aph IV) and gentamycin (aac3 and aacC4) or resistance to herbicides such as glufosinate (bar or pat) and glyphosate (EPSPS).


Screenable markers which provide an ability to visually identify transformants can also be employed, e.g., a gene expressing a colored or fluorescent protein such as a luciferase or green fluorescent protein (GFP) or a gene expressing a beta-glucuronidase or uidA gene (GUS) for which various chromogenic substrates are known. It is also contemplated that combinations of screenable and selectable markers will be useful for identification of transformed cells.


Cells that survive exposure to the selective agent, or cells that have been scored positive in a screening assay, may be cultured in regeneration media and allowed to mature into plants. Developing plantlets can be transferred to soil less plant growth mix, and hardened off, e.g., in an environmentally controlled chamber at about 85% relative humidity, 600 ppm CO2, and 25-250 microeinsteins m−2s−1 of light, prior to transfer to a greenhouse or growth chamber for maturation. Plants are preferably matured either in a growth chamber or greenhouse. Plants are regenerated from about 6 weeks to 10 months after a transformant is identified, depending on the initial tissue. During regeneration, cells are grown to plants on solid media at about 19 to 28° C. After regenerating plants have reached the stage of shoot and root development, they may be transferred to a greenhouse for further growth and testing. Plants may be pollinated using conventional plant breeding methods known to those of skill in the art and seed produced.


Progeny may be recovered from transformed plants and tested for expression of the exogenous recombinant polynucleotide. Useful assays include, for example, “molecular biological” assays, such as Southern and Northern blotting and PCR; “biochemical” assays, such as detecting the presence of RNA, e.g. double stranded RNA, or a protein product, e.g., by immunological means (ELISAs and Western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole regenerated plant.


One of skill in the art will be able to select an appropriate vector for introducing the encoding nucleic acid sequence in a relatively intact state. Thus, any vector which will produce a host cell, e.g., plant protoplast, carrying the introduced encoding nucleic acid should be sufficient. The selection of the vector, or whether to use a vector, is typically guided by the method of transformation selected.


Plant Regeneration


Following transformation, plant cells transformed with a plant expression vector may be regenerated, e.g., from single cells, callus tissue or leaf discs according to standard plant tissue culture techniques. Almost any plant can be entirely regenerated from cells, tissues, and organs of the plant using methods that are known in the art.


The transformed plants may then be grown, and either pollinated with the same transformed strain or different strains, and the resulting hybrid having expression of the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure expression of the desired phenotypic characteristic has been achieved.


Normally, a plant cell is regenerated to obtain a whole plant from the transformation process. The term “growing” or “regeneration” as used herein means growing a whole plant from a plant cell, a group of plant cells, a plant part (including seeds), or a plant piece (e.g., from a protoplast, callus, or tissue part).


Regeneration from protoplasts varies from species to species of plants, but generally a suspension of protoplasts is first made. In certain species, embryo formation can then be induced from the protoplast suspension. The culture media will generally contain various amino acids and hormones, necessary for growth and regeneration. Examples of hormones utilized include auxins and cytokinins. Efficient regeneration will depend on the medium, on the genotype, and on the history of the culture. If these variables are controlled, regeneration is reproducible. Regeneration also occurs from plant callus, explants, organs or parts.


In vegetatively propagated crops, the mature genetically modified plants are propagated by utilizing cuttings or tissue culture techniques to produce multiple identical plants. Selection of desirable genetically modified plants is made and new varieties are obtained and propagated vegetatively for commercial use.


In seed propagated crops, mature genetically modified plants can be self-crossed to produce a homozygous inbred plant. The resulting inbred plant produces seed containing the genetic mutation. These seeds can be grown to produce plants that would produce the selected phenotype, e.g., increased lateral root growth, uptake of nutrients, overall plant growth and/or vegetative or reproductive yields.


Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, fruit, and the like are included in the invention, provided that these parts comprise cells comprising the isolated nucleic acid of the present invention. Progeny and variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced nucleic acid sequences. Genetically modified plants expressing a selectable marker can be screened for transmission of the nucleic acid of the present invention by, for example, standard immunoblot and DNA detection techniques. Genetically modified plant cells are also typically evaluated on levels of expression of the genetically modified nucleic acid. Expression at the RNA level can be determined initially to identify and quantitate expression-positive plants. Standard techniques for RNA analysis can be employed and include PCR amplification assays using oligonucleotide primers designed to amplify only the genetically modified RNA templates and solution hybridization assays using genetically modified nucleic acid-specific probes. The RNA-positive plants can then analyzed for protein expression by Western immunoblot analysis using the specifically reactive antibodies of the present invention. In addition, in situ hybridization and immunocytochemistry according to standard protocols can be done using genetically modified nucleic acid specific polynucleotide probes and antibodies, respectively, to localize sites of expression within genetically modified tissue. Generally, a number of genetically modified lines are usually screened for the incorporated nucleic acid to identify and select plants with the most appropriate expression profiles.


In one embodiment, the present invention provides a genetically modified plant that is homozygous for the introduced genetically modified nucleic acid; i.e., a genetically modified plant that contains two added nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome pair. A homozygous genetically modified plant can be obtained by sexually mating (selfing) a heterozygous genetically modified plant that contains a single added genetically modified nucleic acid, germinating some of the seed produced and analyzing the resulting plants produced for altered expression of a polynucleotide of the present invention relative to a control plant (i.e., native, non-genetically modified). Back-crossing to a parental plant and out-crossing with a non-genetically modified plant are also contemplated.


Transformed plant cells which are derived by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed genotype. Such regeneration techniques often rely on manipulation of certain phytohormones in a tissue culture growth medium.


Products of Genetically Modified Plants and Algae


Engineered plants exhibiting the desired physiological and/or agronomic changes can be used directly in agricultural production.


Thus, provided herein are products derived from the genetically modified plants or methods of producing genetically modified plants provided herein. In certain embodiments, the products are commercial products. Some non-limiting example include genetically engineered trees for e.g., the production of pulp, paper, paper products or lumber; tobacco, e.g., for the production of cigarettes, cigars, or chewing tobacco; crops, e.g., for the production of fruits, vegetables and other food, including grains, e.g., for the production of wheat, bread, flour, rice, corn; and canola, sunflower, e.g., for the production of oils or biofuels.


Biofuels


In one embodiment, biofuels are derived from a genetically engineered plant or algae of the present invention.


In one embodiment, a biofuel is a fuel that is produced through contemporary biological processes, such as agriculture and anaerobic digestion, as opposed to a fuel produced by geological processes such as those involved in the formation of fossil fuels, such as coal and petroleum, from prehistoric biological matter. Biofuels can be derived directly from plants, or indirectly from agricultural wastes.


In one embodiment, plants or plant parts as described herein are used as biofuel.


In one embodiment, algae are used as a biofuel. In one embodiment, the biofuel is selected from the group consisting of: biodiesel, ethanol, biojet fuel, and green gasoline.


In one embodiment, the biofuel is an alcohol fuel, such as bioethanol. In one embodiment, the bioethanol is produced by fermentation of sugars derived from wheat, corn, sugar beets, sugar cane, molasses, or a combination thereof. In another embodiment, the bioethanol is produced by fermentation of any sugar or starch from which alcoholic beverages such as whiskey, can be made (such as barley, potato and fruit waste, etc).


Thus, in one embodiment, the present invention provides a biofuel comprising genetically modified plants or plant parts of the present invention. In one embodiment, the present invention provides a process of producing a biofuel comprising: delivering a recombinant polynucleotide encoding a C-terminal-inactivated ACHT4 to plant or algae cells, regenerating plants or algae from said cells, screening said plants or algae to identify a plant having enhanced yield, extracting sugar or starch from some or all portions of said plant or algae or progeny thereof, fermenting said sugars to produce an alcoholic mixture, and distilling said alcohol from said mixture. Methods for producing biofuels from plants and plant parts are known in the art.


In one embodiment, algae are a sustainable source for essential omega-3 fatty acids.


In another embodiment, algae of the present invention are used as commodity animal feeds. In another embodiment, algae of the present invention are used as a source for foods. In one embodiment, essential omega-3 fatty acids from algae are used in infant formula and other food products and vitamins. In another embodiment, carbohydrates and emulsifiers produced from seaweeds are used in food products. In another embodiment, Spirulina is used in food products.


In another embodiment, algae of the present invention are used as a source for specialty feeds.


In one embodiment, algae contain carbohydrates, proteins, vegetable oils, micronutrients, vitamins, as well as valuable pigments used in animal feeds, such as beta carotene, lutein and astaxanthin. In another embodiment, algae is used as a source of feed in aquaculture operations, including as feed for fish and shellfish like clams, oysters, mussels and scallops.


In another embodiment, algae of the present invention are used as a source for chemicals. In one embodiment, the chemical is a plastic. In one embodiment, the chemical is a fertilizer. In one embodiment, the alga is seaweed. In one embodiment, some microalgae fix atmospheric nitrogen and could be a source of organic fertilizers (“green manure”).


In another embodiment, algae of the present invention are used as a source for cosmetics. In one embodiment, the cosmetic is a skin-care product. In one embodiment, the cosmetic provide UV protection. In another embodiment, algae of the present invention are used as a source for pharmaceuticals.


In one embodiment, multiple products from the same algal biomass are possible.


In certain embodiments, commercial products are derived from a genetically engineered species of woody, ornamental or decorative, crop or cereal, fruit or vegetable plant, and algae (e.g., Chlamydomonas reinhardtii), which may be used in the compositions and methods provided herein. Non-limiting examples of plants include plants from the genus Arabidopsis or the genus Oryza. Other examples include plants from the genuses Acorus, Aegilops, Allium, Amborella, Antirrhinum, Apium, Arachis, Beta, Betula, Brassica, Capsicum, Ceratopteris, Citrus, Cryptomeria, Cycas, Descurainia, Eschscholzia, Eucalyptus, Glycine, Gossypium, Hedyotis, Helianthus, Hordeum, Ipomoea, Lactuca, Linum, Liriodendron, Lotus, Lupinus, Lycopersicon, Medicago, Mesembryanthemum, Nicotiana, Nuphar, Pennisetum, Persea, Phaseolus, Physcomitrella, Picea, Pinus, Poncirus, Populus, Prunus, Robinia, Rosa, Saccharum, Schedonorus, Secale, Sesamum, Solanum, Sorghum, Stevia, Thellungiella, Theobroma, Triphysaria, Triticum, Vitis, Zea, or Zinnia.


In some embodiments, commercial products are derived from a genetically engineered gymnosperms and angiosperms, both monocotyledons and dicotyledons.


Examples of monocotyledonous angiosperms include, but are not limited to, asparagus, field and sweet corn, barley, wheat, rice, sorghum, onion, pearl millet, rye and oats and other cereal grains. Examples of dicotyledonous angiosperms include, but are not limited to tomato, tobacco, cotton, rapeseed, field beans, soybeans, peppers, lettuce, peas, alfalfa, clover, cole crops or Brassica oleracea (e.g., cabbage, broccoli, cauliflower, brussel sprouts), radish, carrot, beets, eggplant, spinach, cucumber, squash, melons, cantaloupe, sunflowers and various ornamentals.


In certain embodiments, commercial products are derived from a genetically engineered woody species, such as poplar, pine, sequoia, cedar, oak, etc.


In other embodiments, commercial products are derived from a genetically engineered plant including, but are not limited to, wheat, cauliflower, tomato, tobacco, corn, petunia, trees, etc.


In certain embodiments, commercial products are derived from genetically engineered crop plants, for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea, and other root, tuber, or seed crops. In one embodiment, commercial products are derived from a genetically engineered cereal crops, including, but are not limited to, any species of grass, or grain plant (e.g., barley, corn, oats, rice, wild rice, rye, wheat, millet, sorghum, triticale, etc.), non-grass plants (e.g., buckwheat flax, legumes or soybeans, etc.). In another embodiment, commercial products are derived from a genetically engineered grain plants that provide seeds of interest, oil-seed plants and leguminous plants. In other embodiments, commercial products are derived from a genetically engineered grain seed, such as corn, wheat, barley, rice, sorghum, rye, etc. In yet other embodiments, commercial products are derived from a genetically engineered oil seed plants, such as cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. In certain embodiments, commercial products are derived from a genetically engineered oil-seed rape, sugar beet, maize, sunflower, soybean, or sorghum. In some embodiments, commercial products are derived from genetically engineered leguminous plants, such as beans and peas (e.g., guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc.)


In certain embodiments, commercial products are derived from a genetically engineered horticultural plant of the present invention, such as lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower, and carnations and geraniums; tomato, tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper, chrysanthemum, poplar, eucalyptus, and pine.


In still other embodiments, commercial products are derived from a genetically engineered corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annuus), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum, Nicotiana benthamiana), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Coffea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Peryea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), oats, barley, Arabidopsis spp., vegetables, ornamentals, and conifers.


Enhanced Phenotype


In one embodiment, “enhanced phenotype” as used herein refers to a measurable improvement in a crop trait including, but not limited to, yield increase, including increased yield under non-stress conditions and increased yield under environmental stress conditions. Stress conditions may include, for example, drought, shade, fungal disease, viral disease, bacterial disease, insect infestation, nematode infestation, cold temperature exposure, heat exposure, osmotic stress, reduced nitrogen nutrient availability, reduced phosphorus nutrient availability and high plant density. Many agronomic traits can affect “yield”, including without limitation, plant height, pod number, pod position on the plant, number of internodes, incidence of pod shatter, grain size, efficiency of nodulation and nitrogen fixation, efficiency of nutrient assimilation, resistance to biotic and abiotic stress, carbon assimilation, plant architecture, resistance to lodging, percent seed germination, seedling vigor, and juvenile traits. Other traits that can affect yield include, efficiency of germination (including germination in stressed conditions), growth rate (including growth rate in stressed conditions), seed size, composition of seed (starch, oil, protein) and characteristics of seed fill.


In another embodiment, the present invention also provides genetically modified plants that demonstrate enhanced phenotypic properties that may or may not confer an increase in overall plant yield. Such properties include enhanced plant morphology, plant physiology or enhanced components of the mature seed harvested from the genetically modified plant. In another embodiment, the present invention also provides genetically modified plants with enhancements in seed oil, tocopherol, protein and starch components, including increases in the quantity of any of these components, alterations in the ratios of any of these components, or production of new types of these components that do not exist in the seed from control plants. By way of example, increases in total tocopherol content are desirable, as are increases in the relative percentage of alpha-tocopherol produced by plants.


In one embodiment, “increased yield” of a genetically modified plant of the present invention may be evidenced and measured in a number of ways, including test weight, seed number per plant, seed weight, seed number per unit area (i.e. seeds, or weight of seeds, per acre), bushels per acre, tonnes per acre, tons per acre, kilo per hectare. Increased yield may result from improved utilization of key biochemical compounds, such as nitrogen, phosphorous and carbohydrate, or from improved responses to environmental stresses, such as cold, heat, drought, salt, and attack by pests or pathogens. Polynucleotides of the present invention may also be used to provide plants having improved growth and development, and ultimately increased yield, as the result of modified expression of plant growth regulators or modification of cell cycle or photosynthesis pathways.


Use for Gene Suppression


In one embodiment, polynucleotides of the present invention include recombinant polynucleotides providing for expression of mRNA encoding a polypeptide. In another embodiment, polynucleotides of the present invention include recombinant polynucleotides providing for expression of mRNA complementary to at least a portion of an mRNA native to the target plant for use in suppression of the ACTH4 gene.


In one embodiment, “gene suppression” is used herein to refer to reduction or suppression of expression of a target protein in a host cell as the result of transcription of a recombinant polynucleotide provided herein, wherein the polynucleotide is oriented with respect to a promoter to provide for production of RNA having a gene silencing effect, such as antisense RNA or interfering RNA (RNAi).


Other Methods of Down-Regulating ACHT4 Protein Expression in Plants and Algae


In another embodiment, the present invention provides an antibody against a polypeptide comprising a C-terminal deleted form of atypical CYS HIS rich thioredoxin 4 (ACHT4).


In one embodiment, down-regulation of ACHT4 protein is partial. In another embodiment, the down-regulation completely eliminates protein activity by decreasing overall steady state levels of the protein associated therewith.


In one embodiment, down-regulation of ACHT4 protein comprises decreasing the levels of ACHT4 protein. In another embodiment, down-regulation of ACHT4 protein comprises decreasing the activity of ACHT4 protein.


In one embodiment, the down-regulation is achieved by antisense RNA. In another embodiment, the down-regulation is achieved by ribozyme technology, which, in one embodiment, works at the RNA translational level and involves making catalytic RNA molecules which bind to and cleave the mRNA of interest. Both of these were found effective in regulating protein levels in plants. In another embodiment, the down-regulation is achieved by co-suppression.


In another embodiment, the down-regulation is achieved using antibodies. In one embodiment, the antibody is a monoclonal antibody. In another embodiment, the down-regulation is achieved using functional fragments of antibodies, which in one embodiment, is a single chain antibody (SCAb).


In one embodiment, the antibody binds to ACHT4ΔC. In another embodiment, the antibody binds to ACHT4.


In another embodiment, the present invention provides methods for using antibodies to ACHT4 and/or ACHT4ΔC as described herein and functional fragments thereof (e.g., Fv or Fab fragments), for increasing plant or algae yield and/or growth comprising administering said antibody to a plant or alga. Methods for producing antibodies and functional fragments of antibodies are known in the art.


In another embodiment, the present invention provides methods for using anti-sense RNA, ribozymes, etc for increasing plant or algae yield and/or growth comprising transforming said plant or alga with said RNA or ribozyme.


Combinations of Modified Genetic Traits


The present invention also encompasses genetically modified plants with stacked engineered traits, e.g. a crop having an enhanced phenotype resulting from expression of a recombinant polynucleotide provided herein, in combination with herbicide and/or pest resistance traits. For example, genes of the current invention can be stacked with other traits of agronomic interest, such as a trait providing herbicide resistance, for example a RoundUp Ready trait, or insect resistance, such as using a gene from Bacillus thuringensis to provide resistance against lepidopteran, coliopteran, homopteran, hemiopteran, and other insects. Herbicides for which resistance is useful in a plantinclude glyphosate herbicides, phosphinothricin herbicides, oxynil herbicides, imidazolinone herbicides, dinitroaniline herbicides, pyridine herbicides, sulfonylurea herbicides, bialaphos herbicides, sulfonamide herbicides and gluphosinate herbicides.


All patents, patent applications, and scientific publications cited herein are hereby incorporated by reference in their entirety.


The following examples are presented in order to more fully illustrate the preferred embodiments of the invention. They should in no way be construed, however, as limiting the broad scope of the invention.


EXAMPLES
Example 1

ACHT4-Driven Oxidation of APS1 Attenuates Starch Synthesis Under Low Light Intensity in Arabidopsis Plants


Materials and Methods


Plant Material and Growth Conditions



Arabidopsis thaliana var Columbia were grown under a 8/16 h light/dark cycle at 20° C./18° C., respectively, at 80 μE*m−2*s−1 (unless otherwise stated) for 3-4 weeks. Thylakoid membranes were isolated as previously described.


Generation of Genetically Modified Plants


ACHT4, ACHT4MT (in which the non-nucleophilic cysteine of the active site was replaced with a serine), ACHT4ΔC and ACHT1 open reading frames were ligated upstream and in frame of the HA3 affinity tag and under the control of the 35S promotor into pART7 vector. All four constructs were used to transform Arabidopsis leaves using a standard floral dip transformation protocol (Clough and Bent Plant J. 1998 December; 16(6):735-43, incorporated herein by reference in its entirety). In short, the floral dip transformation method involves simple dipping of developing floral tissues into a solution containing Agrobacterium tumefaciens, 5% sucrose and 500 microliters per litre of surfactant Silwet L-77.


Protein Redox Assays, Immunoblot and Affinity Purification Analyses


The disulfide state of plant-extracted proteins, the identification of intermolecular disulfide complexes, and their isolation by immunoprecipitation were assayed in planta as previously described. The mass-spectrometry (MS) analysis is detailed in Table 3.









TABLE 3







Identification of 2-Cys Prx and APS1 as ACHT4


targets by mass spectrometry














Protein






sequence
Protein


Protein


coverage
length


name
Queries matched
Score
(%)
(aa)





2-CYS-
APDFEAEAVFDQEFIK (3)
478
44
266


PRX
LNTEVLGVSVDSVFSHLAWVQ






TDR (2)






SGGLGDLNYPLISDVTK (2)






SFGVLIHDQGIALR (2)






GLFIIDK (1)






EGVIQHSTINNLGIGR (1)






TLQALQYIQENPDEVCPAGWK






PGEK (2)








APS1
LIDIPVSNCLNSNISK (1)
572
37
520



IYVLTQFNSASLNR (2)






NEGFVEVLAAQQSPENPNWFQ






GTADAVR (4)






ETDADITVAALPMDEQR (1)






VDTTILGLDDQR (1)






EMPFIASMGIYVVSR (1)






NQFPGANDFGSEVIPGATSLG






LR (3)






VQAYLYDGYWEDIGTIEAFYN






ANLGITK (1)






KPVPDFSFYDR (1)






MLDADVTDSVIGEGCVIK






(1)






IINSDNVQEAAR (1)









The Mass-spectrometry analysis was performed by the Biological Mass Spectrometry Unit at Weizmann Institute of Science by online reversed-phase nano-liquid chromatography, electrospray ionization tandem mass spectrometric analyses. Survey scans were recorded in the FT-mode followed by data-dependent collision-induced dissociation of the 7 most intense ions in the linear ion trap. Raw spectra were processed using open-source software DTA SuperCharge. The data were searched with MASCOT (Matrix Science, London, UK) against a Swissprot or NCBI database. Control samples treated with DTT or derived from wild-type plants, allowed for the subtraction of nonspecific background proteins.


Trapped intermolecular disulfide protein complexes were incubated overnight at 4° C. in RIPA buffer (1% sodium deoxycholate, 0.1% SDS, 1% Triton X-100, 10 mM Tris-HCl, pH 8, and 140 mM NaCl) with either anti-HA (Sigma A2095) resin or anti-2-Cys Prx- or anti-APS1-coated protein G beads (Amersham). The proteins were eluted with either reducing or nonreducing 2×sample buffer and separated on SDS-PAGE gels for immunoblots or for MS analysis. Anti-APS1 polyclonal antibodies were raised in rabbits at GenScript HK Ltd., using a purified peptide (CILGLDDQRAKEMPF). 2-Cys Prx-specific polyclonal antibodies were as in Dangoor, 2009 Plant Physiol. 149:1240-1250, which is incorporated herein by reference in its entirety. Mouse monoclonal anti-HA antibodies (SIGMA H9658) were used in protein blot assays.


Starch Analysis


Starch content of 0.8 g samples of two month-old rosettes was analyzed using the SIGMA starch assay kit (SA20-1KT). Every replicate included ten rosettes.


Accession Numbers


Sequence data can be found in the Arabidopsis Genome Initiative or GenBank/EMBL databases under the following accession numbers: ACHT4 (AT1G08570), 2-Cys PrxA (At3g11630), and APS1 (AT5G48300).


Results


Reoxidation of ACHT4 by 2-Cys Prx Shortly after Illumination


In order to examine whether other thylakoid-associated members of the ACHT family have similar or unique roles to that of ACHT1, we analyzed the redox state changes of the ACHT4 catalytic site following the onset of growth light (80 to 100 μE*m−2*s−1) after a typical 16 h night period in plants expressing ACHT4 (Dangoor, 2009). The catalytic site of ACHT4 was found to be mostly disulfide-bonded at the end of the night and to undergo rapid reduction within 1 min of exposure to the light (FIG. 1A). As reported for ACHT1, significant oxidation of the ACHT4 catalytic site was observed within 30 min of illumination and counteracted its reductive state. Thereafter, ACHT4 redox state appeared to be in a dynamic quasi-oxidized state. A control experiment demonstrated that the total amount of ACHT4 did not change significantly (FIG. 1A, reduced panel) under the experimental conditions.


To investigate the identity of the proteins that ACHT4 exchanges disulfides with in planta, we captured as in (Dangoor, 2012, The Plant Cell 24(5):1894-1906, which is incorporated herein by reference in its entirety) its intermolecular disulfide reaction intermediates. Protein blot analysis of denatured, but not reduced, plant extracts identified three intermolecular disulfide-linked ACHT4-containing protein complexes, verified by their susceptibility to chemical reduction by dithiothreitol (DTT) (FIG. 1B, ACHT4 panel). A comparison to the ACHT1 intermolecular disulfide-linked protein complexes (FIG. 1B, ACHT1 panel) showed that two of the main ACHT4 complexes, with estimated sizes of ˜55 kDa and ˜70 kDa, corresponded to the combined molecular weight of the previously characterized heterodimeric and the heterotrimeric ACHT1 and 2-Cys Prx complexes, respectively (Dangoor, 2012). Interestingly, an additional third intermolecular disulfide ACHT4 complex (marked with asterisk), with a higher molecular weight than that of the ACHT4 2-Cys Prx heterotrimer, and which did not form in ACHT1 extracts, was identified as well.


To verify the authenticity of ACHT4-2-Cys Prx intermolecular disulfide complexes, they were pulled down in a reciprocal analysis performed under non-reducing denaturing conditions, with either anti-HA (for ACHT4) or anti-2-Cys Prx sera. Protein blot analysis of denatured and reduced samples identified the 2-Cys Prx in the anti-HA pulled down complexes and ACHT4 in the anti-2-Cys Prx pulled down complexes (FIG. 1C). As expected, mass-spectrometry analysis identified both ACHT4 and 2-Cys Prx in the gel slice containing the heterotrimer complex but not in a corresponding gel slice containing extracts that were separated under reducing conditions (Table 3). We concluded that ACHT4 exchanged disulfides with the 2-Cys Prx in plants, in a similar manner as that reported for ACHT1.


APS1 is a Unique Target of ACHT4


The comparison of the intermolecular disulfide complexes formed in planta by ACHT4 and ACHT1 uncovered a major disulfide linked complex unique to ACHT4 (FIG. 1B). A mass spectrometry analysis identified APS1 in the gel slice containing the unique complex of ACHT4 but not in a corresponding reduced gel slice (Table 3). To verify these findings, the protein intermolecular disulfide complexes were reciprocally pulled-down under non-reducing denaturing conditions with either anti-HA, for ACHT4, or anti-APS1. Protein blot analyses identified the APS1 in the anti-HA pulled down complexes and ACHT4 in the anti-APS1 pulled down complexes (FIG. 1D), suggesting that APS1 is a target protein of ACHT4. These findings also implied that although ACHT1 and ACHT4 share a similar mode of oxidation by 2-Cys Prx, they differ in at least one major target, suggesting that they may serve to regulate distinctive processes.


ACHT4 Participates in the Diurnal Redox Regulation of AGPase


The trapping in vivo of APS1-ACHT4 and 2-Cys Prx-ACHT4 disulfide exchange reaction intermediates (RIs) opened the possibility of studying the environmental stimuli that influence ACHT4-driven AGPase redox control. First, we analyzed the changes in the 2-Cys Prx-ACHT4 and APS1-ACHT4 RIs, and the corresponding changes in the APS1 redox state upon light onset of plants grown under a 8/16 h light/dark regime. Intriguingly, the low APS1-ACHT4 RI level and the high 2-Cys Prx-ACHT4 level in the dark contrasted each other (FIG. 2A). Since the 2-Cys Prx RIs were high in the dark also with ACHT1 (Dangoor, 2012), it suggested that the ACHT4 disulfide transfer reaction with APS1 differs from the reducing reaction of either ACHT1 or ACHT4 with 2-Cys Prx (Dangoor, 2009). As the dark is a relatively stable condition, i.e. changes in redox state of reacting proteins are not expected, these finding could conceivably reflect an opposite directionality of the disulfide transfer reaction, ACHT4 to APS1 versus 2-Cys Prx to ACHT4, which could possibly be derived from dissimilar redox midpoint potentials of the proteins. Consequently, the contrasting levels of ACHT4 RIs in the dark would be a result of the levels of its two substrates, high level of oxidized 2-Cys Prx and low level of reduced APS1. Notably, the increase in the level of APS1-ACHT4 RIs upon illumination (FIG. 2A) matched the increase in reduction of APS1 at this time period, further supporting this notion. The APS1-ACHT4 RI level approached steady state level after a 30 min transition period in the light. The 2-Cys Prx-ACHT4 RI levels (FIG. 2A) showed a similar pattern to those of 2-Cys Prx-ACHT1 (Dangoor, 2012) also during illumination, a transient decrease, then an increased level, which reached a steady state after the 30 min transition period in the light.


The analysis of the concomitant changes in the redox state of APS1 showed that APS1 was resting in the inactive intermolecular disulfide form in the dark (FIG. 2B). In addition, in dark conditions, a monomeric form of APS1 with slightly lower molecular weight (MW) that was converted to the monomeric form with the expected MW during the first 5 min of illumination was observed. The conversion of the faster migrating form to the slower form upon illumination, suggested that the lower MW monomer could be a compacted APS1 form bearing an intramolecular disulfide, the reduction of which creates a stretched molecule in the light. To verify that, we compared the migration of the APS1 monomer purified from dark protein extracts and chemically reduced, with DTT, in vitro, to the monomer purified from light extracts (FIG. 2C). The DTT-reduced monomer derived from dark extracts migrated parallel to the monomer from light extract, whereas the DTT-reduced monomer isolated from light extracts did not alter its migration, indicating that an APS1 intramolecular disulfide indeed participated in the redox control of APS1. Thus, we analyzed the monomeric APS1 via a methodology that exclusively measures the levels of reduced monomer. In this method, the reduced cysteines are first blocked with N-ethyl maleimide (NEM), and the disulfides are then chemically reduced and reacted with methoxypolyethylene glycol-maleimide (mPEG). We found that the level of reduced monomer was barely detectable in the dark, gradually increased during the 30 min transition period and reached a steady state level thereafter (FIG. 2D). In parallel, 2-Cys Prx-ACHT4 RI levels, indicative of ACHT4 oxidative signal, were low during the transition period and also reached relatively stable values only thereafter (FIG. 2A), suggesting that the oxidative signal of ACHT4 dynamically counteracted APS1 reduction by Trx-f1, and approached a dynamic equilibrium after 30 min in the light.


Both 2-Cys Prx ACHT4 and APS1 RI levels and APS1 redox state after the light was switched off, i.e. the period in which oxidation is expected to turn off AGPase, were inverse to those observed after the onset of light. The 2-Cys Prx-ACHT4 RI remained stable during the first 5 min and then increased, suggesting increased oxidation by 2-Cys Prx during that time period (FIG. 2E). Levels of the reduced monomeric APS1 form gradually decreased alongside a concomitant increase in the oxidized, intramolecular disulfide-bearing monomer and intermolecular disulfide-linked dimer (FIG. 2F). At the same time, the level of the APS1 reaction intermediate, which was high at the end of the day, gradually decreased and reached levels similar to those observed before the beginning of the day (FIG. 2E). These results are consistent with an increased oxidation rate of APS1 by ACHT4 during the transition from day to night, resulting in diminishing APS1 activity.


ACHT4 Participates in the Regulation of APS1 During Fluctuations in Light Intensity


We found that ACHT4 participated in the diurnal regulation of APS1, that has been proposed to influence the day and night cycles of starch synthesis and degradation. The reactions of ACHT4 with APS1 and 2-Cys Prx, as judged by the levels of their disulfide exchange reaction intermediates, seemed to reach balanced levels after a transition period in the light (FIG. 2A), as manifested by APS1 activation levels (FIG. 2B), suggesting that ACHT4 oxidation of APS1 is active and dynamically counteracting its reduction by Trx-f1 during the day. Such activity might be important for the rationing of photosynthates to starch and sugars exported from the chloroplast during the day. Alternatively, as shown in the unicellular green alga Chlamydomonas reinhardtii, such activity may regulate the levels of starch to be used as a transient pool for reduced carbon in a fluctuating light environment. We tested this hypothesis by subjecting the plants to small fluctuations of light intensity during the day. Interestingly, when light intensity was reduced to 10 μE*m−2*s−1 after a 2-hour 50 μE*m−2*s−1 light regime, APS1-ACHT4 RI levels declined within 5 min, and rose again when the light intensity was switched back to 50 μE*m−2*s−1 (FIG. 3A). While 2-Cys Prx-ACHT4 RI levels remained steady upon decreased light intensity, they rapidly declined following the transfer back to the higher light intensity (FIG. 3A). In parallel, a lower ratio of reduced to oxidized APS1 was obtained following the transfer to the lower light and a higher ratio was observed upon increased light intensity (FIG. 3B). These results suggest that, in addition to its diurnal role of ACHT4-driven oxidation of APS1 in the beginning of the night, it may also participate in the dynamic regulation of AGPase activity in response to natural fluctuations in light intensity. Furthermore, the reduction of APS1 upon increased light intensity occurred rapidly in comparison to its oxidation upon decreased light intensity (FIG. 3B), suggesting that AGPase redox regulation might also play a role in stimulating a transient burst of starch synthesis to accommodate an abruptly increased light intensity. The concomitant gradual decrease of both APS1-ACHT4 RIs levels and reduced APS1 upon lowering the light intensity and the abrupt increase in both upon increased intensity further supported the notion that ACHT4 reacted with the reduced APS1.


Example 2
Expression of C-Terminal Truncated Form of AtACHT4 Increases Transitory Starch Content in Arabidopsis Leaves and Increases Plant Biomass

The C-terminus of ACHT4 is important for its reaction with APS1


We then assessed whether the distinct 47-amino acid-long ACHT4 C-terminus (FIG. 4A) is responsible for its in planta differences from ACHT1, by analyzing the 2-Cys Prx and APS1 RIs in plants expressing ACHT4 lacking the C-terminus (ACHT4ΔC). Notably, the ACHT4ΔC form reacted with 2-Cys Prx, and formed the two intermolecular disulfide-linked 2-Cys Prx ACHT4 RIs, but failed altogether to interact with APS1 (FIG. 2B). The preferential reaction of ACHT4ΔC with the 2-Cys Prx and not with APS1 was maintained throughout the transition from night to day (FIG. 4C). Moreover, the profile of the reaction of ACHT4ΔC with 2-Cys Prx during the first 2 hr of illumination was similar to that of ACHT4, indicating that the oxidation of ACHT4ΔC by 2-Cys Prx does not involve the protein's C-terminus and that the C-terminus deletion only affected ACHT4-driven redox control of APS1.


Both ACHT4 and ACHT1 are thylakoid-associated proteins. The distinct reaction of ACHT4, and not of ACHT1, with APS1, prompted us to investigate whether the disparity might be due to different thylakoid localization, either the grana, the grana margins, or the stroma lamella. Protein blot analysis showed that ACHT1 was primarily found in both the grana and the stroma lamella domains, whereas ACHT4 was mainly present in the stroma lamella domain and was undetectable in the grana (FIG. 4D), further promoting distinct roles of ACHT4 and ACHT1. The membrane association of ACHT4ΔC was then analyzed to determine whether ACHT4 localization is influenced by its unique C-terminus. Only a small increase in the partitioning of ACHT4ΔC to the grana margins was observed, suggesting that the ACHT4 C-terminus deletion effect on its disulfide exchange reaction with APS1 was direct and not mediated via its thylakoid domain localization.


The finding that the deletion of the ACHT4 C-terminus diminished its disulfide exchange reaction with APS1 (FIG. 2B) facilitated studying the effect of ACHT4 on APS1 redox state and on transitory starch content. We first compared the APS1 redox state after two hours in the light, representing steady state conditions in which transitory starch is synthesized, and when light intensity was reduced, conditions in which APS1 is being oxidized (FIG. 3). Consistently with the expected oxidative role of ACHT4 and the dominant negative effect of the C-terminus deletion, plants expressing increased levels of ACHT4 exhibited lower levels of reduced APS1 monomer, and plants expressing ACHT4ΔC contained higher levels of the reduced APS1 monomer, than WT plants (FIG. 5A). Next, in order to determine whether the influence of ACHT4 on the redox state of APS1 impacts starch accumulation, we assayed the starch content of WT ACHT4 and ACHT4ΔC plants at the end of the day (FIG. 5B). In agreement with the changes in the APS1 redox state, plants expressing ACHT4 showed reduced starch levels, whereas plants expressing ACHT4ΔC contained increased starch levels. These results corroborate the notion that disulfides transferred by ACHT4 to APS1 are used in planta to fine-tune APS1 activity by counterbalancing the reduction of APS1 by the reductive type Trxs.


Expression of AtACHT4ΔC Also Increases Arabidopsis Biomass by 10%


Stimulation of starch synthesis results not only in increased accumulation of transitory starch in Arabidopsis leaves, it also stimulates growth, indicating that OE of AtACHT4ΔC stimulates the export of photosynthates from the chloroplast which are then directed toward growth and biomass accumulation.


AtACHT4ΔC-OE, AtACHT4-OE and WT lines were grown under long day (18 h/6 h of light/dark cycle at 21° C./18° C.) for 4 weeks. The plants shoot was excised and fresh weight (FW) were recorded. The tissues were dried at 60° C. for 4 days and dry weight (DW) were recorded. The FW of AtACHT4ΔC-OE plants was increased by 10.6% (FIG. 6A) and the DW was increased by 9.1% over those of WT (FIG. 6B), indicating that AtACHT4ΔC stimulates the export of photosynthates from the chloroplast which are then directed toward growth. The FW of AtACHT4 OE plants was decreased by 10.9% (FIG. 6A) and the DW was increased by 11.5% over those of WT (FIG. 6B), confirming that the C-terminus of ACHT4 attenuates growth.


Example 3
Expression of C-Terminal Truncated Form of Potato StACHT4-2 and StACHT4-1 Paralogs Increases Correspondingly Potato Tuber Yield and Transitory Starch Content in Leaves

Potato Plants



Arabidopsis has one paralog of ACHT4 where other crop plants, including potato, maize, rice, barley, wheat, sorghum, castor, bean, rapeseed, cotton, soybean, beat, banana, chili, chickpea, tomato, African oilpalm, Foxtail millet, cassava and the algae Chlamydomonas and Chlorella have one to five paralogs (FIG. 7).


Materials and Methods


Identification and Over Expression of AtACHT Homologs in Potato


We analyzed the potato genome for AtACHT4 homologs. Protein blast (blastP) analysis of AtACHT against the genome database of potato (solgenomics.net) identified two paralogs, StACHT4-1 (XP_006348023.1) and StACHT4-2 (XP_006351368.1). A 69 and 68 amino acid long tail region at the C-terminus of StACHT4-1 and StACHT4-2, correspondingly, were identified. Four constructs of StACHT4-1ΔC and StACHT4-2ΔC, StACHT4-1, and StACHT4-2 were custom synthesized by Hy-Laboratories Ltd., Israel, were subcloned into pART7 vector under control of CaMV35S promoter and OCS terminator. HA (Human influenza hemagglutinin) tag was fused in frame at the C-terminus of each of the protein coding sequences. All four StACHT4::pART7 and StACHT4-2ΔC:: pART7 constructs were digested with NotI and subcloned into pART27 plant expression vector (FIG. 11) and used to transform potato leaf discs (Solanum tuberosum cv. Desiree) using a standard agrobacterium-based transformation protocol. Positive transformants were identified by PCR analysis and protein expression was verified by immunoblot analysis using anti-HA antibody.


Results


The Over Expression of StACHT4-2ΔC in Potato Plants Nearly Doubles the Tubers Yield in Comparison to WT Plants


The StACHT4-2ΔC-OE and WT plants (Solanum tuberosum cv. Desiree) were planted on May 22, 2016 and were grown for 60 days in the green-house. Plants were harvested, tubers were collected, counted and their fresh weight was recorded (FIG. 8). The yield analysis of the StACHT4-2ΔC-OE lines showed 91% increase in the total tuber yield per plant as compared to WT plants. Photographs of representative plants show the increase in biomass accumulation of both shoots and tubers (FIG. 9).


StACHT4-1ΔC OE Lines Accumulated Increased Level of Transitory Starch in Leaves


To test whether StACHT4-1ΔC OE proteins regulate transitory starch content, young green leaves from 6-weeks old green-house grown plant were collected and analyzed for the starch content by Sigma starch assay kit (Catalog Number SA20) with some modifications. Approximately, 0.25-0.5 g fresh leaves were ground to fine powder using liquid nitrogen and then suspended into 20 ml DMSO and 5 ml of 8M HCl solution. The suspensions were incubated for 30 min at 60° C. and then 50 ml water was added to it. The pH was adjusted between 4 and 5, allowed to cool and volume adjusted till 100 ml. From these samples 400 μl were mixed with same volume of starch assay reagent, incubated for 15 min at 60° C. and then allowed to cool at room temperature. Then 200 μl of starch assay mixture were mixed with same volume of glucose assay reagent, incubated at room temperature for 15 min and then absorbance was recorded at 340 nm. Simultaneously, standard starch powder (provided in kit) at concentration of 0, 2, 4, 6, 8 and 10 mg were also used for starch assay and a standard curve was plotted using the absorbance data. The absorbance recorded from the leaves sample of potato transgenic lines were then used to calculate the starch level in leaves using formula obtained from the standard curve. Notably, the StACHT4-1ΔC OE lines accumulated 19% higher starch content relative to WT plants (FIG. 10).


OE of the potato paralogs StACHT4-2ΔC and StACHT4-1ΔC relieves growth and starch synthesis attenuation, as was found for OE of Arabidopsis AtACHT4ΔC. However, in potato, each of the two potato paralogs has a unique function. OE expression of StACHT4-2ΔC stimulated the allocation of photosynthates towards growth and near doubles tuber yield and plants shoot growth (FIGS. 8-9), whereas OE expression of StACHT4-1ΔC stimulated transitory starch (FIG. 10). Importantly, OE of the full length protein of either StACHT4-2 (FIG. 8) or StACHT4-1 (FIG. 10) did not result in the stimulating effect, again demonstrating that the expressed C-terminus truncated form of ACHT4 has a negative dominant effect, in StACHT4-2ΔC or StACHT4-1-ΔC, as well as in AtACHT4ΔC.


It will be appreciated by those skilled in the art that changes could be made to the embodiments described above without departing from the broad inventive concept thereof. It is understood, therefore, that this invention is not limited to the particular embodiments disclosed, but it is intended to cover modifications that are within the spirit and scope of the invention, as defined by the appended claims.


It is the intent of the Applicant(s) that all publications, patents and patent applications referred to in this specification are to be incorporated in their entirety by reference into the specification, as if each individual publication, patent or patent application was specifically and individually noted when referenced that it is to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting. In addition, any priority document(s) of this application is/are hereby incorporated herein by reference in its/their entirety.

Claims
  • 1. A recombinant polynucleotide encoding an atypical CYS HIS rich thioredoxin 4 (ACHT4) protein, wherein the C-terminal portion of said ACHT4 protein comprises an inactivating mutation.
  • 2. A composition comprising the polynucleotide of claim 1.
  • 3. An expression vector comprising the polynucleotide of claim 1.
  • 4. The expression vector of claim 3 further comprising a constitutive, inducible, or tissue-specific promoter operably linked to the polynucleotide.
  • 5. A composition comprising the expression vector of claim 4.
  • 6. A cell comprising the expression vector of claim 4.
  • 7. A composition comprising the cell of claim 6.
  • 8. A seed comprising the recombinant polynucleotide of claim 1.
  • 9. A plant, or plant part, comprising the recombinant polynucleotide of claim 1.
  • 10. The plant or plant part of claim 9, wherein said plant is an Arabidopsis plant.
  • 11. The plant or plant part of claim 9, wherein said plant is a moss.
  • 12. A biofuel comprising the plant or plant part of claim 9.
  • 13. An alga comprising a C-terminal inactivated form of an atypical CYS HIS rich thioredoxin 4 (ACHT4) gene.
  • 14. A biofuel comprising the algae of claim 13.
  • 15. A polypeptide encoded by the recombinant polynucleotide of claim 1.
  • 16. A method of increasing the yield of a plant or algae comprising contacting a cell from said plant or algae with the recombinant polynucleotide of claim 1, thereby increasing the yield of said plant or algae.
  • 17. A method of increasing the productivity of a plant or algae comprising contacting a cell from said plant or algae with the recombinant polynucleotide of claim 1, thereby increasing the productivity of said plant or algae.
  • 18. A method of increasing the biomass of a plant or algae comprising contacting a cell from said plant or algae with the recombinant polynucleotide of claim 1, thereby increasing the biomass of said plant or algae.
  • 19. A method of stimulating the growth of a plant or algae comprising contacting a cell from said plant or algae with the recombinant polynucleotide of claim 1, thereby stimulating the growth of said plant or algae.
  • 20. A method of enhancing the starch content of a plant or algae comprising contacting a cell from said plant or algae with the recombinant polynucleotide of claim 1, thereby enhancing the starch content of said plant or algae.
RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 15/753,024 filed on Feb. 15, 2018, which is a National Phase Application of PCT Patent Application No. PCT/IL2016/050891, having International Filing Date of Aug. 16, 2016, claiming priority from U.S. Provisional Patent Application Ser. No. 62/205,768 filed on Aug. 17, 2015. The contents of the above applications are all incorporated by reference as if fully set forth herein in their entirety.

Provisional Applications (1)
Number Date Country
62205768 Aug 2015 US
Continuations (1)
Number Date Country
Parent 15753024 Feb 2018 US
Child 17676315 US