The present disclosure relates to the fields of agriculture, plant biotechnology, and plant molecular biology. More specifically, the present disclosure relates to recombinant DNA molecules encoding engineered proteins that are useful for conferring tolerance to β-triketone herbicides such as mesotrione, that are phytotoxic inhibitors of plant hydroxyphenylpyruvate dioxygenases (HPPD). The present disclosure also provides herbicide tolerant transgenic plants, plant parts, cells, and seeds comprising the recombinant DNA molecules, and methods of using the same.
A computer readable form of a sequence listing is filed with this application by electronic submission and is incorporated into this application by reference in its entirety. The sequence listing is contained in the file named MONS_523WO_ST26, which is 89.7 kilobytes in size (measured in operating system MS Windows) and was created on Feb. 1, 2023.
Agricultural crop production often utilizes transgenic traits created using the methods of biotechnology. A heterologous gene, also known as a transgene, can be introduced into a plant to produce a transgenic trait. Expression of the transgene in the plant confers a trait, such as herbicide tolerance, to the plant. Examples of transgenic herbicide tolerance traits include glyphosate tolerance, glufosinate tolerance, and dicamba tolerance. With the increase of weed species resistant to the commonly used herbicides, new herbicide tolerance traits are needed in the field. Herbicides of particular interest include herbicides that inhibit hydroxyphenylpyruvate dioxygenases (HPPD), referred to as HPPD inhibitor herbicides. HPPD inhibitor herbicides provide control of a spectrum of herbicide-resistant weeds, thus making a trait conferring tolerance to these herbicides particularly useful in a cropping system combined with one or more other herbicide-tolerance trait(s). Provided herein are novel, engineered enzymes useful for providing HPPD herbicide tolerance in plants.
Provided herein is a recombinant DNA molecule comprising a nucleic acid sequence encoding a protein having triketone dioxygenase (TDO) activity, wherein the protein has at least 70% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16, wherein the protein comprises at least one amino acid substitution as compared to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, and wherein the protein comprises: a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), serine(S), or glycine (G) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2; a tyrosine (Y), phenylalanine (F), or histidine (H) at the position corresponding to position 141 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), serine(S), threonine (T), valine (V), or glycine (G) at the position corresponding to position 229 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), or valine (V), at the position corresponding to position 299 of SEQ ID NO: 2; or a combination of any thereof; provided that: (i) when a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), or serine(S) is present at the position corresponding to position 230 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 229 of SEQ ID NO: 2; and (ii) when an alanine (A), leucine (L), valine (V), isoleucine (I), threonine (T), or serine(S) is present at the position corresponding to position 229 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 230 of SEQ ID NO: 2.
In some embodiments, a protein as described herein has at least 70% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, and 16 and wherein the protein comprises: a threonine (T) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2; or a combination of any thereof. In other embodiments, wherein the protein comprises: a threonine (T) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; and an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2. In certain embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 2 and comprises one or more amino acid substitutions selected from the group consisting of H141F, H141Y, G229A, G229I, G229L, G229S, G229T, G229V, T230A, T230G, T230I, T230L, T230S, T230V, L299A, L299I, L299V, and combinations of any thereof. In some embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 4 and comprises one or more amino acid substitutions selected from the group consisting of H141F, H141Y, A230G, A230I, A230L, A230S, A230T, A230V, G231A, G231I, G231L, G231S, G231T, G231V, F300A, F300I, F300L, F300V, G302S, I333R, and combinations of any thereof. In other embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G231T, G302S, I333R, and combinations of any thereof. In further embodiments, the protein further comprises an isoleucine (I) at position 336 of SEQ ID NO: 4.
In additional embodiments, a protein as described herein has at least 70% sequence identity to SEQ ID NO: 6 and comprises one or more amino acid substitutions selected from the group consisting of Q140F, Q140H, Q140Y, G229A, G229I, G229L, G229S, G229T, G229V, P230A, P230G, P230I, P230L, P230S, P230T, P230V, L299A, L299I, L299V, A301S, L332R, V335I, and combinations of any thereof. In further embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of P230T, A301S, L332R, V335I, and combinations of any thereof. In some embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 8 and comprises one or more amino acid substitutions selected from the group consisting of H144F, H144Y, G233A, G233I, G233L, G233S, G233T, G233V, G234A, G234I, G234L, G234S, G234T, G234V, F303A, F303I, F303L, F303V, G305S, L336R, F339I, and combinations of any thereof. In certain embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G234T, G305S, L336R, F339I, and combinations of any thereof. In other embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 10 and comprises one or more amino acid substitutions selected from the group consisting of H145F, H145Y, G234A, G234I, G234L, G234S, G234T, G234V, G235A, G235I, G235L, G235S, G235T, G235V, F304A, F304I, F304L, F304V, A306S, V337R, L340I, and combinations of any thereof. In certain embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G235T, A306S, V337R, L340I, and combinations of any thereof. In some embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 12 and comprises one or more amino acid substitutions selected from the group consisting of Q140F, Q140H, Q140Y, G229A, G229I, G229L, G229S, G229T, G229V, F299A, F299I, F299L, F299V, G230A, G230I, G230L, G230S, G230T, G230V, G301S, I332R, L335I, and combinations of any thereof. In certain embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G230T, G301S, 1332R, L335I, and combinations of any thereof. In other embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 14 and comprises one or more amino acid substitutions selected from the group consisting of H141F, H141Y, A212G, A212I, A212L, A212S, A212T, A212V, G213A, G213I, G213L, G213S, G213T, G213V, F282A, F282I, F282L, F282V, G284S, V315R, V318I, and combinations of any thereof. In further embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G213T, G284S, V315R, V318I, and combinations of any thereof. In some embodiments, the protein has at least 70% sequence identity to SEQ ID NO: 16 and comprises one or more amino acid substitutions selected from the group consisting of Q140F, Q140H, Q140Y, G229A, G229I, G229L, G229S, G229T, G229V, F299A, F299I, F299L, F299V, G230A, G230I, G230L, G230S, G230T, G230V, G301S, I332R, L335R, and combinations of any thereof. In certain embodiments, the protein comprises one or more amino acid substitutions selected from the group consisting of G230T, G310S, 1332R, L335R, and combinations of any thereof. In some embodiments, the protein has at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16. In other embodiments, the protein has at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16. In certain embodiments, the protein comprises SEQ ID NO: 48. In further embodiments, the protein further comprises: a leucine (L) at the position corresponding to position 11 of SEQ ID NO: 2; a lysine (K) at the position corresponding to position 18 of SEQ ID NO: 2; a glutamine (Q) at the position corresponding to position 32 of SEQ ID NO: 2; a proline (P) at the position corresponding to position 35 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 48 of SEQ ID NO: 2; a glycine (G) at the position corresponding to position 54 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 82 of SEQ ID NO: 2; an aspartic acid (D) at the position corresponding to position 89 of SEQ ID NO: 2; a histidine (H) at the position corresponding to position 116 of SEQ ID NO: 2; a glutamic acid (E) at the position corresponding to position 120 of SEQ ID NO: 2; a valine (V) at the position corresponding to position 133 of SEQ ID NO: 2; a histidine (H) at the position corresponding to position 167 of SEQ ID NO: 2; a cysteine (C) at the position corresponding to position 173 of SEQ ID NO: 2; a phenylalanine (F) at the position corresponding to position 199 of SEQ ID NO: 2; a threonine (T) at the position corresponding to position 204 of SEQ ID NO: 2; a glycine (G) at the position corresponding to position 230 of SEQ ID NO: 2; a glycine (G) at the position corresponding to position 242 of SEQ ID NO: 2; a proline (P) at the position corresponding to position 256 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 274 of SEQ ID NO: 2; a lysine (K) at the position corresponding to position 279 of SEQ ID NO: 2; or a combination of any thereof.
Also provided herein is a recombinant DNA molecule comprising a nucleic acid sequence encoding a protein having triketone dioxygenase (TDO) activity, wherein the protein has at least 50% identity to SEQ ID NO: 2 and comprises one or more amino acid substitutions selected from the group consisting of I11L, A18K, K32Q, S35P, L48I, S54G, V82I, S89D, N116H, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, S242G, E256P, C274S, R279K, and combinations of any thereof. In some embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V82I, S89D, N116H, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, S242G, E256P, C274S, and R279K amino acid substitutions. In other embodiments, the protein comprises I11L, A18K, K32Q, L48I, S54G, V821, S89D, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, S242G, E256P, C274S, and R279K amino acid substitutions. In certain embodiments, the protein comprises I11L, A18K, K32Q, L48I, S54G, V82I, S89D, N116H, I133V, N167H, T173C, L199F, A204T, T230G, S242G, E256P, C274S, and R279K amino acid substitutions. In some embodiments, the protein comprises I11L, A18K, K32Q, L48I, S54G, V82I, S89D, N116H, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, E256P, C274S, and in R279K amino acid substitutions. In other embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V82I, S89D, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, E256P, C274S, and R279K amino acid substitutions. In some embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V821, S89D, N116H, I133V, N167H, T173C, L199F, A204T, T230G, E256P, C274S, and R279K amino acid substitutions. In other embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V821, S89D, N116H, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, C274S, and R279K amino acid substitutions. In further embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V82I, S89D, N116H, Q120E, I133V, N167H, T173C, L199F, A204T, T230G, E256P, and R279K amino acid substitutions. In certain embodiments, the protein comprises I11L, A18K, S35P, S54G, V82I, S89D, N116H, Q120E, N167H, T173C, L199F, A204T, T230G, C274S, and R279K amino acid substitutions. In other embodiments, the protein comprises I11L, A18K, K32Q, S35P, L48I, S54G, V82I, S89D, N116H, Q120E, I133V, N167H, T173C, A204T, T230G, S242G, E256P, and C274S amino acid substitutions. In some embodiments, the protein further comprises: a threonine (T) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; and an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2. In other embodiments, the protein has at least 80% identity to SEQ ID NO: 2. In further embodiments, the protein has at least 90% identity to SEQ ID NO: 2. In certain embodiments, the protein comprises SEQ ID NO: 18, 28, 30, 32, 34, 36, 38, 40, 42, or 44.
Provided herein is a recombinant DNA molecule comprising a nucleic acid sequence encoding a protein having triketone dioxygenase (TDO) activity, wherein the nucleic acid sequence comprises SEQ ID NO: 1. In some embodiments, the DNA molecule further comprises a heterologous promoter operably linked to the nucleic acid sequence encoding a protein having triketone dioxygenase (TDO) activity. In other embodiments, the promoter comprises a constitutive promoter. In further embodiments, the heterologous promoter is functional in a plant cell. In certain embodiments, the recombinant DNA molecule is comprised within a genome of a plant cell. DNA constructs comprising the recombinant DNA molecules and engineered proteins encoded by the recombinant DNA molecules described herein are also provided.
Also provided are transgenic plants, plant seeds, plant cells, or plant parts comprising the recombinant DNA molecule described herein. In some embodiments, the transgenic plant, plant seed, plant cell, or plant part is tolerant to at least one 4-hydroxyphenylpyruvate dioxygenase (HPPD) inhibitor herbicide. In other embodiments, the HPPD inhibitor herbicide is selected from the group consisting of mesotrione, benzobicyclon (BBC), tembotrione, sulcotrione, tefuryltrione, 2-(2-nitro-4-trifluoromethylbenzoyl)cyclohexane-1,3-dione (NTBC), pyrazolate, benzofenap, pyrazoxyfen, isoxaflutole, pyrazolynate, topramezone, and combinations of any thereof. In some embodiments, the transgenic plant, plant seed, plant cell, or plant part is tolerant to at least a second herbicide. In certain embodiments, the second herbicide is selected from the group consisting of an ACCase inhibitor, an ALS inhibitor, an EPSPS inhibitor, a synthetic auxin, a photosynthesis inhibitor, a glutamine synthesis inhibitor, a HPPD inhibitor, a PPO inhibitor, and a long-chain fatty acid inhibitor. In further embodiments, the ACCase inhibitor is an aryloxyphenoxy propionate or a cyclohexanedione; the ALS inhibitor is a sulfonylurea, imidazolinone, triazoloyrimidine, or a triazolinone; the EPSPS inhibitor is glyphosate; the synthetic auxin is a phenoxy herbicide, a benzoic acid, a carboxylic acid, or a semicarbazone; the photosynthesis inhibitor is a triazine, a triazinone, a nitrile, a benzothiadiazole, or a urea; the glutamine synthesis inhibitor is glufosinate; the HPPD inhibitor is an isoxazole, a pyrazolone, or a triketone; the PPO inhibitor is a diphenylether, a N-phenylphthalimide, an aryl triazinone, or a pyrimidinedione; or the long-chain fatty acid inhibitor is a chloroacetamide, an oxyacetamide, or a pyrazole. A commodity product comprising the recombinant DNA molecules described herein are also provided.
Further disclosed herein is a method for conferring HPPD inhibitor herbicide tolerance to a plant, plant seed, plant cell, or plant part, comprising heterologously expressing the engineered protein encoded by a recombinant DNA molecule comprising a nucleic acid sequence encoding a protein having triketone dioxygenase (TDO) activity, wherein the protein has at least 70% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16, wherein the protein comprises at least one amino acid substitution as compared to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, and wherein the protein comprises: a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), serine(S), or glycine (G) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2; a tyrosine (Y), phenylalanine (F), or histidine (H) at the position corresponding to position 141 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), serine(S), threonine (T), valine (V), or glycine (G) at the position corresponding to position 229 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), or valine (V), at the position corresponding to position 299 of SEQ ID NO: 2; or a combination of any thereof; provided that: (i) when a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), or serine(S) is present at the position corresponding to position 230 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 229 of SEQ ID NO: 2; and (ii) when an alanine (A), leucine (L), valine (V), isoleucine (I), threonine (T), or serine(S) is present at the position corresponding to position 229 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 230 of SEQ ID NO: 2 in the plant, plant seed, plant cell, or plant part.
Provided herein is a method for producing an herbicide tolerant plant, comprising: (a) transforming a plant cell with the recombinant DNA molecule described herein; and (b) regenerating a plant from the plant cell that comprises the recombinant DNA molecule. In some embodiments, the method further comprises selecting the plant or a progeny thereof for HPPD inhibitor tolerance. In other embodiments, the method comprises the step of crossing the regenerated plant with itself or with a second plant to produce progeny. Also provided herein is a method of producing a plant tolerant to an HPPD inhibitor herbicide and at least one other herbicide comprising: (a) crossing a plant provided herein with a second plant comprising tolerance to the at least one other herbicide, and (b) selecting a progeny plant resulting from said crossing that comprises tolerance to an HPPD inhibitor herbicide and the at least one other herbicide.
Also provided herein is a method for controlling or preventing weed growth in a plant growth area, comprising applying an effective amount of at least one HPPD inhibitor herbicide to a plant growth area that comprises the transgenic plants or seed provided herein, wherein the transgenic plant or seed is tolerant to the HPPD inhibitor herbicide. Also provided herein is a method of screening for a herbicide tolerance gene comprising: (a) expressing the recombinant DNA molecules of the present disclosure in a plant cell; and (b) identifying a plant cell that displays tolerance to an HPPD inhibitor herbicide. Also provided herein is a method for reducing the development of herbicide-tolerant weeds comprising: (a) cultivating in a crop growing environment a plant according to the present disclosure; and (b) applying an HPPD inhibitor herbicide and at least one other herbicide to the crop growing environment, wherein the crop plant is tolerant to the HPPD inhibitor herbicide and the at least one other herbicide.
SEQ ID NO:1 and SEQ ID NO:2 are the polynucleotide and amino acid sequences for Os.TDO, TDO from Oryza sativa. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:3 and SEQ ID NO:4 are the polynucleotide and amino acid sequences for Zm.TDO, the TDO homolog from Zea mays. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:5 and SEQ ID NO:6 are the polynucleotide and amino acid sequences for Sb.TDO, the TDO homolog from Sorghum bicolor. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:7 and SEQ ID NO:8 are the polynucleotide and amino acid sequences for Si.TDO, the TDO homolog from Setaria italica. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:9 and SEQ ID NO:10 are the polynucleotide and amino acid sequences for Ta.TDO, the TDO homolog from Triticum aestivum. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:11 and SEQ ID NO:12 are the polynucleotide and amino acid sequences for ANDge.TDO, the TDO homolog from Andropogon gerardii. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:13 and SEQ ID NO:14 are the polynucleotide and amino acid sequences for Cl.Col.TDO1, the TDO homolog from Coix lacryma-jobi. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:15 and SEQ ID NO:16 are the polynucleotide and amino acid sequences for Cl.Col.TDO2, the TDO homolog from Coix lacryma-jobi. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:17 and SEQ ID NO:18 are the polynucleotide and amino acid sequences for the Os.TDO variant D08. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:19 and SEQ ID NO:20 are the polynucleotide and amino acid sequences of the Os.TDO variant B03. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:21 and SEQ ID NO:22 are the polynucleotide and amino acid sequences for the Os.TDO variant F09. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:23 and SEQ ID NO:24 are the polynucleotide and amino acid sequences for the Os.TDO variant A04. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:25 and SEQ ID NO:26 are the polynucleotide and amino acid sequences for the Os.TDO variant B09. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:27 and SEQ ID NO:28 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_1. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:29 and SEQ ID NO:30 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_2. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:31 and SEQ ID NO:32 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_3. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:33 and SEQ ID NO:34 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_4. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:35 and SEQ ID NO:36 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_5. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:37 and SEQ ID NO:38 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_6. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:39 and SEQ ID NO:40 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_7. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:41 and SEQ ID NO:42 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_8. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:43 and SEQ ID NO:44 are the polynucleotide and amino acid sequences for the Os.TDO variant TDO_9. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:45 is a different codon-optimized polynucleotide sequence for the Os.TDO variant TDO_6. The corresponding amino acid sequence is SEQ ID NO:38.
SEQ ID NO:46 is a different codon-optimized polynucleotide sequence for the Os.TDO variant TDO_8. The corresponding amino acid sequence is SEQ ID NO:42.
SEQ ID NO:47 and SEQ ID NO:48 are the polynucleotide and amino acid sequences for the Sb.TDO variant SwM. The polynucleotide sequence was codon optimized for dicot expression.
SEQ ID NO:49 is the polynucleotide sequence for HIS1, the wild-type TDO from Oryza sativa. The corresponding amino acid sequence is SEQ ID NO:2.
SEQ ID NO:50 is the amino acid sequence for the leucoanthocyanidin dioxygenase (At.dioxy) from Arabidopsis thaliana. Leucoanthocyanidin dioxygenase is also known as anthocyanidin synthase.
SEQ ID NO:51 is the amino acid sequence for Gm.TDO, the TDO homolog from Glycine max.
SEQ ID NO:52 is the amino acid sequence for At.TDO, the TDO homolog from Arabidopsis thaliana.
SEQ ID NO:53 is the amino acid sequence for Nt.TDO, the TDO homolog from Nicotiana tabacum.
SEQ ID NO:54 is the amino acid sequence for the anthocyanidin synthase from Arabidopsis thaliana. It is identical to SEQ ID NO:50 with the exception of lacking the first methionine. SEQ ID NO: 54 is also referred to herein as 2BRT.
In plants, hydroxyphenylpyruvate dioxygenases (HPPD) are enzymes that are significant in catalyzing the biosynthesis of plastoquinone and tocopherols. Plastoquinone is a cofactor for phytoene desaturase, a key enzyme in the production of carotenoids. Carotenoids protect chlorophyll from degradation and are required for photosynthesis. Carotenoids and tocopherols are responsible for the detoxification of reactive oxygen species and scavenging of free radicals in plant tissues. Inhibition of HPPD leads to uncoupling of photosynthesis, deficiency in accessory light-harvesting pigments and, most importantly, to destruction of chlorophyll by UV-radiation and reactive oxygen species (bleaching) due to the lack of photoprotection normally provided by carotenoids (Norris et al., 1995). Bleaching of photosynthetically active tissues leads to growth inhibition and plant death. Some molecules that inhibit HPPD have proven to be very effective herbicides. The triketone family is one of the most widely used HPPD inhibitor herbicides.
The present disclosure overcomes the limitations of the prior art by providing novel, engineered proteins, referred to herein as TDO proteins, and the recombinant DNA molecules that encode them as well as compositions and methods using these. The TDO proteins are proteins containing a Fe2OG dioxygenase domain that have the ability to inactivate triketone herbicides. As used herein, inactivating an herbicide means making the herbicide no longer have its herbicidal activity against a plant. The TDO proteins exhibit novel substrate selectivity, useful enzyme kinetics, and increased enzyme stability at elevated temperature. Transgenic plants expressing a TDO protein demonstrates improved tolerance to application of triketone herbicides.
Provided herein are novel, engineered proteins and the recombinant DNA molecules that encode them. As used herein, a TDO, “triketone dioxygenase,” or a protein having “triketone dioxygenase activity” refers to a protein, specifically a Fe(II)/2-oxoglutarate-dependent dioxygenase, or a protein containing a Fe2OG dioxygenase domain that has the ability to hydroxylate a triketone herbicide. As used herein, the term “engineered” refers to a non-natural DNA, protein, cell, or organism that would not normally be found in nature and was created by human intervention. An “engineered protein,” “engineered enzyme,” or “engineered triketone dioxygenase,” refers to a protein, enzyme, or triketone dioxygenase whose amino acid sequence was conceived of and created in the laboratory using one or more of the techniques of biotechnology, protein design, or protein engineering, such as molecular biology, protein biochemistry, bacterial transformation, plant transformation, site-directed mutagenesis, directed evolution using random mutagenesis, genome editing, gene editing, gene cloning, DNA ligation, DNA synthesis, protein synthesis, and DNA shuffling. For example, an engineered protein may have one or more deletions, insertions, or substitutions relative to the coding sequence of the wild-type protein and each deletion, insertion, or substitution may consist of one or more amino acids. Genetic engineering can be used to create a DNA molecule encoding an engineered protein, such as an engineered TDO protein and comprises at least a first amino acid substitution relative to a wild-type TDO protein as described herein.
Examples of engineered proteins provided herein are Fe2OG dioxygenase domain containing proteins having triketone dioxygenase activity (referred herein as “TDO proteins” or “TDO enzymes”) comprising at least 70% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16, wherein the protein comprises at least one amino acid substitution as compared to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, and wherein the protein comprises: a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), serine(S), or glycine (G) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2; a tyrosine (Y), phenylalanine (F), or histidine (H) at the position corresponding to position 141 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), serine(S), threonine (T), valine (V), or glycine (G) at the position corresponding to position 229 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), or valine (V), at the position corresponding to position 299 of SEQ ID NO: 2; or a combination of any thereof; provided that: (i) when a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), or serine(S) is present at the position corresponding to position 230 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 229 of SEQ ID NO: 2; and (ii) when an alanine (A), leucine (L), valine (V), isoleucine (I), threonine (T), or serine(S) is present at the position corresponding to position 229 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 230 of SEQ ID NO: 2. In specific embodiments, an engineered protein provided herein comprises one, two, three, four, five, six, seven, eight, nine, ten, or more of any combination of such substitutions.
Engineered proteins are enzymes that have triketone dioxygenase activity. As used herein, “triketone dioxygenase activity” means the ability of a protein to inactivate one or more HPPD inhibitor herbicide(s). The term “triketone dioxygenase activity” means the ability to catalyze the hydroxylation of a triketone herbicide, such as mesotrione, thereby inactivating the herbicide. Enzymatic activity of a triketone dioxygenase can be measured by any means known in the art, for example, by an enzymatic assay in which the second hydroxylation of a triketone, such as production of oxy-mesotrione from mesotrione in the presence of a protein having triketone dioxygenase activity is measured via fluorescence, high performance liquid chromatography (HPLC), or mass spectrometry (MS).
As used herein, “wild-type” means naturally-occurring. As used herein, a “wild-type DNA molecule”, “wild-type polypeptide”, or a “wild-type protein” is a naturally-occurring DNA molecule, polypeptide, or protein, that is, a DNA molecule, polypeptide, or protein pre-existing in nature. A wild-type version of a polypeptide, protein, or DNA molecule may be useful for comparison with an engineered protein or gene. An example of a wild-type protein useful for comparison with the engineered proteins provided by the present disclosure is the HIS1 enzyme from Oryza sativa. An example of a wild-type DNA molecule useful for comparison with the recombinant DNA molecules provided by the present disclosure is the HIS1 gene from Oryza sativa. A wild-type version of a protein or DNA molecule may be useful as a control in an experiment.
A “wild-type plant” is a naturally occurring plant. Such wild-type plants may also be useful for comparison with a plant comprising a recombinant or engineered DNA molecule or protein. An example of a wild-type plant useful for comparison with plants comprising a recombinant or engineered DNA molecule or protein may be a plant of the same type as the plant comprising the engineered DNA molecule or protein, such as a protein conferring an herbicide tolerance trait, and as such is genetically distinct from the plant comprising the herbicide tolerance trait.
As used herein, “control” means an experimental control designed for comparison purposes. For example, a control plant in a transgenic plant analysis is a plant of the same type as the experimental plant (that is, the plant to be tested) but does not contain the transgenic insert, recombinant DNA molecule, DNA construct, or variant protein or gene of the experimental plant. Examples of control plants useful for comparison with transgenic plants include: for maize plants, non-transgenic LH244 maize (ATCC deposit number PTA-1173); for comparison with soybean plants: non-transgenic A3555 soybean (ATCC deposit number PTA-10207); for comparison with cotton plants: non-transgenic Coker 130 (Plant Variety Protection (PVP) Number 8900252); for comparison with canola or Brassica napus plants: non-transgenic Brassica napus variety 65037 Restorer line (Canada Plant Breeders' Rights Application 06-5517); for comparison with wheat plants: non-transgenic wheat variety Samson germplasm (PVP 1994). As used herein, the term “recombinant” refers to a non-naturally occurring DNA, protein, cell, seed, or organism that is the result of genetic engineering and was created by human intervention. A “recombinant DNA molecule” is a DNA molecule comprising a DNA sequence that does not naturally occur and as such is the result of human intervention, such as a DNA molecule comprising at least two DNA molecules heterologous to each other. An example of a recombinant DNA molecule is a DNA molecule provided herein encoding a protein having triketone dioxygenase activity operably linked to a heterologous promoter. A “recombinant protein” is a protein comprising an amino acid sequence that does not naturally occur and as such is the result of human intervention, such as an engineered protein. A recombinant cell, seed, or organism is a cell, seed, or organism comprising transgenic or heterologous DNA or protein, for example a transgenic plant cell, seed, or plant comprising a DNA construct or engineered protein of the present disclosure.
As used herein, the term “DNA” or “DNA molecule” refers to a double-stranded DNA molecule of genomic or synthetic origin (that is, a polymer of deoxyribonucleotide bases or a polynucleotide molecule) read from the 5′ (upstream) end to the 3′ (downstream) end. As used herein, the term “DNA sequence” refers to the nucleotide sequence of a DNA molecule. The nomenclature used herein corresponds to that of by Title 37 of the United States Code of Federal Regulations § 1.822, and set forth in the tables in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3.
The present disclosure provides a nucleic acid molecule encoding a protein having triketone dioxygenase activity comprising at least 70% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, and 16, wherein the protein comprises at least one amino acid substitution as compared to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, and wherein the protein comprises: a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), serine(S), or glycine (G) at the position corresponding to position 230 of SEQ ID NO: 2; a serine(S) at the position corresponding to position 301 of SEQ ID NO: 2; an arginine (R) at the position corresponding to position 332 of SEQ ID NO: 2; an isoleucine (I) at the position corresponding to position 335 of SEQ ID NO: 2; a tyrosine (Y), phenylalanine (F), or histidine (H) at the position corresponding to position 141 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), serine(S), threonine (T), valine (V), or glycine (G) at the position corresponding to position 229 of SEQ ID NO: 2; an alanine (A), isoleucine (I), leucine (L), or valine (V), at the position corresponding to position 299 of SEQ ID NO: 2; or a combination of any thereof; provided that: (i) when a threonine (T), alanine (A), leucine (L), valine (V), isoleucine (I), or serine(S) is present at the position corresponding to position 230 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 229 of SEQ ID NO: 2; and (ii) when an alanine (A), leucine (L), valine (V), isoleucine (I), threonine (T), or serine(S) is present at the position corresponding to position 229 of SEQ ID NO: 2, a glycine (G) is present at the position corresponding to position 230 of SEQ ID NO: 2.
As used herein, the term “protein-coding DNA molecule” refers to a DNA molecule comprising a DNA sequence that encodes a protein. As used herein, the term “protein” refers to a chain of amino acids linked by peptide (amide) bonds and includes both polypeptide chains that are folded or arranged in a biologically functional way and polypeptide chains that are not. As used herein, a “protein-coding sequence” means a DNA sequence that encodes a protein. As used herein, a “sequence” means a sequential arrangement of nucleotides or amino acids. A “DNA sequence” may refer to a sequence of nucleotides or to the DNA molecule comprising of a sequence of nucleotides; a “protein sequence” may refer to a sequence of amino acids or to the protein comprising a sequence of amino acids. The boundaries of a protein-coding sequence are usually determined by a translation start codon at the 5′-terminus and a translation stop codon at the 3′-terminus.
Engineered proteins may be produced by changing or modifying a wild-type protein sequence to produce a new protein with modified characteristic(s) or a novel combination of useful protein characteristics, such as altered Vmax, Km, Ki, IC50, substrate specificity, inhibitor/herbicide specificity, substrate selectivity, ability to interact with other components in the cell such as partner proteins or membranes, and protein stability, among others. Modifications may be made at specific amino acid positions in a protein and may be made by substituting an alternate amino acid for the typical amino acid found at that same position in nature (that is, in the wild-type protein). Amino acid modifications may be made as a single amino acid substitution in the protein sequence or in combination with one or more other modifications, such as one or more other amino acid substitution(s), deletions, or additions. In some embodiments, an engineered protein has altered protein characteristics, such as those that result in decreased sensitivity to one or more herbicides as compared to the wild-type protein or ability to confer tolerance to one or more herbicides on a transgenic plant expressing the engineered protein. In other embodiments, the present disclosure therefore provides an engineered protein such as a TDO protein, and the recombinant DNA molecule encoding it, having one or more amino acid substitution(s) selected from the group consisting of H141F, H141Y, G229A, G229I, G229L, G229S, G229T, G229V, T230A, T230G, T230I, T230L, T230S, T230V, L299A, L2991, L299V, and all combinations thereof, wherein the position of the amino acid substitution(s) is relative to the amino acid position set forth in SEQ ID NO: 2. In specific embodiments, an engineered protein provided herein comprises one, two, three, four, five, six, seven, eight, nine, ten, or more of any combination of such substitutions, wherein the modification is made at a position relative to a position comparable in function to that in the amino acid sequence provided as SEQ ID NO:2.
Similar modifications can be made in analogous positions of any protein containing a Fe2OG dioxygenase domain by alignment of the amino acid sequence of the Fe2OG dioxygenase domain containing protein to be mutated with the amino acid sequence of a Fe2OG dioxygenase domain containing protein that has triketone dioxygenase activity. One example of a sequence coding a Fe(II)/2-oxoglutarate-dependent dioxygenase that has triketone dioxygenase activity that can be used for alignment is SEQ ID NO:2.
As used herein, the term “isolated” refers to at least partially separating a molecule from other molecules typically associated with it in its natural state. As used herein, the term “isolated” refers to a DNA molecule that is separated from the nucleic acids that normally flank the DNA molecule in its natural state. For example, a DNA molecule encoding a protein that is naturally present in a bacterium would be an isolated DNA molecule if it was not within the DNA of the bacterium from which the DNA molecule encoding the protein is naturally found. Thus, a DNA molecule fused to or operably linked to one or more other DNA molecule(s) with which it would not be associated in nature, for example as the result of recombinant DNA or plant transformation techniques, is considered isolated herein. Such molecules are considered isolated even when integrated into the chromosome of a host cell or present in a nucleic acid solution with other DNA molecules.
Any number of methods well known to those skilled in the art can be used to isolate and manipulate a DNA molecule, or fragment thereof, as disclosed herein. For example, polymerase chain reaction (PCR) technology can be used to amplify a particular starting DNA molecule or to produce variants of the original molecule. DNA molecules, or fragment thereof, can also be obtained by other techniques, such as by directly synthesizing the fragment by chemical means, as is commonly practiced by using an automated oligonucleotide synthesizer.
Because of the degeneracy of the genetic code, a variety of different DNA sequences can encode proteins, such as the altered or engineered proteins disclosed herein. For example,
Recombinant DNA molecules provided herein may be synthesized and modified by methods known in the art, either completely or in part, where it is desirable to provide sequences useful for DNA manipulation (such as restriction enzyme recognition sites or recombination-based cloning sites), plant-preferred sequences (such as plant-codon usage or Kozak consensus sequences), or sequences useful for DNA construct design (such as spacer or linker sequences). The present disclosure includes recombinant DNA molecules and engineered proteins having at least 50% sequence identity, at least 60% sequence identity, at least 70% sequence identity, at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, and at least 99% sequence identity to any of the recombinant DNA molecule or amino acid sequences provided herein, and having triketone dioxygenase activity. As used herein, the term “percent sequence identity” or “% sequence identity” refers to the percentage of identical nucleotides or amino acids in a linear polynucleotide or amino acid sequence of a reference (“query”) sequence (or its complementary strand) as compared to a test (“subject”) sequence (or its complementary strand) when the two sequences are optimally aligned (with appropriate nucleotide or amino acid insertions, deletions, or gaps totaling less than 20 percent of the reference sequence over the window of comparison). Optimal alignment of sequences for aligning a comparison window are well known to those skilled in the art and may be conducted by tools such as the local homology algorithm of Smith and Waterman, the homology alignment algorithm of Needleman and Wunsch, the search for similarity method of Pearson and Lipman, and by computerized implementations of these algorithms such as GAP, BESTFIT, FASTA, and TFASTA available as part of the Sequence Analysis software package of the GCG® Wisconsin Package® (Accelrys Inc., San Diego, CA), MEGAlign (DNAStar Inc., 1228 S. Park St., Madison, WI 53715), and MUSCLE (version 3.6) (RC Edgar, “MUSCLE: multiple sequence alignment with high accuracy and high throughput” Nucleic Acids Research 32 (5): 1792-7 (2004)) for instance with default parameters. An “identity fraction” for aligned segments of a test sequence and a reference sequence is the number of identical components that are shared by the two aligned sequences divided by the total number of components in the portion of the reference sequence segment being aligned, that is, the entire reference sequence or a smaller defined part of the reference sequence. Percent sequence identity is represented as the identity fraction multiplied by 100. The comparison of one or more sequences may be to a full-length sequence or a portion thereof, or to a longer sequence.
As used herein, a “DNA construct” is a recombinant DNA molecule comprising two or more heterologous DNA sequences. DNA constructs are useful for transgene expression and may be comprised in vectors and plasmids. DNA constructs may be used in vectors for transformation (that is, the introduction of heterologous DNA into a host cell) to produce recombinant bacteria or transgenic plants and cells (and as such may also be contained in the plastid DNA or genomic DNA of a transgenic plant, seed, cell, or plant part). As used herein, a “vector” means any recombinant DNA molecule that may be used for bacterial or plant transformation. DNA molecules provided herein can, for example, be inserted into a vector as part of a DNA construct having the DNA molecule operably linked to a heterologous gene expression element that functions in a plant to affect expression of the engineered protein encoded by the DNA molecule. Methods for making and using DNA constructs and vectors are well known in the art and described in detail in, for example, handbooks and laboratory manuals including Michael R. Green and Joseph Sambrook, “Molecular Cloning: A Laboratory Manual” (Fourth Edition) ISBN: 978-1-936113-42-2, Cold Spring Harbor Laboratory Press, NY (2012). The components for a DNA construct, or a vector comprising a DNA construct, include one or more gene expression elements operably linked to a transcribable nucleic acid sequence, such as the following: a promoter for the expression of an operably linked DNA, an operably linked protein-coding DNA molecule, and an operably linked 3′ untranslated region (UTR). Gene expression elements that are useful include, but are not limited to, one or more of the following type of elements: promoter, 5′ UTR, enhancer, leader, cis-acting element, intron, transit sequence, 3′ UTR, and one or more selectable marker transgenes.
The term “transgene” refers to a DNA molecule artificially incorporated into the genome of an organism as a result of human intervention, such as by plant transformation methods. As used herein, the term “transgenic” means comprising a transgene, for example a “transgenic plant” refers to a plant comprising a transgene in its genome and a “transgenic trait” refers to a characteristic or phenotype conveyed or conferred by the presence of a transgene incorporated into the plant genome. As a result of such genomic alteration, the transgenic plant is something distinctly different from the related wild-type plant and the transgenic trait is a trait not naturally found in the wild-type plant. Transgenic plants provided herein comprise the recombinant DNA molecules and engineered proteins of the present disclosure.
As used herein, the term “heterologous” refers to the relationship between two or more things not normally associated in nature, for instance that are derived from different sources or not normally found in nature together in any other manner. For example, a DNA molecule or protein may be heterologous with respect to another DNA molecule, protein, cell, plant, seed, or organism if not normally found in nature together or in the same context. In certain embodiments, a first DNA molecule is heterologous to a second DNA molecule if the two DNA molecules are not normally found in nature together in the same context. For instance, a protein-coding recombinant DNA molecule is heterologous with respect to an operably linked promoter if such a combination is not normally found in nature. Similarly, a protein is heterologous with respect to a second operably linked protein, such as a transit peptide, if such combination is not normally found in nature. In another embodiment, a recombinant DNA molecule encoding a TDO protein is heterologous with respect to an operably linked promoter that is functional in a plant cell if such combination is not normally found in nature. A recombinant DNA molecule also may be heterologous with respect to a cell, seed, or organism into which it is inserted when it would not naturally occur in that cell, seed, or organism.
A “heterologous protein” is a protein present in a plant, seed, cell, tissue, or organism in which it does not naturally occur or operably linked to a protein with which it is not naturally linked. An example of a heterologous protein is an engineered TDO protein comprising at least a first amino acid substitution described herein that is expressed in any plant, seed, cell, tissue, or organism. Another example is a protein operably linked to a second protein, such as a transit peptide, with which it is not naturally linked, or a protein introduced into a plant cell in which it does not naturally occur using the techniques of genetic engineering.
As used herein, “operably linked” means two or more DNA molecules or two or more proteins linked in manner so that one may affect the function of the other. Operably linked DNA molecules or operably linked proteins may be part of a single contiguous molecule and may or may not be adjacent. For example, a promoter is operably linked with a protein-coding DNA molecule in a DNA construct where the two DNA molecules are so arranged that the promoter may affect the expression of the transgene.
The DNA constructs of the present disclosure may include a promoter operably linked to a protein-coding DNA molecule provided herein, whereby the promoter drives expression of the engineered protein. Promoters useful in practicing the contemplated embodiments include those that function in a cell for expression of an operably linked DNA molecule, such as a bacterial or plant promoter. Plant promoters are varied and well known in the art and include, for instance, those that are inducible, viral, synthetic, constitutive, temporally regulated, spatially regulated, or spatio-temporally regulated.
In some embodiments, a DNA construct provided herein includes a DNA sequence encoding a transit sequence that is operably linked to a heterologous DNA sequence encoding a TDO protein, whereby the transit sequence facilitates localizing the protein molecule within the cell. Transit sequences are known in the art as signal sequences, targeting peptides, targeting sequences, localization sequences, and transit peptides. An example of a transit sequence is a chloroplast transit peptide (CTP), a mitochondrial transit sequence (MTS), or a dual chloroplast and mitochondrial transit peptide. By facilitating protein localization within the cell, the transit sequence may increase the accumulation of recombinant protein, protect the protein from proteolytic degradation, or enhance the level of herbicide tolerance, and thereby reduce levels of injury in the cell, seed, or organism after herbicide application. CTPs and other targeting molecules that may be used in connection with the present disclosure are well known in the art.
As used herein, “transgene expression”, “expressing a transgene”, “protein expression”, and “expressing a protein” mean the production of a protein through the process of transcribing a DNA molecule into messenger RNA (mRNA) and translating the mRNA into polypeptide chains, which are ultimately folded into proteins. A protein-coding DNA molecule may be operably linked to a heterologous promoter in a DNA construct for use in expressing the protein in a cell transformed with the recombinant DNA molecule.
In one aspect, the present disclosure provides cells, tissues, plants, and seeds that comprising the recombinant DNA molecules or engineered proteins described herein. These cells, tissues, plants, and seeds comprising the recombinant DNA molecules or engineered proteins exhibit tolerance to one or more HPPD inhibitor herbicide(s).
One method of producing such cells, tissues, plants, and seeds is through plant transformation. Suitable methods for transformation of host plant cells for use with the present disclosure include any method by which DNA can be introduced into a cell (for example, where a recombinant DNA construct is stably integrated into a plant chromosome) and are well known in the art. Two effective, and widely utilized, methods for cell transformation are Agrobacterium-mediated transformation and microprojectile bombardment-mediated transformation. Microprojectile bombardment methods are illustrated, for example, in US Patent Nos. U.S. Pat. Nos. 5,550,318; 5,538,880; 6,160,208; and 6,399,861. Agrobacterium-mediated transformation methods are described, for example in US Patent No. U.S. Pat. No. 5,591,616. A cell with a recombinant DNA molecule or engineered protein of the present disclosure may be selected for the presence of the recombinant DNA molecule or engineered protein, for instance through its encoded enzymatic activity, before or after regenerating such a cell into a plant.
According to some embodiments, another method of producing the cells, plants, and seeds is through genome modification using site-specific integration or genome editing. Targeted modification of plant genomes through the use of genome editing methods can be used to create improved plant lines through modification of plant genomic DNA. As used herein “site-directed integration” refers to genome editing methods that enable targeted insertion of one or more nucleic acids of interest into a plant genome. Suitable methods for altering a wild-type DNA sequence or a preexisting transgenic sequence or for inserting DNA into a plant genome at a pre-determined chromosomal site include any method known in the art. Exemplary methods include the use of sequence specific nucleases, such as zinc-finger nucleases, engineered or native meganucleases, TALE-endonucleases, or an RNA-guided endonucleases (for example, a Clustered Regularly Interspersed Short Palindromic Repeat (CRISPR)/Cas9 system, a CRISPR/Cpf1 system, a CRISPR/CasX system, a CRISPR/CasY system, a CRISPR/Cascade system). Several embodiments relate to methods of genome editing by using single-stranded oligonucleotides to introduce precise base pair modifications in a plant genome, as described by Sauer et al., Plant Physiology 170 (4): 1917-1928 (2016). Methods of genome editing to modify, delete, or insert nucleic acid sequences into genomic DNA are known in the art.
In some embodiments, the modification or replacement of an existing coding sequence, such as a TDO coding sequence or another existing transgenic insert, within a plant genome with a sequence encoding an engineered protein, such as an engineered TDO coding sequence provided herein, or an expression cassette encoding such an engineered protein is provided. Several embodiments relate to the use of a known genome editing methods, such as zinc-finger nucleases, engineered or native meganucleases, TALE-endonucleases, or an RNA-guided endonucleases (for example, a Clustered Regularly Interspersed Short Palindromic Repeat (CRISPR)/Cas9 system, a CRISPR/Cpf1 system, a CRISPR/CasX system, a CRISPR/CasY system, a CRISPR/Cascade system).
Several embodiments may therefore relate to a recombinant DNA construct comprising an expression cassette(s) encoding a site-specific nuclease and, optionally, any associated protein(s) to carry out genome modification. These nuclease-expressing cassette(s) may be present in the same molecule or vector as a donor template for templated editing or an expression cassette comprising nucleic acid sequence encoding a TDO protein as described herein (in cis) or on a separate molecule or vector (in trans). Several methods for site-directed integration are known in the art involving different sequence-specific nucleases (or complexes of proteins or guide RNA or both) that cut the genomic DNA to produce a double strand break (DSB) or nick at a desired genomic site or locus. As understood in the art, during the process of repairing the DSB or nick introduced by the nuclease enzyme, the donor template DNA, transgene, or expression cassette may become integrated into the genome at the site of the DSB or nick. The presence of the homology arm(s) in the DNA to be integrated may promote the adoption and targeting of the insertion sequence into the plant genome during the repair process through homologous recombination, although an insertion event may occur through non-homologous end joining (NHEJ).
As used herein, the term “double-strand break inducing agent” refers to any agent that can induce a double-strand break (DSB) in a DNA molecule. In some embodiments, the double-strand break inducing agent is a site-specific genome modification enzyme.
As used herein, the term “site-specific genome modification enzyme” refers to any enzyme that can modify a nucleotide sequence in a sequence-specific manner. In some embodiments, a site-specific genome modification enzyme modifies the genome by inducing a single-strand break. In some embodiments, a site-specific genome modification enzyme modifies the genome by inducing a double-strand break. In some embodiments, a site-specific genome modification enzyme comprises a cytidine deaminase. In some embodiments, a site-specific genome modification enzyme comprises an adenine deaminase. In the present disclosure, site-specific genome modification enzymes include endonucleases, recombinases, transposases, deaminases, helicases and any combination thereof. In some embodiments, the site-specific genome modification enzyme is a sequence-specific nuclease.
In one aspect, the endonuclease is selected from a meganuclease, a zinc-finger nuclease (ZFN), a transcription activator-like effector nucleases (TALEN), an Argonaute (non-limiting examples of Argonaute proteins include Thermus thermophilus Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi Argonaute (NgAgo), an RNA-guided nuclease, such as a CRISPR associated nuclease (non-limiting examples of CRISPR associated nucleases include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, Cpf1, CasX, CasY, homologs thereof, or modified versions thereof).
In some embodiments, the site-specific genome modification enzyme is a recombinase. Non-limiting examples of recombinases include a tyrosine recombinase attached to a DNA recognition motif provided herein is selected from the group consisting of a Cre recombinase, a Gin recombinase, a Flp recombinase, and a Tnp1 recombinase. In an aspect, a Cre recombinase or a Gin recombinase provided herein is tethered to a zinc-finger DNA-binding domain, or a TALE DNA-binding domain, or a Cas9 nuclease. In another aspect, a serine recombinase attached to a DNA recognition motif provided herein is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase. In another aspect, a DNA transposase attached to a DNA binding domain provided herein is selected from the group consisting of a TALE-piggyBac and TALE-Mutator.
Any of the DNA of interest provided herein can be integrated into a target site of a chromosome sequence by introducing the DNA of interest and the provided site-specific genome modification enzymes. Any method provided herein can utilize any site-specific genome modification enzyme provided herein.
Provided herein are cells, plants, and seeds that are tolerant to HPPD inhibitor herbicides. Such cells, plants, and seeds are useful in the methods of agriculture, such as weed control and crop production.
As used herein, “herbicide” is any molecule that is used to control, prevent, or interfere with the growth of one or more plants. Exemplary herbicides include acetyl-CoA carboxylase (ACCase) inhibitors (for example aryloxyphenoxy propionates and cyclohexanediones); acetolactate synthase (ALS) inhibitors (for example sulfonylureas, imidazolinones, triazolopyrimidines, and triazolinones); 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) inhibitors (for example glyphosate), synthetic auxins (for example phenoxys, benzoic acids, carboxylic acids, semicarbazones), photosynthesis (photosystem II) inhibitors (for example triazines, triazinones, nitriles, benzothiadiazoles, and ureas), glutamine synthetase (GS) inhibitors (for example glufosinate and bialaphos), 4-hydroxyphenylpyruvate dioxygenase (HPPD) inhibitors (for example isoxazoles, pyrazolones, and triketones), protoporphyrinogen oxidase (PPO) inhibitors (for example diphenylethers, N-phenylphthalimide, aryl triazinones, and pyrimidinediones), very long-chain fatty acid inhibitors (for example chloroacetamides, oxyacetamides, and pyrazoles), cellulose biosynthesis inhibitors (for example indaziflam), photosystem I inhibitors (for example paraquat), microtubule assembly inhibitors (for example pendimethalin), and phytoene desaturase (PDS) inhibitors (for example norflurazone), among others.
As used herein, a “HPPD herbicide” is a chemical that targets and inhibits the enzymatic activity of 4-hydroxyphenyl-pyruvate-dioxygenase (HPPD), which is an Fe-containing enzyme that catalyzes the second reaction in the catabolism of tyrosine, the conversion of 4-hydroxyphenylpyruvate to homogentisate. Inhibition of 4-hydroxyphenyl-pyruvate-dioxygenase prevents plant carotenoid pigment formation, which in turn leads to chlorophyll degradation and eventually leads to lipid peroxidation of cell membranes. HPPD inhibitor herbicides are well-known in the art and commercially available. Examples of HPPD inhibitor herbicides include, but are not limited to, aclonifen, amitrole, beflubutamid, benzofenap, clomazone, diflufenican, fluridone, flurochloridone, flurtamone, isoxachlortole, isoxaflutole, mesotrione, norflurazon, picolinafen, pyrazolynate, pyrazoxyfen, sulcotrione, tembotrione, tolpyralate, topramezone and tefuryltrione salts and esters thereof, and mixtures thereof. Proteins having triketone dioxygenase activity and cells, seeds, plants, and plant parts provided by the present disclosure exhibit herbicide-tolerance to one or more HPPD inhibitor herbicide(s).
As used herein, “herbicide-tolerant” or “herbicide-tolerance” means the ability to be wholly or partially unaffected by the presence or application of one of more herbicide(s), for example to resist the toxic effects of an herbicide when applied. A cell or organism is “herbicide-tolerant” if it is able to maintain at least some normal growth or phenotype in the presence of one or more herbicide(s). A trait is an herbicide-tolerance trait if its presence can confer improved tolerance to an herbicide upon a cell, plant, or seed as compared to the wild-type or control cell, plant, or seed. Crops comprising a herbicide-tolerance trait can continue to grow and are minimally affected by the presence of the herbicide. A target enzyme is “herbicide-tolerant” if it exhibits improved enzyme activity relative to a wild-type or control enzyme in the presence of the herbicide. Herbicide-tolerance may be complete or partial insensitivity to a particular herbicide, and may be expressed as a percent (%) tolerance or insensitivity to a particular herbicide.
Contemplated plants which might be produced with an herbicide tolerance trait provided herein could include, for instance, any plant including crop plants such as soybean (Glycine max), maize (Zea mays), cotton (Gossypium sp.), and Brassica plants, among others.
Herbicides may be applied to a plant growth area comprising the plants and seeds provided by the present disclosure as a method for controlling weeds. Provided herein are plants and seeds that comprise an herbicide tolerance trait and as such are tolerant to the application of one or more HPPD inhibitor herbicides. The herbicide application may be the recommended commercial rate (1X) or any fraction or multiple thereof, such as twice the recommended commercial rate (2X). Herbicide rates may be expressed as acid equivalent per pound per acre (lb ae/acre) or acid equivalent per gram per hectare (g ae/ha) or as pounds active ingredient per acre (lb ai/acre) or grams active ingredient per hectare (g ai/ha), depending on the herbicide and the formulation. The herbicide application comprises at least one HPPD inhibitor herbicide. The plant growth area may or may not comprise weed plants at the time of herbicide application. A herbicidally-effective dose of HPPD inhibitor herbicide(s) for use in an area for controlling weeds may consist of a range from about 0.1X to about 30X label rate(s) over a growing season. One (1) acre is equivalent to 2.47105 hectares and one (1) pound is equivalent to 453.592 grams. Herbicide rates can be converted between English and metric as: (lb ai/ac) multiplied by 1.12=(kg ai/ha) and (kg ai/ha) multiplied by 0.89=(lb ai/ac).
Herbicide applications may be sequential or tank mixed with one, two, or a combination of several HPPD inhibitor herbicides or any other compatible herbicide. Multiple applications of one herbicide or of two or more herbicides, in combination or alone, may be used over a growing season to areas comprising transgenic plants of the present disclosure for the control of a broad spectrum of dicot weeds, monocot weeds, or both, for example, two applications (such as a pre-planting application and a post-emergence application or a pre-emergence application and a post-emergence application) or three applications (such as a pre-planting application, a pre-emergence application, and a post-emergence application or a pre-emergence application and two post-emergence applications).
As used herein, a “weed” is any undesired plant. A plant may be considered generally undesirable for agriculture or horticulture purposes (for example, Amaranthus species) or may be considered undesirable in a particular situation (for example, a crop plant of one species in a field of a different species, also known as a volunteer plant).
The transgenic plants, progeny, seeds, plant cells, and plant parts of the present disclosure may also contain one or more additional traits. Additional traits may be introduced by crossing a plant containing a transgene comprising the recombinant DNA molecules provided by the present disclosure with another plant containing one or more additional trait(s). As used herein, “crossing” means breeding two individual plants to produce a progeny plant. Two plants may thus be crossed to produce progeny that contain the desirable traits from each parent. As used herein “progeny” means the offspring of any generation of a parent plant, and transgenic progeny comprise a DNA construct provided by the present disclosure and inherited from at least one parent plant.
Additional trait(s) also may be introduced by co-transforming a DNA construct for that additional transgenic trait(s) with a DNA construct comprising the recombinant DNA molecules provided by the present disclosure (for example, with all the DNA constructs present as part of the same vector used for plant transformation) or by inserting the additional trait(s) into a transgenic plant comprising a DNA construct provided herein or vice versa (for example, by using any of the methods of plant transformation or genome editing on a transgenic plant or plant cell). Such additional traits include, but are not limited to, increased insect resistance, increased water use efficiency, increased yield performance, increased drought resistance, increased seed quality, improved nutritional quality, hybrid seed production, and herbicide-tolerance, in which the trait is measured with respect to a wild-type plant. Exemplary additional herbicide-tolerance traits may include transgenic or non-transgenic tolerance to one or more herbicides such as ACCase inhibitors (for example aryloxyphenoxy propionates and cyclohexanediones), ALS inhibitors (for example sulfonylureas, imidazolinones, triazolopyrimidines, and triazolinones) EPSPS inhibitors (for example glyphosate), synthetic auxins (for example phenoxys, benzoic acids, carboxylic acids, semicarbazones), photosynthesis inhibitors (for example triazines, triazinones, nitriles, benzothiadiazoles, and ureas), glutamine synthesis inhibitors (for example glufosinate), HPPD inhibitors (for example isoxazoles, pyrazolones, and triketones), PPO inhibitors (for example diphenylethers, N-phenylphthalimide, aryl triazinones, and pyrimidinediones), and long-chain fatty acid inhibitors (for example chloroacetamindes, oxyacetamides, and pyrazoles), among others. Examples of herbicide-tolerance proteins useful for producing additional herbicide-tolerance traits are well known in the art and include, but are not limited to, glyphosate-tolerant 5-enolypyruvyl shikimate 3-phosphate synthases (e.g., CP4 EPSPS, 2mEPSPS), glyphosate oxidoreductases (GOX), glyphosate N-acetyltransferases (GAT), herbicide-tolerant acetolactate synthases (ALS) acetohydroxyacid synthases (AHAS), herbicide-tolerant 4-hydroxyphenylpyruvate dioxygenases (HPPD), dicamba monooxygenases (DMO), phosphinothricin acetyl transferases (PAT), herbicide-tolerant glutamine synthetases (GS), 2,4-dichlorophenoxyproprionate dioxygenases (TfdA), R-2,4-dichlorophenoxypropionate dioxygenases (RdpA), S-2,4-dichlorophenoxypropionate dioxygenases (SdpA), herbicide-tolerant protoporphyrinogen oxidases (PPO), and cytochrome P450 monooxygenases. Exemplary insect resistance traits may include resistance to one or more insect members within one or more of the orders of Lepidoptera, Coleoptera, Hemiptera, Thysanoptera, Diptera, Hymenoptera, and Orthoptera, among others. Such additional traits are well known to one of skill in the art; for example, and a list of such transgenic traits is provided by the United States Department of Agriculture's (USDA) Animal and Plant Health Inspection Service (APHIS).
Transgenic plants and progeny that are tolerant to HPPD inhibitor herbicides may be used with any breeding methods that are known in the art. In plant lines comprising two or more traits, the traits may be independently segregating, linked, or a combination of both in plant lines comprising three or more transgenic traits. Backcrossing to a parental plant and out-crossing with a non-transgenic plant are also contemplated, as is vegetative propagation. Descriptions of breeding methods that are commonly used for different traits and crops are well known to those of skill in the art. To confirm the presence of the transgene(s) in a particular plant or seed, a variety of assays may be performed. Such assays include, for example, molecular biology assays, such as Southern and northern blotting, PCR, and DNA sequencing; biochemical assays, such as detecting the presence of a protein product, for example, by immunological means (ELISAs and western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole plant.
Introgression of a transgenic trait into a plant genotype is achieved as the result of the process of backcross conversion. A plant genotype into which a transgenic trait has been introgressed may be referred to as a backcross converted genotype, line, inbred, or hybrid. Similarly a plant genotype lacking the desired transgenic trait may be referred to as an unconverted genotype, line, inbred, or hybrid.
As used herein, the term “comprising” means “including but not limited to”.
Having described the present disclosure in detail, it will be apparent that modifications, variations, and equivalent embodiments are possible without departing from the spirit and scope of the present disclosure as further defined in the appended claims. Furthermore, it should be appreciated that all examples in the present disclosure including the following are provided as non-limiting examples.
Mesotrione, a β-triketone herbicide, is an extremely potent competitive inhibitor of 4-Hydroxyphenylpyruvate dioxygenase (HPPD), and has been developed as a selective herbicide for many crops, including corn. Mesotrione provides pre-emergence and post-emergence control of many important broad-leaf weeds and some annual grass weeds in corn fields (Mitchell et al., 2001). However, it cannot be used to manage weeds in soybean fields due to soybean's sensitivity to mesotrione (Mitchell et al., 2001). Development of soybean plants that can tolerate two times (2X) the herbicide application rates recommended for commercial use of mesotrione and resulting in a herbicide injury rate lower than 10% will offer soybean farmers an additional tool to manage weeds.
The rice HPPD Inhibitor Sensitive 1 (HIS1) gene was found to confer tolerance to β-triketone herbicides, including mesotrione, although the level of tolerance was not evaluated to determine if it is sufficient to serve for commercial use (Maeda et al., 2019). HIS1 encodes a Fe(II)/2-oxoglutarate-dependent dioxygenase that catalyzes the hydroxylation of a number of β-triketone herbicides such as benzobicyclon, tefuryltrione, sulcotrione, mesotrione, and tembotrione, leading to their inactivation. To test whether ectopic expression of the rice HIS1 gene in soybean can confer sufficient tolerance to mesotrione for trait development, the rice HIS1 gene sequence was codon-optimized for expression in soybean and designated as Os.TDO (SEQ ID NO: 2).
The Os.TDO sequence was cloned into a set of binary vectors using standard cloning methods known in the art with its expression driven by different constitutive promoters and different 3′UTRs. Transgenic soybean plants comprising these constructs were developed using an Agrobacterium-mediated transformation system as described in Larue et al. (2020). The copy number of the transgene was determined via a Taqman assay and events with a single copy of the T-DNA insertion were selected for evaluation of mesotrione tolerance. Briefly, R2 and later generation homozygous lines of three soybean transgenic events derived from Construct 32 were selected for a mesotrione tolerance greenhouse assay. Construct 32 contained two expression cassettes between the T-DNA borders: one for expression of Os.TDO and the other for expression of a selectable marker. The Os.TDO cassette utilized the cauliflower mosaic virus (CaMV) 35S promotor and the 5′ UTR of HSP70 from petunia to drive expression of Os.TDO, followed by the 3′ UTR of the nopaline synthase (nos) gene from the T-DNA region of the Ti plasmid of Agrobacterium tumefaciens, which functions to direct polyadenylation of the mRNA. The selectable marker cassette conferring spectinomycin and streptomycin resistance comprises the promoter, intron, and leader sequences from the Arabidopsis thaliana actin 7 gene, the transit peptide region of Arabidopsis thaliana EPSPS, the coding region of the Tn7 adenylyltransferase (AAD (3″)) gene from E. coli, and the 3′ non-translated region from the Agrobacterium tumefaciens nopaline synthase gene.
Each plant was grown in a 3.5″ pot with soybean potting media in a growth chamber under conditions of 28° C./24° C. (light/dark) and a photoperiod of 16 hr light at 700 μE/8 hr dark, and fertilized with Peters general purpose fertilizer. Eight plants at V1 growth stage were sprayed with mesotrione (in Callisto®, Syngenta in 1% crop oil) at 1X, 2X, 4X, 8X, and 16X of commercial rate (equivalent to 105, 210, 420, 840, and 1680 g active ingredient/ha, respectively). Crop injury ratings were taken 13 days after mesotrione treatment.
As shown in
To investigate if HIS1 from other species could also provide tolerance to HPPD-inhibiting herbicides, homologs from different plant species were identified using bioinformatic methods known in the art. Plant expression vectors containing TDO homolog sequences from Zea mays, Sorghum bicolor, Setatria italica, Triticum aestivum, Andropogon gerardii, Coix lacryma-jobi, and Oryza sativa were constructed and verified using standard molecular cloning techniques. These homolog sequences were optimized for enhanced expression in dicots using an in-house sequence optimization program. The nucleotides downstream of the ATG (nucleotide positions +4 to +11) of the wheat TDO sequence were codon-optimized as ATGGCttcctc for enhanced protein expression in plants (Sawant et al., 2001). This resulted in the amino acid substitution of alanine to serine at the 4th amino acid position from the N-terminus (A4S). Table 1 provides information regarding the sequences identified in other plant species.
Oryza sativa
Zea mays
Sorghum bicolor
Setaria italica
Triticum aestivum
Andropogon
gerardii
Coix lacryma-jobi
Coix lacryma-jobi
Plant expression vectors comprising the selected TDO homologs under the control of one of two constitutive promoters were transformed into soybean via an Agrobacterium tumefaciens-mediated method. Following Agrobacterium transformation, R0 transgenic soybean plants containing a single copy of the transgene were identified through molecular characterization and segregation analysis. Transgenic and wild-type (control) plants were grown in 6-inch pots filled with BM7 soilless mix in a growth chamber under the conditions of 28° C./23° C. (light/dark) and a 16 hr light/8 hr dark photoperiod. Plants were fertilized with 15-5-15 liquid fertilizer at the V3-V4 growth stage and treated with 210 g a.i./ha mesotrione (1X=105 g a.i./ha; Callisto®, Syngenta in 1% crop oil). Plants were visually rated 7-14 days after treatment on a scale of 0% to 100%, with 0%=no injury and 100%=plant death, relative to the wild-type controls. The results summarized in Table 2 represent the mean of all tested events (10-15) for each construct.
Oryza sativa
Zea mays
Sorghum bicolor
Setatria italica
Triticum aestivum
Andropogon gerardii
Coix lacryma-jobi
Coix lacryma-jobi
Oryza sativa
The results in Table 2 show that transgenic plants expressing Os.TDO had minimal mesotrione injury. While homologs from corn, sorghum, and Setatria italica showed reduced but still significant injury, those from wheat, Andropogon gerardii, and Coix lacryma-jobi did not seem to provide any protection against mesotrione.
Crop plants are often grown in diverse locations throughout the world, which can expose the plants to a wide range of climatic conditions. One of these climatic variables includes a wide range of temperatures. Previous work on herbicide tolerance trait development has shown that is often necessary that the enzymes conferring these traits are stable at higher temperatures (such as field situations) in order to build a robust tolerance trait.
As shown in the previous examples, expression of Os.TDO in transgenic soybean plants provides tolerance to the mesotrione. However, the trait performance declined when the plants were grown in growth chamber experiments in which the temperature was increased to mimic a hot field environment. This decline in performance was linked to a decline in TDO protein expression (
To address TDO protein instability at higher temperatures, one of the approaches taken was to improve expression of TDO protein through selection of expression elements in the plant transformation constructs, especially promoters. Experiments were conducted in which transgenic soybean plants expressing Os.TDO under the control of a strong constitutive promoter (Construct 32), a medium constitutive promoter A (Construct 9), and a different medium constitutive promoter B(Construct 1) were grown in growth chambers at 85° F. or 99° F. At V3-V4 growth stage, these plants were treated with 4X mesotrione (420 g a.i./ha; 1X=105 g a.i./ha; in Callisto®, Syngenta in 1% crop oil). Plants were visually rated 7-14 days after treatment on a scale of 0% to 100%, with 0%=no injury and 100%=plant death, relative to the untreated transgenic and wild-type controls.
As shown in
A rational design approach based on the ortholog analysis described above was taken to design a set of TDO variants that possess improved temperature stability along with the desired enzymatic activity. In the first round of variants screened, five variants passed high-throughput screens and were further characterized.
As shown in Table 3, all five variants showed heat stability that was better compared to Os.TDO, defined as the temperature at which 50% of the enzyme denatures. However, all five also showed reduced enzyme activity in vitro. The enzyme activity was measured at room temperature for the hydroxylation of mesotrione, and was defined as Kcat/Km, M-1 sec-1 (specificity constant). Of these variants, four were tested in soybean plants. All of the variants tested in transgenic soybean plants either trended or showed significantly improved enzyme stability from leaf samples collected from plants grown at normal growth temperature (85° F.) and elevated temperature (99° F.) as shown by ELISA (
Additional testing of the best variant, D08, in a field setting confirmed that this variant provided good tolerance to mesotrione spray treatment, but was not significantly different from the control plants expressing Os.TDO driven by the strong constitutive promoter. Since this testing was conducted in the field, there was no control on the temperature ranges the plants were exposed to.
Based on these observations, an additional round of optimization focused on improving the enzyme activity while maintaining the improved temperature stability in order to further reduce herbicide injury in plants at both normal and higher temperatures to mesotrione applications. The variant D08 was used as the base variant for all Round 2 optimization. Rational design, which took into account the results from the Round 1 screens, was used in engineering Round 2 variants. Much of the rational design work focused on selected point amino acid changes in D08 (relative to wild-type) that were made in Round 1 that could be restored to wild-type amino acids (or similar) with the general approach being that the amino acid positions could be uncovered that were important for activity and those important for stability. The goal was to find amino acid positions that were independent of each other so that improvements could be made to both stability and activity without a negative impact on the opposing improvement target to build variants with improved characteristics for both objectives. As a general rule, improving an enzyme for either stability or activity often has an opposing effect on the other characteristic. For example, improving stability may result in an enzyme that is very resistant to denaturing at higher temperatures, but at the expense of reducing overall enzymatic activity. Effort was focused on improvements in both characteristics at the same time, which is often a much larger engineering challenge.
For the Round 2 optimization, approximately 300 variants were designed based on the D08 template, largely exploring various amino acid positions that could be returned to the wild-type amino acid. Of these, approximately 250 were successfully cloned and screened in a high-throughput screen, with a subset selected for more detailed characterization. Several rounds of enzymatic characterization were completed to select a final eight candidates that underwent detailed in vitro enzymatic characterization and were tested in plant assays as well. Of these eight candidates, all showed improved enzymatic stability relative to the Os.TDO starting point (data not shown). However, only two of the candidates (TDO_6 and TDO_8) had enzymatic activity that was significantly improved over the D08 variant, and none of the enzymes had enzymatic activity better than Os.TDO at room temperature (
The next set of experiments was focused on determining if the balance between improved enzyme stability could outweigh a reduction in activity observed at optimal temperatures in soybean plants. Potentially the improved stability could enable more active enzyme to be available in the plant cell throughout a reasonable range of temperatures and thus tolerance is improved even if the activity is somewhat reduced for each enzyme. To test this hypothesis, the eight variants from Round 2 biochemical analysis described in the preceding paragraph were transformed into soybean through Agrobacterium-mediated transformation, along with three controls. One of the controls (TDO_9) was a variant from the high-throughput screens that had poor activity. Two other controls were the D08 variant and the Os.TDO. R0 transgenic soybean plants were sprayed with the mesotrione herbicide to screen for herbicide tolerance. Due to some technical issues in the plant transformation process unrelated to the transgenes themselves, three of the variants did not produce any transgenic plants for testing. These included the Os.TDO control, the D08 variant and one Round 2 variant. The plants were scored for percent injury as described in the previous examples. All the variants showed low to mid 30% injury ratings, demonstrating tolerance in these plants relative to the wild-type transformation germplasm control, which averaged 45% injury (Table 4).
As shown in Table 4, the wild-type (WT) control had the highest injury of the variants tested. While the Os.TDO control failed transformation in this experiment, a typical injury rating for R0 soybean (using the same expression elements) would have been high 20% to low 30%. This suggests that the variants had similar injury to what would be expected for Os.TDO at optimal growth conditions. The results from the Round 2 variants provided insights on sites/amino acid changes that could help stabilize the TDO protein, especially for crops that may be exposed to hotter temperatures. In addition, these amino acid changes are candidates for gene editing of the native trait in rice or similar crops to improve the native trait, or to edit the transgene itself in crops such as soybean.
Cotton is a crop of interest that is often exposed to high temperature growing conditions and thus, selected TDO variants were tested in cotton. Three TDO variants, D08, TDO_6, and TDO_8 were transformed into cotton (along with the Os.TDO control) using similar expression elements in single-cassette vectors or as part of multi-cassette vectors. R2 transgenic cotton plants were then screened for tolerance to mesotrione. Sixteen plants from two events each per construct were sprayed with two rates (2X and 8X) of mesotrione at V3 stage under normal (80° F.) and hot (100° F.) growing conditions. In addition, the effect of heat treatment on TDO accumulation in cotton plants were measured by ELISA. Cotton plants were planted and grown under normal greenhouse conditions (80° F. day/70° F. night) until V3 growth stage. Half of the plants were then moved to a higher temperature growth chamber (100° F. day/82° F. night) for the heat treatment, whereas the remaining half of the plants stayed in the same greenhouse (80° F. day/70° F. night). At day 3 after the plants were placed under different temperature conditions, leaf samples were collected for TDO protein measurement. The plants in 100° F. growth chamber and in 80° F. greenhouse were both sprayed with 2X mesotrione (210 g a.i./ha). Leaf samples were collected again 7 days after mesotrione treatment (DAT: Days After Treatment) for protein assay. Table 5 compares protein levels of the TDO variants in transgenic cotton plants grown under the two tested conditions. The protein levels for the TDO variants were measured at three time points for plants: 1) just before entering the high temperature treatment, 2) three days following placement into control (normal growth temperature) or high temperature (100° F.), and 3) 7 days after a 2X mesotrione spray.
In this experiment, the Os.TDO plants showed a reduction in TDO protein levels in the high temperature treatment relative to normal conditions at both the 3 and 7 days after treatment timepoints. However, all three optimized TDO variants showed no significant changes in TDO protein levels between the normal and hot growth conditions confirming excellent in planta heat stability of the enzyme. Table 6 shows the results of mesotrione treatment for cotton plants expressing these selected TDO variants. While all TDO expressing cotton events showed excellent tolerance, two of the variants, D08 and TDO_6, showed injury rates that were trending even lower than the Os.TDO controls at the higher temperature. This suggests that the enhanced heat stability of the enzyme translates to improved herbicide tolerance in cotton. In contrast, the wild-type control cotton plants showed high injury (poor tolerance) to all mesotrione applications.
Of the TDO genes tested from different species, the rice gene (Os.TDO) was unique in its ability to provide strong protection to plants to mesotrione. The only other TDO which provided protection, although at a much lower efficacy, was the maize TDO (Zm.TDO), as shown in Example 2. The amino acid sequences of various proteins having TDO activity and two closely related proteins, BRT and At.dioxy, were aligned to amino acid sequence of Os.TDO. 2BRT and At.dioxy are anthocyanidin synthases from Arabidopsis thaliana. Both are involved in anthocyanin and protoanthocyanidin biosynthesis by catalyzing the oxidation of leucoanthocyanidins into anthocyanidins.
Based on this sequence alignment, the amino acids were divided into several categories, including those amino acids which were: 1) conserved throughout all of the proteins (indicated with open triangles in
To test the hypothesis, the four amino acids of sorghum TDO corresponding to amino acid positions 230, 301, 332, and 335 in rice TDO were replaced with those of rice TDO. The corn, rice, sorghum, and the modified sorghum TDO genes were cloned into a pET28 vector for E. coli expression with His6 tag in the N-terminus. Subsequently, the proteins were expressed and purified by affinity chromatography. BL21-CodonPlus (DE3)-RIPL E. coli cells (Stratagene) were used for expression. Cells transformed with expression vector were grown in autoinduction media (AIM) supplemented with 100 μg mL-1 kanamycin and 34 μg mL-1 chloramphenicol.
TDO proteins used in this study were expressed in AIM cells using an auto-induction system as described by Studier (2014). Briefly, the recombinant cultures were grown at 37° C. for 3 hours and then 20° C. for one or two days. Cells were harvested by centrifugation and frozen at −80° C. The pellet was resuspended in lysis buffer (Bper and Yper [Pierce] with ratio 3 to 1, 25 mM Tris-HCl, 300 mM NaCl, 10 mM imidazole, pH 8.0). 150 mM ferric ammonium citrate (1000X) was added to the lysate to a final concentration of 150 UM and incubated on ice for 20 minutes. The lysate was centrifuged to clarify the supernatant. Clear supernatant was loaded onto a 5 mL Ni column connected to AKTA Express (GE Healthcare). The column was washed with washing buffer (25 mM Tris-HCl, 300 mM NaCl, 30 mM imidazole, pH 8.0). The protein was eluted with 25 mM Tris-HCl, 300 mM NaCl, 150 mM imidazole, pH 8.0. The protein from the Ni column was concentrated and subjected to size-exclusion chromatography using a Superdex 200 column connected to AKTA Explorer (GE Healthcare) equilibrated with 25 mM Tris, 150 mM NaCl and 1 mM DTT, pH 8.0. Fractions containing pure TDO were pooled. All purification steps were carried out at 4° C. Protein concentrations were determined spectrophotometrically based on their calculated molar absorption extinction coefficients at 280 nm.
TDO activity was measured by the second hydroxylation of mesotrione. For routine activity measurements of TDO and determining the kinetic properties of the second hydroxylation, the continuous change in A330, which is specific to the second hydroxylation and subsequent non-enzymatic water removal (i.e. the conversion of hydroxy-mesotrione to oxy-mesotrione), was monitored. The reaction mixture (100 μL) contained 50 mM MOPS (pH 7.2), 2.5 mM oxoglutarate, 50 μM iron (II) sulfate heptahydrate, 150 mM KCl, 150 UM sodium ascorbate, 100 to 600 M mesotrione, and 50-250 nM TDO protein. The reaction proceeded until the maximum A330 was obtained (usually 20-120 minutes depending upon the concentration of mesotrione) and the linear portion of the curve was determined by visual inspection. Because the starting concentration of mesotrione was used as an estimate for the concentration of hydroxy-mesotrione, and less than 100% of mesotrione would have been converted by the beginning of the linear phase of the second reaction, the starting concentrations of hydroxy-mesotrione and therefore also the Km was always slightly underestimated.
For the activity calculation, aliquots of the reaction were removed at various time points during the reaction and heated at 95° C. to halt the reaction. The reaction mixtures were analyzed by HPLC and mass spectrophotometry. By product conversation followed by peak integration on HPLC, a linear plot was obtained and the extinction co-efficient delta between the substrate (mesotrione) and the second product (oxy-mesotrione) was obtained according to the equation ΔA330=(εp-εs)*bΔp where ΔA330 is the change of absorbance at 330 nm and b is the light path.
The specific activity of the TDO variants of the second hydroxylation is shown in
In addition to the full-length alignment of TDO proteins, the amino acid sequences adjacent to the active sites from TDO homologues were aligned (
On the left side of the binding pocket (Ser236 1GP5), dicots have more rigidity in this area. Proteins that contain two glycine residues in this region may allow enough flexibility in the binding pocket for the reaction to occur whereas proteins that have no glycine may not have enough flexibility.
Both active monocot TDO proteins have one glycine and a residue with few degrees of backbone freedom. Therefore, glycine could be substituted for serine at the corresponding to position 236 in 1GP5 (see
Furthermore, a leucine, alanine, valine, or isoleucine at position 299 in Os.TDO (corresponding to position F304 in 1GP5) may allow more space for the non-aromatic ring of mesotrione to bind. In the case of DQH, there is pi stacking between the Phe and DQH. This is present among dicots such as soy (Gm.TDO F299). It also may explain reduced activity in corn TDO (Phe300 in Zm.TDO) compared to rice TDO. However, surveying other TDOs based on their similarity to the active site of rice TDO, rather than the similarity of the entire protein, may provide a valid strategy to identify naturally active or other variants.
This application claims the priority of U.S. Provisional Appl. Ser. No. 63/307,525, filed Feb. 7, 2022, the entire disclosure of which is incorporated herein by reference.
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/US2023/062054 | 2/6/2023 | WO |
| Number | Date | Country | |
|---|---|---|---|
| 63307525 | Feb 2022 | US |