The sequence listing that is contained in the file named “AGOE003US_ST25.txt,” which is 27 KB (measured in MS-Windows) and created on Jun. 24, 2022, is filed herewith by electronic submission and incorporated herein by reference.
The disclosure relates to methods, polynucleotide constructs, and systems for controlling and enhancing gene expression in Rhizobia such as Agrobacterium tumefaciens, and improving the ability of A. tumefaciens and other Rhizobia to genetically transform cells of plants and other organisms.
Agrobacterium tumefaciens is a soil-borne, Gram-negative bacterium that is widely studied for its ability to transfer DNA into plants. Agrobacterium-mediated transformation (AMT) is perhaps the most versatile technology for production of genetically modified plants. AMT is also used for the transformation of filamentous fungi, green algae and human cells. Agrobacterium spp., such as A. rhizogenes, and other Rhizobia, i.e. members of the Rhizobiales, such as Rhizobium spp., Mesorhizobium spp., Sinorhizobium spp., Bradyrhizobium spp. and related species and genera beyond Agrobacterium tumefaciens have also been found to be able to genetically transform plants.
In addition to its role in genetic transformation of plant cells, A. tumefaciens has been utilized in a variety of studies. For example, it has been adopted as a well-characterized model organism for the study of plant-microbe signaling (Barton, et al., Environmental Microbiology, 20:16-29, 2018; Venturi & Fuqua, Ann Rev Phytopathol, 51:17-37, 2013), bacterial cell-to-cell communication (Faure & Lang, Agrobacterium tumefaciens. Frontiers in Plant Science, 5, 14. doi:10.3389/fpls.2014.000142014), and virulence mechanisms (Jakubowski, et al, J Bacteriol, 187:3486-3495, 2005).
However, although AMT is a valuable technology for the production of genetically modified crop plants, Agrobacterium (and related Rhizobia) is not able to genetically transform certain plant (crop) species, or does so inefficiently or in a genotype-dependent manner. Improved methods of transforming plants through AMT that address these shortcomings would therefore be a significant advance in the art.
SEQ ID Nos:1-3 Artificial cumic acid-inducible promoters.
SEQ ID Nos:4-18 VirG-controlled promoters from pTiB0542.
SEQ ID Nos:19-26 Artificial VirB promoters with various AT repeats inserted in the spacer region between the promoter and the reporter gene.
SEQ ID Nos:27-46 Artificial VirB promoters with engineered VirG binding site.
SEQ ID NO:47 Consensus VirG-binding motif.
SEQ ID Nos:48-62 A. tumefaciens VirG binding sites of
SEQ ID NO:63 PVirB core region, as shown in
SEQ ID NO:64 PVIrB core region with consensus engineered sites.
SEQ ID Nos:65-85 Engineered PVirB promoter fragments of
SEQ ID NO:86 A native WT PVirB promoter fragment of
In one aspect the invention comprises a recombinant polynucleotide construct comprising a DNA molecule encoding: (a) at least one gene of interest operably linked to a heterologous inducible promoter for expression of the gene of interest in a bacterial cell, wherein the ratio of expression of the gene of interest in the presence of an added inducer relative to expression in the absence of the added inducer is at least 100; and (b) a broad host range origin of replication functional in Enterobacteriaceae and Rhizobiaceae. In certain embodiments the inducer is cumic acid or vanillic acid. The invention also comprises embodiments wherein the origin of replication of the construct comprises an oriT functional with IncQ, IncP, IncW, or colE1. In certain embodiments the ratio of expression of the gene of interest in the presence of an added inducer relative to expression in the absence of the added inducer is at least 200, 300, 400 or 500 relative to expression in the absence of the added inducer.
In another aspect the invention comprises a transgenic bacterium comprising the recombinant polynucleotide construct comprising a DNA molecule encoding: (a) at least one gene of interest operably linked to a heterologous inducible promoter for expression of the gene of interest in a bacterial cell, wherein the ratio of expression of the gene of interest in the presence of an added inducer relative to expression in the absence of the added inducer is at least 100; and (b) a broad host range origin of replication functional in Enterobacteriaceae and Rhizobiaceae. In certain embodiments the bacterium is from a species within a genus selected from the group consisting of: Escherichia, Agrobacterium , and Rhizobium. In particular embodiments the bacterium is an Agrobacterium tumefaciens bacterium or an Agrobacterium rhizogenes bacterium. The invention may also comprise an in vitro culture of the bacterium, growing in the presence of an inducer. In some embodiments a culture of the bacterium growing in the presence both of a plant cell and of the inducer is contemplated.
In some embodiments of the invention the bacterium further comprises a VirGN54D protein. In further embodiments the heterologous inducible promoter may comprise a nucleotide sequence selected from the group consisting of: SEQ ID Nos:1-3, SEQ ID Nos:19-26, and SEQ ID Nos:27-47, or the heterologous inducible promoter comprises a nucleotide sequence selected from the group consisting of: SEQ ID NO:64 and SEQ ID Nos:65-85.
In another aspect the invention provides a method for expressing a gene of interest comprising: (a) obtaining a transgenic bacterium comprising a recombinant polynucleotide construct comprising a DNA molecule encoding: (i) at least one gene of interest operably linked to a heterologous inducible promoter for expression of the gene of interest in a bacterial cell, wherein the ratio of expression of the gene of interest in the presence of an added inducer relative to expression in the absence of the added inducer is at least 100; and (ii) a broad host range origin of replication functional in Enterobacteriaceae and Rhizobiaceae wherein the heterologous inducible promoter comprises a nucleotide sequence selected from the group consisting of: SEQ ID NOs:1-3, SEQ ID Nos:19-26, SEQ ID Nos:27-47, and SEQ ID Nos:64-85; (b) growing a culture of cells of the bacterium in the presence of an inducer of the heterologous promoter; and (c) assaying the culture, or a portion or an extract thereof, for expression of the gene of interest. In such a method the culture of the bacterium may further comprise plant cells. In certain embodiments of such a method, the assaying may comprise measuring the transformation frequency (“TF”) of a plant cell by the bacterium.
The invention further provides, in another aspect, a polynucleotide construct comprising a gene of interest operably linked to a heterologous inducible promoter sequence for expression of the gene of interest in a bacterial cell, wherein the promoter sequence comprises a nucleotide sequence selected from the group consisting of: SEQ ID Nos:64-85.
A kit comprising the bacterium comprising a polynucleotide construct comprising a gene of interest operably linked to a heterologous inducible promoter sequence for expression of the gene of interest in a bacterial cell, wherein the promoter sequence comprises a nucleotide sequence selected from the group consisting of: SEQ ID Nos:1-3, 19-26, 27-46, 47, 64, and 65-85, and an inducer of the heterologous promoter is also contemplated.
Agrobacterium (and related Rhizobia) is not able to genetically transform certain plant (crop) species and certain varieties (genotypes) of other species, or does so inefficiently. Thus there is a need for enhancing the ability of A. tumefaciens and related Rhizobia to transform plant cells and other cells, especially of plant species not efficiently transformed by A. tumefaciens or other Rhizobia. The invention overcomes such limitations of the prior art by providing nucleic acid constructs, methods, and systems for enhancing and controlling gene expression and transformation by A. tumefaciens as well as other Rhizobia. In addition, the present disclosure provides methods for rational and systematic genetic engineering of bacteria to enhance plant cell transformation.
Methods and compositions for enhancement of transformation ability provided herein may include, for example, controlling and optimizing vir gene expression as well as expression of other bacteria loci, including chromosomal loci such as chv genes to achieve more efficient cell transformation including an increase in transformation frequency and an improved broader range of plant species for which efficient cell transformation is available. The approaches described herein can also allow for improved transformation efficiency of non-plant cells by Agrobacterium and other Rhizobia. The described constructs, polynucleotide sequences, and methods also provide for rationally-controlled inducible gene expression systems for expression of one or more gene(s) of interest in Rhizobia including A. tumefaciens. Expression of entire bacterial vir gene clusters and/or chromosomal chv gene clusters (or other bacterial operons) may be altered, allowing for efficient cell transformation of an expanded set of target plant (crop) or other species targeted for bacterial-mediated transformation.
Additionally, there is a need to further develop or enhance methods and polynucleotide constructs for gene expression in A. tumefactions and other Rhizobia for expression of genes of interest in a controlled manner. This may include development of effective inducible expression systems to control gene expression in Agrobacterium and other Rhizobia. As part of the described “toolkit” for gene expression described herein, the engineered inducible promoter sequences of the disclosure further allow for predictable levels of gene expression in Agrobacterium and other Rhizobia over a useful range important for fine-tuning such expression of genes (e.g. vir genes), or groups of genes such as operons, of interest. Such efficient inducible expression systems may reduce or eliminate the need for traditional phenolic inducers of Agrobacterium vir gene expression, such as acetosyringone. Reliable induction systems for gene expression may also be useful to achieve precise control of gene expression. Simple sequence repeats in the spacer region between the ribosome-binding site and the start codon (ATG) were found, for example, to effectively modulate translation in A. tumefaciens, with various lengths of AT sequence repeats ((AT)0-(AT)10) inserted in the spacer region between the promoter and the reporter gene (SEQ ID Nos:19-26) showing that altering the number of AT repeats can robustly and predictably tune gene expression levels over a 100-fold range. Since complex biosynthetic pathways often require a coordinated, fine balance of expression of individual genes in order to achieve optimal performance, the present invention allows for gene expression fine-tuning in A. tumefaciens.
The disclosure thus provides, in one embodiment, for an inducible bacterial gene expression system comprising a recombinant construct comprising a DNA molecule encoding at least one gene of interest for expression in a bacterial cell, operably linked to a heterologous inducible promoter for expression of the gene of interest in the bacterial cell, wherein the ratio of expression of the gene of interest in the presence of an added inducer relative to expression in the absence of the added inducer is at least 100×; or 500× or more relative to expression in the absence of an inducer. The efficiency of the inducible expression system may, in specific embodiments, also be measured by ascertaining the absolute level of gene expression in the presence of an inducer, relative to the expression seen in the absence of an inducer. A polynucleotide of the present invention may also comprise a broad host range origin of replication functional in both Enterobacteriaceae and Rhizobiaceae. In certain embodiments the inducer of bacterial gene expression may be cumic acid or vanillic acid. The broad host-range origin of replication may for instance comprise an oriT functional with IncQ, IncP, IncW, or colE1.
In certain embodiments, a transgenic bacterial strain comprising the recombinant construct is contemplated. The bacterial strain may be from a species of Rhizobia, or for instance from within a genus selected from the group consisting of: Escherichia, Agrobacterium, and Rhizobium. The bacterial strain may be comprised within a bacterial culture that may be growing in the presence of the inducer, or in the presence of the inducer and of a plant cell. In further embodiments the bacterial strain also comprises an “amplifier” module, such as comprising a VirGN54D protein.
Also contemplated as an aspect of the invention is a polynucleotide construct comprising a gene of interest operably linked to a heterologous promoter sequence for expression of the gene of interest in a bacterial cell, wherein the promoter sequence comprises a nucleotide sequence selected from the group consisting of the polynucleotide sequences as disclosed herein (e.g. SEQ ID Nos:1-3, SEQ ID Nos:19-47, SEQ ID NO:64, or as shown in
In another aspect, the invention provides methods for expressing a gene of interest in a bacterial cell comprising: (a) obtaining a bacterial strain comprising a gene of interest operably linked to a heterologous promoter sequence for expression of the gene of interest in a bacterial cell, wherein the promoter sequence comprises a nucleotide sequence selected from the group consisting of the polynucleotide sequences as shown in
The mechanism of T-DNA transfer to plant cells by Agrobacterium has been well documented (e.g. Gelvin, Microbiology and Molecular Biology Reviews, 67:16-37, 2003). Briefly, the T-DNA is delimited by two border regions, referred to as right border (RB) and left border (LB). The borders are nicked by virulence protein VirD2 which produces single stranded transferred DNA (the “T-strand”) with covalent attachment of the VirD2 on its 5′ end. The protein-DNA complex, also including Agrobacterium VirE2 protein, exits Agrobacterium cells through the so-called Type 4 secretion system (T4SS, both virulence protein and ssDNA transporter), and is transferred into plant cells and integrated in the plant genome with the help of both Agrobacterium virulence proteins and plant factors.
The following descriptions and definitions are provided to better define the invention and to guide those of ordinary skill in the art in the practice of the invention. Unless otherwise noted, terms are to be understood according to conventional usage by those of ordinary skill in the relevant art.
“Amplifiers” are widely used devices to enhance signals in electrical and electronic systems. In synthetic biology, bio-amplifiers such as T7 RNAP and cooperative activator proteins (e.g., HrpR and HrpS) have also been used to increase the sensitivity and output dynamic range of transcription based biosensors (e.g. Tang et al., ACS Synthetic Biology 7:1753-1762, 2018 Wang et al., Nucleic Acids Research, 42:9484-9492, 2014). The transcriptional factor VirG of A. tumefaciens activates the expression of virulence genes by binding to “Vir boxes”, nucleotide sequences of the Vir operons in the presence of signals and the sensor protein VirA (Krishnamohan et al., J Bacteriol 183:4079-4089, 2001). In contrast, the VirG mutant VirGN54D alone can activate the vir genes without signals and VirA (Jin et al., Molecular Microbiology, 7:555-562, 1993; Jung et al., Current Microbiology, 49:334-340, 2004). Thus an “amplifier” module may be constructed and utilized for enhanced or more precise control of gene expression. Such an amplifier module may comprise a promoter that functions in expression of a polynucleotide sequence of interest in a bacterial cell. Such a promoter may be a native (“wild-type”) promoter, or it may be modified or engineered to improve (increase or decrease) or otherwise control the resulting level of gene expression under certain growth conditions of interest. As used herein, “inducible promoter” refers to a promoter that exhibits an increased level of expression of an operably linked gene of interest, when cells comprising the promoter and the gene of interest are grown under inducing conditions, such as in the presence of a chemical or other inducer. Such “inducibility” may be due to direct or indirect effects as the inducer promotes gene expression.
As used herein, the term “recombinant” refers to a non-naturally occurring DNA, protein, cell, seed, or organism that is the result of genetic engineering and as such would not normally be found in nature. A “recombinant DNA molecule” is a DNA molecule comprising a DNA sequence that does not naturally occur in nature and as such is the result of human intervention, such as a DNA molecule comprised of at least two DNA molecules heterologous to each other. An example of a recombinant DNA molecule is a DNA molecule operably linked to a heterologous regulatory or other element, such as a heterologous promoter for expression in a plant cell, or other cell. A “recombinant protein” is a protein comprising an amino acid sequence that does not naturally occur and as such is the result of human intervention, such as an engineered protein or a chimeric protein. A recombinant cell, seed, or organism is a cell, seed, or organism comprising transgenic DNA, for example a transgenic cell, seed, plant, or plant part comprising a recombinant DNA molecule and therefore produced as a result of plant transformation.
As used herein, the term “genetic engineering” refers to the creation of a non-natural DNA, protein, or organism that would not normally be found in nature and therefore entails applying human intervention. Genetic engineering can be used to produce an engineered DNA, protein, or organism that was conceived of and created in the laboratory using one or more of the techniques of biotechnology such as molecular biology, protein biochemistry, bacterial transformation, and plant transformation. For example, genetic engineering can be used to express a gene of interest in a bacterial, fungal, plant, or animal cell.
The term “transformation frequency (“TF”) refers to the ability of a bacterial cell to transfer DNA via AMT, or other bacterial-mediated transformation. This may be measured, for instance, by the number of transformed cells or plants obtained from a given treated sample. Such transformation may be the result of transient or stable transformation.
The term “transgene” refers to a DNA molecule artificially incorporated into an organism's genome as a result of human intervention, such as a plant transformation method. As used herein, the term “transgenic” means comprising a transgene, for example a “transgenic plant” refers to a plant comprising a transgene in its genome and a “transgenic trait” refers to a characteristic or phenotype conveyed or conferred by the presence of a transgene incorporated into the plant genome. As a result of such genomic alteration, the transgenic plant or other organism is something distinctly different from the related wild-type plant or other organism and the transgenic trait is a trait not naturally found in the wild-type plant or other organism. Transgenic plants and organisms of the invention comprise the recombinant DNA molecules and engineered proteins provided by the invention.
As used herein, the term “heterologous” refers to the relationship between two or more things derived from different sources and thus not normally associated in nature. For example, a protein-coding recombinant DNA molecule is heterologous with respect to an operably linked promoter if such a combination is not normally found in nature. In addition, a particular recombinant DNA molecule may be heterologous with respect to a cell, seed, or organism into which it is inserted when it would not naturally occur in that particular cell, seed, or organism.
As used herein, the term “protein-coding DNA molecule” refers to a DNA molecule comprising a nucleotide sequence that encodes a protein. A “protein-coding sequence” means a DNA sequence that encodes a protein. A “sequence” means a sequential arrangement of nucleotides or amino acids. The boundaries of a protein-coding sequence are usually determined by a translation start codon at the 5′-terminus and a translation stop codon at the 3′-terminus. A protein-coding molecule may comprise a DNA sequence encoding a protein sequence. As used herein, “transgene expression”, “expressing a transgene”, “protein expression”, and “expressing a protein” mean the production of a protein through the process of transcribing a DNA molecule into messenger RNA (mRNA) and translating the mRNA into polypeptide chains, which are ultimately folded into proteins. A protein-coding DNA molecule may be operably linked to a heterologous promoter in a DNA construct for use in expressing the protein in a cell transformed with the recombinant DNA molecule. As used herein, “operably linked” means two DNA molecules linked in manner so that one may affect the function of the other. Operably-linked DNA molecules may be part of a single contiguous molecule and may or may not be adjacent. For example, a promoter is operably linked with a protein-coding DNA molecule in a DNA construct where the two DNA molecules are so arranged that the promoter may affect the expression of the transgene.
As used herein, a “DNA construct” is a recombinant DNA molecule comprising two or more heterologous DNA sequences. DNA constructs are useful for transgene expression and may be comprised in vectors and plasmids. DNA constructs may be used in vectors for the purpose of transformation, that is the introduction of heterologous DNA into a host cell, in order to produce transgenic plants and cells, and as such may also be contained in the plastid DNA or genomic DNA of a transgenic plant, seed, cell, or plant part. As used herein, a “vector” means any recombinant DNA molecule that may be used for the purpose of bacterial or plant transformation. Recombinant DNA molecules as set forth in the sequence listing, can, for example, be inserted into a vector as part of a construct having the recombinant DNA molecule operably linked to a gene expression element that functions in a plant to affect expression of the engineered protein encoded by the recombinant DNA molecule. Methods for constructing DNA constructs and vectors are well known in the art.
The components for a DNA construct, or a vector comprising a DNA construct or expression cassette, generally include one or more gene expression elements operably linked to a transcribable DNA sequence, such as the following: a promoter for the expression of an operably linked DNA, an operably linked protein-coding DNA molecule, and a 3′ untranslated region (UTR). A promoter drives expression of the recombinant protein molecule. Gene expression elements useful in practicing the present invention also include, but are not limited to, one or more of the following type of elements: 5′ UTR, enhancer, leader, cis-acting element, intron, targeting sequence, 3′ UTR, and one or more selectable or screenable marker transgenes.
Promoters useful in practicing the present invention include those that function in a cell for expression of an operably linked polynucleotide, such as a bacterial promoter. The microbial genome is a useful source for identifying DNA segments such as promoters for synthetic biology applications (Jin et al., Applied Microbiology and Biotechnology, 103:8725-8736, 2019). Many endogenous promoters have been identified such as PsacB promoter from B. subtilis THY-7 (Jin et al., 2019) and the Pvgb promoter from Vitreoscilla stercoraria (Lara et al., ACS Synthetic Biology, 6:344-356, 2017), in addition to, for instance, Pvir promoters found on the Agrobacterium Ti plasmid. Bacterial and plant promoters are varied and well known in the art and include those that are inducible, viral, synthetic, constitutive, temporally regulated, spatially regulated, and/or spatio-temporally regulated. The present invention further provides a panel of engineered (modified) bacterial promoter sequences for versatile application of an inducible bacterial gene expression system, such as the disclosed promoters that are controlled by VirGN54D.
Recombinant DNA molecules of the present invention may be synthesized and modified by methods known in the art, either completely or in part, especially where it is desirable to provide sequences useful for DNA manipulation (such as restriction enzyme recognition sites or recombination-based cloning sites), or sequences useful for DNA construct design (such as spacer or linker sequences).
As used herein, the term “percent sequence identity” or “% sequence identity” refers to the percentage of identical nucleotides or amino acids in a linear polynucleotide or polypeptide sequence of a reference (“query”) sequence (or its complementary strand) as compared to a test (“subject”) sequence (or its complementary strand) when the two sequences are optimally aligned (with appropriate nucleotide or amino acid insertions, deletions, or gaps totaling less than 20 percent of the reference sequence over the window of comparison). Optimal alignment of sequences for aligning a comparison window are well known to those skilled in the art and may be conducted by tools such as the local homology algorithm of Smith and Waterman, the homology alignment algorithm of Needleman and Wunsch, the search for similarity method of Pearson and Lipman, and by computerized implementations of these algorithms such as GAP, BESTFIT, FASTA, and TFASTA available as part of the Sequence Analysis software package of the GCG® Wisconsin Package® (Accelrys Inc., San Diego, Calif.), MEGAlign (DNAStar, Inc., 1228 S. Park St., Madison, Wis. 53715), and MUSCLE (version 3.6) (Edgar, Nucleic Acids Research 32(5):1792-7, 2004) with default parameters. An “identity fraction” for aligned segments of a test sequence and a reference sequence is the number of identical components which are shared by the two aligned sequences divided by the total number of components in the reference sequence segment, that is, the entire reference sequence or a smaller defined part of the reference sequence. Percent sequence identity is represented as the identity fraction multiplied by 100. The comparison of one or more sequences may be to a full-length sequence or a portion thereof, or to a longer sequence.
The present invention includes recombinant DNA molecules and engineered proteins having at least 70% sequence identity, at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, and at least 99% sequence identity to any of the recombinant DNA molecule or polypeptide sequences provided herein. Such identity may be calculated over the full length of the protein or nucleotide sequence, or over a portion of the length of the sequences of interest, such as 1%, 5%, 10%, 25%, or 50% of the sequence length. Alternatively, identity may be calculated over a portion (“window”) of a sequence of interest based on nucleotide length such as 50 nucleotide base-pairs or amino acid residues, 100, 200, 500, 1000, 5000 etc., including intervening lengths. Variants having a percent identity to a sequence disclosed herein may have the same activity as the base sequence.
In one embodiment, fragments of a promoter sequence disclosed herein are provided. Promoter fragments may comprise promoter activity and may be useful alone or in combination with other promoters and promoter fragments, such as in constructing chimeric promoters, or in combination with other expression elements and expression element fragments. In specific embodiments, fragments of a promoter are provided comprising at least about 50, at least about 75, at least about 95, at least about 100, at least about 125, at least about 150, at least about 175, at least about 200, at least about 225, at least about 250, at least about 275, at least about 300, at least about 500, at least about 600, at least about 700, at least about 750, at least about 800, at least about 900, or at least about 1000 contiguous nucleotides, or longer, of a recombinant DNA molecule disclosed herein. Fragments of a sequence disclosed herein may have the same activity as the base sequence. Methods for producing such fragments from a starting promoter molecule are well known in the art.
The term “plant” includes whole plants, shoot vegetative organs/structures (for example, leaves, stems and tubers), roots, flowers and floral organs/structures (for example, bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit (the mature ovary), plant tissue (for example, vascular tissue, ground tissue, and the like) and cells (for example, guard cells, egg cells, and the like), and progeny of same. The class of plants that can be used in the method of the invention is generally as broad as the class of higher and lower plants amenable to transformation techniques, including for instance angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms, and ferns. Cells of other organisms may be of interest for instance to achieve improved transformation frequency (“TF”) by bacterial-mediated cell transformation, and may include for instance cells of fungi, algae, cyanobacteria, and animals such as nematodes, insects, fish, and mammals.
Exemplary plants contemplated herein may include monocotyledonous or dicotyledonous crop plants including, for instance, cassava, maize (corn; Zea mays), soybean (Glycine max), cotton (Gossypium hirsutum; Gossypium sp.), peanut (Arachis hypogaea), barley (Hordeum vulgare); oats (Avena sativa); orchard grass (Dactylis glomerata); rice (Oryza sativa, including indica and japonica varieties); sorghum (Sorghum bicolor); sugar cane (Saccharum sp.); tall fescue (Festuca arundinacea); turfgrass species (e.g. species: Agrostis stolonifera, Poa pratensis, Stenotaphrum secundatum); wheat (Triticum aestivum); teff (Eragrostis); millet, alfalfa (Medicago sativa); members of the genus Brassica, including broccoli, cabbage, cauliflower, carrot, cucumber, dry bean and other leguminous plants, eggplant, tobacco (Nicotiana sp.), leek, lettuce, melon, okra, onion, pea, pepper, pumpkin, radish, spinach, squash, sweet corn, tomato, potato, watermelon, ornamental plants, and other fruit, vegetable, tuber, oilseed, and root crops, wherein oilseed crops may include soybean, canola, oil seed rape, oil palm, sunflower, olive, coffee, citrus, flaxseed, safflower, and coconut, among others. Host cells, such as Escherichia coli, and Agrobacterium sp. or other Rhizobia, comprising the disclosed constructs are also contemplated as part of the invention.
The resulting transgenic organisms such as plants, progeny, seeds, plant cells, plant parts, and/or cells of other contemplated organisms of the invention may contain or display one or more transgenic traits as a result of their genetic transformation. Other transgenic trait(s) may be introduced by co-transforming a DNA construct for that additional transgenic trait(s) with a DNA construct comprising the recombinant DNA molecules provided by the invention (for example, with all the DNA constructs present as part of the same vector used for plant transformation) or by inserting the additional trait(s) into a transgenic plant comprising a DNA construct provided by the invention or vice versa (for example, by using any of the methods of plant transformation on a transgenic plant or plant cell).
Transgenic traits include, but are not limited to, expression of a gene product of interest, increased insect resistance, increased water use efficiency, increased yield performance, increased drought resistance, increased seed quality, improved nutritional quality, hybrid or inbred seed production, male sterility, grain nutritional or energy value and herbicide tolerance, in which the trait is measured with respect to a wild-type plant. Such transgenic traits are well known to one of skill in the art.
Transgenic cells and progeny that contain a transgenic trait provided by the invention may be used with any breeding methods that are commonly known in the art. In plant lines comprising two or more transgenic traits, the transgenic traits may be independently segregating, linked, or a combination of both in plant lines comprising three or more transgenic traits. Back-crossing to a parental plant and out-crossing with a non-transgenic plant are also contemplated, as is vegetative propagation. Descriptions of breeding methods that are commonly used for different traits and crops are well known to those of skill in the art. To confirm the presence of the transgene(s) in cells of a particular organism such as a bacterial cell, plant cell, or seed, a variety of assays may be performed. Such assays include, for example, molecular biology assays, such as Southern and northern blotting, PCR, and DNA sequencing; biochemical assays, such as detecting the presence of a protein product, for example, by immunological means (ELISAs and western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole plant, when identifying transformed cells.
Sequences for transgene expression may be codon optimized for expression in bacteria, algae, cyanobacteria, fungi, animals, or plants, including monocotyledonous and dicotyledonous plants. The genes of interest for expression may be located on the same construct, or on separate constructs, and may be co-transformed, transformed separately, or may be introduced together into a plant cell via a step of plant breeding. Marker-assisted selection may be utilized to confirm the presence of one or more gene(s) of interest via a plant breeding approach.
Stable or transient expression of constructs comprising a gene of interest is contemplated. The disclosure contemplates preparation of an expression vector that can be transported across a cell membrane, or a plant cell wall and membrane, resulting in transformation of a cell, for expression therein. Transformation of cells of organisms other than plants is also contemplated. In one embodiment, a vector may replicate in a bacterial host such that the vector can be produced and purified in sufficient quantities for transient expression or other use. In another embodiment, a vector can encode a marker gene to allow for selection or screening for the presence of the vector in a host cell such as a bacterial cell, an animal cell, an algal cell, a fungal cell, an insect cell, or a plant cell, or the vector can also comprise an expression cassette to provide for the expression of a gene of interest such as in a plant. The selection or marker gene may be expressed in a cell, or in a cell nucleus or in an organelle such as a chloroplast or a mitochondrion. In some embodiments an expression cassette contains a promoter region, a 5′ untranslated region, an optional intron to aid expression, and optionally a multiple cloning site to allow facile introduction of sequences of interest, and a 3′ UTR.
The method may further comprise assaying for the presence of an introduced gene in the genome of a cell, and/or the presence of a resulting protein product in the cell. Thus, well known methods such as Southern blotting and western blotting may be used. Methods may further comprise assaying for protein or enzyme activity. The presence of an introduced gene may be transient, or the gene may be stably integrated into a cell genome. Activity may thus be expressed in a transient or stable manner, and may occur in a cell, or in a cell nucleus, cytoplasm, mitochondria, or chloroplast.
The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples that follow represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents that are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. Cited references are incorporated herein by reference. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
Strains and cultivation conditions. E. coli DH10B was the host for plasmid construction. The plasmid-free strain A. tumefaciens NTL4 (Luo, Z.-Q., et al.,. Mol. Plant-Microbe Interact. 14:98-103, 2001) was used as the host for promoter library construction and phenotypic validation. All strains were cultured in the Luria-Bertani (LB) with 200 rpm shaking at 37° C. (E. coli) or at 30° C. (A. tumefaciens). Appropriate antibiotics were added at the following concentrations (μg/mL): for A. tumefaciens, 100 of carbenicillin, 50 of kanamycin; for E. coli, 100 of ampicillin, 50 of kanamycin. Inducers of gene expression (cumic acid, L-arabinose, IPTG, vanillic acid, sodium salicylate, and naringenin) were added to the medium at the concentrations of 10-3-103 μmol/mL when necessary.
Plasmids used in this study are listed in Table 1. Plasmids were constructed using the Gibson assembly method (Gibson et al., Nature Methods, 343-345. doi:10.1038/NMETH.13182009). All plasmids use pBBR1 (Szpirer et al., J Bacteriol. 183:2101-2110, 2001) or pVS1 (Vodala et al.,. Mol Cell. 31:104-13, 2008) as origins. To construct the inducible systems, the pBBR1 origin was cloned from the plasmid pBBR1-kan-hyg-ccdB (Wang et al., Nature Protocols 11:1175-1190, 2016). An ampicillin resistance cassette was cloned from the plasmid pAM PAT-ProCPCG. The sfgfp gene was cloned from the plasmid BP-Target-EndyD. The inducible systems were cloned from the plasmids pAJM.336 (Lad), pAJM.657 (cymR), 6 pAJM.661 (TtgR), pAJM.677 (AraE), pAJM.771 (NahR), and pAJM.773 (VanR) respectively (Meyer et al., 2019). The fragments were assembled to generate the plasmids pBBR1-B2, pBBR1-B4, pBBR1-B5, pBBR1-B7, pBBR1-B9, and pBBR1-B11. For the construction of the amplifier, VirG mutants virGNs4D were cloned from A. tumefaciens strains GV3101 and EHA105, respectively. Then virGN54D mutants were introduced into the plasmid pBBR1-B5 to generate pBBR1-CN and pBBR1-BN accordingly. To construct the reporter, the sfgfp and mKate2 were cloned from the plasmid BP-Target-EndyD (Bonnet, et al, PNAS, 109:8884-8889, 2012). The virB and virE promoters were cloned from the strain EHA105. The pVS1 origin were cloned from the plasmid pCAMBIA5105 (Wendt, et al., Transgenic Research, 21:567-578, 2012). Then, these fragments were assembled to generate the plasmids pBVKGM. To screen promoters, fifteen promoters were cloned from the strain EHA105. The sfgfp gene was used as a reporter gene and the pVS1 was used as the origin. The constructed plasmids were named as pBVKP1-15. To construct pleD expressing plasmids, sfgfp from the plasmid pBVK-P3 was replaced by pleD to generate the plasmid pBVKpleD.
Protocols 11:1175-
GFP measurement. An inoculum of A. tumefaciens was grown overnight to stationary phase and then transferred to fresh media at a 1:100 dilution. The new inocula were added with inducers to a final concentration of 1×10−3 to 1×103 μmol/ml when their optical density at 600 nm (OD600) reached 0.5. After 6 h of incubation, cells were collected and resuspended in 1×PBS buffer. Then 100 μL of suitably diluted cultures were added into 96-well microtiter plates. OD600 and relative green fluorescence were measured by a BioTek® microplate reader H1. Congo red plate assay. The dye was dissolved in ddH2O at 20 mg ml−1 and passed through 0.2 μm filters immediately. Four ml of filtered Congo Red was added per L (final concentration: 80 μg/ml) to generate LB-CR agar medium. The overnight cultured cultures were diluted to an OD600 of 1.0. 10 μl cultures were plated on LB-CR agar with appropriate antibiotics, followed by incubation at 30° C. for 24 hours.
Biofilms were determined using the sterile coverslip cultured method with minor modifications (Xu et al., Molec. Microbiol. 89:929-948, 2013). Briefly, for biofilm formation, 18 mm glass coverslips were added into the 12-well polystyrene cell culture plates. One ml of pre-cultured cells were inoculated into the plates at an OD600 of 0.05, and incubated without shaking for 24 h at 30° C. To quantify biofilm formation the culture supernatants were removed and the coverslips were washed twice in ddH2O. The remaining attached bacteria were stained by 0.1% (w/v) crystal violet for 10 min and washed twice in ddH2O. Biomass adhering to the coverslip was extracted with 1 ml of 33% acetic acid to solubilize the biofilm. The optical density (OD) of 150 μl of dilution cultures was measured at OD595 nm (A595) in a microplate reader.
A reliable induction system is required to achieve precise control of gene expression. To develop effective tools for A. tumefaciens, six candidate inducible expression systems (the β-d-1-thiogalactopyranoside (IPTG-), cumic acid (Cum-), vanillic acid (Van-), arabinose (Ara-), naringenin (Nar-), and sodium salicylate (Sal-) inducible systems) were chosen for study. These systems exhibit low background expression and a large dynamic range in E. coli (Meyer et al., Nature Chemical Biology, 15:196-204, 2019). An ori from the broad host-range plasmid pBBR1, which propagates in both E. coli and A. tumefaciens, was utilized for the origin of replication (Szpirer et al., J Bacteriol. 183:2101-2110, 2001). To evaluate the performance of these inducible systems, a green fluorescence protein gene (sfgfp) was utilized as a reporter gene, and placed under the control of the corresponding inducible promoters, Ptac, Pcym, Pvan, PBAD, Pug and Psal, respectively. The repressor genes of the inducible systems (lacI, cymR, vanR, araE, ttgR and nahR) were driven by a common constitutive promoter (Pcon). Schematic diagrams of these inducible systems are shown in
To evaluate these induction systems, green fluorescence intensities of cells transformed with the plasmids were measured at various concentrations of inducers, ranging from 10−3 to 103 μM. As shown in
An “amplifier module” was constructed to increase controllable expression in A. tumefaciens. To enable expression amplification, two VirG variants from the plasmids pTiC58 and pTiBO542 were utilized. These were designated CN and BN respectively, and inserted under the control of the cumic acid-inducible promoter Pcym (
To enable versatile applications of inducible gene expression system, promoters that are controlled by VirGN54D in the amplification module were identified. MEME, a program for ab initio identification of novel motifs (Bailey et al., Nucleic Acids Research, 37(suppl_2), W202-W208, 2009), was utilized to identify VirG-controlled promoters in the upstream regions of all upregulated genes in the plasmid pTiBo542. The consensus VirG-binding motif, RTTDCAWWTGHAAY (SEQ ID NO:47), with up to three mismatches allowed (Haryono et al., Frontiers In Microbiology, 10:1554, 2019) was used in the search, which resulted in 15 putative promoters as shown in
Notably, the promoters of virA, virB, virC, virD, virE, virG, and repABC operons have the consensus VirG-binding motif, consistent with the previous reports that these genes are activated by VirG (Cho & Winans, PNAS 102:14843-14848, 2005). Other putative promoters PBO100, PBO114, and PBO210 also contain the VirG-binding motif, indicating that these promoters may also be controlled by VirG.
To validate these promoters they were cloned from the plasmid pTiBO542 and placed upstream of sfgfp for fluorescence-based quantification. The results (
Complex biosynthetic pathways may require a coordinated, fine balance of expression of individual genes in order to achieve optimal performance. Thus strategies were developed for gene expression fine-tuning in A. tumefaciens. Using simple sequence repeats in the spacer region between the ribosome-binding site and the start codon (ATG) is a simple and effective approach to tune gene expression in E. coli (Egbert & Klavins, PNAS 109:16817-16822, 2012).
To test the feasibility of this approach for modulating translation in A. tumefaciens, various lengths of AT sequence repeats ((AT)0-(AT)10) were inserted in the spacer region between the promoter PvirB and the fluorescence reporter gene sfgfp (
The results showed that altering AT repeats can robustly and predictably tune gene expression levels over a 100-fold range. To demonstrate the application of this fine-tuning strategy, three constructs, PvirB-(AT)0, PvirB-(AT)6 and PvirB-(AT)8 were to drive the expression of pleD which encodes the protein that positively regulates UPP polysaccharide synthesis and biofilm formation in A. tumefaciens (Hengge, Nature Rev. Microbiol. 7:263-273, 2009; Xu et al., Molec. Immunol. 89:929-948, 2013). Strains carrying with the plasmids PvirB-(AT)0-PleD, PvirB-(AT)6-pleD, and PvirB-(AT)8-pleD and tested their polysaccharide production. Visible Congo Red dye staining was used to evaluate the production of polysaccharides, because the intensity of red staining is proportional to the amount of Congo Red-reactive polysaccharide produced (Xu et al., 2013). As shown in
Site-specific mutagenesis of promoters is another powerful strategy to fine-tune gene expression (Qin et al., Appl Environ Microbiol. 77:3600-3608, 2011). As shown in
To test this possibility, the promoter Persi was utilized as a template and mutations were introduced into its binding site while conserving the core region (CAATTG; SEQ ID NO:63) and randomizing other sites, yielding RYTNCAATTGNAAY (SEQ ID NO:64; R=A or G; Y=C or T; N=A, T, G or C) (
Gene clusters such as those produce complex metabolic pathways often involve multiple genes that are expressed at different levels for an optimal realization of function. To demonstrate the utilization of the described controlled gene expression toolkit, the feasibility of using a single inducer to simultaneously regulate multiple genes with different expression was studied. The virB promoter library, the cumic acid-based induction system, and the CN-based amplification module was utilized to control three pairs of PvirB promoter variants with differential activities for driving the expression of the two fluorescence reporter genes, sfgfp and mkate2.
As shown in
To further illustrate application of the toolkit two sets of controllable expression systems to simultaneously drive pleD and sfgfp (
This application claims the benefit of U.S. provisional Application No. 63/215,249, filed Jun. 25, 2021, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63215249 | Jun 2021 | US |