The present invention relates to expression enhancers that are active in plants. The present invention also relates to the expression of proteins of interest in plants and provides methods and compositions for production of proteins of interest in plants.
Plants offer great potential as production systems for recombinant proteins. One approach to producing foreign proteins in plants is to generate stable transgenic plant lines. However this is a time consuming and labor intensive process. An alternative to transgenic plants is the use of plant virus-based expression vectors. Plant virus-based vectors allow for the rapid, high level, transient expression of proteins in plants.
High level transient expression of foreign proteins in plants has been obtained using of vectors based on RNA plant viruses, including comoviruses, such as Cowpea mosaic virus (CPMV; see, for example, WO2007/135480; WO2009/087391; US 2010/0287670, Sainsbury F. et al., 2008, Plant Physiology; 148: 121-1218; Sainsbury F. et al., 2008, Plant Biotechnology Journal; 6: 82-92; Sainsbury F. et al., 2009, Plant Biotechnology Journal; 7: 682-693; Sainsbury F. et al. 2009, Methods in Molecular Biology, Recombinant Proteins From Plants, vol. 483: 25-39).
Modifications of the 5′UTR of the RNA-2 of the comovirus cowpea mosaic virus (CPMV) have resulted in additional expression enhancer activity (as determined level of expression of a nucleic acid of interest or a protein of interest), when compared to the wild type CPMV 5′UTR. For example, mutation of the start codon at position 161 in a CPMV RNA-2 vector (U162C; HT) increases the levels of expression of a protein encoded by a sequence inserted after the start codon at position 512. This permits the production of high levels of foreign proteins without the need for viral replication and was termed the CPMV-HT system (WO2009/087391; Sainsbury and Lomonossoff, 2008, Plant Physiol. 148, 1212-1218). In pEAQ expression plasmids (Sainsbury et al., 2009, Plant Biotechnology Journal, 7, pp 682-693; US 2010/0287670), the sequence to be expressed is positioned between the 5′UTR and the 3′ UTR. The 5′UTR in the pEAQ series carries the U162C (HT) mutation.
Additional modification of the CPMV 5′ UTR region, have been described that further increase expression of a nucleic acid of interest within a plant. For example, “CMPV HT+” (comprising nucleotides 1-160 of the CPMV 5′ UTR with modified ATGs at position 115-117, and at position 161-163; WO2015/143567; which is incorporated herein by reference), and “CPMVX” (where X=160, 155, 150, or 114 nucleic acids in length; WO 2015/103704; which is incorporated herein by reference). An example of CMPVX is the expression enhancer “CPMV 160”. Expression of a nucleic acid sequence operatively linked to CPMV HT+” resulted in a significant increase in production of a protein of interest that was encoded by the nucleic acid sequence, when compared to the production of the same protein of interest using the same nucleic acid sequence operatively linked to the “CPMV HT” expression enhancer (see FIGS. 2 and 3 of WO2015/143567). Furthermore, expression of a nucleic acid sequence operatively linked to the “CPMV 160” expression enhancer resulted in a significant increase in production of a protein of interest encoded by the nucleic acid sequence, when compared to the production of the same protein of interest using the same nucleic acid sequence operatively linked to the “CPMV HT+” expression enhancer (see FIGS. 2 and 3 of WO2015/143567).
Diamos et. al (Frontiers in Plant Science. 2016, vol 7 pp. 1-15; which is incorporated herein by reference) describe several expression enhancers that may be used to increase production of proteins in plants (see Table 2 of Diamos et. al.), including the expression enhancer NbPsaK2 3′. As shown in FIG. 4 of Diamos et. al. (2016), production of protein of interest encoded by a nucleci acid that was operatively linked to NbPsaK2 3′ resulted in enhanced protein production when compared to the production of the same protein encoded by the same nucleic acid sequence operatively linked to other truncated psaK expression enhancers.
The present invention relates to expression enhancers that are active in plants. The present invention also relates to the expression of proteins of interest in plants, and provides methods and compositions for the production of proteins of interest in plants.
It is an object of the invention to provide an improved expression enhancer active in a plant.
According to the present invention there is provided an isolated expression enhancer active in a plant, the expression enhancer selected from the group consisting of:
nbEPI42 (SEQ ID NO:1);
nbSNS46 (SEQ ID NO:2);
nbCSY65 (SEQ ID NO:3);
nbHEL40 (SEQ ID NO:4);
nbSEP44 (SEQ ID NO:5); and
a nucleic acid having from 90-100% sequence identity to the nucleotide sequence set forth in any one of SEQ ID NO's:1-5. Wherein, the expression enhancer, when operatively linked to a nucleic acid of interest, for example a heterologous nucleic acid of interest, results in expression of the nucleic acid of interest. Additionally, the expression enhancer, when operatively linked to a nucleic acid of interest, for example a heterologous nucleic acid of interest, may increase the level of expression of the nucleic acid of interest, or the heterologous nucleic acid of interest, when compared to the level of expression of the same nucleic acid or heterologous nucleic acid of interest that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6).
The present disclosure also provides for a nucleic acid sequence comprising one of the isolated expression enhancers as described above, the expression enhancer operatively linked with a heterologous nucleotide sequence encoding a protein of interest. The heterologous nucleotide sequence may encode a viral protein or an antibody, for example which is not to be considered limiting, the viral protein may be an influenza protein or a norovirus protein. If the protein of interest is an influenza protein then it may include M2, a hemagglutinin protein selected from the group of H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, H16, an influenza type B hemagglutinin, or a combination thereof. If the protein of interest is a norovirus protein, then it may include a VP1 protein, a VP2 protein, or a combination thereof, selected from the group of GI.1, GI.2, GI.3, GI.5, GI.7, GII.1, GII.2, GII.3, GII.4, GII.5, GII.6, GII.7, GII.12, GII.13, GII.14, GII.17 and GII.21.
The present invention also provides a plant expression system comprising one or more than one of the nucleic acid sequence described above. The plant expression system may further comprise a comovirus 3′ UTR.
The present invention also provides a plant expression system comprising one or more than one of the isolated nucleic acid sequence operatively linked with a heterologous nucleic acid, or nucleotide sequence, as described above. The plant expression system may further comprise a comovirus 3′ UTR.
Also disclosed herein is a method of producing a protein of interest in a plant or in a portion of a plant comprising, introducing into the plant or in the portion of a plant the plant expression system as described above, comprising the one or more than one of nucleic acid sequence, and incubating the plant, the portion of a plant, or plant cell, under conditions that permit expression of each of the heterologous nucleotide sequence encoding the protein of interest. For example, the protein of interest may be a viral protein, such as an influenza protein or a norovirus protein. If the protein of interest is an influenza protein then it may include M2, a hemagglutinin protein selected from the group of H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, H16, an influenza type B hemagglutinin, and a combination thereof. If the protein of interest is a norovirus protein, then it may include a VP1 protein, a VP2 protein, or a combination thereof, selected from the group of GI.1, GI.2, GI.3, GI.5, GII.1, GII.2, GII.3, GII.4, GII.5, GII.6, GII.7, GII.12, GII.13, GII.14, GII.17 and GII.21.
A method of producing a multimeric protein of interest, is also described herein. The method involves co-expressing two or more than two of the nucleic acid sequence as described above, in a plant, the portion of a plant, or plant cell, in a transient or stable manner, wherein each of the two or more than two of the nucleic acid sequence encodes a component of the multimeric protein, and incubating the plant, the portion of a plant, or plant cell, under conditions that permit expression of each of the heterologous nucleotide sequence encoding the multimeric protein of interest.
Also provided herein is a plant, a portion of a plant, or plant cell that is transiently transformed, or stably transformed, with plant expression system as described above.
A plant-based expression system comprising an expression enhancer as described herein results in expression of the nucleic acid of interest. Furthermore, the plant-based expression system comprising an expression enhancer as described herein may result in increasing or enhancing expression of a nucleotide sequence encoding a heterologous open reading frame that is operatively linked to the expression enhancer, as described herein. The increase in expression may be determined by comparing the level of expression obtained using the expression enhancer as described herein with the level of expression of the same nucleotide sequence encoding the heterologous open reading frame but not operatively linked to an expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO: 6).
The plant based expression systems, vectors, constructs and nucleic acids comprising one or more than one expression enhancer as described herein may also have a number of properties such as, for example, containing convenient cloning sites for genes or nucleotide sequences of interest, they may be used to easily transform plants in a cost-effective manner, they may cause efficient local or systemic transformation of inoculated plants. In addition, the transformation of a plant should provide a good yield of useful protein material.
This summary of the invention does not necessarily describe all features of the invention.
These and other features of the invention will become more apparent from the following description in which reference is made to the appended drawings wherein:
The following description is of a preferred embodiment.
As used herein, the terms “comprising,” “having,” “including” and “containing,” and grammatical variations thereof, are inclusive or open-ended and do not exclude additional, un-recited elements and/or method steps. The term “consisting essentially of” when used herein in connection with a use or method, denotes that additional elements and/or method steps may be present, but that these additions do not materially affect the manner in which the recited method or use functions. The term “consisting of” when used herein in connection with a use or method, excludes the presence of additional elements and/or method steps. A use or method described herein as comprising certain elements and/or steps may also, in certain embodiments, consist essentially of those elements and/or steps, and in other embodiments consist of those elements and/or steps, whether or not these embodiments are specifically referred to. In addition, the use of the singular includes the plural, and “or” means “and/or” unless otherwise stated. The term “plurality” as used herein means more than one, for example, two or more, three or more, four or more, and the like. Unless otherwise defined herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. As used herein, the term “about” refers to an approximately +/−10% variation from a given value. It is to be understood that such a variation is always included in any given value provided herein, whether or not it is specifically referred to. The use of the word “a” or “an” when used herein in conjunction with the term “comprising” may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one” and “one or more than one.”
The term “plant”, “portion of a plant”, “plant portion’, “plant matter”, “plant biomass”, “plant material”, plant extract”, or “plant leaves”, as used herein, may comprise an entire plant, tissue, cells, or any fraction thereof, intracellular plant components, extracellular plant components, liquid or solid extracts of plants, or a combination thereof, that are capable of providing the transcriptional, translational, and post-translational machinery for expression of one or more than one nucleic acids described herein, and/or from which an expressed protein of interest or VLP may be extracted and purified. Plants may include, but are not limited to, agricultural crops including for example canola, Brassica spp., maize, Nicotiana spp., (tobacco) for example, Nicotiana benthamiana, Nicotiana rustica, Nicotiana, tabacum, Nicotiana alata, Arabidopsis thaliana, alfalfa, potato, sweet potato (Ipomoea batatus), ginseng, pea, oat, rice, soybean, wheat, barley, sunflower, cotton, corn, rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), safflower (Carthamus tinctorius).
The term “plant portion”, as used herein, refers to any part of the plant including but not limited to leaves, stem, root, flowers, fruits, callus tissue or cell cultured plant tissue, a cluster of plant cells, a plant cell, for example a plant cell, cluster of plants cells callus or cultured plant tissue obtained from leaves, stem, root, flowers, fruits, a plant extract obtained from leaves, stem, root, flowers, fruits, or a combination thereof. The term plant cell refers to a cell of plant that is bounded by a plasma membrane and may or may not comprise a cell wall. A plant cell includes a protoplast (or spheroplast) that comprises an enzymatically digested cell and that may be obtained using techniques well known in the art (e.g. Davey M R et al., 2005, Biotechnology Advances 23:131-171; which is incorporated herein by reference). Callus plant tissue or cultured plant tissue may be produced using methods well known in the art (e.g. M K Razdan 2nd Ed., Science Publishers, 2003; which is incorporated herein by reference) The term “plant extract”, as used herein, refers to a plant-derived product that is obtained following treating a plant, a portion of a plant, a plant cell, or a combination thereof, physically (for example by freezing followed by extraction in a suitable buffer), mechanically (for example by grinding or homogenizing the plant or portion of the plant followed by extraction in a suitable buffer), enzymatically (for example using cell wall degrading enzymes), chemically (for example using one or more chelators or buffers), or a combination thereof. A plant extract may be further processed to remove undesired plant components for example cell wall debris. A plant extract may be obtained to assist in the recovery of one or more components from the plant, portion of the plant or plant cell, for example a protein (including protein complexes, protein surprastructures and/or VLPs), a nucleic acid, a lipid, a carbohydrate, or a combination thereof from the plant, portion of the plant, or plant cell. If the plant extract comprises proteins, then it may be referred to as a protein extract. A protein extract may be a crude plant extract, a partially purified plant or protein extract, or a purified product, that comprises one or more proteins, protein complexes, protein suprastructures, and/or VLPs, from the plant tissue. If desired a protein extract, or a plant extract, may be partially purified using techniques known to one of skill in the art, for example, the extract may be subjected to salt or pH precipitation, centrifugation, gradient density centrifugation, filtration, chromatography, for example, size exclusion chromatography, ion exchange chromatography, affinity chromatography, or a combination thereof. A protein extract may also be purified, using techniques that are known to one of skill in the art.
By “nucleotide (or nucleic acid) sequence of interest”, or “coding region of interest”, it is meant any nucleotide sequence, or coding region (these terms may be used interchangeably) that is to be expressed within a plant, portion of a plant, or a plant cell, to produce a protein of interest. Such a nucleotide sequence of interest may encode, but is not limited to, native or modified proteins, an industrial enzyme or a modified industrial enzyme, an agricultural protein or a modified agricultural protein, a helper protein, a protein supplement, a pharmaceutically active protein, a nutraceutical, a value-added product, or a fragment thereof for feed, food, or both feed and food use.
The protein of interest may comprise a native, or a non-native signal peptide; the non-native signal peptide may be of plant origin. For example, which is not to be considered limiting, the non-native signal peptide may be obtained from alfalfa protein disulfide isomerase (PDI SP; nucleotides 32-103 of Accession No. Z11499), potato patatin (PatA SP; located nucleotides 1738-1806 of GenBank Accession number A08215), Kiwi actinidin (Act), Tobacco cysteine proteinase 3 precursor (CP23), Corn ΔZein (ΔZein), Papaya proteinase I (Papain; Pap) and Thale cress cysteine proteinase RD21A (RD21). The native signal peptide may correspond to that of the protein of interest being expressed.
The nucleotide sequence of interest, or coding region of interest may also include a nucleotide sequence that encodes a pharmaceutically active protein, for example growth factors, growth regulators, antibodies, antigens, and fragments thereof, or their derivatives useful for immunization or vaccination and the like. Such proteins include, but are not limited to a protein that is a human pathogen, a viral protein, for example but not limited to virus like particle (VLP)-forming antigens, one or more proteins from Norovirus, Respiratory syncytial virus (RSV), Rotavirus, influenza virus, human immunodeficiency virus (HIV), Rabies virus, human papiloma virus (HPV), Enterovirus 71 (EV71), or interleukins, for example one or more than one of IL-1 to IL-24, IL-26 and IL-27, cytokines, Erythropoietin (EPO), insulin, G-CSF, GM-CSF, hPG-CSF, M-CSF or combinations thereof, interferons, for example, interferon-alpha, interferon-beta, interferon-gama, blood clotting factors, for example, Factor VIII, Factor IX, or tPA hGH, receptors, receptor agonists, antibodies for example but not limited to rituximab, neuropolypeptides, insulin, vaccines, growth factors for example but not limited to epidermal growth factor, keratinocyte growth factor, transformation growth factor, growth regulators, antigens, autoantigens, fragments thereof, or combinations thereof.
The protein of interest may include an influenza hemagglutinin (HA; see WO 2009/009876, WO 2009/076778, WO 2010/003225, which are incorporated herein by reference). HA is a homotrimeric membrane type I glycoprotein, generally comprising a signal peptide, an HA1 domain, and an HA2 domain comprising a membrane-spanning anchor site at the C-terminus and a small cytoplasmic tail. Nucleotide sequences encoding HA are well known and are available (see, for example, the BioDefense and Public Health Database (Influenza Research Database; Squires et al., 2008, Nucleic Acids Research 36:D497-D503) at URL: biohealthbase.org/GSearch/home.do?decorator=Influenza; or the databases maintained by the National Center for Biotechnology Information (see URL: ncbi.nlm.nih.gov), both of which are incorporated herein by reference).
An HA protein may be of a type A influenza, a type B influenza, or is a subtype of type A influenza HA selected from the group of H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, H16. H17 and H18. In some aspects of the invention, the HA may be from a type A influenza, selected from the group H1, H2, H3, H5, H6, H7 and H9. Fragments of the HAs listed above may also be considered a protein of interest. Furthermore, domains from an HA type or subtype listed above may be combined to produce chimeric HA's (see for example WO2009/076778 which is incorporated herein by reference).
Examples of subtypes comprising HA proteins include A/New Caledonia/20/99 (H1N1), A/Indonesia/5/2006 (H5N1), A/PuertoRico/8/34 (H1N1), A/Brisbane/59/2007 (H1N1), A/Solomon Islands 3/2006 (H1N1), A/California/04/2009 (H1N1), A/California/07/2009 (H1N1), A/chicken/New York/1995, A/Singapore/1/57 (H2N2), A/herring gull/DE/677/88 (H2N8), A/Texas/32/2003, A/mallard/MN/33/00, A/duck/Shanghai/1/2000, A/northern pintail/TX/828189/02, A/Brisbane 10/2007 (H3N2), A/Wisconsin/67/2005 (H3N2), A/Victoria/361/2011 (H3N2), A/Texas/50/2012 (H3N2), A/Hawaii/22/2012 (H3N2), A/New York/39/2012 (H3N2), A/Perth/16/2009 (H3N2), C/Johannesburg/66, A/Anhui/1/2005 (H5N1), A/Vietnam/1194/2004 (H5N1), A/Teal/HongKong/W312/97 (H6N1), A/Equine/Prague/56 (H7N7), A/Turkey/Ontario/6118/68 (H8N4), H7 A/Hangzhou/1/2013, A/Anhui/1/2013 (H7N9), A/Shanghai/2/2013 (H7N9), A/shoveler/Iran/G54/03, A/HongKong/1073/99 (H9N2), A/chicken/Germany/N/1949 (H10N7), A/duck/England/56 (H11N6), A/duck/Alberta/60/76 (H12N5), A/Gull/Maryland/704/77 (H13N6), A/Mallard/Gurjev/263/82, A/duck/Australia/341/83 (H15N8), A/black-headed gull/Sweden/5/99 (H16N3), B/Malaysia/2506/2004, B/Florida/4/2006, B/Brisbane/60/08, B/Massachusetts/2/2012-like virus (Yamagata lineage), B/Wisconsin/1/2010 (Yamagata lineage), or B/Lee/40.
The HA may also be a modified or chimeric HA, for example, the native transmembrane domain of the HA may be replaced with a heterologous transmembrane domain (WO 2010/148511; which is incorporated herein by reference), the HA may comprise a chimeric ectodomain (WO2012/083445, which is incorporated herein by reference), or the HA may comprises a proteolytic loop deletion (WO 2014/153647, which is incorporated herein by reference).
The protein of interest may also include a norovirus protein or a modified norovirus protein as described in WO2018/170603; which is incorporated herein by reference), or U.S. provisional application 62/593,006 (filed Nov. 11, 2017; which is incorporated herein by reference). Norovirus is a non-enveloped viral strain of the genus norovirus of the family Caliciviridae that is characterized as having a single-stranded, positive-sense RNA. Norovirus strains may include any known norovirus strain, but also modifications to known norovirus strains that are known to develop on a regular basis over time. For example, norovirus strains may include, GI.1, GI.2, GI.3, GI.5, GI.7, GII.1, GII.2, GII.3, GII.4, GII.5, GII.6, GII.7, GII.12, GII.13, GII.14, GII.17 and GII.21, for example but not limited to Hu/GI.1/United States/Norwalk/1968, Hu/GI.2/Leuven/2003/BEL, Hu/GI.3/S29/2008/Lilla Edet/Sweden, Hu/GI.5/Siklos/Hun5407/2013/HUN, Hu/GII.1/Ascension208/2010/USA, Hu/GII.2/CGMH47/2011/TW, Hu/GII.3/Jingzhou/2013402/CHN, Hu/GII.4/Sydney/NSW0514/2012/AU, US96/GII.4/Dresden174/1997/DE AY741811, FH02/GII.4/FarmingtonHills/2002/US AY502023, Hnt04:GII.4/Hunter-NSW504D/2004/AU_DQ078814, 2006b: GII.4/Shellharbour-NSW696T/2006/AU_EF684915, N009: GII.4/Orange-NSW001P/2008/AU_GQ845367, Hu/GII.5/AlbertaEI390/2013/CA, Hu/GII.6/Ohio/490/2012/USA, GII.7/Musa/2010/A1173774, Hu/GII.12/H5206/2010/USA, GII.13/VA173/2010/H9AWU4, GII.14_Saga_2008_JPN_ADE28701 native VP1, Hu/GII.17/Kawasaki323/2014/JP, and Hu/GII.21/Salisbury150/2011/USA. Norovirus strains also include strains having from about 30-100% or any amount therebetween, amino acid sequence identity, to the VP1 protein, the VP2 protein, or both the VP1 and the VP2 proteins, with any of the above norovirus strains.
The terms “percent similarity”, “sequence similarity”, “percent identity”, or “sequence identity”, when referring to a particular sequence, are used for example as set forth in the University of Wisconsin GCG software program, or by manual alignment and visual inspection (see, e.g., Current Protocols in Molecular Biology, Ausubel et al., eds. 1995 supplement). Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, using for example the algorithm of Smith & Waterman, (1981, Adv. Appl. Math. 2:482), by the alignment algorithm of Needleman & Wunsch, (1970, J. Mol. Biol. 48:443), by the search for similarity method of Pearson & Lipman, (1988, Proc. Natl. Acad. Sci. USA 85:2444), by computerized implementations of these algorithms (for example: GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis.).
An example of an algorithm suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al., (1977, Nuc. Acids Res. 25:3389-3402) and Altschul et al., (1990, J. Mol. Biol. 215:403-410), respectively. BLAST and BLAST 2.0 are used, with the parameters described herein, to determine percent sequence identity for the nucleic acids and proteins of the invention. For example the BLASTN program (for nucleotide sequences) may use as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=−4 and a comparison of both strands. For amino acid sequences, the BLASTP program may use as defaults a word length of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, 1989, Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (see URL: ncbi.nlm.nih.gov/).
The term “nucleic acid segment” as used herein refers to a sequence of nucleic acids that encodes a protein of interest. In addition to the sequence of nucleic acids, the nucleic acid segment comprise a regulatory region and a terminator that are operatively linked to the sequence of nucleic acids. The regulatory region may comprise a promoter, and an enhancer element (expression enhancer) operatively linked to the promoter.
The term “nucleic acid complex” as used herein refers to a combination of two or more than two nucleic acid segments. The two or more than two nucleic acid segments may be present in a single nucleic acid, so that the nucleic acid complex comprises two, or more than two nucleic acid segments, with each nucleic acid segment under the control of a regulatory region and a terminator. Alternatively, the nucleic acid complex may comprise two or more separate nucleic acids, each of the nucleic acids comprising one or more than one nucleic acid segment, where each nucleic acid segment is under the control of a regulatory region and a terminator. For example a nucleic acid complex may comprise one nucleic acid that comprises two nucleic acid segments, a nucleic acid complex may comprise two nucleic acids, each nucleic acid comprising one nucleic acid segment, or a nucleic acid complex may comprise two or more than two nucleic acids, with each nucleic acid comprising one or more than one nucleic acid segment.
The terms “vector” or “expression vector” as used herein, refer to a recombinant nucleic acid for transferring exogenous nucleic acid sequences into host cells (e.g. plant cells) and directing expression of the exogenous nucleic acid sequences in the host cells. The vector may be introduced to the plant, the portion of the plant, or a plant cell, directly, or the vectors may be introduced in the plant, the portion of the plant, or a plant cell as part of a plant expression system. The vector or expression vector comprise a construct or an expression construct. The construct or expression construct comprises a nucleotide sequence comprising a nucleic acid of interest under the control of, and operably (or operatively) linked to, an appropriate promoter, an expression enhancer, or other regulatory elements for transcription of the nucleic acid of interest in a host cell. As one of skill in the art would appreciate, the construct or expression cassette may comprise a termination (terminator) sequence that is any sequence that is active in the plant host. For example the termination sequence may be derived from the RNA-2 genome segment of a bipartite RNA virus, e.g. a comovirus, the termination sequence may be a NOS terminator, the terminator sequence may be obtained from the 3′UTR of the alfalfa plastocyanin gene, or a combination thereof.
The constructs of the present disclosure may further comprise a 3′ untranslated region (UTR). A 3′ untranslated region contains a polyadenylation signal and any other regulatory signals capable of effecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by effecting the addition of polyadenylic acid tracks to the 3′ end of the mRNA precursor. Polyadenylation signals are commonly recognized by the presence of homology to the canonical form 5′ AATAAA-3′ although variations are not uncommon. Non-limiting examples of suitable 3′ regions are the 3′ transcribed non-translated regions containing a polyadenylation signal of Agrobacterium tumor inducing (Ti) plasmid genes, such as the nopaline synthase (Nos gene) and plant genes such as the soybean storage protein genes, the small subunit of the ribulose-1, 5-bisphosphate carboxylase gene (ssRUBISCO; U.S. Pat. No. 4,962,028; which is incorporated herein by reference), the promoter and/or terminator used in regulating plastocyanin expression.
For example, which is not to be considered limiting, a CPMV 3′UTR+NOS terminator may be used as a 3′UTR sequence that is operatively linked to the 3′ end of the nucleic acid sequence encoding the protein of interest.
By “regulatory region” “regulatory element” or “promoter” it is meant a portion of nucleic acid typically, but not always, upstream of the protein coding region of a gene, which may be comprised of either DNA or RNA, or both DNA and RNA. When a regulatory region is active, and in operative association, or operatively linked, with a nucleotide sequence of interest, this may result in expression of the nucleotide sequence of interest. A regulatory element may be capable of mediating organ specificity, or controlling developmental or temporal gene activation. A “regulatory region” includes promoter elements, core promoter elements exhibiting a basal promoter activity, elements that are inducible in response to an external stimulus, elements that mediate promoter activity such as negative regulatory elements or transcriptional enhancers. “Regulatory region”, as used herein, also includes elements that are active following transcription, for example, regulatory elements that modulate gene expression such as translational and transcriptional enhancers, translational and transcriptional repressors, upstream activating sequences, and mRNA instability determinants. Several of these latter elements may be located proximal to the coding region.
In the context of this disclosure, the term “regulatory element” or “regulatory region” typically refers to a sequence of DNA, usually, but not always, upstream (5′) to the coding sequence of a structural gene, which controls the expression of the coding region by providing the recognition for RNA polymerase and/or other factors required for transcription to start at a particular site. However, it is to be understood that other nucleotide sequences, located within introns, or 3′ of the sequence may also contribute to the regulation of expression of a coding region of interest. An example of a regulatory element that provides for the recognition for RNA polymerase or other transcriptional factors to ensure initiation at a particular site is a promoter element. Most, but not all, eukaryotic promoter elements contain a TATA box, a conserved nucleic acid sequence comprised of adenosine and thymidine nucleotide base pairs usually situated approximately 25 base pairs upstream of a transcriptional start site. A promoter element may comprise a basal promoter element, responsible for the initiation of transcription, as well as other regulatory elements that modify gene expression.
There are several types of regulatory regions, including those that are developmentally regulated, inducible or constitutive. A regulatory region that is developmentally regulated or controls the differential expression of a gene under its control, is activated within certain organs or tissues of an organ at specific times during the development of that organ or tissue. However, some regulatory regions that are developmentally regulated may preferentially be active within certain organs or tissues at specific developmental stages, they may also be active in a developmentally regulated manner, or at a basal level in other organs or tissues within the plant as well. Examples of tissue-specific regulatory regions, for example seed-specific a regulatory region, include the napin promoter, and the cruciferin promoter (Rask et al., 1998, J. Plant Physiol. 152: 595-599; Bilodeau et al., 1994, Plant Cell 14: 125-130). An example of a leaf-specific promoter includes the plastocyanin promoter (see U.S. Pat. No. 7,125,978, which is incorporated herein by reference).
An inducible regulatory region is one that is capable of directly or indirectly activating transcription of one or more DNA sequences or genes in response to an inducer. In the absence of an inducer the DNA sequences or genes will not be transcribed. Typically, the protein factor that binds specifically to an inducible regulatory region to activate transcription may be present in an inactive form, which is then directly or indirectly converted to the active form by the inducer. However, the protein factor may also be absent. The inducer can be a chemical agent such as a protein, metabolite, growth regulator, herbicide or phenolic compound or a physiological stress imposed directly by heat, cold, salt, or toxic elements or indirectly through the action of a pathogen or disease agent such as a virus. A plant cell containing an inducible regulatory region may be exposed to an inducer by externally applying the inducer to the cell or plant such as by spraying, watering, heating or similar methods. Inducible regulatory elements may be derived from either plant or non-plant genes (e.g. Gatz, C. and Lenk, I. R. P., 1998, Trends Plant Sci. 3, 352-358). Examples, of potential inducible promoters include, but not limited to, tetracycline-inducible promoter (Gatz, C., 1997, Ann. Rev. Plant Physiol. Plant Mol. Biol. 48, 89-108), steroid inducible promoter (Aoyama, T. and Chua, N. H., 1997, Plant J. 2, 397-404) and ethanol-inducible promoter (Salter, M. G., et al, 1998, Plant Journal 16, 127-132; Caddick, M. X., et al, 1998, Nature Biotech. 16, 177-180) cytokinin inducible IB6 and CKI1 genes (Brandstatter, I. and Kieber, J. J., 1998, Plant Cell 10, 1009-1019; Kakimoto, T., 1996, Science 274, 982-985) and the auxin inducible element, DRS (Ulmasov, T., et al., 1997, Plant Cell 9, 1963-1971).
A constitutive regulatory region directs the expression of a gene throughout the various parts of a plant and continuously throughout plant development. Examples of known constitutive regulatory elements include promoters associated with the CaMV 35S transcript. (p35S; Odell et al., 1985, Nature, 313: 810-812; which is incorporated herein by reference), the rice actin 1 (Zhang et al, 1991, Plant Cell, 3: 1155-1165), actin 2 (An et al., 1996, Plant J., 10: 107-121), or tms 2 (U.S. Pat. No. 5,428,147), and triosephosphate isomerase 1 (Xu et. al., 1994, Plant Physiol. 106: 459-467) genes, the maize ubiquitin 1 gene (Cornejo et al, 1993, Plant Mol. Biol. 29: 637-646), the Arabidopsis ubiquitin 1 and 6 genes (Holtorf et al, 1995, Plant Mol. Biol. 29: 637-646), the tobacco translational initiation factor 4A gene (Mandel et al, 1995 Plant Mol. Biol. 29: 995-1004); the Cassava Vein Mosaic Virus promoter, pCAS, (Verdaguer et al., 1996); the promoter of the small subunit of ribulose biphosphate carboxylase, pRbcS: (Outchkourov et al., 2003), the pUbi (for monocots and dicots).
The term “constitutive” as used herein does not necessarily indicate that a nucleotide sequence under control of the constitutive regulatory region is expressed at the same level in all cell types, but that the sequence is expressed in a wide range of cell types even though variation in abundance is often observed.
The constructs or expression constructs as described above may be present in a vector (or an expression vector). The vector may comprise border sequences which permit the transfer and integration of the expression cassette into the genome of the organism or host. The construct may be a plant binary vector, for example a binary transformation vector based on pPZP (Hajdukiewicz, et al. 1994). Other example constructs include pBin19 (see Frisch, D. A., L. W. Harris-Haller, et al. 1995, Plant Molecular Biology 27: 405-409).
A nucleotide sequence interest that encodes a protein requires the presence of a “translation initiation site” or “initiation site” or “translation start site” or “start site” or “start codon” located upstream of the gene to be expressed. Such initiation sites may be provided either as part of an enhancer sequence or as part of a nucleotide sequence encoding the protein of interest.
The term “native”, “native protein” or “native domain”, as used herein, refers to a protein or domain having a primary amino acid sequence identical to wildtype. Native proteins or domains may be encoded by nucleotide sequences having 100% sequence similarity to the wildtype sequence. A native amino acid sequence may also be encoded by a human codon (hCod) optimized nucleotide sequence or a nucleotide sequence comprising an increased GC content when compared to the wild type nucleotide sequence provided that the amino acid sequence encoded by the hCod-nucleotide sequence exhibits 100% sequence identity with the native amino acid sequence.
By a nucleotide sequence that is “human codon optimized” or an “hCod” nucleotide sequence, it is meant the selection of appropriate DNA nucleotides for the synthesis of an oligonucleotide sequence or fragment thereof that approaches the codon usage generally found within an oligonucleotide sequence of a human nucleotide sequence. By “increased GC content” it is meant the selection of appropriate DNA nucleotides for the synthesis of an oligonucleotide sequence or fragment thereof in order to approach codon usage that, when compared to the corresponding native oligonucleotide sequence, comprises an increase of GC content, for example, from about 1 to about 30%, or any amount therebetween, over the length of the coding portion of the oligonucleotide sequence. For example, from about 1, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30%, or any amount therebetween, over the length of the coding portion of the oligonucleotide sequence. As described below, a human codon optimized nucleotide sequence, or a nucleotide sequence comprising an increased GC content (when compared to the wild type nucleotide sequence) exhibits increased expression within a plant, portion of a plant, or a plant cell, when compared to expression of the non-human optimized (or lower GC content) nucleotide sequence.
The term “single construct” or “single constructs”, as used herein, refers to a nucleic acid comprising a single nucleic acid sequence. The term “dual construct” or “dual constructs”, as used herein, refers to a nucleic acid comprising two nucleic acid sequences.
By co-expression it is meant the introduction and expression of two or more nucleotide sequences, each of the two or more nucleotide sequences encoding a protein of interest, or a fragment of a protein of interest within a plant, portion of a plant or a plant cell. The two or more nucleotide sequences may be introduced into the plant, portion of the plant or the plant cell within one vector, so that each of the two or more nucleotide sequences is under the control of a separate regulatory region (e.g. comprising a dual construct). Alternatively, the two or more nucleotide sequences may be introduced into the plant, portion of the plant or the plant cell within separate vectors (e.g. comprising single constructs), and each vector comprising appropriate regulatory regions for the expression of the corresponding nucleic acid. For example, two nucleotide sequences, each on a separate vector and introduced into separate Agrobacterium tumefaciens hosts, may be co-expressed by mixing suspensions of each A. tumefaciens host in a desired volume (for example, an equal volume, or the ratios of each A. tumefaciens host may be altered) before vacuum infiltration. In this manner, co-infiltration of multiple A. tumifaciens suspensions permits co-expression of multiple transgenes.
The nucleic acid encoding a protein of interest as described herein may further comprise sequences that enhance expression of the protein of interest in the plant, portion of the plant, or plant cell. Sequences that enhance expression are described herein and for example, may include one or more of, an expression enhancer element obtained from a nucleic acid encoding a secretory protein (SPEE) or an expression enhancer element obtained from a nucleic acid encoding a cytosolic protein (CPEE), in operative association with the nucleic acid encoding the protein of interest. Non-limiting examples of using the expression enhancer as described herein for the expression of a secreted protein includes any protein of interest comprising a signal peptide or signal sequence that targets the protein of interest to the extracellular compartment, for example an antibody (see
The sequence encoding the protein of interest may also be optimized for human codon usage, increased GC content, or a combination thereof. Co-expression of a nucleic acid encoding a second protein of interest may lead to functional multimeric protein, for example an antibody comprising heavy and light chain components, or to an increased yield of protein. If the protein of interest results in the production of a VLP, then co-expression of two or more proteins may result in an increase yield, increased density, increased integrity, or combination thereof, of the VLPs that comprise the protein of interest. The increase in yield, density, integrity, or combination thereof, may be determined by comparing the yield, density, integrity, or combination thereof, obtained using the expression enhancer as described herein with the yield, density, integrity, or combination thereof, of the same nucleotide sequence encoding the heterologous open reading frame but not operatively linked to an expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6).
A plant expression system comprising a nucleic acid comprising a regulatory region, operatively linked with one or more than one expression enhancer as described herein and a nucleotide sequence of interest is also provided. The plant expression system may comprise one or more than one vector, one or more than one construct or one or more than one nucleic acid, that comprises the regulatory region operatively linked with one or more than one expression enhancer as described herein and the nucleotide sequence or nucleic acid of interest, along with other components that may be introduced into the plant, the portion of the plant or a plant cell. For example, the plant expression system may also comprise additional vectors, constructs, or nucleic acids, additional Agrobacteria comprising vectors, constructs or nucleic acids for co-expression, one or more than one chemical compound to modify the efficiency of transformation, other components, or a combination thereof.
Furthermore, a nucleic acid comprising a promoter (regulatory region) sequence, operatively linked with an expression enhancer comprising an expression enhancer as described herein, and a nucleotide sequence of interest is described. The nucleic acid may further comprise a sequence encoding a 3′UTR, for example a comovirus 3′ UTR, or a plastocyanin 3′ UTR, and a terminator sequence, for example a NOS terminator, so that the nucleotide sequence of interest is inserted upstream from the 3′UTR.
“Expression enhancer(s)”, “enhancer sequence(s)” or “enhancer element(s)”, as referred to herein, when operatively linked to a nucleic acid of interest, for example a heterologous nucleic acid of interest, results in expression of the nucleic acid of interest. The expression enhancer may also enhance or increase expression of a downstream heterologous open reading frame (ORF) to which they are attached. The expression enhancer may be operatively linked at the 5′end of the enhancer sequence with a regulatory region that is active in a plant, and operatively linked to a nucleotide sequence of interest at the 3′end of the expression enhancer in order to drive expression of the nucleotide sequence of interest within a host, for example a plant, portion of a plant or a plant cell. Expression enhancers described herein include sequences derived from, or that share sequence similarity with, a nucleotide sequence selected from nbEPI42 (SEQ ID NO:1); nbSNS46 (SEQ ID NO:2); nbCSY65 (SEQ ID NO:3); nbHEL40 (SEQ ID NO:4); and nbSEP44 (SEQ ID NO:5).
By “operatively linked” it is meant that the particular sequences interact either directly or indirectly to carry out an intended function, such as mediation or modulation of expression of a nucleic acid sequence. The interaction of operatively linked sequences may, for example, be mediated by proteins that interact with the operatively linked sequences.
The term “5′UTR” or “5′ untranslated region” or “5′ leader sequence” refers to regions of an mRNA that are not translated. The 5′UTR typically begins at the transcription start site and ends just before the translation initiation site or start codon (usually AUG in an mRNA, ATG in a DNA sequence) of the coding region. The 5′ UTR may modulate the stability and/or translation of an mRNA transcript. If desired, the length of the 5′UTR may be modified by mutation for example substitution, deletion or insertion of the 5′UTR.
The expression enhancer may further comprise one or more “restriction site(s)” or “restriction recognition site(s)”, “multiple cloning site”, “MCS”, “cloning site(s)”, “polylinker sequence”, or “polylinker’ to facilitate the insertion of the nucleotide of interest into the plant expression system. Restrictions sites are specific sequence motifs that are recognized by restriction enzymes and are well known in the art. The expression enhancer may comprise one or more restriction sites or cloning sites that are located downstream (3′) of the 5′UTR. The polylinker sequence (multiple cloning site) may comprise any sequence of nucleic acids that are useful for adding and removing nucleic acid sequences, including a nucleotide sequence encoding a protein of interest, to the 3′ end of the 5′UTR. A polylinker sequence may comprise from 4 to about 100 nucleic acids, or any amount therebetween. As would be evident to one of skill in the art, any multiple cloning site (MCS), or an MCS of different length (either shorter or longer) may be used.
Expression systems, or vectors, to produce one or more proteins of interest in a plant using one or more than one expression enhancers as described herein are also provided. The expression systems described herein comprise an expression cassette comprising one or more than one expression enhancer, or a sequence that comprises 80-100% sequence similarity, or any amount therebetween, with the one or more than one expression enhancer. The expression cassette comprising the expression enhancer may further comprise a regulatory region that is active in a plant that is operatively linked to the 5′end of the expression enhancer. A nucleotide sequence of interest may be operatively linked to the 3′end of the expression cassette so that when introduced within a plant, portion of the plant or a plant cell, expression of the nucleotide sequence of interest within a plant is achieved.
Plants, portions of plants, plant cells, plant tissues, whole plants, inoculum, nucleic acids, constructs comprising nucleotide sequences of interest encoding proteins of interest, expression cassettes or expression systems comprising the one or more than one expression enhancer as described herein, and methods of expressing a protein of interest in plants, portions of plants, or plant cells are also provided.
The nucleotide sequence of interest may be fused (operatively linked) to the enhancer sequence comprising a plant regulatory region, using a variety of approaches. For example, which are not to be considered limiting, a nucleotide sequence of interest encoding a protein of interest may be fused to the 3′ end of an expression enhancer immediately after the 5′UTR sequence.
The constructs of the present invention can be introduced into plant cells in a stable or transient manner using Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, micro-injection, electroporation, infiltration, and the like. For reviews of such techniques see for example Weissbach and Weissbach, Methods for Plant Molecular Biology, Academy Press, New York VIII, pp. 421-463 (1988); Geierson and Corey, Plant Molecular Biology, 2d Ed. (1988); and Miki and Iyer, Fundamentals of Gene Transfer in Plants. In Plant Metabolism, 2d Ed. D T. Dennis, D H Turpin, D D Lefebrve, D B Layzell (eds), Addison-Wesley, Langmans Ltd. London, pp. 561-579 (1997). Other methods include direct DNA uptake, the use of liposomes, electroporation, for example using protoplasts, micro-injection, microprojectiles or whiskers, and vacuum infiltration. See, for example, Bilang, et al. (Gene 100: 247-250 (1991), Scheid et al. (Mol. Gen. Genet. 228: 104-112, 1991), Guerche et al. (Plant Science 52: 111-116, 1987), Neuhause et al. (Theor. Appl Genet. 75: 30-36, 1987), Klein et al., Nature 327: 70-73 (1987); Howell et al. (Science 208: 1265, 1980), Horsch et al. (Science 227: 1229-1231, 1985), DeBlock et al., Plant Physiology 91: 694-701, 1989), Methods for Plant Molecular Biology (Weissbach and Weissbach, eds., Academic Press Inc., 1988), Methods in Plant Molecular Biology (Schuler and Zielinski, eds., Academic Press Inc., 1989), Liu and Lomonossoff (J. Virol Meth, 105:343-348, 2002), U.S. Pat. Nos. 4,945,050; 5,036,006; 5,100,792; 6,403,865; 5,625,136, (all of which are hereby incorporated by reference).
Transient expression methods may be used to express the constructs of the present invention (see Liu and Lomonossoff, 2002, Journal of Virological Methods, 105:343-348; which is incorporated herein by reference). Alternatively, a vacuum-based transient expression method, as described by Kapila et al. 1997 (incorporated herein by reference) may be used. These methods may include, for example, but are not limited to, a method of Agro-inoculation or Agro-infiltration, however, other transient methods may also be used as noted above. With either Agro-inoculation or Agro-infiltration, a mixture of Agrobacteria comprising the desired nucleic acid enter the intercellular spaces of a tissue, for example the leaves, aerial portion of the plant (including stem, leaves and flower), other portion of the plant (stem, root, flower), or the whole plant. After crossing the epidermis the Agrobacterium infect and transfer t-DNA copies into the cells. The t-DNA is episomally transcribed and the mRNA translated, leading to the production of the protein of interest in infected cells, however, the passage oft-DNA inside the nucleus is transient.
If the nucleotide sequence of interest encodes a product that is directly or indirectly toxic to the plant, then by using the method of the present invention, such toxicity may be reduced throughout the plant by selectively expressing the nucleotide sequence of interest within a desired tissue or at a desired stage of plant development. In addition, the limited period of expression resulting from transient expression may reduce the effect when producing a toxic product in the plant. An inducible promoter, a tissue-specific promoter, or a cell specific promoter may be used to selectively direct expression of the sequence of interest.
U.S. Provisional Application No. 62/643,053 (filed Mar. 14, 2018; which is incorporated herein by reference) describes the identification of 15 plant-derived expression enhancer elements (UTRs) identified using polysomal analysis. Two of these UTRs (nbMT78, SEQ ID NO:7; and atHSP69, SEQ ID NO:8) are included herein for comparative purposes. The expression enhancers described herein (endogenous expression enhancers) were identified by analysing the 5′ region of UTRs that were identified using the polysomal analysis described in U.S. Provisional Application No. 62/643,053. However, the endogenous expression enhancers described herein were not identified within the group of UTRs identified as a result of the polysomal analysis (described in U.S. Provisional Application No. 62/643,053), nor were the endogenous expression enhancers described herein identified using polysomal analysis.
Examples of expression enhancers (endogenous expression enhancers) as described herein include:
The enhancer sequence may be selected from any one of SEQ ID NO's:1-5, or a nucleotide sequence that exhibits 100%, 99%, 98%, 97%, 96%, 95%, or 90%, or any amount therebetween, sequence identity to the sequence as set forth in any one of SEQ ID NO's:1-15, wherein, the expression enhancer, when operatively linked to a nucleic acid of interest, results in the expression of the nucleic acid of interest, or increases the level of expression of the nucleic acid of interest when compared to the level of expression of the same nucleic acid of interest that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6). Each of the enhancer sequences shown in SEQ ID NO's:1-5 may be modified using methods known to one of skill in the art, including deletion, insertion, and/or substitution of one or more than one nucleotide of the enhancer sequence, to produce an expression enhancer that results in a similar or increased enhancer activity, or that results in another beneficial property of the expression enhancer (see for example Diamos et. al., Frontiers in Plant Science. 2016, vol 7 pp. 1-15; Dvir S. et. al., 2013, PNAS published online Jul. 15, 2013; Leppek K et. al., 2018, Nature Reviews Mol. Cell Biol. 19:158-174; which are incorporated herein by reference). For example, a beneficial property may include improved transcriptional initiation, improved mRNA stability, improved mRNA translation, or a combination thereof.
Use of one or more than one of the above expression enhancer of SEQ ID NO's:1-5, was observed to result in the expression of the nucleic acid of interest, or result in an increased expression of a nucleic acid of interest, or a protein of interest (an increase in the yield, the activity, or both the yield and activity, of the protein of interest) as shown with reference to
With reference to
The activity of the prior art expression enhancer CPMV 160 (SEQ ID NO:6), operatively linked to a nucleic acid sequence encoding a protein of interest, is shown relative to the prior art expression enhancer, CPMV-HT, in
Therefore, the expression enhancers described herein may be used within a plant expression system comprising a regulatory region that is operatively linked with the expression enhancer sequence and a nucleotide sequence of interest.
For example, the present invention provides a method of producing a protein of interest, or increasing production of a protein of interest, for example but not limited to an influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein, in plants. The method involves introducing a nucleic acid comprising an expression enhancer as described herein operatively linked to nucleotide sequence encoding the protein of interest, for example the influenza HA protein, a modified influenza HA protein, a norovirus protein, or a modified norovirus protein, into the plant, portion of the plant, or plant cell, and expressing the protein in the plant, portion of the plant, or plant cell in a transient or stable manner. Where, the increase in expression may be determined by comparing the level of expression of the nucleotide sequence operatively linked to an expression enhancer, with the level of expression of the same nucleotide sequence that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6). Alternatively, the method may comprise providing a plant, portion of the plant, or plant cell that comprises an expression enhancer as described herein operatively linked to nucleotide sequence encoding the protein of interest, for example the influenza HA protein, modified influenza HA protein, norovirus protein, modified norovirus protein, or multimeric protein, and expressing the nucleic acid encoding the protein in the plant, portion of the plant or plant cell in a transient or stable manner.
Furthermore, the present invention provides plant matter, a plant extract, or a protein extract comprising a protein of interest, for example an influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein. The plant matter, plant extract, or protein extract may be used to induce immunity, for example, to influenza or norovirus infection in a subject. Alternatively, the protein of interest, for example influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein may be purified or partially purified, and the purified or partially purified preparation may be used to induce immunity, for example, to influenza or norovirus infection in a subject, or the protein of interest, for example the influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein may be used within a composition for inducing an immune response, and a pharmaceutically acceptable carrier, adjuvant, vehicle, or excipient.
The expression enhancers described herein may be used for the production of any protein of interest or for the production of virus like particles (VLPs). For example, with reference to
Therefore, the present invention also provides a method of producing, or increasing production of VLPs comprising an influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein, in plants. For example, the method may involve introducing a nucleic acid comprising an expression enhancer as described herein, for example, one or more than one of SEQ ID NO:1-5, operatively linked to nucleotide sequence encoding an influenza HA protein, a modified influenza HA protein, a norovirus protein, or a modified norovirus protein, into the plant, portion of the plant, or plant cell in a transient or stable manner, and expressing the protein in the plant, portion of the plant, or plant cell, in order to produce a VLP comprising the influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein. The increase in expression may be determined by comparing the level of expression of the nucleotide sequence operatively linked to an expression enhancer, with the level of expression of the same nucleotide sequence that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6). Alternatively, the method may comprise providing a plant, portion of the plant, or plant cell that comprises an expression enhancer as described herein operatively linked to nucleotide sequence encoding the influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein, and expressing the nucleic acid encoding the protein in order to produce a VLP comprising the influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein.
Furthermore, the present invention provides plant matter, a plant extract, or a protein extract comprising VLPs comprising an influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein. The plant matter, plant extract, or protein extract may be used to induce immunity to norovirus infection in a subject. Alternatively, VLPs comprising the influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein may be purified or partially purified, and the purified or partially purified preparation may be used to induce immunity to a norovirus infection in a subject, or the VLPs comprising influenza HA protein, modified influenza HA protein, norovirus protein, or modified norovirus protein may be used within a composition for inducing an immune response, and a pharmaceutically acceptable carrier, adjuvant, vehicle, or excipient.
The expression enhancers described herein may also be used for the production of a multimeric protein of interest, for example an antibody. As shown with reference to
Therefore, the present invention also provides a method of producing, or increasing production of a multimeric protein in plants. For example, the method may involve introducing, in a transient or stable manner, a first nucleic acid comprising an expression enhancer as described herein operatively linked to nucleotide sequence encoding a first protein component, and a second nucleic acid comprising an expression enhancer as described herein operatively linked to nucleotide sequence encoding a second protein component, into the plant, portion of the plant, or plant cell, and co-expressing the first and second nucleic acids in the plant, portion of the plant, or plant cell, in order to produce the multimeric protein. The increase in expression may be determined by comparing the level of expression of the first and second nucleic acid, each operatively linked to an expression enhancer(s), with the level of expression of the same first and second nucleic acids, each of which are not operatively linked to the expression enhancer(s), or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6). Alternatively, the method may comprise providing a plant, portion of the plant, or plant cell that comprises the first nucleic acid comprising an expression enhancer as described herein operatively linked to nucleotide sequence encoding a first protein component, and a second nucleic acid comprising an expression enhancer as described herein operatively linked to nucleotide sequence encoding a second protein component and co-expressing the first and second nucleic acid sequences to produce the multimeric protein.
Furthermore, the present invention provides plant matter, a plant extract, or a protein extract comprising the multimeric protein, or the multimeric protein may be purified or partially purified.
As described herein, there is provided a nucleic acid construct comprising an expression enhancer sequence operatively linked to a nucleotide sequence of interest encoding a protein of interest. Also provided are plant expression systems and vectors, comprising the construct or one or more than one nucleic acid comprising an enhancer sequence as described herein. Also provided is a plant expression system, a vector, a construct, or a nucleic acid, comprising a plant regulatory region, in operative association with an enhancer sequence that is operatively linked to a nucleotide sequence of interest, the nucleotide sequence of interest encoding a protein of interest. The enhancer sequence may be selected from any one of SEQ ID NO's:1, 5, or a nucleotide sequence that exhibits 100%, 99%, 98%, 97%, 96%, 95%, or 90%, or any amount therebetween, sequence identity to the sequence as set forth in any one of SEQ ID NO's:1-5, wherein, the expression enhancer, when operatively linked to a nucleic acid of interest, results in the expression of the nucleic acid of interest, or increases the level of expression of the nucleic acid of interest when compared to the level of expression of the same nucleic acid of interest that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6).
The enhancer sequence of any one of SEQ ID NO's:1-5 may be modified using methods known to one of skill in the art, including deletion, insertion, and/or substitution of one or more than one nucleotide of the enhancer sequence, to produce an expression enhancer that results in a similar or increased enhancer activity, or that results in another beneficial property of the expression enhancer. For example, a beneficial property may include improved transcriptional initiation, improved mRNA stability, improved mRNA translation, or a combination thereof.
The enhancer sequence of the present invention may be used to express a protein of interest in a host organism for example a plant. In this case, the protein of interest may also be heterologous to the host organism in question and introduced into the plant cells using transformation techniques know in the art. A heterologous gene in an organism may replace an endogenous equivalent gene, i.e. one which normally performs the same or a similar function, or the inserted sequence may be additional to the endogenous gene or other sequence.
The invention further provides an expression cassette comprising in series, a promoter or plant regulatory region, operatively linked to an expression enhancer sequence as described herein which is fused with a nucleotide sequence of interest, a 3′UTR sequence, and a terminator sequence. The enhancer sequence may be defined by, any one of SEQ ID NO's:1-5, or a nucleotide sequence that exhibits 100%, 99%, 98%, 97%, 96%, 95%, or 90%, sequence identity, or any amount therebetween, to the sequence as set forth in any one of SEQ ID NO's:1-5. The enhancer sequence may also be modified using techniques known to one of skill in the art, provided that the enhancer sequence results in the expression of the nucleic acid of interest, or increases the level of expression of the nucleotide sequence of interest, for example, determined by comparing the level of expression of the nucleotide sequence operatively linked to an expression enhancer, with the level of expression of the same nucleotide sequence that is not operatively linked to the expression enhancer, or for example, when operatively linked to the prior art expression enhancer CPMV 160 (SEQ ID NO:6).
The sequences described in the present application are listed in Table 1.
The present invention will be further illustrated in the following examples.
The following constructs comprising the enhancers identified above were prepared as follows:
A sequence encoding Dasher fluorescent protein (Atum, Cat #FPB-27-609) fused to the nbMT78 5′UTR was cloned into 2X35S promoter+CPMV 3′UTR/NOS expression system using the following PCR-based method. In a first round of PCR, a fragment containing the Dasher fluorescent protein was amplified using primers nbMT78_Dasher.c (SEQ ID NO:15) and IF-Dasher (27-609).r (SEQ ID NO: 10) using Dasher gene sequence (SEQ ID NO: 11;
Primers, templates as well as nucleic acid and protein sequences for all constructs described herein, are presented in Table 2.
For influenza H1 construct, the same cloning approach described above for the Dasher fluorescent protein was used.
For Influenza H3 constructs, the cloning vector used, integrates an influenza M2 ion channel gene under the control of Alfalfa Plastocyanin promoter and terminator in addition to the 2X35S promoter+CPMV 3′UTR/NOS-based expression cassette. Plasmid number 4160 (SEQ ID NO: 64;
For Norovirus VP1 and Rituximab HC and LC constructs, the cloning vector integrates a Matrix attachment region (MAR) regulatory element from the tobacco RB7 gene after the NOS terminator in addition to the 2X35S promoter+CPMV 3′UTR/NOS-based expression cassette. Plasmid number 4170 (SEQ ID NO: 65;
Agrobacterium tumefaciens Transfection
Agrobacterium tumefaciens strain AGL1 was transfected (transformed) by electroporation with the different expression vectors using the methods described by D'Aoust et al., 2008 (Plant Biotech. 1 6:930-40). Transfected Agrobacterium were grown in LB medium supplemented with 10 mM 2-(N-morpholino)ethanesulfonic acid (MES) and 50 μg/ml kanamycin pH5.6 to an OD600 between 0.6 and 1.6 and frozen in 100 μl aliquots.
N. benthamiana plants were grown from seeds in flats filled with a commercial peat moss substrate. The plants were allowed to grow in the greenhouse under a 16/8 photoperiod and a temperature regime of 25° C. day/20° C. night. Three weeks after seeding, individual plantlets were picked out, transplanted in pots and left to grow in the greenhouse for three additional weeks under the same environmental conditions
Agrobacteria transfected (transformed) with each expression vector were grown in a LB medium supplemented with 10 mM 2-(N-morpholino)ethanesulfonic acid (MES) and 50 μg/ml kanamycin pH5.6 until they reached an OD600 between 0.6 and 1.6. Agrobacterium suspensions were centrifuged before use and resuspended in infiltration medium (10 mM MgCl2 and 10 mM MES pH 5.6) and stored overnight at 4° C. On the day of infiltration, culture batches were diluted in 2.5 culture volumes and allowed to warm before use. Whole plants of N. benthamiana were placed upside down in the bacterial suspension in an air-tight stainless steel tank under a vacuum of 20-40 Torr for 2-min. Plants were returned to the greenhouse for a 6 or 9 day incubation period until harvest.
Following incubation, the aerial part of plants was harvested, frozen at −80° C. and crushed into pieces. Total soluble proteins were extracted by homogenizing (Polytron) each sample of frozen-crushed plant material in 2 volumes of cold 50 mM Tris buffer pH 8.0+500 mM NaCl, 0.4 μg/ml Metabisulfite and 1 mM phenylmethanesulfonyl fluoride. After homogenization, the slurries were centrifuged at 10,000 g for 10 min at 4° C. and these clarified crude extracts (supernatant) kept for analyses.
The total protein content of clarified crude extracts was determined by the Bradford assay (Bio-Rad, Hercules, Calif.) using bovine serum albumin as the reference standard. Proteins were separated by SDS-PAGE under reducing conditions using Criterion™ TGX Stain-Free™ precast gels (Bio-Rad Laboratories, Hercules, Calif.). Proteins were visualized by staining the gels with Coomassie Brilliant Blue. Alternatively, proteins were visualized with Gel Doc™ EZ imaging system (Bio-Rad Laboratories, Hercules, Calif.) and electrotransferred onto polyvinylene difluoride (PVDF) membranes (Roche Diagnostics Corporation, Indianapolis, Ind.) for immunodetection. Prior to immunoblotting, the membranes were blocked with 5% skim milk and 0.1% Tween-20 in Tris-buffered saline (TBS-T) for 16-18 h at 4° C.
Dasher Expression as Determined by Direct Fluorescence in Crude Extract
Dasher expression was quantified by direct measure of fluorescence in crude extracts. Frozen biomass was extracted using 50 mM Tris+150 mM NaCl pH 7.4 extraction buffer by mechanical extraction and centrifuged 10 minutes at 10000 g at 4° C. to remove insoluble debris. Clarified crude extracts were diluted 1/16, 1/48 and 1/144 in PBS and fluorescence was measured using a Fluoroskan (Ascent) instrument using 485 nm as excitation filter and 518 nm as emission filter.
Hemagglutination assay was based on a method described by Nayak and Reichl (2004, J. Virol. Methods 122:9-15). Serial double dilutions of the test samples (100 μL) were made in V-bottomed 96-well microtiter plates containing 100 μL PBS, leaving 100 μL of diluted sample per well. One hundred microliters of a 0.25% turkey red blood cells suspension (Bio Link Inc., Syracuse, N.Y.; for all B strains, H1, H5 and H7) or 0.5% guinea pig red blood cells suspension (for H3) were added to each well, and plates were incubated for 2h at room temperature. The reciprocal of the highest dilution showing complete hemagglutination was recorded as HA activity.
For Rituximab expression analysis, crude protein extracts (2 g biomass/EU) were produced from leaves by mechanical extraction in 150 mM Tris, pH 7.4 buffer with 150 mM NaCl and the extracts were electrophoresed on SDS-PAGE under non-reducing conditions for in-gel densitometry quantification of the band corresponding to the fully assembled H2L2 form of the antibody. Protein electrophoresis was performed in Stain-Free gels from Bio-Rad and gel imaging system was carried-out using Gel Doc XR+ system, including the Image Lab software for image analyses and in-gel quantification.
Proteins were extracted from frozen biomass by mechanical extraction in a blender with 2 volumes of extraction buffer (100 mM phosphate buffer pH 7.2+150 mM NaCl). The slurry was filtered through a large pore nylon filter to remove large debris and centrifuged 5000 g for 5 min at 4° C. The supernatant was collected and centrifuged again at 5000 g for 30 min (4° C.) to remove additional debris. The supernatant is then loaded on a discontinuous iodixanol density gradient. Analytical density gradient centrifugation was performed as follows: 38 ml tubes containing discontinuous iodixanol density gradient in acetate buffer (1 ml at 45%, 2 ml at 35%, 2 ml at 33%, 2 ml at 31%, 2 ml at 29% and 5 ml at 25% of iodixanol) were prepared and overlaid with 25 ml of the extracts containing the virus-like particles. The gradients were centrifuged at 175 000 g for 4 hours (4° C.). After centrifugation, 1 ml fractions were collected from the bottom to the top and fractions were analyzed by SDS-PAGE combined with protein staining or Western blot.
N. benthamiana leaves were, vacuum infiltrated, as described in Example 2, with Agrobacterium tumefaciens comprising expression vectors encoding the protein of interest operatively linked to the defined expression enhancer, to permit expression of the protein of interest, and the leaves examined for the protein of interest production. After 9 days post infiltration (DPI), total crude protein extracts were prepared from leaf homogenates, and hemagglutinin titer was determined as described above.
With reference to
With reference to
These results demonstrate the expression enhancer sequences described herein may be used for the expression of a protein of interest that is operatively linked to the expression enhancer, in a plant, portion of plant, or a plant cell.
N. benthamiana leaves were vacuum infiltrated, as described in Example 2, with Agrobacterium tumefaciens comprising expression vectors encoding norovirus VP1 from GII.4 genotype, and the leaves examined for VLP production. After 9 days post infiltration (DPI), total crude protein extracts were separated by SDS-PAGE, and stained with Coomassie (VP1 production), or separated using discontinuous iodixanol density gradients as described in Example 2, above (VLP production). Fractions from the density gradients were examined using Coomassie-stained SDS-PAGE. Norovirus VP1 proteins appear at an approximate 55-60 kDa band. The occurrence of the VP1 protein within a fraction of the density gradients is indicative of the fraction(s) to which the VLPs equilibrate during density gradient centrifugation. The yield of VLPs obtained from peak fractions after density gradient centrifugation was also determined.
As shown in
The expression enhancers described herein may also be used for the production of a multimeric protein of interest, for example an antibody. N. benthamiana leaves were, vacuum infiltrated, as described in Example 2, with Agrobacterium tumefaciens comprising expression vectors encoding the protein of interest operatively linked to the defined expression enhancer, to permit expression of the protein of interest, and the leaves examined for the protein of interest production. After 9 days post infiltration (DPI), total crude protein extracts were prepared from leaf homogenates were separated by SDS-PAGE and expression levels were determined by in-gel densitometry of complete IgG, as described above.
As shown with reference to
These results demonstrate that the expression enhancers described herein may be used to produce multimeric proteins, such as an antibody, and that the same or different the expression enhancers as described herein may be operatively linked to each of the nucleic acid sequences used to encode the components of the multimeric protein.
All citations are hereby incorporated by reference.
The present invention has been described with regard to one or more embodiments. However, it will be apparent to persons skilled in the art that a number of variations and modifications can be made without departing from the scope of the invention as defined in the claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2019/050319 | 3/14/2019 | WO | 00 |