The present disclosure relates to a fusion protein, a method for producing a substance, a vector, a transgenic cell, a method for producing a pneumatic tire, and a method for producing a rubber product.
Nowadays, natural rubber (a type of polyisoprenoid) for use in industrial rubber products may be obtained by cultivating rubber-producing plants, such as para rubber trees (Hevea brasiliensis) belonging to the family Euphorbiaceae or Indian rubber trees (Ficus elastica) belonging to the family Moraceae, to cause the laticifer cells of the plants to biosynthesize natural rubber, and then manually harvesting the natural rubber from the plants.
At present, Hevea brasiliensis is virtually the only source of natural rubber used in industrial rubber products. Hevea brasiliensis is a plant that can grow only in limited regions, including Southeast Asia and South America. Moreover, Hevea brasiliensis trees take about seven years from planting to grow mature enough to yield rubber, and they yield natural rubber only for a period of 20 to 30 years. Demand for natural rubber is expected to increase in years to come, especially in developing countries, but for the reasons discussed above it is difficult to greatly increase natural rubber production from Hevea brasiliensis. Therefore, there has been a concern that natural rubber resources might dry up, and it has been desirable to provide stable natural rubber sources other than mature Hevea brasiliensis trees and to improve productivity of natural rubber from Hevea brasiliensis.
Natural rubber has a cis-1,4-polyisoprene structure formed of isopentenyl diphosphate (IPP) as an elementary unit. This structure suggests that cis-prenyltransferase (CPT) may be involved in natural rubber biosynthesis. For example, several CPTs have been found in Hevea brasiliensis, including Hevea rubber transferase 1 (HRT1) and Hevea rubber transferase 2 (HRT2) (see, for example, Non-Patent Literatures 1 and 2). It is also known that rubber synthesis may be reduced in a dandelion species, Taraxacum brevicorniculatum, by suppressing the CPT expression (see, for example, Non-Patent Literature 3).
Non-Patent Literature 1: Rahaman et al., BMC Genomics, 2013, vol. 14
Non-Patent Literature 2: Asawatreratanakul et al., European Journal of Biochemistry, 2003, vol. 270, 4671-4680
Non-Patent Literature 3: Post et al., Plant Physiology, 2012, vol. 158, 1406-1417
Non-Patent Literature 4: Karine et al., Biochimica et Biophysica Acta 1838 (2014), 287-299
As discussed above, there have been needs to develop stable natural rubber sources other than mature Hevea brasiliensis trees and to improve productivity of natural rubber from Hevea brasiliensis.
As a solution to these problems, the present inventors discovered that a polyisoprenoid (natural rubber) may be accumulated in rubber particles by binding a cis-prenyltransferase (CPT) family protein, which is a rubber synthase, to rubber particles, and causing an enzymatic reaction (see, for example, WO 2017/002818).
As a result of extensive research and experimentation, the present inventors have found that the above approach is disadvantageous in that the use of rubber particles limits the range of applications because when considering rubber synthesis in cells, limited cells (plants) can produce rubber particles.
Then, the present inventors have conducted extensive research and experimentation to arrive at the use of lipid droplets present in all plants, instead of rubber particles possessed only by special plants.
As shown in
Rubber particles and lipid droplets are similar in that: the membranes have a single lipid membrane structure; hydrophobic substances are stored inside; and multiple proteins exist on the membranes. Thus, the inventors expected that any enzyme capable of binding to rubber particles could also bind to lipid droplets. Actually, however, it was found that enzymes capable of binding to rubber particles do not bind to lipid droplets.
The present disclosure aims to solve the new problem found by the present inventors and provide a fusion protein capable of binding to lipid droplets while having an enzymatic activity to synthesize a hydrophobic compound, a method for producing a substance including accumulating a hydrophobic compound in lipid droplets using the fusion protein, a vector which can enhance production of a hydrophobic compound when it is introduced into cells using genetic recombination techniques, and a transgenic cell into which the vector or a gene coding for the fusion protein has been introduced.
The present inventors have investigated why enzymes capable of binding to rubber particles, such as rubber synthases, do not bind to lipid droplets. As a result, they have arrived at a hypothesis that the reason is the difference in the type of lipid forming the membrane between rubber particles and lipid droplets, or in other words that enzymes such as rubber synthases hardly bind to lipid droplets due to the difference in the type of lipid forming the membrane between rubber particles and lipid droplets.
It is reported that the binding ability of rubber elongation factors (REFs) capable of binding to rubber particles depends on the type of lipid (Non-Patent Literature 4).
Thus, an attempt was made to fuse an amino acid sequence having an enzymatic activity to synthesize a hydrophobic compound with an amino acid sequence capable of binding to lipid droplets as an anchor (for example, an amino acid sequence derived from a protein originally present in lipid droplets) in order to enhance the lipid droplet-binding ability of an enzyme that has an enzymatic activity to synthesize a hydrophobic compound and that itself is not inherently capable of binding to lipid droplets, such as a rubber synthase.
The inventors revealed that binding to lipid droplets in some cases failed depending on the lipid droplet-binding protein used. They have investigated the cause of the failure and then found that lipid droplet-binding proteins are classified into class I proteins which are originally bound to the membrane upon the formation of lipid droplets and class II proteins which bind to the formed lipid droplets, and that in order to bind a protein to the formed lipid droplets, it is necessary to use a class II protein because class I proteins cannot allow the protein to bind to the formed lipid droplets.
Then, they have found that fusion proteins having a structure as shown in
Specifically, the present disclosure relates to a fusion protein, having: an amino acid sequence (first amino acid sequence) capable of binding to lipid droplets; and an amino acid sequence (second amino acid sequence) having an enzymatic activity to synthesize a hydrophobic compound, with the enzymatic activity of the second amino acid sequence being maintained.
Preferably, the first amino acid sequence is an amino acid sequence derived from a protein capable of binding to lipid droplets which is a class II protein.
Preferably, the second amino acid sequence is an amino acid sequence derived from an enzyme that has an enzymatic activity to synthesize a hydrophobic compound and that itself is not capable of binding to lipid droplets.
Preferably, the first amino acid sequence is an amino acid sequence derived from a lipid droplet-associated protein (LDAP)/small rubber particle protein (SRPP) family protein.
Preferably, the first amino acid sequence is an amino acid sequence derived from a LDAP/SRPP family protein of plant origin.
Preferably, the first amino acid sequence is an amino acid sequence derived from a LDAP/SRPP family protein derived from at least one selected from the group consisting of plants of the genera Persea, Hevea, and Taraxacum.
Preferably, the first amino acid sequence is an amino acid sequence derived from a LDAP/SRPP family protein derived from at least one plant selected from the group consisting of Persea americana, Hevea brasiliensis, and Taraxacum kok-saghyz.
Preferably, the second amino acid sequence is an amino acid sequence derived from a prenyltransferase family protein.
Preferably, the second amino acid sequence is an amino acid sequence derived from a cis-prenyltransferase family protein.
Preferably, the second amino acid sequence is an amino acid sequence derived from a cis-prenyltransferase family protein derived from a plant of the genus Hevea or Taraxacum.
Preferably, the number of amino acids between the first amino acid sequence and the second amino acid sequence is three or less.
Preferably, the number of amino acids between the first amino acid sequence and the second amino acid sequence is two or less.
Preferably, the first amino acid sequence is directly bound to the second amino acid sequence.
The present disclosure also relates to a method for producing a substance, the method including
binding the above fusion protein to lipid droplets, and
accumulating a product in the lipid droplets by the enzymatic activity of the second amino acid sequence.
The present disclosure also relates to a vector into which a gene coding for the above fusion protein has been introduced.
The present disclosure also relates to a transgenic cell into which a gene coding for the above fusion protein has been introduced.
The present disclosure also relates to a method for producing a substance, the method including
using the above transgenic cell to accumulate a product in lipid droplets in the cell by the enzymatic activity of the second amino acid sequence.
The present disclosure also relates to a method for producing a pneumatic tire, the method including:
producing a polyisoprenoid by the above method for producing a substance;
kneading the polyisoprenoid with an additive to obtain a kneaded mixture;
forming a green tire from the kneaded mixture; and
vulcanizing the green tire.
The present disclosure also relates to a method for producing a rubber product, the method including:
producing a polyisoprenoid by the above method for producing a substance;
kneading the polyisoprenoid with an additive to obtain a kneaded mixture;
forming a raw rubber product from the kneaded mixture; and
vulcanizing the raw rubber product.
The fusion protein according to the present disclosure has an amino acid sequence (first amino acid sequence) capable of binding to lipid droplets, and an amino acid sequence (second amino acid sequence) having an enzymatic activity to synthesize a hydrophobic compound, with the enzymatic activity of the second amino acid sequence being maintained. This fusion protein is capable of binding to lipid droplets while having an enzymatic activity to synthesize a hydrophobic compound.
Thus, when the fusion protein is used with lipid droplets, the fusion protein binds to the lipid droplets, and the hydrophobic compound synthesized by the fusion protein can be accumulated in the lipid droplets.
Moreover, the fusion protein which can be suitably used in cells containing lipid droplets can be used in a wide range of cells to enhance production of the hydrophobic compound in the wide range of cells because lipid droplets exist in both prokaryotes and eukaryotes.
Moreover, a vector into which a gene coding for the fusion protein has been introduced can enhance production of the hydrophobic compound when it is introduced into cells using genetic recombination techniques. In a transgenic cell into which the vector or a gene coding for the fusion protein has been introduced, the production of the hydrophobic compound can be enhanced.
The method for producing a pneumatic tire of the present disclosure includes producing a polyisoprenoid by the method for producing a substance of the present disclosure, kneading the polyisoprenoid with an additive to obtain a kneaded mixture, forming a green tire from the kneaded mixture, and vulcanizing the green tire. This method produces a pneumatic tire from a polyisoprenoid produced by a method that produces a polyisoprenoid with high productivity. Thus, plant resources can be effectively used, and environmentally friendly pneumatic tires can be produced.
The method for producing a rubber product of the present disclosure includes producing a polyisoprenoid by the method for producing a substance of the present disclosure, kneading the polyisoprenoid with an additive to obtain a kneaded mixture, forming a raw rubber product from the kneaded mixture, and vulcanizing the raw rubber product. This method produces a rubber product from a polyisoprenoid produced by a method that produces a polyisoprenoid with high productivity. Thus, plant resources can be effectively used, and environmentally friendly rubber products can be produced.
The fusion protein of the present disclosure has an amino acid sequence (first amino acid sequence) capable of binding to lipid droplets, and an amino acid sequence (second amino acid sequence) having an enzymatic activity to synthesize a hydrophobic compound, with the enzymatic activity of the second amino acid sequence being maintained.
As described earlier, the fusion protein of the present disclosure has a structure as shown in
Here, the fusion protein of the present disclosure may have the first amino acid sequence and the second amino acid sequence in the stated order from the N-terminal side, or may have the first amino acid sequence and the second amino acid sequence in the stated order from the C-terminal side, as shown in
The first amino acid sequence and the second amino acid sequence are now described. Firstly, the second amino acid sequence that has an enzymatic activity is described.
The second amino acid sequence is an amino acid sequence having an enzymatic activity to synthesize a hydrophobic compound.
Although the amino acid sequence having an enzymatic activity to synthesize a hydrophobic compound may be any amino acid sequence that has an enzymatic activity to synthesize a hydrophobic compound, it is preferably an amino acid sequence derived from an enzyme having an enzymatic activity to synthesize a hydrophobic compound, more preferably an amino acid sequence derived from an enzyme that has an enzymatic activity to synthesize a hydrophobic compound and that itself is not capable of binding to lipid droplets. Here, since it is sufficient that the amino acid sequence derived from an enzyme having an enzymatic activity to synthesize a hydrophobic compound has such enzymatic activity, it may be the entire amino acid sequence of the enzyme having an enzymatic activity to synthesize a hydrophobic compound or a part of the amino acid sequence, e.g., from which a transport signal sequence has been removed.
The hydrophobic compound may be any hydrophobic compound that can be stored in lipid droplets. Examples include isoprenoid compounds, fatty acids, fat-soluble vitamins, and hydrophobic polymers (excluding isoprenoid compounds). Isoprenoid compounds are preferred among these, with polyisoprenoids (natural rubber) being more preferred.
Non-limiting examples of enzymes corresponding to the above-mentioned hydrophobic compounds (enzymes having an enzymatic activity to synthesize the hydrophobic compounds) include rubber synthases, lycopene cyclases, and polyhydroxyalkanoate (PHA) synthases. Rubber synthases are preferred among these.
Rubber synthases themselves are usually not capable of binding to lipid droplets and thus correspond to enzymes which have an enzymatic activity to synthesize a hydrophobic compound and which themselves are not capable of binding to lipid droplets.
Preferred examples of rubber synthases are prenyltransferase family proteins. Prenyltransferase family proteins include cis-prenyltransferase family proteins and trans-prenyltransferase family proteins, with cis-prenyltransferase family proteins being preferred. In other words, the second amino acid sequence is preferably an amino acid sequence derived from a prenyltransferase family protein, more preferably an amino acid sequence derived from a cis-prenyltransferase family protein.
The gene coding for the prenyltransferase family protein or the gene coding for the cis-prenyltransferase (CPT) family protein may be of any origin and may be of microbial, animal, or plant origin, preferably of plant origin. More preferably, it is derived from at least one selected from the group consisting of plants of the genera Hevea, Sonchus, Taraxacum, and Parthenium, still more preferably from a plant of the genus Hevea or Taraxacum, particularly preferably Hevea brasiliensis or Taraxacum kok-saghyz, most preferably Hevea brasiliensis.
Non-limiting examples of the plants include plants of the genus Hevea such as Hevea brasiliensis; plants of the genus Sonchus such as Sonchus oleraceus, Sonchus asper, and Sonchus brachyotus; plants of the genus Solidago such as Solidago altissima, Solidago virgaurea subsp. asiatica, Solidago virgaurea subsp. leipcarpa, Solidago virgaurea subsp. leipcarpa f. paludosa, Solidago virgaurea subsp. gigantea, and Solidago gigantea Ait. var. leiophylla Fernald; plants of the genus Helianthus such as Helianthus annuus, Helianthus argophyllus, Helianthus atrorubens, Helianthus debilis, Helianthus decapetalus, and Helianthus giganteus; plants of the genus Taraxacum such as dandelion (Taraxacum), Taraxacum venustum H. Koidz, Taraxacum hondoense Nakai, Taraxacum platycarpum Dahlst, Taraxacum japonicum, Taraxacum officinale Weber, Taraxacum kok-saghyz, and Taraxacum brevicorniculatum; plants of the genus Ficus such as Ficus carica, Ficus elastica, Ficus pumila L., Ficus erecta Thumb., Ficus ampelas Burm. f., Ficus benguetensis Merr., Ficus irisana Elm., Ficus microcarpa L. f., Ficus septica Burm. f., and Ficus benghalensis; plants of the genus Parthenium such as Parthenium argentatum, Parthenium hysterophorus, and annual ragweed (Ambrosia artemisiifolia); lettuce (Lactuca sativa), Ficus benghalensis, and Arabidopsis thaliana.
As used herein, the term “prenyltransferase (CPT) family protein” refers to an enzyme that catalyzes a reaction of chain elongation of an isoprenoid compound.
Also as used herein, the term “trans-prenyltransferase (CPT) family protein” refers to an enzyme that catalyzes a reaction of trans-chain elongation of an isoprenoid compound.
Also as used herein, the term “cis-prenyltransferase (CPT) family protein” refers to an enzyme that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
Specifically, in plants, for example, a polyisoprenoid is biosynthesized via a polyisoprenoid biosynthesis pathway as shown in
As used herein, the term “isoprenoid compound” refers to a compound containing an isoprene unit (C5H9). Also, the term “cis-isoprenoid” refers to a compound including an isoprenoid compound in which isoprene units are cis-bonded, and examples include cis-farnesyl diphosphate, undecaprenyl diphosphate, and natural rubber.
Here,
The multiple sequence alignment can be performed as described in WO 2017/002818.
Thus, the CPT family protein preferably has an aspartic acid residue at position 41 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at a corresponding position, a glycine residue at position 42 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at a corresponding position, an arginine residue at position 45 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at a corresponding position, and an asparagine residue at position 89 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at a corresponding position. As described above, the CPT family protein having such a sequence is considered to have the functions of CPT family proteins and can function as an enzyme that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
More preferably, the CPT family protein has, at positions 41 to 49 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at corresponding positions, the following amino acid sequence (A):
DGNX1RX2AKK (A)
wherein X1 and X2 are the same as or different from each other and each represent any amino acid residue, or an amino acid sequence that is identical to the amino acid sequence (A) regarding at least five out of the seven amino acid residues other than X, and X2. Still more preferably, in the amino acid sequence (A), X1 represents H, G, or R, and X2 represents W, F, or Y.
The amino acid sequence that is identical to the amino acid sequence (A) regarding at least five out of the seven amino acid residues other than X1 and X2 is more preferably an amino acid sequence that is identical regarding at least six out of the seven amino acid residues other than X1 and X2.
Also more preferably, the CPT family protein has, at positions 81 to 97 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at corresponding positions, the following amino acid sequence (B):
TX11X12AFSX13X14NX15X16RX17X18X19EV (B)
wherein X11 to X19 are the same as or different from each other and each represent any amino acid residue, or an amino acid sequence that is identical to the amino acid sequence (B) regarding at least five out of the eight amino acid residues other than Xu to X1.
Still more preferably, in the amino acid sequence (B), X11 represents L, V, A, or I; X12 represents Y, F, or H; X13 represents S, T, I, M, or L; X14 represents E, D, or H; X15 represents W or F; X16 represents N, S, K, G, or R; X17 represents P, S, H, G, R, K, or Q; X18 represents A, K, S, or P; and X1 represents Q, D, R, I, E, H, or S.
The amino acid sequence that is identical to the amino acid sequence (B) regarding at least five out of the eight amino acid residues other than X11 to X19 is more preferably an amino acid sequence that is identical regarding at least six, still more preferably at least seven, out of the eight amino acid residues other than Xu, to X19.
Further, the CPT family protein particularly preferably has, at positions 41 to 49 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at corresponding positions, an amino acid sequence that is identical to the amino acid sequence at positions 41 to 49 (DGNRRFAKK) of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 regarding at least six out of the nine amino acid residues. The amino acid sequence is more preferably identical regarding at least seven, still more preferably at least eight, out of the nine amino acid residues.
Further, the CPT family protein particularly preferably has, at positions 81 to 97 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 or at corresponding positions, an amino acid sequence that is identical to the amino acid sequence at positions 81 to 97 (TIYAFSIDNFRRKPHEV) of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 regarding at least 14 out of the 17 amino acid residues. The amino acid sequence is more preferably identical regarding at least 15, still more preferably at least 16, out of the 17 amino acid residues.
Specifically, the conserved region corresponding to positions 41 to 49 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
positions 25 to 33 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
positions 29 to 37 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
positions 75 to 83 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
positions 79 to 87 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
positions 43 to 51 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
positions 42 to 50 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
positions 41 to 49 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
positions 41 to 49 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
positions 42 to 50 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
positions 41 to 49 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
positions 58 to 66 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
positions 58 to 66 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
positions 34 to 42 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
positions 34 to 42 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Moreover, the conserved region corresponding to positions 81 to 97 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
positions 65 to 81 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
positions 69 to 85 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
positions 115 to 131 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
positions 119 to 135 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
positions 84 to 100 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
positions 82 to 98 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
positions 81 to 97 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
positions 81 to 97 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
positions 82 to 98 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
positions 81 to 97 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
positions 98 to 114 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
positions 98 to 114 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
positions 74 to 90 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
positions 74 to 90 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Moreover, the aspartic acid residue corresponding to position 41 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
the aspartic acid residue at position 25 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
the aspartic acid residue at position 29 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
the aspartic acid residue at position 75 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
the aspartic acid residue at position 79 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
the aspartic acid residue at position 43 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
the aspartic acid residue at position 42 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
the aspartic acid residue at position 41 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
the aspartic acid residue at position 41 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
the aspartic acid residue at position 42 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
the aspartic acid residue at position 41 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
the aspartic acid residue at position 58 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
the aspartic acid residue at position 58 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
the aspartic acid residue at position 34 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
the aspartic acid residue at position 34 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Moreover, the glycine residue corresponding to position 42 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
the glycine residue at position 26 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
the glycine residue at position 30 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
the glycine residue at position 76 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
the glycine residue at position 80 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
the glycine residue at position 44 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
the glycine residue at position 43 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
the glycine residue at position 42 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
the glycine residue at position 42 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
the glycine residue at position 43 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
the glycine residue at position 42 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
the glycine residue at position 59 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
the glycine residue at position 59 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
the glycine residue at position 35 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
the glycine residue at position 35 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Moreover, the arginine residue corresponding to position 45 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
the arginine residue at position 29 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
the arginine residue at position 33 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
the arginine residue at position 79 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
the arginine residue at position 83 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
the arginine residue at position 47 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
the arginine residue at position 46 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
the arginine residue at position 45 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
the arginine residue at position 45 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
the arginine residue at position 46 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
the arginine residue at position 45 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
the arginine residue at position 62 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
the arginine residue at position 62 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
the arginine residue at position 38 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
the arginine residue at position 38 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Moreover, the asparagine residue corresponding to position 89 of HRT1 from Hevea brasiliensis represented by SEQ ID NO:2 corresponds to, for example:
the asparagine residue at position 73 of undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli represented by SEQ ID NO:45 disclosed in WO 2017/002818;
the asparagine residue at position 77 of undecaprenyl diphosphate synthase (UPS) from Micrococcus represented by SEQ ID NO:46 disclosed in WO 2017/002818;
the asparagine residue at position 123 of SRT1 from yeast represented by SEQ ID NO:47 disclosed in WO 2017/002818;
the asparagine residue at position 127 of AtCPT5 from Arabidopsis thaliana represented by SEQ ID NO:44 disclosed in WO 2017/002818;
the asparagine residue at position 92 of AtCPT8 from Arabidopsis thaliana represented by SEQ ID NO:22 disclosed in WO 2017/002818;
the asparagine residue at position 90 of DDPS from tobacco represented by SEQ ID NO:48 disclosed in WO 2017/002818;
the asparagine residue at position 89 of HRT2 from Hevea brasiliensis represented by SEQ ID NO:32 disclosed in WO 2017/002818;
the asparagine residue at position 89 of CPT3 from Hevea brasiliensis represented by SEQ ID NO:36 disclosed in WO 2017/002818;
the asparagine residue at position 90 of CPT4 from Hevea brasiliensis represented by SEQ ID NO:37 disclosed in WO 2017/002818;
the asparagine residue at position 89 of CPT5 from Hevea brasiliensis represented by SEQ ID NO:41 disclosed in WO 2017/002818;
the asparagine residue at position 106 of LsCPT3 from lettuce represented by SEQ ID NO:14 disclosed in WO 2017/002818;
the asparagine residue at position 106 of TbCPT1 from Taraxacum brevicorniculatum represented by SEQ ID NO:43 disclosed in WO 2017/002818;
the asparagine residue at position 82 of DDPS from mouse represented by SEQ ID NO:49 disclosed in WO 2017/002818; and
the asparagine residue at position 82 of HDS from human represented by SEQ ID NO:50 disclosed in WO 2017/002818.
Examples of the CPT family protein include CPT from Hevea brasiliensis (HRT1, HRT2, CPT3 to CPT5), AtCPT1 to AtCPT9 from Arabidopsis thaliana, CPT1 to CPT3 from lettuce, CPT1 to CPT3 from Taraxacum brevicorniculatum, undecaprenyl pyrophosphate synthase (UPPS) from Escherichia coli, undecaprenyl diphosphate synthase (UPS) from Micrococcus, SRT1 from yeast, DDPS from tobacco, DDPS from mouse, and HDS from human.
Specific examples of the CPT family protein include the following protein [11]:
[11] a protein having the amino acid sequence represented by SEQ ID NO:2.
Moreover, it is known that proteins having one or more amino acid substitutions, deletions, insertions, or additions relative to the original amino acid sequence can have the inherent function. Thus, another specific example of the CPT family protein is the following protein [12]:
[12] a protein having an amino acid sequence containing one or more amino acid substitutions, deletions, insertions, and/or additions relative to the amino acid sequence represented by SEQ ID NO:2, and having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
To maintain the function of the CPT family protein, the protein preferably has an amino acid sequence containing one or more, more preferably 1 to 58, still more preferably 1 to 44, further preferably 1 to 29, particularly preferably 1 to 15, most preferably 1 to 6, still most preferably 1 to 3 amino acid substitutions, deletions, insertions, and/or additions relative to the amino acid sequence represented by SEQ ID NO:2.
Herein, preferred examples of amino acid substitutions are conservative substitutions. Specific examples include substitutions within each of the following groups in the parentheses: (glycine, alanine), (valine, isoleucine, leucine), (aspartic acid, glutamic acid), (asparagine, glutamine), (serine, threonine), (lysine, arginine), and (phenylalanine, tyrosine).
It is also known that proteins with amino acid sequences having high sequence identity to the original amino acid sequence can also have similar functions. Thus, another specific example of the CPT family protein is the following protein [13]:
[13] a protein having an amino acid sequence with at least 80% sequence identity to the amino acid sequence represented by SEQ ID NO:2, and having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
To maintain the function of the CPT family protein, the sequence identity to the amino acid sequence represented by SEQ ID NO:2 is preferably at least 85%, more preferably at least 90%, still more preferably at least 95%, particularly preferably at least 98%, most preferably at least 99%.
Specific examples of the CPT family protein also include the following proteins [14] to [16].
[14] a protein having the amino acid sequence represented by SEQ ID NO:3
[15] a protein having an amino acid sequence containing one or more amino acid substitutions, deletions, insertions, and/or additions relative to the amino acid sequence represented by SEQ ID NO:3, and having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound
To maintain the function of the CPT family protein, the protein preferably has an amino acid sequence containing one or more, more preferably 1 to 60, still more preferably 1 to 45, particularly preferably 1 to 30, most preferably 1 to 15, still most preferably 1 to 6, further most preferably 1 to 3 amino acid substitutions, deletions, insertions, and/or additions relative to the amino acid sequence represented by SEQ ID NO:3.
[16] a protein having an amino acid sequence with at least 80 sequence identity to the amino acid sequence represented by SEQ ID NO:3, and having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound
To maintain the function of the CPT family protein, the sequence identity to the amino acid sequence represented by SEQ ID NO:3 is preferably at least 85%, more preferably at least 90%, still more preferably at least 95%, particularly preferably at least 98%, most preferably at least 99%.
Herein, the sequence identity between amino acid sequences or between nucleotide sequences may be determined using the algorithm BLAST [Pro. Natl. Acad. Sci. USA, 90, 5873 (1993)] developed by Karlin and Altschul or FASTA [Methods Enzymol., 183, 63 (1990)].
Whether it is a protein having the above enzymatic activity may be determined by, for example, conventional techniques, such as by expressing a target protein in a transformant produced by introducing a gene coding for the target protein into Escherichia coli or other host organism, and determining the presence or absence of the function of the target protein by the corresponding activity measuring method.
The gene coding for a protein having the second amino acid sequence may be any gene that codes for the protein having the second amino acid sequence to express and produce the protein having the second amino acid sequence.
The gene coding for the CPT family protein may be any gene that codes for the CPT family protein to express and produce the CPT family protein. Specific examples of the gene include the following DNAs [11] and [12]:
[11] a DNA having the nucleotide sequence represented by SEQ ID NO:1; and
[12] a DNA which hybridizes under stringent conditions to a DNA having a nucleotide sequence complementary to the nucleotide sequence represented by SEQ ID NO:1, and which codes for a protein having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
Specific examples of the gene coding for the CPT family protein also include the following DNAs [13] and [14]:
[13] a DNA having the nucleotide sequence represented by SEQ ID NO:6; and
[14] a DNA which hybridizes under stringent conditions to a DNA having a nucleotide sequence complementary to the nucleotide sequence represented by SEQ ID NO:6, and which codes for a protein having an enzymatic activity that catalyzes a reaction of cis-chain elongation of an isoprenoid compound.
As used herein, the term “hybridize” means a process in which a DNA hybridizes to a DNA having a specific nucleotide sequence or a part of the DNA. Thus, the DNA having a specific nucleotide sequence or part of the DNA may have a nucleotide sequence long enough to be usable as a probe in Northern or Southern blot analysis or as an oligonucleotide primer in polymerase chain reaction (PCR) analysis. The DNA used as a probe may have a length of at least 100 bases, preferably at least 200 bases, more preferably at least 500 bases, but it may be a DNA of at least 10 bases, preferably of at least 15 bases in length.
Techniques to perform DNA hybridization experiments are well known. The hybridization conditions under which experiments are carried out may be determined according to, for example, Molecular Cloning, 2nd ed. and 3rd ed. (2001), Methods for General and Molecular Bacteriology, ASM Press (1994), Immunology methods manual, Academic press (Molecular), or many other standard textbooks.
Herein, the stringent conditions may include, for example, an overnight incubation at 42° C. of a DNA-immobilized filter and a DNA probe in a solution containing 50% formamide, 5×SSC (750 mM sodium chloride, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 μg/L denatured salmon sperm DNA, followed by washing the filter for example in a 0.2×SSC solution at approximately 65° C. Less stringent conditions may also be used. Changes in the stringency may be accomplished by manipulating the formamide concentration (lower percentages of formamide result in lower stringency), salt concentrations, or temperature. For example, low stringent conditions include an overnight incubation at 37° C. in a solution containing 6×SSCE (20×SSCE: 3 mol/L sodium chloride, 0.2 mol/L sodium dihydrogen phosphate, 0.02 mol/L EDTA, pH 7.4), 0.5% SDS, 30% formamide, and 100 μg/L denatured salmon sperm DNA, followed by washing in a 1×SSC solution containing 0.1% SDS at 50° C. In addition, to achieve even lower stringency, washes performed following hybridization may be done at higher salt concentrations (e.g., 5×SSC) in the above-mentioned low stringent conditions.
Variations in the above various conditions may be accomplished through the inclusion or substitution of blocking reagents used to suppress background in hybridization experiments. The inclusion of blocking reagents may require modification of the hybridization conditions for compatibility.
The DNA capable of hybridizing under stringent conditions as described above may have a nucleotide sequence with at least 80%, preferably at least 90%, more preferably at least 95%, still more preferably at least 98%, particularly preferably at least 99% sequence identity to the nucleotide sequence represented by SEQ ID NO:1 or 6 as calculated using a program such as BLAST or FASTA with the parameters mentioned above.
Whether the DNA which hybridizes to the above-mentioned DNA under stringent conditions codes for a protein having a predetermined enzymatic activity may be determined by conventional techniques, such as by expressing a target protein in a transformant produced by introducing a gene coding for the target protein into Escherichia coli or other host organism, and determining the presence or absence of the function of the target protein by the corresponding activity measuring method.
Moreover, conventional techniques may be employed to identify the amino acid sequence or nucleotide sequence of the protein. For example, total RNA is extracted from a growing plant, the mRNA is optionally purified, and a cDNA is synthesized by a reverse transcription reaction. Subsequently, degenerate primers are designed based on the amino acid sequence of a known protein corresponding to the target protein, a DNA fragment is partially amplified by RT-PCR, and the sequence is partially identified. Then, the RACE method or the like is performed to identify the full-length nucleotide sequence or amino acid sequence. The RACE method (rapid amplification of cDNA ends method) refers to a method in which, when the nucleotide sequence of cDNA is partially known, PCR is performed based on the nucleotide sequence information of such a known region to clone the unknown region extending to the cDNA terminal. This method can clone full-length cDNA by PCR without preparing a cDNA library.
The degenerate primers may preferably be prepared from plant-derived sequences having a highly similar sequence part to the target protein.
Moreover, if the nucleotide sequence coding for the protein is known, the full-length nucleotide sequence or amino acid sequence can be identified by designing a primer containing a start codon and a primer containing a stop codon using the known nucleotide sequence, followed by performing RT-PCR using a synthesized cDNA as a template.
The first amino acid sequence is an amino acid sequence capable of binding to lipid droplets.
As used herein, binding of a protein to lipid droplets means, for example, but not limited to, that the protein is fully or partially incorporated into the lipid droplets or inserted into the membrane structure of the lipid droplets, and also means embodiments in which, for example, the protein localizes to the surface or inside of the lipid droplets. Moreover, the concept of binding to lipid droplets also includes embodiments in which the protein forms a complex with another protein bound to the lipid droplets to be present as the complex on the lipid droplets.
Although the amino acid sequence capable of binding to lipid droplets may be any amino acid sequence capable of binding to lipid droplets, it is preferably an amino acid sequence derived from a protein capable of binding to lipid droplets, more preferably an amino acid sequence derived from a protein capable of binding to lipid droplets which is a class II protein. Here, since it is sufficient that the amino acid sequence derived from a protein capable of binding to lipid droplets is capable of binding to lipid droplets, it may be the entire amino acid sequence of the protein capable of binding to lipid droplets or a part of the amino acid sequence. The amino acid sequence capable of binding to lipid droplets is not limited, but is desirably an amino acid sequence of a transmembrane domain of a protein that is inherently bound to lipid droplets in the natural world.
Here, the term “class II protein refer to a lipid droplet-binding protein characterized by being capable of binding to lipid droplets, and further by localizing to the cytosol when lipid droplets are absent in the cells (c.f., Gidda et al., Plant Physiology, April 2016, Vol. 170, pp. 2052-2071). Unlike class I proteins which localize to ER fractions when lipid droplets are absent, class II proteins can transfer between the cytosol and lipid droplets.
Non-limiting examples of the protein capable of binding to lipid droplets which is a class II protein include lipid droplet-associated protein (LDAP)/small rubber particle protein (SRPP) family proteins. Preferred are lipid droplet-associated protein (LDAP)/small rubber particle protein (SRPP) family proteins, among others.
Thus, the first amino acid sequence is preferably an amino acid sequence derived from a lipid droplet-associated protein (LDAP)/small rubber particle protein (SRPP) family protein.
The gene coding for the LDAP/SRPP family protein may be of any origin and may be of microbial, animal, or plant origin, preferably of plant origin.
Examples of the plant include plants as listed above, as well as plants of the genus Persea such as Persea americana, plants of the genus Sesamum such as Sesamum indicum, plants of the genus Brassica such as Brassica napus, and plants of the genus Camellia such as Camellia japonica.
The gene coding for the LDAP/SRPP family protein is preferably derived from a plant, more preferably from at least one selected from the group consisting of plants of the genera Persea, Sesamum, Brassica, Camellia, Hevea, Sonchus, Taraxacum, and Parthenium, still more preferably from at least one selected from the group consisting of plants of the genera Persea, Hevea, and Taraxacum, particularly preferably from at least one plant selected from the group consisting of Persea americana, Hevea brasiliensis, and Taraxacum kok-saghyz, most preferably from Persea americana or Hevea brasiliensis.
Thus, the first amino acid sequence is preferably an amino acid sequence derived from a LDAP/SRPP family protein of plant origin, more preferably an amino acid sequence derived from a LDAP/SRPP family protein derived from at least one selected from the group consisting of plants of the genera Persea, Sesamum, Brassica, Camellia, Hevea, Sonchus, Taraxacum, and Parthenium, still more preferably an amino acid sequence derived from a LDAP/SRPP family protein derived from at least one selected from the group consisting of plants of the genera Persea, Hevea, and Taraxacum, particularly preferably an amino acid sequence derived from a LDAP/SRPP family protein derived from at least one plant selected from the group consisting of Persea americana, Hevea brasiliensis, and Taraxacum kok-saghyz, most preferably an amino acid sequence derived from a LDAP/SRPP family protein derived from Persea americana or Hevea brasiliensis.
Examples of the LDAP/SRPP family protein include LDAP from Persea americana (LDAP1, LDAP2), SRPP or REF from Hevea brasiliensis, SRPP from Taraxacum kok-saghyz, and REF or SRPP from plants of the genus Ficus.
Specific examples of the LDAP/SRPP family protein include the following proteins [101] to [103]. The protein [101] or the protein [102] is preferred among these, with the protein [101] being more preferred.
[101] a protein having the amino acid sequence represented by SEQ ID NO:4
[102] a protein having the amino acid sequence represented by SEQ ID NO:7
[103] a protein having the amino acid sequence represented by SEQ ID NO:8
It is also known that proteins with amino acid sequences having high sequence identity to the original amino acid sequence can also have similar functions. Thus, specific examples of the LDAP/SRPP family protein also include the following proteins [104] to [106]. The protein [104] or the protein [105] is preferred among these, with the protein [104] being more preferred.
[104] a protein having an amino acid sequence with at least 80, sequence identity to the amino acid sequence represented by SEQ ID NO:4 and being capable of binding to lipid droplets
[105] a protein having an amino acid sequence with at least 805 sequence identity to the amino acid sequence represented by SEQ ID NO:7 and being capable of binding to lipid droplets
[106] a protein having an amino acid sequence with at least 80% sequence identity to the amino acid sequence represented by SEQ ID NO:8 and being capable of binding to lipid droplets
To maintain the function of the LDAP/SRPP family protein, the sequence identity to the amino acid sequence represented by SEQ ID NO:4, 7, or 8 is preferably at least 85%, more preferably at least 90%, still more preferably at least 95%, particularly preferably at least 98%, most preferably at least 99%.
Whether it is a protein capable of binding to lipid droplets may be determined by, for example, conventional techniques, such as by expressing a target protein in a transformant produced by introducing a gene coding for the target protein into Escherichia coli or other host organism, and determining the presence or absence of the function of the target protein.
The gene coding for a protein having the first amino acid sequence may be any gene that codes for the protein having the first amino acid sequence to express and produce the protein having the first amino acid sequence.
The fusion protein of the present disclosure has the first amino acid sequence and the second amino acid sequence. Moreover, the fusion protein maintains the lipid droplet-binding ability of the first amino acid sequence and the enzymatic activity of the second amino acid sequence.
Once the sequences of the first and second amino acid sequences are determined, a person skilled in the art can produce the fusion protein by known techniques.
The method for producing the fusion protein preferably includes a gene preparation step involving linking DNA fragments of a gene coding for a protein having the first amino acid sequence and a gene coding for a protein having the second amino acid sequence to prepare a gene coding for the fusion protein.
Moreover, when the fusion protein is produced by cell-free protein synthesis, the method preferably includes, in addition to the gene preparation step, preparing a mRNA based on the gene coding for the fusion protein, and performing protein synthesis using a cell-free protein synthesis solution containing the mRNA coding for the fusion protein to produce the fusion protein.
When the fusion protein is produced using a transgenic cell, the method preferably includes, in addition to the gene preparation step,
preparing a transgenic cell into which the gene coding for the fusion protein has been introduced, and
culturing the transgenic cell to produce the fusion protein.
The number of amino acids between the first amino acid sequence and the second amino acid sequence is preferably three or less, more preferably two or less, still more preferably one or less, particularly preferably zero.
The phrase “the number of amino acids between the first amino acid sequence and the second amino acid sequence is zero” means that the first amino acid sequence is directly bound to the second amino acid sequence.
Although the fusion protein may contain amino acid sequences other than the first amino acid sequence and the second amino acid sequence, the proportion of the amino acids of the first and second amino acid sequences relative to the total amino acids of the fusion protein is preferably 90% or higher, more preferably 95% or higher, still more preferably 98% or higher, particularly preferably 99% or higher.
Although the fusion protein of the present disclosure has the first amino acid sequence and the second amino acid sequence, as described above, the fusion protein of the present disclosure is preferably a protein other than the following five embodiments:
(1) a protein having the amino acid sequence of LDAP1 from Persea americana (the amino acid sequence represented by SEQ ID NO:4) and the amino acid sequence of HRT1 from Hevea brasiliensis (the amino acid sequence represented by SEQ ID NO:2) in the stated order from the N-terminal side;
(2) a protein having the amino acid sequence of LDAP2 from Persea americana (the amino acid sequence represented by SEQ ID NO:7) and the amino acid sequence of HRT1 from Hevea brasiliensis (the amino acid sequence represented by SEQ ID NO:2) in the stated order from the N-terminal side;
(3) a protein having the amino acid sequence of SRPP from Hevea brasiliensis (the amino acid sequence represented by SEQ ID NO:8) and the amino acid sequence of HRT1 from Hevea brasiliensis (the amino acid sequence represented by SEQ ID NO:2) in the stated order from the N-terminal side;
(4) a protein having the amino acid sequence of LDAP1 from Persea americana (the amino acid sequence represented by SEQ ID NO:4) and the amino acid sequence of AtCPT5 from Arabidopsis thaliana (the amino acid sequence represented by SEQ ID NO:3) in the stated order from the N-terminal side; and
(5) a protein having the amino acid sequence of AtCPT5 from Arabidopsis thaliana (the amino acid sequence represented by SEQ ID NO:3) and the amino acid sequence of LDAP1 from Persea americana (the amino acid sequence represented by SEQ ID NO:4) in the stated order from the N-terminal side.
Moreover, the first amino acid sequence in the fusion protein of the present disclosure is preferably an amino acid sequence derived from the protein [102] or the protein [105].
Further, in this case the second amino acid sequence is preferably an amino acid sequence derived from a CPT family protein other than the amino acid sequence of HRT1 from Hevea brasiliensis (the amino acid sequence represented by SEQ ID NO:2), more preferably an amino acid sequence derived from any of the proteins [14] to [16].
Further, in this case the number of amino acids between the first amino acid sequence and the second amino acid sequences is preferably as described above.
The method for producing a substance (hydrophobic compound) of the present disclosure includes binding the above-described fusion protein to lipid droplets and accumulating a product (hydrophobic compound) in the lipid droplets by the enzymatic activity of the second amino acid sequence.
Since the fusion protein maintains the lipid droplet-binding ability of the first amino acid sequence and the enzymatic activity of the second amino acid sequence, the fusion protein can bind to lipid droplets and catalyze an enzymatic reaction on the lipid droplets by the enzymatic activity of the second amino acid sequence to produce a product (hydrophobic compound) which is accumulated in the lipid droplets.
As used herein, binding of the fusion protein to lipid droplets means, for example, but not limited to, that the fusion protein is fully or partially incorporated into the lipid droplets or inserted into the membrane structure of the lipid droplets, and also means embodiments in which, for example, the fusion protein localizes to the surface or inside of the lipid droplets. Moreover, the concept of binding to lipid droplets also includes embodiments in which the fusion protein forms a complex with another protein bound to the lipid droplets to be present as the complex on the lipid droplets.
As used herein, the term “lipid droplets” refers to inclusion bodies or cell organelles which have a membrane structure including phospholipids and membrane proteins (for example, oleosins or lipid droplet-associated protein (LDAP)/small rubber particle protein (SRPP) family proteins) and store triacylglycerols therein, as shown in
Lipid droplets are also called oil droplets, fat droplets, oil bodies, or LD.
Although the lipid droplets may be of any origin, the lipid droplets are preferably of plant origin.
Examples of the plant include plants as described for the origin of the gene coding for the LDAP/SRPP family protein.
Among the plants, the lipid droplets are more preferably derived from at least one selected from the group consisting of plants of the genera Persea, Sesamum, Brassica, and Camellia, still more preferably from a plant of the genus Persea, particularly preferably from Persea americana. Moreover, the origins of the first amino acid sequence and the lipid droplets are preferably the same.
The lipid droplets may be prepared by any method including known methods. For example, a plant piece may be crushed in a buffer, followed by filtration and centrifugation to collect fat pads. The fat pads may also be washed, if necessary. Then, centrifugation may be performed to separate lipid droplets from the fat pads.
The centrifugation may be carried out, for example, at 15,000 to 20,000×g for 15 to 60 minutes.
Moreover, the centrifugation temperature is preferably 0 to 10° C., more preferably 2 to 8° C., particularly preferably 4° C.
The fusion protein may be bonded to lipid droplets by any method that can bind the fusion protein to the lipid droplets. An exemplary method includes performing protein synthesis in the presence of both a cell-free protein synthesis solution containing a mRNA coding for the fusion protein and lipid droplets to bind the fusion protein to the lipid droplets.
Thus, it is preferred to perform protein synthesis in the presence of both a cell-free protein synthesis solution containing a mRNA coding for the fusion protein and lipid droplets (more specifically, while mixing a cell-free protein synthesis solution containing a mRNA coding for the fusion protein with lipid droplets) to obtain the lipid droplets bound to the fusion protein.
Here, binding of the fusion protein to lipid droplets by protein synthesis in the presence of both the cell-free protein synthesis solution and the lipid droplets means, for example, but not limited to, that the fusion protein synthesized by the protein synthesis is fully or partially incorporated into the lipid droplets or inserted into the membrane structure of the lipid droplets, and also means embodiments in which, for example, the fusion protein localizes to the surface or inside of the lipid droplets. Moreover, the concept of binding to lipid droplets also includes embodiments in which the fusion protein forms a complex with another protein bound to the lipid droplets to be present as the complex on the lipid droplets as described above.
The mRNA coding for the fusion protein serves as a translation template that can be translated to synthesize the fusion protein. Moreover, the mRNA coding for the fusion protein may be prepared by any method as long as it serves as a translation template that can be translated to synthesize the fusion protein. For example, the mRNA may be prepared by determining a nucleotide sequence coding for the fusion protein based on the amino acid sequence information of the fusion protein, obtaining a DNA fragment of a gene coding for the fusion protein based on the determined nucleotide sequence information, and performing an ordinary in vitro transcription reaction of the DNA fragment.
Alternatively, the mRNA may be prepared by linking DNA fragments of a gene coding for a protein having the first amino acid sequence and a gene coding for a protein having the second amino acid sequence, obtaining a DNA fragment of a gene coding for the fusion protein based on the nucleotide sequence information of the linked DNA fragments, and performing an ordinary in vitro transcription reaction of the DNA fragment.
As long as the cell-free protein synthesis solution contains the mRNA coding for the fusion protein, it may contain mRNAs coding for other proteins.
The mRNAs coding for other proteins may be those that can be translated to express the other proteins.
In addition to the mRNA (translation template) coding for the fusion protein, the cell-free protein synthesis solution preferably contains ATP, GTP, creatine phosphate, creatine kinase, L-amino acids, potassium ions, magnesium ions, and other components required for protein synthesis, as well as activity enhancers, if necessary. The use of such a cell-free protein synthesis solution provides a cell-free protein synthesis reaction system.
Examples of reaction systems or apparatuses for protein synthesis that can be used in the cell-free protein synthesis include a batch method (Pratt, J. M. et al., Transcription and Translation, Hames, 179-209, B. D. & Higgins, S. J., eds, IRL Press, Oxford (1984)), a continuous cell-free protein synthesis system in which amino acids, energy sources, and other components are supplied continuously to the reaction system (Spirin, A. S. et al., Science, 242, 1162-1164 (1988)), a dialysis method (Kigawa et al., 21st Annual Meeting of the Molecular Biology Society of Japan, WID 6), and an overlay method (instruction manual of PROTEIOS™ wheat germ cell-free protein synthesis core kit, Toyobo Co., Ltd.). Other methods may include supplying template RNA, amino acids, energy sources, and other components as necessary to the protein synthesis reaction system, and discharging the synthesis product or decomposition product as required.
A person skilled in the art can perform the protein synthesis in the presence of both the cell-free protein synthesis solution and lipid droplets with reference to the method described in WO 2017/002818, for example.
The production method of the present disclosure may include, after binding the fusion protein to lipid droplets, collecting the lipid droplets bound to the fusion protein, if necessary.
The lipid droplet-collecting step may be carried out by any method that can collect the lipid droplets, including methods usually used to collect lipid droplets. Specific examples include methods involving centrifugation. When the lipid droplets are collected by such a centrifugation method, the centrifugal force, centrifugation time, and centrifugation temperature may be appropriately selected so as to be able to collect the lipid droplets.
The centrifugation may be carried out, for example, at 15,000 to 20,000×g for 15 to 60 minutes.
Moreover, from the standpoint of maintaining the protein activity of the fusion protein bound to lipid droplets, the centrifugation temperature is preferably 0 to 10° C., more preferably 2 to 8° C., particularly preferably 4° C.
For example, when the cell-free protein synthesis is performed, the centrifugation may be performed to separate the lipid droplets and the cell-free protein synthesis solution into the upper and lower layers, respectively. Then, the upper lipid droplet layer may be collected to recover the lipid droplets bound to the fusion protein. The collected lipid droplets may be re-suspended in an appropriate buffer with a neutral pH for storage.
Then, a hydrophobic compound (e.g., a polyisoprenoid) may be synthesized using the lipid droplets bound to the fusion protein. Specifically, the hydrophobic compound may be synthesized by adding a substrate corresponding to the enzymatic activity of the second amino acid sequence to the lipid droplets bound to the fusion protein to cause an enzymatic reaction. For example, when the second amino acid sequence used is an amino acid sequence derived from a cis-prenyltransferase family protein, isopentenyl diphosphate (IPP), optionally together with farnesyl diphosphate (FPP), may be added as a substrate to synthesize a polyisoprenoid (natural rubber). Then, the synthesized hydrophobic compound is accumulated in the lipid droplets.
As described above, the method for producing a substance (hydrophobic compound) of the present disclosure includes binding the fusion protein to lipid droplets and accumulating a product (hydrophobic compound) in the lipid droplets by the enzymatic activity of the second amino acid sequence.
Moreover, the hydrophobic compound (e.g., polyisoprenoid (natural rubber)) produced by the production method of the present disclosure may be collected, for example, by subjecting the lipid droplets to a solidification step as described below.
The solidification step may be carried out by any solidification method, such as by adding the lipid droplets to a solvent that does not dissolve the polyisoprenoid (natural rubber), such as ethanol, methanol, or acetone, or adding an acid to the lipid droplets. The rubber (natural rubber) can be recovered as solids from the lipid droplets by the solidification step. The recovered rubber (natural rubber) may be dried if necessary before use.
As used herein, the term “polyisoprenoid” is a collective term for polymers composed of isoprene units (C5H9). Examples of the polyisoprenoid include sesterterpenes (C25), triterpenes (C30), tetraterpenes (C40), natural rubber, and other polymers. Also as used herein, the term “isoprenoid” refers to a compound containing an isoprene unit (C5H8), and conceptually includes polyisoprenoids.
The weight average molecular weight (Mw) of the polyisoprenoid is preferably 1,000 or more, more preferably 10,000 or more, still more preferably 100,000 or more, and the upper limit is not critical.
The weight average molecular weight (Mw) is determined by gel permeation chromatography (GPC) under the following conditions (1) to (7).
(1) Apparatus: HLC-8020 available from Tosoh Corporation
(2) Separation column: GMH-XL available from Tosoh Corporation
(3) Measurement temperature: 40° C.
(4) Carrier: tetrahydrofuran
(5) Flow rate: 0.6 mL/min
(6) Detector: differential refractometer, UV
(7) Molecular weight standards: polystyrene standards
The method for producing a rubber product of the present disclosure includes: producing a polyisoprenoid by the method for producing a substance (using droplets bound to the fusion protein); kneading the polyisoprenoid with an additive to obtain a kneaded mixture; forming a raw rubber product from the kneaded mixture; and vulcanizing the raw rubber product.
The rubber product may be any rubber product producible from rubber, preferably natural rubber. Examples include pneumatic tires, rubber rollers, rubber fenders, gloves, and medical rubber tubes.
When the rubber product is a pneumatic tire or, in other words, when the method for producing a rubber product of the present disclosure is the method for producing a pneumatic tire of the present disclosure, the raw rubber product forming step corresponds to forming a green tire from the kneaded mixture, and the vulcanization step corresponds to vulcanizing the green tire. Thus, the method for producing a pneumatic tire of the present disclosure includes: producing a polyisoprenoid as described above; kneading the polyisoprenoid with an additive to obtain a kneaded mixture; forming a green tire from the kneaded mixture; and vulcanizing the green tire.
In the kneading step, the polyisoprenoid produced by the production method (using droplets bound to the fusion protein) is kneaded with an additive to obtain a kneaded mixture.
Any additive may be used, including additives used in the production of rubber products. For example, in the case where the rubber product is a pneumatic tire, examples of the additive include rubber components other than the polyisoprenoid, reinforcing fillers such as carbon black, silica, calcium carbonate, alumina, clay, and talc, silane coupling agents, zinc oxide, stearic acid, processing aids, various antioxidants, softeners such as oils, waxes, vulcanizing agents such as sulfur, and vulcanization accelerators.
In the kneading step, kneading may be carried out using an open roll mill, a Banbury mixer, an internal mixer, or other rubber kneading machines.
In the raw rubber product forming step, a raw rubber product (green tire in the case of tire) is formed from the kneaded mixture obtained in the kneading step.
The raw rubber product may be formed by any method, including appropriate methods used to form raw rubber products. For example, in the case where the rubber product is a pneumatic tire, the kneaded mixture obtained in the kneading step may be extruded into the shape of a tire component and then formed and assembled with other tire components in a usual manner on a tire building machine to form a green tire (unvulcanized tire).
In the vulcanization step, the raw rubber product obtained in the raw rubber product forming step is vulcanized to obtain a rubber product.
The raw rubber product may be vulcanized by any method, including appropriate methods used to vulcanize raw rubber products. For example, in the case where the rubber product is a pneumatic tire, the green tire (unvulcanized tire) obtained in the raw rubber product forming step may be vulcanized by heating and pressing in a vulcanizer to obtain a pneumatic tire.
The vector of the present disclosure is a vector into which a gene coding for the above-described fusion protein has been introduced.
By introducing such a vector into cells for transformation, the gene coding for the fusion protein in the vector can be expressed to enhance production of a hydrophobic compound (e.g., a polyisoprenoid) in the cells.
The vector of the present disclosure may be prepared by inserting a nucleotide sequence of a gene coding for the fusion protein, preferably a nucleotide sequence of a promoter and a nucleotide sequence of a gene coding for the fusion protein, into a vector commonly known as a transformation vector by conventional techniques.
Examples of vectors that can be used to prepare the vector of the present disclosure include pBI vectors, binary vectors such as pGA482, pGAH, and pBIG, intermediate plasmids such as pLGV23Neo, pNCAT, and pMON200, and pH35GS containing GATEWAY cassette.
Examples of promoters that can be used to prepare the vector of the present disclosure include promoters generally used in genetic recombination-related fields, such as CaMV35 and NOS promoters.
As long as the vector of the present disclosure contains a nucleotide sequence of a gene coding for the fusion protein, it may contain other nucleotide sequences. In addition to these nucleotide sequences, the vector usually contains sequences originated from the vector as well as other sequences such as a restriction enzyme recognition sequence, a spacer sequence, a marker gene sequence, and a reporter gene sequence.
Moreover, the vector of the present disclosure may contain a nucleotide sequence of a gene coding for a coenzyme as necessary.
Examples of the marker gene include drug-resistant genes such as a kanamycin-resistant gene, a hygromycin-resistant gene, and a bleomycin-resistant gene. Moreover, the reporter gene may be introduced to determine the expression site in a plant. Examples of the reporter gene include a luciferase gene, a S-glucuronidase (GUS) gene, a green fluorescent protein (GFP), and a red fluorescent protein (RFP).
A transgenic cell transformed to express the above-described fusion protein can be obtained, for example, by introducing the vector of the present disclosure into a cell such as a plant cell. Thus, a transgenic cell into which a gene coding for the fusion protein has been introduced can be obtained. Then, as the fusion protein is expressed in the transgenic cell, the functions including the certain enzymatic activity of the protein are newly exhibited in the cell with the introduced gene coding for the fusion protein, and thus the fusion protein binds to lipid droplets in the cell to synthesize a hydrophobic compound (e.g., a polyisoprenoid), thereby enhancing production of the hydrophobic compound (e.g., polyisoprenoid) in the cell. For example, when the second amino acid sequence used is an amino acid sequence derived from a cis-prenyltransferase family protein, the production of a polyisoprenoid (natural rubber) can be enhanced. Then, the synthesized hydrophobic compound is accumulated in the lipid droplets in the cell.
Moreover, a vector containing a nucleotide sequence of a gene coding for a coenzyme may be introduced, if necessary, together with the vector of the present disclosure into a cell such as a plant cell. Needless to say, a vector containing a nucleotide sequence of a gene coding for a coenzyme may be used as the vector of the present disclosure.
The host into which a gene coding for the fusion protein is to be introduced may be any organism containing lipid droplets. Examples include microorganisms such as algae and microalgae, plants, and animals. Among these, the host is preferably a microorganism or a plant, more preferably a microorganism, still more preferably a microalga.
In general, it is difficult to synthesize a hydrophobic compound using an organism. However, the synthesis of a hydrophobic compound can be facilitated when a gene coding for the fusion protein is introduced into the host described above.
The microorganisms may be either prokaryotes or eukaryotes. Examples include prokaryotes such as microorganisms of the genera Escherichia, Bacillus, Synechocystis, and Synechococcus; and eukaryotic microorganisms such as yeasts and filamentous fungi. Preferred among these are Escherichia coli, Bacillus subtilis, Rhodosporidium toruloides, and Mortierella sp., with Escherichia coli being more preferred.
Among the algae and microalgae, algae of the genera Chlamydomonas, Chlorella, Phaeodactylum, and Nannochloropsis are preferred because there are established genetic recombination techniques. More preferred are algae of the genus Nannochloropsis. Specific examples of the algae of the genus Nannochloropsis include Nannochloropsis oculata, Nannochloropsis gaditana, Nannochloropsis salina, Nannochloropsis oceanica, Nannochloropsis atomus, Nannochloropsis maculata, Nannochloropsis granulata, and Nannochloropsis sp. Nannochloropsis oculata or Nannochloropsis gaditana is preferred among these, with Nannochloropsis oculata being more preferred.
Among the plants, Arabidopsis thaliana, Brassica napus, Brassica rapa, Cocos nucifera, Elaeis guineensis, Cuphea, Glycine max, Zea mays, Oryza sativa, Helianthus annuus, Cinnamomum camphora, and Jatropha curcas are preferred, with Arabidopsis thaliana being more preferred.
Although a method for preparing the transgenic cell is briefly described below, such a transgenic cell may be prepared by conventional techniques.
The vector of the present disclosure may be introduced into a plant (including plant cells, such as calluses, cultured cells, spheroplasts, or protoplasts) by any method that can introduce DNA into plant cells. Examples include methods using agrobacterium (JP 559-140885 A, JP S60-70080 A, WO94/00977), electroporation methods (JP S60-251887 A), and methods using particle guns (gene guns) (JP 2606856 B, JP 2517813 B).
In addition, the transgenic cell may be prepared by introducing the vector of the present disclosure into, for example, an organism (e.g., a microorganism, yeast, animal cell, or insect cell) or a part thereof, an organ, a tissue, a cultured cell, a spheroplast, or a protoplast, e.g., by any of the above-described DNA introduction methods.
The transgenic cell can be produced by the above or other methods. Here, the form of the transgenic cell is not limited as long as it includes a transformed cell. The transgenic cell may be a single cell or may be a tissue or transformant of combined cells. The term “transgenic cell” conceptually includes not only transgenic cells produced by the above methods, but also all of their progeny or clones and even progeny organisms obtained by passaging these cells.
For example, once obtaining transgenic plant cells into which the vector of the present disclosure has been introduced, progeny or clones can be produced from the transgenic plant cells by sexual or asexual reproduction, tissue culture, cell culture, cell fusion, or other techniques. Moreover, the transgenic plant cells, or their progeny or clones may be used to obtain reproductive materials (e.g., seeds, fruits, cuttings, stem tubers, root tubers, shoots, adventitious buds, adventitious embryos, calluses, protoplasts), which may then be used to produce the transgenic plant on a large scale.
Techniques to regenerate plants (transgenic plants) from transgenic plant cells are known; for example, Doi et al. disclose techniques for eucalyptus (Japanese patent application No. H11-127025), Fujimura et al. disclose techniques for rice (Fujimura et al., (1995), Plant Tissue Culture Lett., vol. 2: p. 74-), Shillito et al. disclose techniques for corn (Shillito et al., (1989), Bio/Technology, vol. 7: p. 581-), Visser et al. disclose techniques for potato (Visser et al., (1989), Theor. Appl. Genet., vol. 78: p. 589-), and Akama et al. disclose techniques for Arabidopsis thaliana (Akama et al., (1992), Plant Cell Rep., vol. 12: p. 7-). A person skilled in the art can regenerate plants from transgenic plant cells according to these documents.
Whether a target protein gene is expressed in a regenerated plant may be determined by well-known methods. For example, Western blot analysis may be used to assess the expression of a target protein.
Seeds can be obtained from the transgenic plant, for example, as follows. The transgenic plant may be rooted in an appropriate medium and transplanted to water-containing soil in a pot. It may be grown under proper cultivation conditions to finally produce seeds, which are then collected. Moreover, plants can be grown from seeds, for example, as follows: the seeds obtained from the transgenic plant as described above may be sown in water-containing soil and grown under proper cultivation conditions into plants.
In the present disclosure, by introducing the vector of the present disclosure into cells such as plant cells, the gene coding for the fusion protein in the vector can be expressed to enhance production of a hydrophobic compound (e.g., a polyisoprenoid) in the cells. Specifically, a hydrophobic compound (e.g., a polyisoprenoid) may be produced by culturing, for example, transgenic cells produced as described above, calluses obtained from the transgenic plant cells, or cells redifferentiated from the calluses in an appropriate medium, or by growing, for example, transgenic plants redifferentiated from the transgenic plant cells, or plants grown from seeds collected from these transgenic plants under proper cultivation conditions.
Thus, another aspect of the present disclosure relates to a method of introducing the vector of the present disclosure into cells such as plant cells to enhance production of a hydrophobic compound (e.g., a polyisoprenoid) in the cells.
The method for producing a rubber product of the present disclosure includes: producing a polyisoprenoid by the method for producing a substance (using a transgenic cell into which a gene coding for the fusion protein has been introduced); kneading the polyisoprenoid with an additive to obtain a kneaded mixture; forming a raw rubber product from the kneaded mixture; and vulcanizing the raw rubber product.
The rubber product is as described above in the present disclosure.
When the rubber product is a pneumatic tire or, in other words, when the method for producing a rubber product of the present disclosure is the method for producing a pneumatic tire of the present disclosure, the raw rubber product forming step corresponds to forming a green tire from the kneaded mixture, and the vulcanization step corresponds to vulcanizing the green tire. Thus, the method for producing a pneumatic tire of the present disclosure includes: producing a polyisoprenoid by the method for producing a substance (using a transgenic cell into which a gene coding for the fusion protein has been introduced); kneading the polyisoprenoid with an additive to obtain a kneaded mixture; forming a green tire from the kneaded mixture; and vulcanizing the green tire.
In the kneading step, the polyisoprenoid produced by the method for producing a substance (using a transgenic cell into which a gene coding for the fusion protein has been introduced) is kneaded with an additive to obtain a kneaded mixture.
The polyisoprenoid produced using a transgenic cell into which a gene coding for the fusion protein has been introduced may be obtained by harvesting lipid droplets from the transgenic cell, and subjecting the harvested lipid droplets to a solidification step as described below.
The lipid droplets may be harvested from the transgenic cell by any method including usual methods. For example, the lipid droplets may be harvested by cutting a part of the transgenic cell, crushing the cut tissue, and extracting the crushed tissue with an organic solvent.
The harvested lipid droplets are subjected to a solidification step. The solidification may be carried out by any method, such as by adding the lipid droplets to a solvent that does not dissolve the polyisoprenoid (natural rubber), such as ethanol, methanol, or acetone, or adding an acid to the lipid droplets. The rubber (natural rubber) can be recovered as solids from the lipid droplets by the solidification step. The recovered rubber (natural rubber) may be dried if necessary before use.
Any additive may be used including additives used in the production of rubber products. For example, in the case where the rubber product is a pneumatic tire, examples of the additive include rubber components other than the rubber obtained from the lipid droplets, reinforcing fillers such as carbon black, silica, calcium carbonate, alumina, clay, and talc, silane coupling agents, zinc oxide, stearic acid, processing aids, various antioxidants, softeners such as oils, waxes, vulcanizing agents such as sulfur, and vulcanization accelerators.
In the kneading step, kneading may be carried out using an open roll mill, a Banbury mixer, an internal mixer, or other rubber kneading machines.
The raw rubber product forming step is as described above in the present disclosure.
The vulcanization step is as described above in the present disclosure.
The present disclosure is specifically described with reference to examples, but the present disclosure is not limited to these examples.
[Synthesis of Persea americana cDNA]
An avocado (Hass avocado) from Mexico, purchased from Shotei, was used. The avocado was preliminarily preserved at −80° C. The avocado was crushed in liquid nitrogen and the total RNA was extracted with TRIzol reagent (Invitrogen), followed by sugar precipitation. To 100 μL of the RNA solution was added 10 μL of 3M NaOAc, and the mixture was left on ice for one hour. Then, the mixture was centrifuged at 13,000×g at 4° C. for 15 minutes, and the supernatant was collected. To the supernatant was added 250 μL of 100% ethanol, and the mixture was left at −30° C. for one hour. The resulting mixture was again centrifuged at 13,000×g at 4° C. for 10 minutes, and the supernatant was collected. Thereto was added 70% ethanol, and the mixture was centrifuged at 13,000×g at 4° C. for five minutes. The supernatant was discarded, followed by drying. To the dried product was added 50 μL of DEPC. A cDNA was synthesized from the obtained RNA using Fast Gene Scriptase II (NIPPON Genetics EUROPE).
DNA fragments of LDAP1 and LDAP2 were amplified by PCR using the cDNA as a template and KOD-Plus-Neo (TOYOBO). The PCR was carried out using the following primer combinations.
The PCR-amplified product was separated by 0.8% agarose gel electrophoresis, and the gel was cut out, followed by purification using Fast Gene™ Gel/PCR Extraction Kit (Nippon GENETICS). The purified DNA fragment was inserted into a pGEM-T EASY vector by TA cloning. Next, Escherichia coli DH5a was transformed and then applied to LB agar medium and cultured overnight at 37° C., followed by blue-white screening. Multiple white colonies were selected to isolate a single colony into which the target sequence had been introduced. The colony was cultured on LB liquid medium containing ampicillin (Amp) (50 μg/mL Amp) at 37° C. for 16 hours. The cultured cells were collected, and the plasmid was recovered using Fast Gene™ Plasmid Mini Kit (GENETICS). Finally, the nucleotide sequence was confirmed by DNA sequencing.
LDAP genes (LDAP1 and LDAP2) were produced as described above. The genes were sequenced to identify the full-length nucleotide sequence and amino acid sequence. The nucleotide sequence of LDAP1 is given by SEQ ID NO:5. The amino acid sequence of LDAP1 is given by SEQ ID NO:4. The nucleotide sequence of LDAP2 is given by SEQ ID NO:9. The amino acid sequence of LDAP2 is given by SEQ ID NO:7.
[Acquisition of CPT Gene from Arabidopsis thaliana]
A CPT gene (AtCPT5) was obtained from Arabidopsis thaliana in the same manner as described above. The gene was sequenced to identify the full-length nucleotide sequence and amino acid sequence. The nucleotide sequence of AtCPT5 is given by SEQ ID NO:6. The amino acid sequence of AtCPT5 is given by SEQ ID NO:3.
The LDAP1 and LDAP2 genes obtained as above were each introduced into a cell-free expression plasmid (pEU-E01-His-TEV-MCS-N2 vector, CellFree Science, Matsuyama, Japan). The LDAP1 and LDAP2 genes were introduced into the BamHI-SpeI site in the multicloning site.
AtCPT5 with BamHI and NotI sites was amplified by PCR using the primers 5 and 6 below, treated with the restriction enzymes BamHI and NotI, and introduced into a cell-free expression plasmid (pEU-E01-His-TEV-MCS-N2 vector, CellFree Science, Matsuyama, Japan) treated with the same restriction enzymes to prepare pEU-AtCPT5. Then, LDAP2 with XhoI and BamHI sites was amplified by PCT using the primers 7 and 8 below, treated with the restriction enzymes XhoI and BamHI, and introduced into pEU-AtCPT5 treated with the same restriction enzymes to prepare pEU-LDAP2-AtCPT5.
LDAP2 with NotI and SpeI sites was amplified by PCR using the primers 9 and 10 below, treated with the restriction enzymes NotI and SpeI, and introduced into pEU-AtCPT5 treated with the same enzymes to prepare pEU-AtCPT5-LDAP2.
AtCPT5 with EcoRV and BamHI sites was amplified by PCR using the primers 11 and 12 below, treated with the restriction enzymes EcoRV and BamHI, and introduced into a cell-free expression plasmid (pEU-E01-His-TEV-MCS-N2 vector, CellFree Science, Matsuyama, Japan) treated with the same restriction enzymes to prepare pEU-AtCPT5-2. Then, LDAP1 with BamHI and SpeI sites was amplified by PCT using the primers 13 and 14 below, treated with the restriction enzymes BamHI and SpeI, and introduced into pEU-AtCPT5-2 treated with the same restriction enzymes to prepare pEU-AtCPT5-LDAP1.
Lipid droplets (LD) were extracted from the mesocarp of an avocado. First, 10 g of the mesocarp of an avocado (Persea americana) was crushed together with 20 mL (twice the amount of the avocado) of buffer A (shown in the table below) using a blender and then further disrupted in a mortar. The disrupted solution was filtered through Miracloth, and the filtrate was centrifuged at 15,000×g at 4° C. for 30 minutes. The fat pads floating on the upper layer were collected and resuspended. Thereto was added 5 mL of buffer B (shown in the table below), and the mixture was centrifuged at 15,000×g at 4° C. for 30 minutes. Again, the fat pads were collected and combined with buffer B, and the mixture was centrifuged at 15,000×g at 4° C. for 30 minutes. Next, to cause separation at the particle size of LD, the fat pads were collected and centrifuged at 1,000×g at 4° C. for 10 minutes, followed by collecting the lower layer solution with a syringe. The solution was centrifuged at 15,000×g at 4° C. for 30 minutes, and the lower layer was removed. Then, 60 μL of TD buffer (shown in the table below) was added to obtain a LD solution.
[Synthesis and Extraction of mRNA in Cell-Free Expression System]
mRNA was synthesized from the construct prepared as described above using WEPRO7240H Expression Kit (ENDEXT Technology, CellFree Science). The reaction was performed using the following formulation at 37° C. for three hours.
The reaction was followed by precipitation with ethanol. Finally, 25 μL of 1×DB buffer (shown in the table below) at room temperature was added to dissolve the precipitate. The collected mRNA was stored at −80° C.
4× DB formulation
In order to express each protein on the LD purified as described above using the mRNA prepared as described above, a translation reaction was performed using a wheat germ-derived cell-free protein expression kit (WEPRO7240H Expression Kit, ENDEXT Technology, CellFree Sciences).
First, 15 μL of 1×DB was added to 7.5 μL of the mRNA prepared as described above to prepare a mRNA premix. Then, a translation reaction solution was prepared using the following formulation.
Here, the mRNA of a coenzyme was also added in the examples using the coenzyme in Table 6.
X: LD was prepared in an amount corresponding to 12.5 μg.
An amount of 650 μL of SDW was added to a plastic tube (PP container, 4.5 mL, 5-094-02, AS ONE). A dialysis cup (MWCO12000, COSMO BIO CO., LTD.) was placed in the tube so that air did not enter the dialysis membrane, and then allowed to stand for 10 minutes or longer. Then, the SDW was discarded from the tube, 650 μL of 1×DB buffer was added as an external fluid, and 50 μL of the prepared translation reaction solution as an internal fluid was added into the dialysis cup. The dialysis cup was placed in the plastic tube so that air did not enter the dialysis membrane, and the cup was covered with a parafilm, followed by performing a reaction at 26° C. for five hours. After the reaction, the external fluid was replaced with new 1×DB, and 5 μL of the mRNA premix was added to the internal fluid, followed by further reaction for 13 hours. After completion of the reaction, the reaction solution was collected, 50 μL of which was transferred to a 1.5 mL tube, and 50 μL of 1×DB was added, followed by centrifugation at 15,000×g at 4° C. for 20 minutes. The LD layer and aqueous layer separated by centrifugation were transferred to a new tube, to which was added 50 μL of 1×DB, and the mixture was centrifuged at 15,000×g at 4° C. for 20 minutes. The aqueous layer was drawn out from the centrifuged solution using a syringe (TERUMO needle, NN-2719S, Terumo Corporation; 1 mL TERUMO tuberculin syringe, SS-01T, Terumo Corporation). The remaining LD layer was diluted with 100 μL of TD buffer to prepare a purified LD solution. Separately, to the tube from which the LD and aqueous layers were removed was added 100 μL of a buffer for resuspension to prepare a pellet solution.
The purified LD solution prepared as described above was shaken as the reaction composition shown below containing [4-14C]IPP (NEC773, Perkin Elmer) in a bath at 30° C. for 18 hours. Here, to align the assay input of the purified LD solution, the protein concentration was measured by the Bradford method using Gene Spec, and an amount corresponding to 2 μg of proteins was input. In addition, in order to measure the background, ultrapure water was added instead of the purified LD solution to prepare a sample, which was then shaken as above.
X: The purified LD was prepared in an amount corresponding to 2 μg.
After the reaction, 200 μL of saturated saline was added and stirred to terminate the reaction. To the reaction mixture was added 1 mL of saturated saline-saturated n-butanol, followed by agitation for one minute in a vortex mixer. The resulting mixture was centrifuged at 15,000 rpm for one minute at room temperature, and then the upper butanol layer was collected to extract a polyisoprenoid. An amount of 50 μL of the extract was added to 3 mL of clear-sol, and the radioactivity was measured using a liquid scintillation counter (LD6500, BECKMAN COULTER). The background value was subtracted from the measured value and, since 50 μL out of 1 mL was measured, the difference was multiplied by 20 times to calculate the count of the extract as a whole. A higher radioactivity (dpm) means a larger production of the polyisoprenoid (natural rubber), indicating a higher rubber synthesis activity.
Table 6 shows the results.
As shown in Table 6, the fusion proteins exhibited higher activity on lipid droplets than AtCPT5 alone.
The reaction solution after protein expression and a lipid droplet fraction obtained by fractionating the reaction solution were subjected to SDS-PAGE electrophoresis to confirm expression of the protein and whether the protein was expressed on the purified LD. A mixture of 2 μL of a sample with 10 μL of a 2×SDS sample buffer and 8 μL of water was applied in 20 μL portions. The acrylamide concentration of the separation gel was 12% (w/v) or 15% (w/v), and Coomassie brilliant blue R-250 was used for staining.
The results of the lipid droplet-binding ability test of LDAP1, LDAP2, and SRPP shown in
The results of the lipid droplet-binding ability test of LDAP1, LDAP2, and SRPP fusion proteins shown in
The results of the lipid droplet-binding test of LDAP1-AtcPT5 and AtCPT5-LDAP1 shown in Table 8 demonstrated that LDAP1 fused with a protein other than HRT1 (AtCPT5) maintained the lipid droplet-binding ability, and the ability was not lost by the fusion orientation. This reveals that the enzyme (second amino acid) sequence to be linked to LDAP1 (lipid droplet-binding protein) is not limited, and the fusion orientation is not limited either. It is also revealed that AtCPT5 not fused with LDAP1 was not bound to lipid droplets.
The above results proved that a fusion protein which has an amino acid sequence (first amino acid sequence) capable of binding to lipid droplets, and an amino acid sequence (second amino acid sequence) having an enzymatic activity to synthesize a hydrophobic compound, with the enzymatic activity of the second amino acid sequence being maintained, can accumulate a hydrophobic compound (e.g., a polyisoprenoid) in lipid droplets by the enzymatic activity of the second amino acid sequence.
Moreover, binding of the fusion protein to lipid droplets was also confirmed by the fact that enzymatic activity was exhibited in the reaction test using the purified LD solution and also by the electrophoresis results.
SEQ ID NO:1: nucleotide sequence of gene coding for HRT1 from Hevea brasiliensis
SEQ ID NO:2: amino acid sequence of HRT1 from Hevea brasiliensis
SEQ ID NO:3: amino acid sequence of AtCPT5 from Arabidopsis thaliana
SEQ ID NO:4: amino acid sequence of LDAP1 from Persea americana
SEQ ID NO:5: nucleotide sequence of gene coding for LDAP1 from Persea americana
SEQ ID NO:6: nucleotide sequence of gene coding for AtCPT5 from Arabidopsis thaliana
SEQ ID NO:7: amino acid sequence of LDAP2 from Persea americana
SEQ ID NO:8: amino acid sequence of SRPP from Hevea brasiliensis
SEQ ID NO:9: nucleotide sequence of gene coding for LDAP2 from Persea americana
SEQ ID NO:10: Primer 1
SEQ ID NO:11: Primer 2
SEQ ID NO:12: Primer 3
SEQ ID NO:13: Primer 4
SEQ ID NO:14: Primer 5
SEQ ID NO:15: Primer 6
SEQ ID NO:16: Primer 7
SEQ ID NO:17: Primer 8
SEQ ID NO:18: Primer 9
SEQ ID NO:19: Primer 10
SEQ ID NO:20: Primer 11
SEQ ID NO:21: Primer 12
SEQ ID NO:22: Primer 13
SEQ ID NO:23: Primer 14
Number | Date | Country | Kind |
---|---|---|---|
2019-208662 | Nov 2019 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/040253 | 10/27/2020 | WO |