The present invention relates to a translation system for expressing a polypeptide using a new genetic code different from the universal genetic code.
Proteins are synthesized from genes according to a genetic code. Since most living organisms use a common genetic code, living organisms can obtain genes from other living organisms by horizontal gene transfer, which allows the synthesis of useful proteins. In the engineering point of view, this common genetic code made it possible to introduce genes obtained from any living organism into Escherichia coli to produce useful proteins. Meanwhile, harmful genes may also transfer into and invade other living organisms and therefore it is feared, for example, exogenous genes introduced in genetically modified microorganisms flow into microorganisms in the natural world, or exogenous genes in genetically modified plants transfer to surrounding plants in the natural world.
Genes in living organisms having a genetic code greatly different from other living organisms do not encode proteins which are functional when transferred to other living organisms and therefore it is considered that such living organisms would be safe artificial living organisms. For example, if genetic codes of genetically modified microorganisms or plants are greatly different from the universal genetic codes, there is no risk of transferring modified genes as sense genes to living organisms in the natural world.
Moreover, in cell-free translation systems, use of a new genetic code different from the universal genetic code allows safe synthesis of proteins that require attention in handling the genes thereof.
Reported examples of use of a genetic code different from the universal genetic code include a study involving replacing all 314 TAG stop codons in the Escherichia coli genome with TAA stop codons (Non Patent Literature 1-3). This study also proposes making a TAG nonsense codon by removing the RF1 gene from this Escherichia coli genome and reassigning the TAG codon to an unnatural amino acid. However, genes based on this genetic code cannot be called as genes orthogonal to existing living organisms since such genes become easily translatable only by acquisition of a suppressor tRNA that reads the TAG codon by the living organism.
Meanwhile, the present inventors have succeed in designating Leu to a codon for Ala by replacing an anticodon loop of a tRNALeu with the anticodon loop same as that of a tRNAAla (Non Patent Literature 4).
An object of the present invention is to provide a translation system that can express polypeptide according to a genetic code different from the universal genetic code, and the like.
To achieve the aforementioned object, the present inventors considered interchanging the relations between codons and amino acids to produce new genetic codes different from the universal genetic code by interchanging the anticodon loops of seryl-tRNA, leucyl-tRNA, and/or alanyl-tRNA, since it is well known that seryl-tRNA synthetase, leucyl-tRNA synthetase, and alanyl-tRNA synthetase do not recognize the anticodon portion of tRNA when catalyzing the aminoacylation of tRNAs.
More specifically, the present inventors interchanged codons for these amino acids in a gene encoding a protein and interchanged the relations between the anticodons and the amino acids in the aminoacyl-tRNAs correspondingly and succeeded in expressing polypeptides having predetermined amino acid sequences according to a new genetic code different from the universal genetic code, thereby completing the present invention.
Accordingly, the present invention relates to:
[1]
A translation system for expressing a polypeptide having a predetermined amino acid sequence according to a code different from the universal genetic code, comprising
a template nucleic acid having a sequence modified from the nucleic acid sequence encoding the polypeptide by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine; and
at least either (i) or (ii):
(i) an aminoacyl-tRNA having an anticodon to a codon after interchange and to which an amino acid encoded by a codon before the interchange is bound;
(ii) a tRNA having an anticodon to a codon after interchange and recognized by an aminoacyl-tRNA synthetase corresponding to an amino acid encoded by a codon before the interchange, or a nucleic acid encoding the tRNA, the amino acids, and the aminoacyl-tRNA synthetase or a nucleic acid encoding the aminoacyl-tRNA synthetase.
[2]
The translation system according to [1], wherein the template nucleic acid has a sequence modified from the nucleic acid sequence encoding the polypeptide by interchanging a codon encoding serine and a codon encoding leucine, and the translation system comprises
at least either (i) or (ii) and
at least either (iii) or (iv):
(i) an aminoacyl-tRNA having an anticodon to a codon encoding leucine and to which serine is bound;
(ii) a aminoacyl-tRNA having an anticodon to a codon encoding leucine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase;
(iii) an aminoacyl-tRNA having an anticodon to a codon encoding serine and to which leucine is bound;
(iv) a tRNA having an anticodon to a codon encoding serine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and a leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase.
[3]
The translation system according to [1], wherein the template nucleic acid has a sequence modified from the nucleic acid sequence encoding the polypeptide by interchanging a codon encoding serine and a codon encoding alanine, and the translation system comprises
at least either (i) or (ii) and
at least either (iii) or (iv):
(i) an aminoacyl-tRNA having an anticodon to a codon encoding alanine and to which serine is bound;
(ii) a tRNA having an anticodon to a codon encoding alanine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase;
(iii) an aminoacyl-tRNA having an anticodon to a codon encoding serine and to which alanine is bound;
(iv) a tRNA having an anticodon to a codon encoding serine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and an alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase.
[4]
The translation system according to [1], wherein the template nucleic acid has a sequence modified from the nucleic acid sequence encoding the polypeptide by interchanging a codon encoding leucine and a codon encoding alanine, and the translation system comprises
at least either (i) or (ii) and
at least either (iii) or (iv):
(i) an aminoacyl-tRNA having an anticodon to a codon encoding alanine and to which leucine is bound;
(ii) a tRNA having an anticodon to a codon encoding alanine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and a leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase;
(iii) an aminoacyl-tRNA having an anticodon to a codon encoding leucine and to which alanine is bound;
(iv) a tRNA having an anticodon to a codon encoding leucine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and an alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase.
[5]
The translation system according to [1], wherein the template nucleic acid has a sequence modified from the nucleic acid sequence encoding the polypeptide by exchanging a codon encoding serine for a codon encoding leucine, a codon encoding leucine for a codon encoding alanine, and a codon encoding alanine for a codon encoding serine, and the translation system comprises;
at least either (i) or (ii),
at least either (iii) or (iv), and
at least either (v) or (vi):
(i) an aminoacyl-tRNA having an anticodon to a codon encoding leucine and to which serine is bound;
(ii) a tRNA having an anticodon to a codon encoding leucine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase;
(iii) an aminoacyl-tRNA having an anticodon to a codon encoding alanine and to which leucine is bound;
(iv) a tRNA having an anticodon to a codon encoding alanine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and a leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase;
(v) an aminoacyl-tRNA having an anticodon to a codon encoding serine and to which alanine is bound;
(vi) a tRNA having an anticodon to a codon encoding serine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and an alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase.
[6]
The translation system according to [1], wherein the template nucleic acid has a sequence modified from the nucleic acid sequence encoding the polypeptide by exchanging a codon encoding serine for a codon encoding alanine, a codon encoding alanine for a codon encoding leucine, and a codon encoding leucine for a codon encoding serine, and the translation system comprises;
at least either (i) or (ii),
at least either (iii) or (iv), and
at least either (v) or (vi):
(i) an aminoacyl-tRNA having an anticodon to a codon encoding alanine and to which serine is bound;
(ii) a tRNA having an anticodon to a codon encoding alanine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase;
(iii) an aminoacyl-tRNA having an anticodon to a codon encoding leucine and to which alanine is bound;
(iv) a tRNA having an anticodon to a codon encoding leucine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and an alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase.
(v) an aminoacyl-tRNA having an anticodon to a codon encoding serine and to which leucine is bound;
(vi) a tRNA having an anticodon to a codon encoding serine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and a leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase.
[7]
A cell comprising the translation system according to any of [1] to [6].
[8]
A living organism comprising the translation system according to any of [1] to [6].
[9]
A method for producing a polypeptide having a predetermined amino acid sequence, comprising a step of:
expressing the polypeptide from the template nucleic acid in the translation system according to any of [1] to [6].
[10]
A method for producing a polypeptide having a predetermined amino acid sequence, comprising a step of:
expressing, in a translation system for expressing a polypeptide having a predetermined amino acid sequence according to a code different from the universal genetic code, the polypeptide from a template nucleic acid,
the translation system comprising:
the template nucleic acid having a sequence modified from a nucleic acid sequence encoding the polypeptide by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine; and
at least either (i) or (ii):
(i) an aminoacyl-tRNA having an anticodon to a codon after interchange and to which an amino acid encoded by a codon before the interchange is bound;
(ii) a tRNA having an anticodon to a codon after interchange and recognized by an aminoacyl-tRNA synthetase corresponding to an amino acid encoded by a codon before the interchange or a nucleic acid encoding the tRNA, the amino acid, and the aminoacyl-tRNA synthetase or a nucleic acid encoding the aminoacyl-tRNA synthetase.
[11]
A kit comprising
a translation system for expressing a polypeptide having a predetermined amino acid sequence according to a code different from the universal genetic code,
the translation system comprising:
a template nucleic acid having a sequence modified from a nucleic acid sequence encoding the polypeptide by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine; and
at least either (i) or (ii):
(i) an aminoacyl-tRNA having an anticodon to a codon after interchange and to which an amino acid encoded by a codon before the interchange is bound;
(ii) a tRNA having an anticodon to a codon after interchange and recognized by an aminoacyl-tRNA synthetase corresponding to an amino acid encoded by a codon before the interchange or a nucleic acid encoding the tRNA, the amino acid, and the aminoacyl-tRNA synthetase or a nucleic acid encoding the aminoacyl-tRNA synthetase.
With the translation system according to the present invention, polypeptides having predetermined amino acid sequences can be expressed according to a new genetic code different from the universal genetic code. Use of genes contained in such a translation system can prevent horizontal gene transfer of harmful genes, because such genes can be kept orthogonal to living organisms in the natural world. Polypeptides requiring attentions in handling their genes can be synthesized safely by applying the translation system according to the present invention to a cell-free translation system or a translation system using Escherichia coli,
Moreover, by applying the translation system according to the present invention to genetically modified living organisms such as genetically modified microorganisms or genetically modified plants, transmission of modified genes to living organisms in the natural world and expression of harmful polypeptides can be prevented.
The outline of the translation system according to the present invention is described. The translation system according to the present invention is a translation system to express a polypeptide having a predetermined amino acid sequence according to a code different from the universal genetic code and comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding the polypeptide by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine; and
aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound.
As used herein, the “polypeptide” refers to a molecule composed of amino acids linked by the peptide bond and is not particularly limited as long as the number of the amino acids thereof is 2 or more. The polypeptides include those called as proteins and peptides. As used herein, the “polypeptide having a predetermined amino acid sequence” refers to a polypeptide whose amino acid sequence is known and that is to be expressed in the translation system according to the present invention.
As used herein, “to express polypeptide” is, unless otherwise specified, used as a term including both transcription, by which mRNA is synthesized based on the sequence of DNA, and translation, by which polypeptide is synthesized based on the sequence of mRNA.
As used herein, the term “aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound” means aminoacyl-tRNAs, for example when codons for serine are interchanged with codons for leucine, aminoacyl-tRNAs having anticodons to the codons for leucine in the universal genetic code after the interchange and to which serine is bound, which is the amino acid encoded by the codons before the interchange. By using such aminoacyl-tRNAs, a polypeptide having the original amino acid sequence is expressed based on the template nucleic acid, in which the codons have been interchanged. Therefore, in this translation system, the relation between the codons and the amino acids will follow the genetic code different from the universal genetic code.
Moreover, the translation system according to the present invention may comprise, instead of “aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound” tRNAs or nucleic acids encoding the tRNAs having anticodons to codons after the interchange and recognized by aminoacyl-tRNA synthetases corresponding to the amino acids encoded by the codons before the interchange or nucleic acids encoding the tRNAs, the amino acids, and the aminoacyl-tRNA synthetases or nucleic acids encoding the aminoacyl-tRNA synthetases. With such a configuration, “aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound” are produced by the aminoacyl-tRNA synthetases in the translation system.
As used herein, the tRNA means transfer RNA. As used herein, the seryl-tRNA (tRNASer), the leucyl-tRNA (tRNALeu), and the alanyl-tRNA (tRNAAla) mean the natural tRNAs that transport serine, leucine, and alanine, respectively. Therefore, they have the anticodons to the codons for serine, leucine, and alanine, respectively.
As used herein, the “codon” means a sequence that specifies one amino acid, and consists of 3 nucleotides selected from T (thymine), C (cytosine), A (adenine), and G (guanine), in the case of DNA. In the codons in mRNA, T (thymine) is replaced with U (uracil).
As used herein, the “nucleic acid sequence encoding X” means a nucleic acid sequence encoding the amino acid sequence of X according to the universal genetic code. As used herein, the “nucleic acid” may be DNA or RNA and may include chimera of DNA and RNA and artificial nucleic acids, as long as an object of the present invention is achieved.
As used herein, the “universal genetic code” refers to the following genetic code in the case of DNA and genes of most species of organisms on the earth encode amino acid sequences according to this code. In the case of RNA, T is replaced with U. Hereinafter, the codons are described basically as DNA sequences, but in the case of RNA, T should be replaced with U.
As used herein, genes to be decoded according to the universal genetic code may be referred to as universal genes and tRNAs that decode mRNA according to the universal genetic code may be referred to as universal tRNAs.
Hereinafter, the first aspect wherein codons for serine and leucine are interchanged; the second aspect wherein codons for serine and alanine are interchanged; the third aspect wherein codons for leucine and alanine are interchanged; the fourth aspect wherein a codon for serine is exchanged with a codon for leucine, a codon for leucine is exchanged with a codon for alanine, and a codon for alanine is exchanged with a codon for serine; and the fifth aspect wherein a codon for serine is exchanged with a codon for alanine, a codon for alanine is exchanged with a codon for leucine, and a codon for leucine is exchanged with a codon for serine will each be described in detail.
The first aspect of the translation system according to the present invention comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons for serine and leucine.
In the universal genetic code, serine is encoded by TCT, TCC, TCA, TCG, AGT, or AGC. Meanwhile, leucine is encoded by TTA, TTG, CTT, CTC, CTA, or CTG. In a sequence modified from a nucleic acid sequence encoding a predetermined protein by interchanging codons for serine and leucine, TCT, TCC, TCA, TCG, AGT, or AGC is replaced with any of TTA, TTG, CTT, CTC, CTA, and CTG and TTA, TTG, CTT, CTC, CTA, or CTG is replaced with any of TCT, TCC, TCA, TCG, AGT, and AGC in the nucleic acid sequence encoding the protein.
Such a nucleic acid can be synthesized by a usual method of nucleic acid synthesis.
The first aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA. With such an aminoacyl-tRNA, a codon for leucine, that is to say, TTA, TTG, CTT, CTC, CTA, or CTG is translated to serine. Since a codon for serine and a codon for leucine are interchanged in the nucleic acid sequence of the first aspect as described above, the positions where the codon for serine was present before the interchange are thereby translated to serine.
In the case where codons for serine and leucine are interchanged in the present invention, it is not necessary that all codons are interchanged. For example, in a sequence in which some codons for leucine and serine are interchanged, AGT and AGC are replaced with TTA or TTG and TTA and TTG are replaced with any of AGT and AGC in a nucleic acid sequence encoding the protein. In this case, the correct protein can be translated by interchanging the anticodons of tRNASer and tRNALeu corresponding to these codons. This is true for the case where codons for serine and alanine, alanine and leucine, or serine, alanine, and leucine are interchanged.
The first aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA, a tRNA having an anticodon to a codon for leucine and is recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for leucine and recognized by a seryl-tRNA synthetase and the seryl-tRNA synthetase are expressed in the translation system and the tRNA and serine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA.
The “tRNA having an anticodon to a codon for leucine and recognized by a seryl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNASer into a sequence containing an anticodon to a codon for leucine. Only the sequence of the anticodon part in the anticodon arm may be modified or a part including a neighboring sequence may also be modified. For example, the anticodon loop of the tRNASer may be exchanged for the anticodon loop of the tRNALeu or the anticodon arm of the tRNASer may be exchanged for the anticodon arm of the tRNALeu. Since the seryl-tRNA synthetase, which binds serine to tRNASer, recognizes parts other than the anticodon arm of tRNASer, it can bind serine to modified tRNASers in which the sequence of the anticodon arm is modified.
The first aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA. With such an aminoacyl-tRNA, a codon for serine, that is, TCT, TCC, TCA, TCG, AGT, or AGC is translated to leucine. Since a codon for serine and a codon for leucine are interchanged in the nucleic acid sequence of the first aspect as described above, the positions where the codon for leucine was present before the interchange are thereby translated to leucine.
The first aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA, a tRNA having an anticodon to a codon for serine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA and a leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for serine and recognized by a leucyl-tRNA synthetase and the leucyl-tRNA synthetase are expressed in the translation system and the tRNA and leucine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA.
The “tRNA having an anticodon to a codon for serine and recognized by a leucyl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNALeu into a sequence containing an anticodon to a codon for serine. Only the sequence of the anticodon part in the anticodon arm and a neighboring sequence may be modified. For example, the anticodon loop of the tRNALeu may be exchanged for the anticodon loop of the tRNASer or the anticodon arm of the tRNALeu may be exchanged for the anticodon arm of the tRNASer. Since the leucyl-tRNA synthetase, which binds leucine to tRNALeu, also recognizes parts other than the anticodon arm of tRNALeu, it can bind leucine to modified tRNALeus in which the sequence of the anticodon arm is modified.
The second aspect of the translation system according to the present invention comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons for serine and alanine. In the universal genetic code, serine is encoded by TCT, TCC, TCA, TCG, AGT, or AGC and alanine is encoded by GCT, GCC, GCA, or GCG. Therefore, in a sequence modified from a nucleic acid sequence encoding a predetermined protein by interchanging codons for serine and alanine, TCT, TCC, TCA, TCG, AGT, and AGC are replaced with any of GCT, GCC, GCA, and GCG and GCT, GCC, GCA, and GCG are replaced with any of TCT, TCC, TCA, TCG, AGT, and AGC in the nucleic acid sequence encoding the protein.
The second aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA. With such an aminoacyl-tRNA, a codon for alanine is translated to serine. Since a codon for serine and a codon for alanine are interchanged in the nucleic acid sequence of the second aspect as described above, the positions where the codon for serine was present before the interchange are thereby translated to serine.
The second aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA, a tRNA having an anticodon to a codon for alanine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and the seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for alanine and is recognized by a seryl-tRNA synthetase and the seryl-tRNA synthetase are expressed in the translation system and the tRNA and serine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA.
The “tRNA having an anticodon to a codon for alanine and is recognized by a seryl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNASer into a sequence containing an anticodon to a codon for alanine. Only the sequence of the anticodon part in the anticodon arm may be modified or a part including a neighboring sequence may also be modified. For example, the anticodon loop of the tRNASer may be exchanged for the anticodon loop of the tRNAAla or the anticodon arm of the tRNASer may be exchanged for the anticodon arm of the tRNAAla.
The second aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA. With such an aminoacyl-tRNA, a codon for serine is translated to alanine. Since a codon for serine and a codon for alanine are interchanged in the nucleic acid sequence of the second aspect as described above, the positions where the codon for alanine was present before the interchange are thereby translated to alanine.
The second aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA, a tRNA having an anticodon to a codon for serine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA and the alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for serine and recognized by an alanyl-tRNA synthetase and the alanyl-tRNA synthetase are expressed in the translation system and the tRNA and alanine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA.
The “tRNA having an anticodon to a codon for serine and recognized by an alanyl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNAAla into a sequence containing an anticodon to a codon for serine. Only the sequence of the anticodon part in the anticodon arm and a neighboring sequence may be modified. For example, the anticodon loop of the tRNAAla may be exchanged for the anticodon loop of the tRNASer or the anticodon arm of the tRNAAla may be exchanged for the anticodon arm of the tRNASer. Since the alanyl-tRNA synthetase, which binds alanine to tRNAAla, also recognizes parts other than the anticodon arm of tRNAAla, it can bind alanine to modified tRNAAla in which the sequence of the anticodon arm is modified.
The third aspect of the translation system according to the present invention comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons for leucine and alanine. In the universal genetic code, leucine is encoded by TTA, TTG, CTT, CTC, CTA, or CTG and alanine is encoded by GCT, GCC, GCA, or GCG. Therefore, in a sequence modified from a nucleic acid sequence encoding a predetermined protein by interchanging codons for leucine and alanine, TTA, TTG, CTT, CTC, CTA, and CTG are replaced with any of GCT, GCC, GCA, and GCG and GCT, GCC, GCA, and GCG are replaced with any of TTA, TTG, CTT, CTC, CTA, and CTG in the nucleic acid sequence encoding the protein.
The third aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA. With such an aminoacyl-tRNA, a codon for alanine is translated to leucine. Since a codon for alanine and a codon for leucine are interchanged in the nucleic acid sequence of the third aspect as described above, the positions where the codon for leucine was present before the interchange are thereby translated to leucine.
The third aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA, a tRNA having an anticodon to a codon for alanine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and the leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for alanine and recognized by a leucyl-tRNA synthetase and the leucyl-tRNA synthetase are expressed in the translation system and the tRNA and leucine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA.
The “tRNA having an anticodon to a codon for alanine and recognized by a leucyl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNALeu into a sequence containing an anticodon to a codon for alanine. Only the sequence of the anticodon part in the anticodon arm may be modified or a part including a neighboring sequence may also be modified.
The third aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA. With such an aminoacyl-tRNA, a codon for leucine is translated to alanine. Since a codon for leucine and a codon for alanine are interchanged in the nucleic acid sequence of the third aspect as described above, the positions where the codon for alanine was present before the interchange are thereby translated to alanine.
The third aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA, a tRNA having an anticodon to a codon for leucine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and the alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for leucine and recognized by an alanyl-tRNA synthetase and the alanyl-tRNA synthetase are expressed in the translation system and the tRNA and alanine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA.
The “tRNA having an anticodon to a codon for leucine and recognized by an alanyl-tRNA synthetase” can be produced by modifying the sequence of the anticodon arm of the tRNAAla into a sequence containing an anticodon to a codon for leucine. Only the sequence of the anticodon part in the anticodon arm and a neighboring sequence may be modified. For example, the anticodon loop of the tRNAAla may be exchanged for the anticodon loop of the tRNALeu or the anticodon arm of the tRNAAla may be exchanged for the anticodon arm of the tRNALeu.
The fourth aspect of the “translation system for producing a polypeptide having a predetermined amino acid sequence” according to the present invention comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding the predetermined protein by exchanging a codon for serine with a codon for leucine, a codon for leucine with a codon for alanine, and a codon for alanine with a codon for serine. In the universal genetic code, serine is encoded by TCT, TCC, TCA, TCG, AGT, or AGC, leucine is encoded by TTA, TTG, CTT, CTC, CTA, or CTG, and alanine is encoded by GCT, GCC, GCA, or GCG. Therefore, in a sequence modified from a nucleic acid sequence encoding the predetermined protein by exchanging a codon for serine with a codon for leucine, a codon for leucine with a codon for alanine, and a codon for alanine with a codon for serine, TCT, TCC, TCA, TCG, AGT, and AGC are replaced with any of TTA, TTG, CTT, CTC, CTA, and CTG; TTA, TTG, CTT, CTC, CTA, and CTG are replaced with any of GCT, GCC, GCA, and GCG; GCT, GCC, GCA, and GCG are replaced with any of TCT, TCC, TCA, TCG, AGT, and AGC in the nucleic acid sequence encoding the protein.
The fourth aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA; an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA; and an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA. In this way, a codon for leucine is translated to serine, a codon for alanine is translated to leucine, and a codon for serine is translated to alanine. Since a codon for serine is exchanged with a codon for leucine, a codon for leucine is exchanged with a codon for alanine, and a codon for alanine is exchanged with a codon for serine in the fourth aspect as described above, the positions where the codon for serine was present before exchange are translated to serine, the positions where the codon for leucine was present are translated to leucine, and the positions where the codon for alanine was present are translated to alanine in this way to produce a predetermined polypeptide having an original amino acid sequence from the template nucleic acid by translation.
The forth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA, a tRNA having an anticodon to a codon for leucine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and a seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for leucine and recognized by a seryl-tRNA synthetase and the seryl-tRNA synthetase are expressed in the translation system and the tRNA and serine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and serine bound to the tRNA.
The fourth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA, a tRNA having an anticodon to a codon for alanine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and the leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for alanine and recognized by a leucyl-tRNA synthetase and the leucyl-tRNA synthetase are expressed in the translation system and the tRNA and leucine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and leucine bound to the tRNA.
The fourth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA, a tRNA having an anticodon to a codon for serine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA and the alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for serine and recognized by an alanyl-tRNA synthetase and the alanyl-tRNA synthetase are expressed in the translation system and the tRNA and alanine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and alanine bound to the tRNA.
The fifth aspect of the “translation system for producing a polypeptide having a predetermined amino acid sequence” according to the present invention comprises a template nucleic acid having a sequence modified from a nucleic acid sequence encoding the predetermined protein by exchanging a codon for serine with a codon for alanine, a codon for alanine with a codon for leucine, and a codon for leucine with a codon for serine. In the universal genetic code, serine is encoded by TCT, TCC, TCA, TCG, AGT, or AGC, alanine is encoded by GCT, GCC, GCA, or GCG, and leucine is encoded by TTA, TTG, CTT, CTC, CTA, or CTG. Therefore, in a sequence modified from a nucleic acid sequence encoding the predetermined protein by exchanging a codon for serine with a codon for alanine, a codon for alanine with a codon for leucine, and a codon for leucine with a codon for serine, TCT, TCC, TCA, TCG, AGT, and AGC are replaced with any of GCT, GCC, GCA, and GCG; GCT, GCC, GCA, and GCG are replaced with any of TTA, TTG, CTT, CTC, CTA, and CTG; TTA, TTG, CTT, CTC, CTA, and CTG are replaced with any of TCT, TCC, TCA, TCG, AGT, and AGC in the nucleic acid sequence encoding the protein.
The fifth aspect of the translation system according to the present invention comprises an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA; an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA; and an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA. In this way, a codon for alanine is translated to serine, a codon for leucine is translated to alanine, and a codon for serine is translated to leucine. Since a codon for serine is exchanged with a codon for alanine, a codon for alanine is exchanged with a codon for leucine, and a codon for leucine is exchanged with a codon for serine in the fifth aspect as described above, the positions where the codon for serine was present before exchange are translated to serine, the positions where the codon for leucine was present are translated to leucine, and the positions where the codon for alanine was present are translated to alanine in this way to produce a predetermined polypeptide having an original amino acid sequence from the template nucleic acid by translation.
The fifth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA, a tRNA having an anticodon to a codon for alanine and recognized by a seryl-tRNA synthetase or a nucleic acid encoding the tRNA, serine, and the seryl-tRNA synthetase or a nucleic acid encoding the seryl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for alanine and is recognized by a seryl-tRNA synthetase and the seryl-tRNA synthetase are expressed in the translation system and the tRNA and serine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for alanine and serine bound to the tRNA.
The fifth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA, a tRNA having an anticodon to a codon for leucine and recognized by an alanyl-tRNA synthetase or a nucleic acid encoding the tRNA, alanine, and the alanyl-tRNA synthetase or a nucleic acid encoding the alanyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for leucine and is recognized by an alanyl-tRNA synthetase and the alanyl-tRNA synthetase are expressed in the translation system and the tRNA and alanine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for leucine and alanine bound to the tRNA.
The fifth aspect of the translation system according to the present invention may comprise, instead of an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA, a tRNA having an anticodon to a codon for serine and recognized by a leucyl-tRNA synthetase or a nucleic acid encoding the tRNA, leucine, and the leucyl-tRNA synthetase or a nucleic acid encoding the leucyl-tRNA synthetase. In this way, a tRNA having an anticodon to a codon for serine and recognized by a leucyl-tRNA synthetase and the leucyl-tRNA synthetase are expressed in the translation system and the tRNA and leucine are bound by the reaction catalyzed by the enzyme to produce an aminoacyl-tRNA comprising a tRNA having an anticodon to a codon for serine and leucine bound to the tRNA.
The translation system according to the present invention may be a translation system using cells or a cell-free translation system or an in vivo translation system. Moreover, the present invention may be a kit comprising the translation system. The kit may comprise articles same as those in the translation system or may further comprise a reaction buffer, a reaction vessel, an instruction, or the like.
For example, the translation system according to the present invention may be a translation system using cultured cells derived from Escherichia coli, yeast, insect cells, or an animal. For example, in the case of Escherichia coli, a translation system can be obtained by replacing the existing tRNA genes in the genome with genes of tRNAs having anticodons to codons after the interchange and recognized by aminoacyl-tRNA synthetases corresponding to the amino acids encoded by the codons before the interchange. Aminoacyl-tRNA synthetases and amino acids to be bound to tRNA may be those that Escherichia coli originally has. To obtain a protein, Escherichia coli is transformed with an expression vector in which a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine is inserted.
When a translation system according to the present invention is a translation system using eukaryotic cells such as insect cells, a recombination baculovirus or a recombination adenovirus comprising a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine is produced according to a known method and the cells are infected with the virus. In this case, proteins encoded in the genome and tRNAs are necessary to be modified similarly according to the corresponding genetic code. The tRNAs are tRNAs having anticodons to codons after the interchange and recognized by aminoacyl-tRNA synthetases corresponding to the amino acids encoded by the codons before the interchange. A polypeptide having a predetermined amino acid sequence may be expressed by using other components such as aminoacyl-tRNA synthetases that the insect cells originally have.
When a translation system according to the present invention is a translation system using animal cells, (i) a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine is inserted into an expression vector and the expression vector is introduced into the animal cells by electroporation, the calcium phosphate method, lipofection, or the like. In this case, proteins encoded in the genome and tRNAs are necessary to be modified similarly according to the corresponding genetic code. The tRNAs are tRNAs having anticodons to codons after the interchange and recognized by aminoacyl-tRNA synthetases corresponding to the amino acids encoded by the codons before the interchange. A polypeptide having a predetermined amino acid sequence may be expressed by using other components such as aminoacyl-tRNA synthetases that the animal cells originally have.
When a cell-free translation system is used, for example, a known system using an Escherichia coli extract and a wheat germ extract may be used. Besides these, a rabbit erythrocyte extract and an insect cell extract may be used. In this case, a design to prevent an amino acid whose codon is exchanged from binding to the tRNAs that bind to the amino acid in the natural world. More specifically, the translation system of the first aspect is designed to prevent the aminoacyl-tRNAs in which serine is bound to tRNASer and the aminoacyl-tRNAs in which leucine is bound to tRNALeu from being contained in the translation system and from being produced in the translation system.
The translation system according to the present invention may be a reconstituted cell-free translation system. The reconstituted cell-free translation system is a translation system constructed by reconstitution involving purified ribosomal proteins, aminoacyl-tRNA synthetases, ribosomal RNAs, amino acids, rRNAs, GTP, ATP, translation initiation factors (IF), elongation factors (EF), release factors (RF) and a ribosome recycling factor (RRF) and other factors necessary for translation. To perform transcription from DNA as well, the system may be a system containing a RNA polymerase.
When the translation system according to the present invention is a reconstituted cell-free translation system, (i) a template nucleic acid having a sequence modified from a nucleic acid sequence encoding a polypeptide having a predetermined amino acid sequence by interchanging codons encoding at least 2 amino acids selected from serine, leucine, and alanine; and (ii) aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound are added to the translation system. Instead of (ii) aminoacyl-tRNAs having anticodons to codons after the interchange and to which the amino acids encoded by the codons before the interchange are bound, (iii) tRNAs having anticodons to codons after the interchange and recognized by aminoacyl-tRNA synthetases corresponding to the amino acids encoded by the codons before the interchange or nucleic acids encoding the tRNAs, the amino acids, and the aminoacyl-tRNA synthetases or nucleic acids encoding the aminoacyl-tRNA synthetases may be added.
Other amino acids may be added to the translation system as aminoacyl-tRNAs in which the amino acids are bound to the tRNAs to which the amino acids are bound in the natural world or the tRNAs corresponding to the amino acids, to which the amino acids are bound in the natural world, or nucleic acids encoding the tRNAs, aminoacyl-tRNA synthetases or nucleic acids encoding the aminoacyl-tRNA synthetases, and the amino acids may be added to the translation system. In the latter case, amino acids and corresponding tRNAs are bound by aminoacyl-tRNA synthetases to produce aminoacyl-tRNAs.
In the case of a reconstituted cell-free translation system, it is possible to prevent the system from containing unnecessary components depending on the purpose and therefore, for example in the translation system of the first aspect, tRNASer and tRNALeu can be excluded from the beginning. In this way, translation into proteins in which serine and leucine are interchanged according to the interchanged codons can be prevented.
When a translation system according to the present invention is a cell-free translation system, the amino acids other than those for which codons are interchanged are translated according to the universal genetic code as usual and serine, leucine and/or alanine are encoded by interchanged codons by incubating the system under the conditions that allows the expression of protein and as a result, a predetermined protein are produced by translation as the original amino acid sequence.
The translation system using cells or cell-free translation system according to the present invention allows safe expression of proteins (for example, antibiotics resistance enzymes and toxic proteins) of which gene flow into the natural world is feared.
The translation system according to the present invention may be an in vivo translation system. For example, J. Craig Venter et al. generated a bacterium having a chemical synthesized genome. Specifically, the genome of Mycoplasma mycoides consisting of 106 base pairs is subdivided and chemically synthesized, and ligated using yeast to generate the genome (Gibson D G et al., Science Vol. 329, p. 52-56 (2010)). Interchanging at least 2 codons selected from serine, leucine, and alanine on the genome and placing genes of tRNAs in which the anticodon is modified on the genome in this method would allow the generation of a living organism (a microorganism such as microbes, a plant, or an animal) according to a new genetic code.
Living organisms having the translation system according to the present invention are safe artificial living organisms that do not exchange any genes with other living organisms on the earth. Application of such living organisms to, for example, genetically modified living organisms such as genetically modified microorganisms and genetically modified plants prevents transfer of modified genes to plants in the natural world to express harmful polypeptides and transfer of a harmful gene (for example, an infectious virus) into the genetically modified living organisms to express a harmful polypeptide.
The disclosure of all Patent Literature and Non-Patent Literature cited herein is incorporated herein by reference in their entirety.
The present invention will be specifically described based on Examples, but the present invention is not limited to these by any means. Those skilled in the art can modify the present invention to various aspects without departing from the spirit of the present invention and such modifications are also within the scope of the present invention.
1. Outline of Design of Genetic Code and tRNA
An outline of new genetic code 1 and 2 used in Examples are illustrated in
In new genetic code 1 (
In new genetic code 2 (
Creatine kinase, creatine phosphatase, and Escherichia coli tRNAs were purchased from Roche Diagnostics. Myokinase was purchased from Sigma-Aldrich Co. LLC. Ribosomes were purified from E. coli strain A19 by using a method following the method of Clemons et al. (Clemons, W. M. et al., J. Mol. Biol., 310, 827-843, doi:10.1006/jmbi.2001.4778 (2001).). Compounds and proteins for translation were prepared according to known methods (Josephson, K., Hartman, M. C. T. & Szostak, J. W., J. Am. Chem. Soc. 127, 11727-11735, doi:10.1021/ja0515809 (2005).; Baggott, J. E., Biochem. J. 308, 1031-1036 (1995).; Shimizu, Y. et al. Nat. Biotechnol. 19, 751-755, doi:10.1038/90802 (2001).). These translation factors were stored according to the method of Ishizawa (Ishizawa, T., Kawakami, T., Reid, P. C. & Murakami, H., J. Am. Chem. Soc., 135, 5433-5440 (2013).) and used at final concentrations as set forth in the table below.
Escherichia coli (E. coli Rosetta 2) was transformed with the plasmid pET21a containing the gene of C5 protein by a conventional method. The transformed Escherichia coli was suspended on an agar plate containing 100 μg/mL ampicillin, 20 μg/mL chloramphenicol and 1% (w/v) glucose and incubated at 37° C. overnight. The culture of Escherichia coli was grown until an OD590 of over approximately 2.0 in LB medium (40 mL) containing 100 μg/mL ampicillin, 20 μg/mL chloramphenicol, and 1% (w/v) glucose at 37° C. 20 mL of the Escherichia coli culture was added to LB medium (500 mL) containing 100 μg/mL ampicillin, 20 μg/mL chloramphenicol, and 1% (w/v) glucose and grown until an OD590 of 0.79 at 37° C. 0.5 mM IPTG was added to induce the expression of C5 protein and incubated at 25° C. overnight. Cells were collected by centrifugation at 9000 rpm at 4° C. for 2 minutes, and resuspended in 70 mL of B1 buffer (300 mM KCl, 50 mM Tris-HCl pH 8.0, 5 mM imidazole pH 7.5, 0.2 mM DTT, 10 mM MgCl2, 1 M NH4Cl, 0.25% (v/v) Tween 20) containing 12 mg of phenylmethylsulfonyl fluoride dissolved in 600 μL of ethanol. Cells were disrupted by 6-minute sonication and disrupted cells were centrifuged at 4° C. at 13,000 rpm, for 15 minutes. The supernatant was collected, 21 g of (NH4)2SO4 was added thereto, and the mixture was centrifuged at 4° C. at 13,000 rpm, for 5 minutes. The precipitate was dissolved in 10 mL of B1 buffer. The C5 protein was purified with a Ni-affinity column and eluted with the elution buffer (300 mM KCl, 20 mM Hepes-K pH 7.5, 250 mM imidazole pH 7.5, 0.2 mM DTT, 10 mM MgCl2, 1 M NH4Cl, 0.25% (v/v) Tween 20). The concentration of C5 protein was measured by the absorbance at 280 nm and then the C5 protein was stored in 20% glycerol at −80° C.
2-3. Preparation of tRNA
The elongation (10 μL) was performed under the following conditions: 1×PCR buffer (10 mM Tris-HCl pH 8.4, 100 mM KCl, 0.1% (v/v) Triton X-100, 2 mM MgSO4, 0.2 mM each of dNTPs (dATP, dTTP, dGTP, dCTP)), 1 μM each of eX.F primers, 1 μM each of eX.R primers, 1.6 nM Phusion DNA polymerase (see Table below).
The aforementioned solution was heated at 94° C. for 3 minutes and then elongation was performed by repeating 5 cycles of 50° C. for 1 minute and 72° C. for 2 minutes.
To prepare pre-tDNAGln, pre-tDNAPro, and pre-tDNATrp, the first PCR (800 μL) was performed under the following conditions: 1×PCR buffer, 0.5 μM T7pro-leader.F36, 0.5 μM each of PCR1.R primers, 1.6 nM Phusion DNA polymerase, 2.5% (v/v) of the extension mixture (see Table above). To prepare other tDNAs, the first PCR (200 μL) was performed under the following conditions: 1×PCR buffer, 0.5 μM T7ex5.F22, 0.5 μM each of PCR1.R primers, 1.6 nM Phusion DNA polymerase, 2.5% (v/v) of the extension mixture.
The aforementioned solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 12 cycles of 94° C. for 20 seconds, 50° C. for 20 seconds, and 72° C. for 30 seconds.
The second PCR (3.2 mL of tDNALys, 2.4 mL of pre-tDNAGln, pre-tDNAPro, pre-tDNATrp, 1.6 mL of other tDNAs) was performed under the following conditions: 1×PCR buffer, 1 μM T7ex5.F22, 1 μM each of PCR2.R primers, 1.6 nM Phusion DNA polymerase, 0.5% (v/v) of the first PCR solution (see Tables 3 and 4).
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 12 cycles of 94° C. for 20 seconds, 50° C. for 20 seconds, and 72° C. for 30 seconds.
In vitro transcription (tRNALys: 24 mL, pre-tRNAGln: 12 mL, pre-tRNAPro, pre-tRNATrp and other tRNAs: 8 mL) was performed under the following conditions: 1×T7 buffer (40 mM Tris-HCl pH 8.0, 1 mM spermidine, 5 mM KCl, 0.01% (v/v) Triton X-100, 10 mM DTT), 22.5 mM MgCl2, 3.25 mM each of NTPs (ATP, TTP, GTP, CTP), 5 mM GMP, 0.3 μM T7 RNA polymerase, about 20% (v/v) of the second PCR solution. The reaction solution was incubated at 37° C. for 12 hours. pre-tRNAGln pre-tRNAPro, and pre-tRNATrp were cleaved with RNase P (see below). Other tRNAs were mixed with 16 μL of DNase I (10 u/μL) and 16 μL of 3 M MgCl2) and the mixtures were incubated at 37° C. for 36 hours. After the phenol/chloroform extraction, 400 μL of 0.5 M EDTA pH 8.0 and 800 μL of 3 M NaCl were added to the solution. 8 mL of 2-propanol was added to precipitate the tRNA. The precipitate was dissolved in 10 mL of 0.3 M NaCl and 8 mL of 2-propanol was added to precipitate the tRNA. After washing with 70% ethanol, the tRNA was dissolved in 500 μL of ultrapure water. The concentration of each tRNA was measured by the absorbance at 260 nm. All sequences of the in vitro transcribed tRNAs are set forth in Table below.
The first PCR (50 μL) was performed under the following conditions: 1×KOD-Plus-PCR Buffer (TOYOBO), 1.5 mM MgSO4, 0.2 mM each of dNTPs, 0.25 μM pGEM-Tseq.R20, 0.25 μM M1RNA.R22, 8 ng of M1 RNA template plasmid, 0.05 U/μL KOD DNA polymerase (TOYOBO).
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 25 cycles of 94° C. for 15 seconds, 55° C. for 30 seconds, 68° C. for 1 minute.
The second PCR (1.6 mL) was performed under the following conditions; 1×PCR buffer, 0.5 μM T7ex5.F22, 0.5 μM M1RNA.R22, 1.6 nM Phusion DNA polymerase.
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 25 cycles of 94° C. for 30 seconds, 50° C. for 30 seconds, and 72° C. for 1 min.
In vitro transcription (8 mL) was performed under the following conditions: 1×T7 buffer, 30 mM MgCl2, 20% (v/v) of the second PCR solution, 0.3 μM T7 RNA polymerase.
M1 RNA was purified by a method similar to that of purification of tRNA. The concentration of M1 RNA was measured by the absorbance at 260 nm.
2-7. Cleavage of Pre-tRNA with RNase P
M1 RNA was refolded under the following conditions: 7.5 μM M1 RNA, 1×T7 buffer, 100 mM NH4Cl.
Solution was heated at 94° C. for 1 minute and cooled to 37° C. at 0.5° C./s and incubated it for 5 minutes. 5 mM MgCl2 and 5 μM C5 protein were added thereto and then the solution was incubated at 25° C. for 5 minutes. 1080 μL of the obtained RNase P solution, 500 μL of 3 M NH4Cl, 30 μL of 3 M MgCl2, and 240 μL of DNase I (1 u/μL, Roche) were added to 12 mL of the pre-tRNAGln, pre-tRNAPro, or pre-tRNATrp solution. The solution was incubated at 37° C. for 12 hours. The tRNA was purified by the method described above. The tRNA concentration was measured by the absorbance at 260 nm.
The DNA template for peptide expression was prepared by the following method. Elongation (5 μL) was performed under the following conditions: 1×KOD PCR Buffer (TOYOBO), 1.5 mM MgSO4, 0.2 mM each dNTP, 1 μM T7esD6MYYY.F55, 1 μM MYYYxDDuaaAS.R38 [5′-CGAAG CTTAG TCGTC X GTAGT AGTAC ATGTT TTTCT-3′; (x, X)=(gcu, AGC), (cgu, ACG), (aau, ATT), (gau, ATC), (ugu, ACA), (cag, CTG), (gag, CTC), (ggu, ACC), (cau, ATG), (auu, AAT), (cuu, AAG), (aag, CTT), (aug, CAT), (uuu, AAA), (ccu, AGG), (ucu, AGA) (acu, AGU), (ugg, CCA), (uau, ATA), or (guu, AAC)] (see Tables below), 0.05 U/μL KOD DNA polymerase.
The elongation was performed by heating the aforementioned solution at 94° C. for 3 minutes and then repeating 5 cycles of 50° C. for 1 minute and 68° C. for 1 minute.
PCR (200 μL) was performed under the following conditions: 10 mM Tris-HCl pH 8.4, 50 mM KCl, 0.1% (v/v) Triton X-100, 2 mM MgCl2, 0.25 mM each of dNTPs, 0.5 μM T7ex5.F22, 0.5 μM each MYYYxDDuaaAS.R38, 0.05% (v/v) elongated DNA, 1% (v/v) Taq DNA polymerase.
DNA was amplified by repeating 12 cycles of 94° C. for 40 seconds, 60° C. for 40 seconds, and 72° C. for 40 seconds. After the phenol/chloroform extraction, the DNA product was collected by ethanol precipitation and dissolved in 20 μL of ultrapure water.
Streptavidin gene was synthesized by Eurofins Genomics. According to the corresponding genetic code, genes were each named CGC.stv, NGC1.stv, and NGC2.stv.
Streptavidin gene sequences are illustrated in
The first PCR (5 μL) was performed under the following conditions: 1×KOD-Plus-PCR Buffer, 1.5 mM MgSO4, 0.2 mM each of dNTPs, 2 ng of CGC.stv, NGC1.stv, or NGC2.stv template plasmid, 1 μM T7g10M.F48, 1 μM CGCStv.R20, NGC1Stv.R20, or NGC2Stv.R20, 0.05 U/μL of KOD DNA polymerase.
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 15 cycles of 94° C. for 20 seconds, 55° C. for 30 seconds, and 68° C. for 1 min.
The second PCR (10 μL) was performed under the following conditions; 1×KOD-Plus-PCR Buffer, 1.5 mM MgSO4, 0.2 mM each of dNTPs, 0.025% (v/v) of the first PCR solution, 1 μM T7g10.F26, 1 μM CGCStv.R20, NGC1Stv.R20, or NGC2Stv.R20, 0.05 U/μL of KOD DNA polymerase.
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 15 cycles of 94° C. for 20 seconds, 55° C. for 30 seconds, and 68° C. for 1 min.
The third PCR (8 mL) was performed under the following conditions: 1×PCR buffer, 0.05% (v/v) of the second PCR solution, 1 μM T7g10.F26, 1 μM CGCStv.R20, NGC1Stv.R20, or NGC2Stv.R20, 1.6 nM Phusion DNA polymerase.
This solution was heated at 94° C. for 3 minutes and then DNA was amplified by repeating 12 cycles of 94° C. for 20 seconds, 55° C. for 30 seconds, and 72° C. for 1 min. After the phenol/chloroform extraction, the DNA product was collected by 2-propanol precipitation and dissolved in 200 μL of ultrapure water.
2-10. Analysis of Activity and Accuracy of Native and in Vitro Transcribed tRNA
The reconstituted cell-free translation system that does not contain 20 proteinous amino acids was prepared according to a known method. In vitro transcribed tRNAfMet, tRNAAsp, tRNATyr, and tRNAXaa were mixed and diluted 5 times. After heating at 95° C. for 3 minutes, the tRNAs were precipitated by ethanol precipitation and dissolved in ultrapure water.
Translation (1 μL) was performed under the following conditions: 20% (v/v) of DNA template solution, 250 μM each of 19 proteinous amino acids (other than Asp), 50 μM [14C]-Asp and 1.5 μ/μL native tRNA or 12 μM each of in vitro transcribed tRNAfMet, tRNAAsp, tRNATyr, and tRNAXaa.
In the analysis of the accuracy of translation, the amino acid to which the x codon is assigned was removed from the DNA template in each reaction solution. The concentrations of other components are set forth in Table 2 above.
The solution was incubated at 37° C. for 2 hours. The obtained product was analyzed by Tricine-SDS-PAGE and autoradiography (FLA-5100, Fuji). Moreover, a similar experiment was conducted by using Asp instead of [14C]-Asp and the obtained product was measured according to a known method with autoflex II (BRUKER DALTONICS).
For the universal genetic code, new genetic code 1, and new genetic code 2, 21 in vitro transcribed tRNAs were prepared. In vitro transcription (1.5 μL) was performed under the following conditions: 20% (v/v) streptavidin gene, 250 μM each of 19 proteinous amino acids (other than Asp), 50 μM [14C]-Asp, Asp, and 1.5 μg/μL native tRNA or 12 μM each of in vitro transcribed tRNAs.
The solution was incubated at 37° C. for 2 hours. The obtained product was analyzed by Tricine-SDS-PAGE and autoradiography. Moreover, a similar experiment was conducted by using Asp instead of [14C]-Asp. Asp. The obtained product was diluted 20 times with TBST (25 mM Tris-HCl pH 7.5, 150 mM NaCl, 0.1% Tween 20) containing 0.01% (w/v) BSA. 2 μL of the product after the dilution was spotted (triplicate) on a nitrocellulose membrane (GE Healthcare UK Ltd.). The membrane was blocked with TBST containing 0.2% (w/v) BSA for 10 minutes and then labeling with 15 μg of biotin conjugated horseradish peroxidase (Invitrogen, ultrafiltered three times) in TBST containing 0.2% (w/v) BSA was conducted for 30 minutes. The membrane was washed three times with TBST for 5 minutes. To detect the streptavidin bound to the biotin, the nitrocellulose membrane was incubated with Luminata™ Western HRP Substrate (MILLIPORE). The fluorescence was measured using Ez-Capture MG (ATTO). The fluorescence intensity of each spot was analyzed using Image J.
3-1. Preparation of In Vitro Transcribed tRNAs and Evaluation of their Activity in Translation
tRNAs (Table 5) corresponding to 20 proteinous amino acids were prepared by in vitro transcription. Moreover, 3 tRNAs (tRNATrp, tRNAGln, tRNAPro) were prepared with an additional RNase P13 treatment since bases at position 1 and 72 in these tRNAs are involved in the recognition by the aminoacyl-tRNA synthetases. As a general rule, the in vitro transcribed tRNAs were designed based on the sequences of the native tRNAs in the Escherichia coli strain K12 and some of the tRNAs were mutated as described below. Since the thiouridine derivative of U34 in tRNALysUUU and tRNAGluUUC is important for translation, the wobble position 34 in tRNALysUUU and tRNAGluUUC was changed from U to C. This is known to markedly reduce the kcat/KM of aminoacylation but restore the translation activity of the right codon. Moreover, by mutating 1U72A of tRNAAsnGUU and 1A72U of tRNAIleGAU into 1G72C and 1C of tRNAfMetCAU into 1G, these tRNAs were prepared by in vitro transcription without engineering RNase P. Furthermore, tRNAArgGCG was prepared by mutating A34 of tRNAArgACG into G34. To construct translation systems using codon-shuffled genetic codes 1 and 2 set forth in
To examine the ability of these in vitro transcribed tRNAs to decode right codons in the reconstituted translation system, the following peptide expression assay was performed: in vitro transcribed tRNAfMetCAU, tRNAAspGUC, tRNATyrGUA, tRNAXaa and a peptide template 5′-UTR-ATG-(TAC)3-NNN-(GAC)2-TAA-UTR-3′ (UTR denotes a untranslated region and Xaa denotes the amino acid corresponding to the NNN codon) (
Except the case where a template in which the position of NNN is the TAC codon is used, a single band was found in the gel. In the case of the TAC codon, in addition to an expected main band, a small band, which is considered to be generated by an error of tRNATyrGUA in decoding the UAA stop codon, was found. More importantly, the mobility of each peptide was the same as the peptide translated with native tRNAs (
To examine ability of chimeric tRNAs to translate, similar experiments were conducted using chimeric tRNALeuGGA tRNASerGAG, tRNAAlaGAG, tRNASerGCC and the corresponding DNA templates. From the result of Tricine-SDS-PAGE and the mass spectrometric result in MALDI-TOF-MS, it was indicated that the expected peptides were expressed in all translation reactions (
3-2. Accuracy of Translation with the Native tRNA Set and with In Vitro Transcribed tRNA Sets
Since the bases of the in vitro transcribed tRNAs are not modified, decrease in accuracy of translation with the in vitro transcribed tRNAs in comparison with the native tRNAs was feared. To confirm the possibility, the error of the native tRNA set and in vitro transcribed tRNA sets in decoding was examined.
First, for the translation using the universal genetic code, 20 in vitro transcribed tRNAs corresponding to the 20 proteinous amino acids and tRNAfMet were used in mixture as an in vitro transcribed tRNA set. As the native tRNA set, tRNA extracted from E. coli MRE600 was used. These RNA sets were added to a reconstituted translation system without tRNA and peptide translation was performed in the absence of Xaa corresponding to the NNN codon (
In the result, expression of some peptides was found with the native tRNAs. This indicates that the native tRNAs make translation errors from some codons in this extreme condition. From the mobility of these bands and the result of mass spectrometry, it was found that Ser was incorporated instead of Ala by mistake, Met was incorporated instead of Pro by mistake, and Asn was incorporated instead of Val by mistake (data not shown). These results suggested that tRNASerGGA makes error in translating GCU, tRNAMetCAU makes error in translating CCU, and tRNAAsnGUU makes error in translating GUU.
Therefore, the accuracy of translation with in vitro transcribed tRNA sets was then examined using the same assay system. The result of Tricine-SDS-PAGE analysis indicated that translation error of some codons occurs at the same rate as that with the native tRNA set (
From the foregoing results, it was concluded that the in vitro transcribed tRNA sets have a similar accuracy to that of the native tRNAs and can be used in protein expression.
Streptavidin was chosen as a model protein and a gene encoding streptavidin according to the universal genetic code was constructed.
As control, translation using the native tRNAs and [14C]-Asp Asp was performed in the presence or absence of the universal gene. By Tricine-SDS-PAGE analysis, it was revealed that the streptavidin is expressed in the presence of the universal gene (
The biotin binding activity of the streptavidin was examined by also conducting the dot blot assay using biotin conjugated horseradish peroxidase (biotin-HRP). In the assay, the binding activity was found in the presence of the gene (
Next, the universal streptavidin gene was translated using the universal in vitro transcribed tRNA set (the universal tRNA set, tRNA No. 0 to 20 in Table 5). In comparison with the case with the native tRNA set, the production rate of the streptavidin is decreased by approximately 70% (
To decode codon-shuffled genetic code 1 (CS1 tRNA set), an in vitro transcribed tRNA set using tRNALeuGGA (tRNA No. 21) and tRNASerGAG (tRNA No. 22) instead of tRNALeuGAG (tRNA No. 11 in Supplementary Table 1) and tRNASerGGA (tRNA No. 16) was then prepared.
Since the UCU and UCC codons were assigned to Leu and the CUU and CUC codons were assigned to Ser in the translation system using the CS1 tRNA set (
By the translation of the CS1 gene using the CS1 tRNA set in the translation system, the streptavidin was produced in an amount at the same level as the amount in the case with the universal gene and the universal tRNA set (
Furthermore, an in vitro transcribed tRNA set (CS2 tRNA set) for codon-shuffled genetic code 2 was prepared using tRNAAlaGAG (tRNA No. 23), tRNALeuGGA (tRNA No. 21), and tRNASerGGC (tRNA No. 24) instead of tRNAAlaGGC (tRNA No. 1), tRNALeuGAG (tRNA No. 11) and tRNASerGGA (tRNA No. 16).
In this translation system using the CS2 tRNA set, the CUU and CUC codons were assigned to Ala, the UCU and UCC codons were assigned to Leu, and the GCU and GCC codons were assigned to Ser (
By the translation of the CS2 gene in the translation system using the CS2 tRNA set, the streptavidin was produced in an amount equal to approximately one-third of the amount in the case with the universal gene and the universal tRNA set (
One of the most important aspects of the present invention is orthogonality of genes encoded by codon-shuffled genetic codes. To demonstrate orthogonality of codon-shuffled genetic code 1 to the universal genetic code, a CS1 gene was translated in a translation system using the universal tRNA set. By the Tricine-SDS-PAGE analysis, it was indicated that the protein was expressed from the CS1 gene (
Furthermore, the CS2 gene was similarly translated using the universal tRNA set to examine the orthogonality of codon-shuffled genetic code 2 to the universal genetic code. Since the streptavidin gene has 25 codons that specify Ala, the produced protein should have 47 amino acid substitutions in total (
From the foregoing results, it was demonstrated that the orthogonal expression of CS1 and CS2 genes was obtained, that is, genetic firewall was formed (
Number | Date | Country | Kind |
---|---|---|---|
2015-191730 | Sep 2015 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2016/078921 | 9/29/2016 | WO | 00 |