Novel thermophilic proteins and the nucleic acids encoding them

Information

  • Patent Application
  • 20070202508
  • Publication Number
    20070202508
  • Date Filed
    February 24, 2006
    18 years ago
  • Date Published
    August 30, 2007
    17 years ago
Abstract
The disclosed invention relates to the fields of molecular biology and biochemistry. Thermophilic proteins and the nucleic acids encoding them are disclosed. The thermophilic proteins are from, or derived from, a bacteriophage, YS40, that infects the thermophilic bacterium Thermus thermophilus. These proteins have enhances stability, particularly at high temperatures.
Description

BRIEF DESCRIPTION OF DRAWINGS

The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.



FIG. 1 shows single and multiround transcription at the T7A1-tR′ and galP-tR′ promoters. Two transcriptionally competent open model promoters from different classes, −10/−35 class (T7A1) and extended—10 class (galP1), attached to a rho-independent terminator were used to qualitatively visualize efficiency of run-off transcription. Reaction 1, in 20 μl of transcription buffer (30 mM tris-HCl, pH 8.0, 10 mM MgC12, 40 mM KCl, 1 mM β-mercaptoethanol), contained core enzyme, sigma (σ) and Ys18. Reaction 1 was incubated at 65° C. (for T. th) and 37° C. (for E. coli) for 10 minutes, followed by the addition of promoter DNA fragments. Reaction 2, in 10 μl of transcription buffer, contained core enzyme and sigma, and, in parallel, Ys18 and a promoter DNA fragment. Reaction 2 was incubated at 65° C. (for T. th) and 37° C. (for E. coli) for 10 minutes and then mixed together. For both reactions 1 and 2, after 10 minutes of incubation at the same temperatures, 200 μM ATP, CTP, UTP, 20 μM GTP and 10 μCi of [α-32P] GTP were added, the reactions were incubated for the next 10 minutes, and terminated by an equal volume of 9 M urea loading buffer. In E. coli, if during the transcription reaction heparin is not added with the nucleotides, the RNA polymerase is able to initiate transcription from the promoter repeatedly (multi-round transcription). If heparin is added, the RNA polymerase is only able to transcribe once (single-round transcription). The open complexes formed by T. th RNA polymerase at the promoters used are very sensitive to heparin so single-round transcription cannot be performed.



FIG. 1A shows multi-round transcriptional inhibition by Ys18 of both T7A1-tR′ and galP-tR′ with the T. th RNA polymerase core and σA. With both promoters, increasing amounts of Ys18 (triangle) repressed transcription in a dose dependent manner (run-off bands) when the components were mixed as in Reaction 1 (order 1). When the components were mixed as in Reaction 2, Ys18 did not significantly affect transcription of promoter T7A1-tR′, but repressed transcription of promoter galP-tR′. The similar change in band intensity of run-off bands and terminator tR′ bands indicated that Ys18 did not affect elongation or termination.



FIG. 1B shows single-round (+heparin) and multi-round (−heparin) transcriptional inhibition by Ys18 of both T7A1-tR′ and galP-tR′ with the E. coli RNA polymerase core and σ70. With both promoters, increasing amounts of Ys18 (triangle) repressed transcription in a dose dependent manner (run-off bands). In the presence of heparin, when the components were mixed as in Reaction 1 (order 1), Ys18 was not as active in repressing transcription as when the components were mixed as in Reaction 2 (order2). In the absence of heparin, Ys18 was more active in repressing transcription when the components were mixed as in Reaction 2 (order 2) with the T7A1-tR′ promoter. There was only a slight difference in repressing activity of Ys18 in the absence of heparin between the two reaction mixtures. The similar change in band intensity of run-off bands and terminator tR′ bands indicated that Ys18 did not affect elongation or termination.



FIG. 1C shows the relative transcriptional activity of the run-off assay in graphical form. Transcriptional activity without Ys18 present is 100% (dark bar). The addition of Ys18 in incremental amounts represses transcriptional activity in a dose dependent manner (light bar). The amount of transcriptional repression by Ys18 presences differs with amount of Ys18 added, promoter type, RNA polymerase core, reaction mixture and presence or absence of heparin.



FIG. 2 shows native binding experiments with histidine-tagged phage protein Ys18 and primary sigma factors from T. thermophilus and E. coli. Reactions, containing corresponding proteins in 20 μl of binding buffer (20 mM tris HCl, pH8.0, 0.5 M NaCl, 2 mM imidazole, 5% v/v glycerol), were preincubated for 10′ at 65° C. (for T. th σA) and 37° C. (E. coli σ70). The binding mixtures were then added to Ni—NTA agarose beads equilibrated in the binding buffer. Reactions were incubated for 10′ at room temperature. The agarose beads were pelleted by quick centrifugation and the unbound proteins were withdrawn. The beads were washed 3 times with the binding buffer containing 20 mM imidazole, and the bound proteins were eluted with the binding buffer containing 200 mM imidazole. Fractions were resolved by SDS-PAGE and stained by Coomassie (L=proteins loaded, U=proteins unbound, B=proteins bound to Ni—NTA agarose).



FIG. 2A shows Ys18HIS bound to the primary sigma factor from T. thermophilus A). With both Ys18HIS and σA present in the sample (+lane, L), σA was detected in the unbound (+lane, U) and the bound fractions (+lane, B). In the absence of Ys18HIS (−lane, L), σA was exclusively observed in the unbound fraction (−lane, U). σA cannot bind to the Ni—NTA agarose beads without Ys18HIS, indicating Ys18HIS was capable of binding to σA.



FIG. 2B shows Ys18HIS bound to the primary sigma factor from E. coli 70). When both Ys18HIS and σ70 were present in the sample (+lane, L), σ70 was detected in the unbound (+lane, U) and the bound fractions (+lane, B). In the absence of Ys18HIS (−lane, L), σ70 was exclusively observed in the unbound fraction (−lane, U). σ70 cannot bind to the Ni—NTA agarose beads without Ys18HIS, indicating Ys18HIS was capable of binding to σ70.



FIG. 2C shows Ys18HIS bound to the primary sigma factor from E. coli lacking region 4 (σ701-549). When both Ys18HIS and σ701-549 were present in the sample (+lane, L), σ701-549 was detected in the unbound (+lane, U) and the bound fractions (+lane, B). In the absence of Ys18HIS (−lane, L), σ701-549 was exclusively observed in the unbound fraction (−lane, U). σ701-549 cannot bind to the Ni—NTA agarose beads without Ys18HIS, indicating Ys18HIS was capable of binding to σ701-549 in a region other than region 4.


Claims
  • 1. An isolated protein, including conservatively modified variants thereof, comprising a thermophilic amino acid sequence at least 75% identical to a YS40 amino acid sequence encoded by at least 25 contiguous codons of a YS40 coding sequence selected from the group consisting of SEQ ID NO: 1-170.
  • 2. The protein of claim 1, wherein the thermophilic amino acid sequence is identical to the YS40 amino acid sequence.
  • 3. The protein of claim 1, wherein the YS40 amino acid sequence is encoded by at least 50 contiguous codons of the YS40 coding sequence.
  • 4. The protein of claim 1, wherein the YS40 amino acid sequence is encoded by at least 100 contiguous codons of the YS40 coding sequence.
  • 5. The protein of claim 1, wherein the YS40 coding sequence is from a YS40 structural protein.
  • 6. The protein of claim 5, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 1, 3, 65, 69, 71, 151 and 152.
  • 7. The protein of claim 1, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 2, 4-64, 70-149, 151 and 153-170.
  • 8. The protein of claim 7, wherein the thermophilic amino acid sequence confers to the protein, at a permissible temperature of about 36° C., an enzymatic activity selected from the group consisting of decarboxylase, nuclease, synthase, recombinase, helicase, dehydrogenase, reductase, nucleotide primase, kinase, protease, nucleotidyltransferase, nucleic acid polymerase, deaminase, acyltransferase, terminase, helicase, glycosyltransferase and peptidase.
  • 9. The protein of claim 8, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 5, 8, 9, 12-14, 17, 18, 23-27, 29, 33, 38, 41, 42, 52, 57, 59, 60, 62, 71, 79, 114, 144 and 161.
  • 10. The protein of claim 8, wherein the YS40 coding sequence is SEQ ID NO: 33, and the enzymatic activity is DNA polymerase.
  • 11. The protein of claim 7, wherein the permissible temperature is at least 45° C.
  • 12. The protein of claim 7, wherein the permissible temperature is at least 55° C.
  • 13. The protein of claim 7, wherein the permissible temperature is at least 65° C.
  • 14. The protein of claim 7, wherein the permissible temperature is at least 75° C.
  • 15. An isolated nucleic acid encoding the protein of claim 1.
  • 16. The nucleic acid of claim 15, wherein the thermophilic amino acid sequence of the encoded protein is identical to the YS40 amino acid sequence.
  • 17. The nucleic acid of claim 15, wherein the YS40 amino acid sequence is encoded by at least 50 contiguous codons of the YS40 coding sequence.
  • 18. The nucleic acid of claim 15, wherein the YS40 amino acid sequence is encoded by at least 100 contiguous codons of the YS40 coding sequence.
  • 19. The nucleic acid of claim 15, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 1, 3, 65, 69, 71, 151 and 152.
  • 20. The nucleic acid of claim 15, wherein the thermophilic amino acid sequence confers an enzymatic activity to the encoded protein at a permissible temperature of 36° C., the enzymatic activity selected from the group consisting of decarboxylase, nuclease, synthase, recombinase, helicase, dehydrogenase, reductase, nucleotide primase, kinase, protease, nucleotidyltransferase, nucleic acid polymerase, deaminase, acyltransferase, terminase, helicase, glycosyltransferase and peptidase.
  • 21. The nucleic acid of claim 20, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 2, 4-64, 70-149, 151 and 153-170.
  • 22. The nucleic acid of claim 21, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 5, 8, 9, 12-14, 17, 18, 23-27, 29, 33, 38, 41, 42, 52, 57, 59, 60, 62, 71, 79, 114, 144 and 161.
  • 23. The nucleic acid of claim 21, wherein the YS40 coding sequence is SEQ ID NO: 33, and the enzymatic activity is DNA polymerase.
  • 24. The nucleic acid of claim 20, wherein the permissible temperature is at least 45° C.
  • 25. The nucleic acid of claim 20, wherein the permissible temperature is at least 55° C.
  • 26. The nucleic acid of claim 20, wherein the permissible temperature is at least 65° C.
  • 27. The nucleic acid of claim 20, wherein the permissible temperature is at least 75° C.
  • 28. A recombinant vector comprising the nucleic acid of claim 15 operably linked to a promoter wherein introduction of the vector into an expression system produces a protein having, at a permissible temperature of 36° C., an enzyme activity selected from the group consisting of decarboxylase, nuclease, synthase, recombinase, helicase, dehydrogenase, reductase, nucleotide primase, kinase, protease, nucleotidyltransferase, nucleic acid polymerase, deaminase, acyltransferase, terminase, helicase, glycosyltransferase and peptidase.
  • 29. The vector of claim 28, wherein the promoter is inducible.
  • 30. The vector of claim 28, wherein the permissible temperature is at least 45° C.
  • 31. The vector of claim 28, wherein the permissible temperature is at least 55° C.
  • 32. The vector of claim 28, wherein the permissible temperature is at least 65° C.
  • 33. The vector of claim 28, wherein the permissible temperature is at least 75° C.
  • 34. The vector of claim 28, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 2, 4-64, 70-149, 151 and 153-170.
  • 35. The vector of claim 34, wherein the YS40 coding sequence is selected from the group consisting of SEQ ID NO: 5, 8, 9, 12-14, 17, 18, 23-27, 29, 33, 38, 41, 42, 52, 57, 59, 60, 62, 71, 79, 114, 144 and 161.
  • 36. The vector of claim 34, wherein the YS40 coding sequence is SEQ ID NO: 33, and the enzyme activity is DNA polymerase.
  • 37. A protein expression system comprising the vector of claim 28 wherein incubating the expression system under permissible conditions produces the recombinant protein encoded by the vector.
  • 38. The protein expression system of claim 37, further comprising a cell wherein the vector is within the cell.
  • 39. An isolated nucleic acid comprising an YS40 nucleotide sequence selected from the group consisting of SEQ ID NO: 1-170, wherein the YS40 nucleotide sequence encodes a YS40 structural protein that does not take a random coil structure at a permissible temperature of 36° C., ora YS40 enzyme that displays, at a permissible temperature of 36° C., an enzyme activity selected from the group consisting of decarboxylase, nuclease, synthase, recombinase, helicase, dehydrogenase, reductase, nucleotide primase, kinase, protease, nucleotidyltransferase, nucleic acid polymerase, deaminase, acyltransferase, terminase, helicase, glycosyltransferase and peptidase.
  • 40. The nucleic acid of claim 39, further comprising a regulatory element operably linked to the YS40 nucleotide sequence.
  • 41. The nucleic acid of claim 39, wherein the permissible temperature is at least 45° C.
  • 42. The nucleic acid of claim 39, wherein the permissible temperature is at least 55° C.
  • 43. The nucleic acid of claim 39, wherein the permissible temperature is at least 65° C.
  • 44. The nucleic acid of claim 39, wherein the permissible temperature is at least 75° C.
  • 45. A recombinant cell comprising the nucleic acid of claim 39.
  • 46. An isolated nucleic acid comprising: a) a vector sequence comprising no more than about 99.9% of the nucleotide sequence of SEQ ID NO: 171 andb) a non-YS40 nucleotide sequence of at least 20 contiguous nucleotides having a 3′ end and a 5′ end,wherein the non-YS40 nucleotide sequence is inserted into the vector sequence whereby the non-YS40 nucleotide sequence is flanked on the 3′ end and the 5′ end by at least 10 contiguous nucleotides of the vector sequence.
  • 47. The nucleic acid of claim 46, further comprising a regulatory element operably linked to the non-YS40 nucleotide sequence.
  • 48. A recombinant system comprising a cell including the nucleic acid of claim 46.
  • 49. The recombinant system of claim 48, wherein the cell is Thermus thermophilus.
  • 50. A method of amplifying a nucleic acid comprising contacting the nucleic acid with a PCR reagent mixture including a recombinant protein, including conservatively modified variants thereof, comprising a thermophilic amino acid sequence at least 75% identical to an YS40 amino acid sequence encoded by at least 25 contiguous codons of a YS40 coding sequence selected from the group consisting of SEQ ID NO: 2, 4-64, 70-149, 151 and 153-170, wherein the thermophilic amino acid sequence confers to the recombinant protein, at a permissible temperature of 36° C., an enzyme activity necessary for DNA amplification.
  • 51. The method of claim 50, wherein the YS40 coding sequence of the thermophilic amino acid sequence is selected from the group consisting of SEQ ID NO: 5, 8, 9, 12-14, 17, 18, 23-27, 29, 33, 38, 41, 42, 52, 57, 59, 60, 62, 71, 79, 114, 144 and 161.
  • 52. The method of claim 50, wherein the YS40 coding sequence of the YS40 amino acid sequence is SEQ ID NO: 33, and the enzyme activity is DNA polymerase.
  • 53. A method of amplifying a nucleic acid from a whole cell, the method comprising: contacting the cell with at least one recombinant protein comprising a thermophilic amino acid sequence, including conservatively modified variants thereof, that is at least 75% identical to a YS40 amino acid sequence encoded by at least 25 contiguous codons of a YS40 coding sequence selected from the group consisting of SEQ ID NO: 1-170, wherein the thermophilic amino acid sequence confers to the thermophilic protein, at a permissible temperature of 36° C., an enzyme activity necessary for DNA amplification or DNA entry into the cell.
  • 54. The method of claim 53, wherein DNA entry into the cell comprises lysing the cell.
  • 55. A method for decomposing a biodegradable material comprising contacting the biodegradable material with at least one recombinant protein, including conservatively modified variants thereof, comprising a thermophilic amino acid sequence at least 75% identical to an YS40 amino acid sequence encoded by at least 25 contiguous codons of a YS40 coding sequence selected from the group consisting of SEQ ID NO: 2, 4-64, 70-149, 151 and 153-170, wherein the thermophilic amino acid sequence confers to the recombinant protein, at a permissible temperature of 36° C., an enzyme activity necessary for decomposing the biodegradable material selected from the group consisting of protease, amylase, cellulase, nuclease, lipase, deaminase and peptidase.
  • 56. A kit suitable for use in amplifying a nucleic acid, the kit comprising: a) a reagent comprising at least one recombinant protein, including conservatively modified variants thereof, comprising a thermophilic amino acid sequence at least 75% identical to an YS40 amino acid sequence encoded by at least 25 contiguous codons of a YS40 coding sequence selected from the group consisting of SEQ ID NO: 1-170,wherein the thermophilic amino acid sequence confers to the recombinant protein, at a permissible temperature of 36° C., an enzyme activity necessary for DNA amplification or DNA entry into the cell; and,b) a buffer solution.
  • 57. The kit of claim 56, further comprising primers suitable for hybridization in a polymerase chain reaction mixture with the nucleic acid being amplified.