The present invention relates to novel polynucleotides from Ashbya gossypii; to oligonucleotides hybridizing therewith; to expression cassettes and vectors which comprise these polynucleotides; to microorganisms transformed therewith; to polypeptides encoded by these polynucleotides; and to the use of the novel polypeptides and polynucleotides as targets for modulating the operations of transcription, RNA processing and/or translation and, in particular, improving vitamin B2 production in microorganisms of the genus Ashbya.
Vitamin B2 (riboflavin, lactoflavin) is an alkali- and light-sensitive vitamin which shows a yellowish green fluorescence in solution. Vitamin B2 deficiency may lead to ectodermal damage, in particular cataract, keratitis, corneal vascularization, or to autonomic and urogenital disorders. Vitamin B2 is a precursor for the molecules FAD and FMN which, besides NAD+ and NADP+, are important in biology for hydrogen transfer. They are formed from vitamin B2 by phosphorylation (FMN) and subsequent adenylation (FAD).
Vitamin B2 is synthesized in plants, yeasts and many microorganisms from GTP and ribulose 5-phosphate. The reaction pathway starts with opening of the imidazole ring of GTP and elimination of a phosphate residue. Deamination, reduction and elimination of the remaining phosphate result in 5-amino-6-ribitylamino-2,4-pyrimidinone. Reaction of this compound with 3,4-dihydroxy-2-butanone 4-phosphate leads to the bicyclic molecule 6,7-dimethyl-8-ribityllumazine. This compound is converted into the tricyclic compound riboflavin by dismutation, in which a 4-carbon unit is transferred.
Vitamin B2 occurs in many vegetables and in meat, and to a lesser extent in cereal products. The daily vitamin B2 requirement of an adult is about 1.4 to 2 mg. The main breakdown product of the coenzymes FMN and FAD in humans is in turn riboflavin, which is excreted as such.
Vitamin B2 is thus an important dietary substance for humans and animals. Efforts are therefore being made to make vitamin B2 available on the industrial scale. It has therefore been proposed to synthesize vitamin B2 by a microbiological route. Microorganisms which can be used for this purpose are, for example, Bacillus subtilis, the ascomycetes Eremothecium ashbyii, Ashbya gossypii, and the yeasts Candida flareri and Saccharomyces cerevisiae. The nutrient media used for this purpose comprise molasses or vegetable oils as carbon source, inorganic salts, amino acids, animal or vegetable peptones and proteins, and vitamin additions. In sterile aerobic submerged processes, yields of more than 10 g of vitamin B2 are obtained per liter of culture broth within a few days. The requirements are good aeration of the culture, careful agitation and setting of temperatures below about 30° C. Removal of the biomass, evaporation and drying of the concentrate result in a product enriched in vitamin B2.
Microbiological production of vitamin B2 is described, for example, in WO-A-92101060, EP-A-0 405 370 and EP-A-0 531 708.
A survey of the importance, occurrence, production, biosynthesis and use of vitamin B2 is to be found, for example, in Ullmann's Encyclopaedia of Industrial Chemistry, volume A27, pages 521 et seq.
Transcription
Gene expression in fungi is mainly controlled at the transcription level. The transcription apparatus consists of a number of proteins which can be divided into two groups: RNA polymerase (the operative DNA-transcribing enzyme) and transcription factors (which control gene transcription by guiding the RNA polymerase to specific promoter DNA sequences which recognize these factors). Fungi such as Ashbya gossypii contain a number of different transcription factors which are specific for different promoters, growth phases, environmental conditions, substrates, oxygen levels and the like, allowing the organism to adapt to various environmental and metabolic conditions.
Promoters are specific DNA sequences which serve as docking sites for the RNA polymerase complex and transcription factors. Many promoter elements have conserved sequence elements which can be identified by homology searches; an alternative possibility is to identify promoter regions for a particular gene using standard techniques such as primer extension. Many promoter regions of eukaryotes are known (Guarente, L (1987), Ann. Rev. Biochem., 21; 425-452).
Promoter transcription control is influenced by several repression or activation mechanisms. Specific regulatory proteins (transcription factors), which bind to promoters, have the ability to block the binding of the RNA holoenzyme (repressors) or assist the latter (activators) and thus control transcription. In addition, certain enzymes modify the histones bound to the DNA and thus make it possible for either access of the transcription factors to the promoter to be prevented or made possible for the first time (Loo, S.; Rine, J. (1995); Annu. Rev. Cell. Dev. Biol., 11, 519-548). The binding of the transcription factors is likewise controlled by their interactions with other molecules such as proteins or other metabolic compounds (Evans, R. (1989), Science, 240, 889-895). The ability to control the transcription of genes thus responds to a plurality of environmental or metabolic signs makes it possible for the cells to control exactly when a gene can be expressed and how much of a gene product can be present in the cell at a point in time. This in turn prevents unnecessary expenditure of energy or unnecessary use of possibly rare intermediate compounds or cofactors.
RNA Processing
RNA is synthesized as heterogeneous fragment, with the coding sequence (exons) in eukaryotes frequently being interrupted by noncoding sequences (introns). During RNA processing after transcription, the introns are cut out (splicing) so that the coding sequence (of mRNA) can be read off on the ribosomes (Sharp, P. (1987), Science; 235, 766-771). Since the export of RNA from the cell is also controlled with the splicing, it is possible in this way to control the amount of mRNA available on the ribosomes.
Translation
Translation is the process by which a polypeptide is synthesized from amino acids in accordance with the information contained in an RNA molecule. The main components of this process are ribosomes and specific initiation or elongation factors such as eEF1 and eEF2 (Moldave (1985); Ann. Rev. Biochem., 54, 1109-1149). Ribosomes are composed of RNA (rRNA) and specific proteins. They consist of a large and a small subunit, each of which can be characterized by its sedimentation behavior in an analytical ultracentrifuge. Synthesis of the ribosomes is controlled by coordinated production of the RNA and protein components depending on the physiological state of the cell.
Each codon of the mRNA molecule encodes a particular amino acid. The conversion of mRNA into amino acid is carried out by transfer RNA (tRNA) molecules. These molecules consist of an RNA single strand (between 60 and 100 bases) which is in the form of an L-shaped three-dimensional structure with projecting regions or “arms”. One of these arms forms base pairs with a particular codon sequence on the mRNA molecule. A second arm interacts specifically with a particular amino acid (which is encoded by the codon). Other tRNA arms comprise the variable arm, the TΨC arm (which has thymidylate and pseudouridylate modifications) and the D arm (which has a dihydrouridine modification). The function of these latter structures is still unknown, but their conservation between tRNA molecules suggests a role in protein synthesis.
In order that the nucleic acid-based tRNA molecule pairs with the correct amino acid it is necessary for a family of enzymes, referred to as aminoacyl-tRNA synthetases, to act. There are many different enzymes of this type, and each is specific for a particular tRNA and a particular amino acid. These enzymes bind the 3′-hydroxyl of the terminal tRNA adenosine ribose unit to the amino acid in a two-step reaction. Firstly, the enzyme is activated by reaction with ATP and the amino acid, resulting in an aminoacyl-tRNA synthetase/aminoacyl adenylate complex. Secondly, the aminoacyl group is transferred from the enzyme to the target tRNA, on which it remains in a high-energy state. Binding of the tRNA molecule to its recognition codon on the mRNA molecule then brings the high-energy amino acid bound to the tRNA into contact with the ribosome. Inside the ribosome, the amino acid-loaded tRNA (aminoacyl-tRNA) occupies a binding site (the A site) next to a second site (the P site) which carries a tRNA molecule whose amino acid is bound to the growing polypeptide chain (peptidyl-tRNA). The activated amino acid on the aminoacyl-tRNA is sufficiently reactive for a peptide bond to form spontaneously between this amino acid and the next amino acid on the growing polypeptide chain. GTP hydrolysis supplies the energy to transfer the tRNA, which is now loaded with the polypeptide chain, from the A site to the P site of the ribosome, and the process is repeated until a stop codon is reached.
There is a number of different steps at which translation can be controlled. These include binding of the ribosome to mRNA, the presence of mRNA secondary structure, the codon usage or the frequency of particular tRNAs.
The utilization of genes associated with the mechanisms of transcription, RNA processing and/or translation for generating microorganisms, preferably of the genus Ashbya, in particular of Ashbya gossypii strains, with improved adaptability to external conditions such as environmental and metabolic conditions has not yet been described.
It is an object of the present invention to provide novel targets for influencing the transcription and/or translation mechanisms and/or the mechanisms of RNA processing in microorganisms of the genus Ashbya, in particular in Ashbya gossypii. The object in particular is specific modulation of the transcription, RNA processing and/or translation in such microorganisms. A further object is to improve the vitamin B2 production by such microorganisms.
We have found that this object is achieved by providing encoding nucleic acid sequences which are upregulated or downregulated in Ashbya gossypii during vitamin B2 production (based on results found with the aid of the MPSS analytical method described in detail in the experimental part), in particular a) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of a 26 S proteasome subunit or of a TAT binding homolog 7.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 28”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 28v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 1. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 4 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 28” and “Oligo 28v” have significant homologies with the MIPS tag “Yta7” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 1 or SEQ ID NO: 4. The amino acid sequences derived therefrom have significant sequence homology with a 26 S proteasome subunit or a TAT-binding homolog 7 (TBP-7) from S. cerevisiae.
b) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of a translation initiation factor subunit.
In a preferred embodiment there has been isolation according to the invention of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 45”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 45v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 6. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 10 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 45” and “Oligo 45v” have significant homologies with the MIPS tag “p39” or “Tif34” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 6 or SEQ ID NO: 10. An amino acid sequence derived therefrom has significant sequence homology with a subunit (P39) of the translation initiation factor EIF3 (IF32) from S. cerevisiae.
c) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of a ribosomal protein.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 85”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 85v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 12. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 14 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 85” and “Oligo 85v” have significant homologies with the MIPS tag “Rpl35a” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 12 or SEQ ID NO: 14. The amino acid sequence derived from the coding strand or amino acid part-sequence has significant sequence homology with a ribosomal protein from S. cerevisiae.
d) a, preferably upregulated, nucleic acid sequence codes for a protein having the function of a nucleolar protein.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 133”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 133v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 17. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 19 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 133” and “Oligo 133v” have significant homologies with the MIPS tag “Nop13” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 17 or SEQ ID NO: 19. The amino acid sequence or amino acid part-sequence derived from the corresponding complementary strand of SEQ ID NO: 17 or from the sequence shown in SEQ ID NO: 19 has significant sequence homology with a nucleolar protein from S. cerevisiae.
e) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of a translation initiation protein.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 172”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 172v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 21. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 24 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 172” and “Oligo 172v” have significant homologies with the MIPS tag “Sua5” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 21 or SEQ ID NO: 24. The amino acid sequence or amino acid part-sequence derived from the coding strand has significant sequence homology with a translation initiation protein from S. cerevisiae.
f) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of a precursor of ribosomal protein S 31.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 63”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 63v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 26. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 29 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 63” and “Oligo 63v” have significant homologies with the MIPS tag “Rps25a” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 26 or SEQ ID NO: 29. The amino acid sequence or amino acid part-sequence derived from the corresponding complementary strand of SEQ ID NO: 26 or from the coding strand shown in SEQ ID NO: 29 has significant sequence homology with a precursor of the ribosomal protein S 31 from S. cerevisiae.
g) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of a cell nuclear pore protein.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 132”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 132v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 31. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 36 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 132” and “Oligo 132v” have significant homologies with the MIPS tag “Nic96” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 31 or SEQ ID NO: 36. An amino acid sequence derived therefrom (corresponding to nucleotides 108 to 764 in SEQ ID NO: 31) has significant sequence homology with a cell nuclear pore protein from S. cerevisiae.
h) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of a constituent of the ADH-histone acetyltransferase complex.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 174”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 174v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 38. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 40 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 174” and “Oligo 174v” have significant homologies with the MIPS tag “Ahc1” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 38 or SEQ ID NO: 40. The amino acid sequence or amino acid part-sequence derived from the corresponding complementary strand to SEQ ID NO: 38 or from the coding strand shown in SEQ ID NO: 40 has significant sequence homology with a constituent of the ADH-histone acetyltransferase complex from S. cerevisiae.
i) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of an RNA helicase involved in RNA processing.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 51”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 51v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 42. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 46 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 51” and “Oligo 51v” have significant homologies with the MIPS tag “Rok1” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 42 or SEQ ID NO: 46. The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 42 or from the coding strand of SEQ ID NO: 46 have significant sequence homology with a S. cerevisiae RNA helicase involved in RNA processing.
k) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of the non-essential constituent of RNA poll.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 30”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 30v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 48. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 51 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 30” and “Oligo 30v” have significant homologies with the MIPS tag “Rpa34” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 48 or SEQ ID NO: 51. The amino acid sequences derived in each case from the coding strand have significant sequence homology with the non-essential constituent of RNA poll from S. cerevisiae.
l) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of an RNA helicase.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 124”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 124v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 53. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 56 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 124” and “Oligo 124v” have significant homologies with a MIPS tag “Sub2” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 53 or SEQ ID NO: 56. The amino acid sequence or amino acid part-sequence derived from the coding strand has significant sequence homology with an RNA helicase from S. cerevisiae.
m) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of an mRNA decapping enzyme.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 139”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 139v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 58. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 60 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 139” and “Oligo 139v” have significant homologies with the MIPS tag “DCP1” from S. cerevisiae. The inserts having nucleic acid sequence as shown in SEQ ID NO: 58 or SEQ ID NO: 60. The amino acid sequence or amino acid part-sequence derived from the coding strand has significant sequence homology with an mRNA decapping enzyme from S. cerevisiae.
n) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of a subunit of the translation initiation factor eIF3.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 144”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 144v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 63. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 65 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 144” and “Oligo 144v” have significant homologies with the MIPS tag “PRT1” from S. cerevisiae. The inserts have a nucleic acid sequence as shown in SEQ ID NO: 63 or SEQ ID NO: 65. The amino acid sequence or amino acid part-sequence derives from the coding strand as significant sequence homology with a subunit of the translation initiation factor eIF3 from S. cerevisiae.
o) a, preferably upregulated, nucleic acid sequence which codes for a protein having the function of a U3 small nucleolar ribonucleoprotein-substituted protein which is involved in preribosomal RNA processing.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 168”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 168v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 67. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 70 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The inserts of “Oligo 168” and “Oligo 168v” have significant homologies with the MIPS tag “Rrp9” from S. cerevisiae. The inserts have the nucleic acid sequence as shown in SEQ ID NO: 67 or SEQ ID NO: 70. The amino acid sequence or amino acid part-sequence derived from the coding strand has significant sequence homology with a S. cerevisiae U3 small nucleolar ribonucleoprotein-associated protein which is involved in preribosomal RNA processing.
p) a, preferably downregulated, nucleic acid sequence which codes for a protein having the function of the ribosomal protein L7a.e.B of the large 60 S subunit.
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 160”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 72, which can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotide complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
The insert of “Oligo 160” has significant homologies with the MIPS tag “Rpl8b” from S. cerevisiae. The insert has a nucleic acid sequence as shown in SEQ ID NO: 72. The amino acid sequence derived from the corresponding complementary strand has significant sequence homology with a ribosomal protein (L7a.e.B; large 60S subunit) from S. cerevisiae.
q) We have found that this object is achieved by providing an encoding nucleic acid sequence which is unregulated in Ashbya gossypii during vitamin B2 production (based on results found with the aid of the MPSS analytical method described in detail in the experimental part).
In a preferred embodiment of this aspect of the invention there has been isolation of a DNA clone which codes for a characteristic part-sequence of the nucleic acid sequence of the invention and which bears the internal name “Oligo 18”.
In a further preferred embodiment there has been isolation according to the invention of a DNA clone which codes for the complete sequence of the nucleic acid of the invention and which bears the internal name “Oligo 18v”.
A first aspect of the present invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 75 or the polynucleotide complementary thereto as shown in SEQ ID NO: 74. A further aspect of the invention relates to a polynucleotide comprising a nucleic acid sequence as shown in SEQ ID NO: 77 or a fragment thereof. The polynucleotides can be isolated preferably from a microorganism of the genus Ashbya, in particular A. gossypii. The invention additionally relates to the polynucleotides complementary thereto; and to the sequences derived from these polynucleotides through the degeneracy of the genetic code.
A further aspect of the invention relates to oligonucleotides which hybridize with one of the above polynucleotides, in particular under stringent conditions.
The invention additionally relates to polynucleotides which hybridize with one of the oligonucleotides of the invention and code for a gene product from microorganisms of the genus Ashbya or a functional equivalent of this gene product.
The invention further relates to polypeptides or proteins which are encoded by the polynucleotides described above; and to peptide fragments thereof which have an amino acid sequence which comprises at least 10 consecutive amino acid residues as shown in SEQ ID NO: 2, 3, 5, 7, 8, 9, 11, 13, 15, 16, 18, 20, 22, 23, 25, 27, 28, 30, 32, 33, 34, 35, 37, 39, 41, 43, 44, 45, 47, 49, 50, 52, 54, 55, 57, 59, 61, 62, 64, 66, 68, 69, 71, 73, 76, or SEQ ID NO: 78; and to functional equivalents of the polypeptides or proteins of the invention.
In this connection, functional equivalents differ from the products specifically disclosed in the invention by their amino acid sequence through addition, insertion, substitution, deletion or inversion at a minimum of one, such as, for example, 1 to 30 or 1 to 20 or 1 to 10, sequence positions without the originally observed protein function, which can be deduced by sequence comparison with other proteins, being lost. It is thus possible for equivalents to have essentially identical, higher or lower activities compared with the native protein.
Further aspects of the invention relate to expression cassettes for the recombinant production of proteins of the invention, comprising one of the nucleic acid sequences defined above, operatively linked to at least one regulatory nucleic acid sequence; and to recombinant vectors comprising at least one such expression cassette of the invention.
Also provided according to the invention are prokaryotic or eukaryotic hosts which are transformed with at least one vector of the above type. A preferred embodiment provides prokaryotic or eukaryotic hosts in which the functional expression of at least one gene which codes for a polypeptide of the invention as defined above is modulated (e.g. inhibited or overexpressed); or in which the biological activity of a polypeptide as defined above is reduced or increased. Preferred hosts are selected from ascomycetes, in particular those of the genus Ashbya and preferably strains of A. gossypii.
Modulation of gene expression in the above sense includes both inhibition thereof, for example through blockade of a stage in expression (in particular transcription or translation) or a specific overexpression of a gene (for example through modification of regulatory sequences or increasing the copy number of the coding sequence).
A further aspect of the invention relates to the use of an expression cassette of the invention, of a vector of the invention or of a host of the invention for the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof.
A further aspect of the invention relates to the use of an expression cassette of the invention, of a vector of the invention or of a host of the invention for the recombinant production of a polypeptide of the invention as defined above.
Also provided according to the invention is a method for detecting or for validating an effector target for modulating the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof. This entails treating a microorganism capable of the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof with an effector which interacts with (such as, for example, non-covalently binds to) a target selected from a polypeptide of the invention as defined above or a nucleic acid sequence coding therefor, validating the influence of the effector on the amount of the microbiologically produced vitamin B2 and/or of the precursor and/or of a derivative thereof; and isolating the target where appropriate. The validation in this case takes place preferably by direct comparison with the microbiological vitamin B2 production in the absence of the effector under otherwise identical conditions.
A further aspect of the invention relates to a method for modulating (in relation to the amount and/or rate of) the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof, where a microorganism capable of the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof is treated with an effector which interacts with a target selected from a polypeptide of the invention as defined above or a nucleic acid sequence coding therefor.
Preferred examples of the abovementioned effectors which should be mentioned are:
The invention likewise relates to the abovementioned effectors having specificity for at least one of the targets, according to the invention, defined above.
A further aspect of the invention relates to a method for the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof, where a host as defined above is cultivated under conditions favoring the production of vitamin B2 and/or precursors and/or derivatives thereof, and the desired product(s) is(are) isolated from the culture mixture. It is preferred in this connection that the host is treated with an effector as defined above before and/or during the cultivation. A preferred host is in this case selected from microorganisms of the genus Ashbya; in particular transformed as described above.
A final aspect of the invention relates to the use of a polynucleotide or polypeptide of the invention as target for modulating the production of vitamin B2 and/or precursors and/or derivatives thereof in a microorganism of the genus Ashbya.
The nucleic acid molecules of the invention encode polypeptides or proteins which are referred to here as proteins of the transcription, RNA processing and/or translation (for example with activity in relation to transcription, RNA processing, splicing or translation) or for short as “TT proteins”. These TT proteins have, for example, a function in the adaptation to various growth phases and environmental and metabolic conditions such as substrates, oxygen level and the like.
Owing to the availability of cloning vectors which can be used in Ashbya gossypii, as disclosed, for example, in Wright and Philipsen (1991) Gene, 109, 99-105, and of techniques for genetic manipulation of A. gossypii and the related yeast species, the nucleic acid molecules of the invention can be used for genetic manipulation of these organisms, in particular of A. gossypii, in order to make them better and more efficient producers of vitamin B2 and/or precursors and/or derivatives thereof. This improved production or efficiency may result from a direct effect of the manipulation of a gene of the invention or result from an indirect effect of such a manipulation.
The present invention is based on the provision of novel molecules which are referred to here as TT nucleic acids and TT proteins and are involved in the transcription, RNA processing and/or translation, in particular in Ashbya gossypii (e.g. in the regulation of transcription, RNA processing and/or translation). The activity of the TT molecules of the invention in A. gossypii influences vitamin B2 production by this organism. The activity of the TT molecules of the invention is preferably modulated so that the metabolic and/or energy pathways of A. gossypii in which the TT proteins of the invention are involved are modulated in relation to the yield, production and/or efficiency of vitamin B2 production, which modulates either directly or indirectly the yield, production and/or efficiency of vitamin B2 production in A. gossypii.
The nucleic acid sequences provided by the invention can be isolated, for example, from the genome of an Ashbya gossypii strain which is freely available from the American Type Culture Collection under the number ATCC 10895.
Improvement in Vitamin B2 Production:
There is a number of possible mechanisms by which the yield, production and/or efficiency of production of vitamin B2 by an A. gossypii strain can be influenced directly through changing the amount and/or activity of a TT protein of the invention.
Thus, a more efficient transcription, RNA processing or translation, which adapts expression of the desired gene products to the external conditions, can achieve optimization of the formation of the desired products of value.
Mutagenesis of one or more TT proteins of the invention may also lead to TT proteins with altered (increased or reduced) activities which influence indirectly the production of the required product from A. gossypii. It is possible, for example, with the aid of the TT proteins for the progress of transcription, RNA processing and/or translation to be assisted (e.g by activators) or blocked (e.g. by repressors) at various points, and thus gene expression or protein synthesis to be influenced. The yield of target product can thus be increased or optimized in relation to external conditions.
Polypeptides
The invention relates to polypeptides which comprise the abovementioned amino acid sequences or characteristic part-sequences thereof and/or are encoded by the nucleic acid sequences described herein.
The invention likewise encompasses “junctional equivalents” of the specifically disclosed novel polypeptides.
“Functional equivalents” or analogs of the specifically disclosed polypeptides are for the purposes of the present invention polypeptides which differ therefrom but which still have the desired biological activity (such as, for example, substrate specificity).
“Functional equivalents” mean according to the invention in particular mutants which have in at least one of the abovementioned sequence positions an amino acid which differs from that specifically mentioned but nevertheless have one of the abovementioned biological activities. “Functional equivalents” thus comprise the mutants obtainable by one or more amino acid additions, substitutions, deletions and/or inversions, it being possible for said modifications to occur in any sequence position as long as they lead to a mutant having the profile of properties of the invention. Functional equivalence exists in particular also when there is qualitative agreement between mutant and unmodified polypeptide in the reactivity pattern, i.e. there are differences in the rate of conversion of identical substrates, for example.
“Functional equivalents” in the above sense are also precursors of the polypeptides described, and functional derivatives and salts of the polypeptides. The term “salts” means both salts of carboxyl groups and acid addition salts of amino groups in the protein molecules of the invention. Salts of carboxyl groups can be prepared in a manner known per se and comprise inorganic salts such as, for example, sodium, calcium, ammonium, iron and zinc salts, and salts with organic bases such as, for example, amines such as triethanolamine, arginine, lysine, piperidine and the like. Acid addition salts such as, for example, salts with mineral acids such as hydrochloric acid or sulfuric acid and salts with organic acids such as acetic acid and oxalic acid are also an aspect of the invention.
“Functional derivatives” of polypeptides of the invention can also be prepared at functional amino acid side groups or at their N- or C-terminal end by known techniques. Such derivatives include, for example, aliphatic esters of carboxyl groups, amides of carboxyl groups obtainable by reaction with ammonia or with a primary or secondary amine; N-acyl derivatives of free amino groups prepared by reaction with acyl groups; or O-acyl derivatives of free hydroxyl groups prepared by reaction with acyl groups. “Functional equivalents” naturally also comprise polypeptides which are obtainable from other organisms, and naturally occurring variants. For example homologous sequence regions can be found by sequence comparison, and equivalent enzymes can be established on the basis of the specific requirements of the invention.
“Functional equivalents” likewise comprise fragments, preferably single domains or sequence motifs, of the polypeptides of the invention, which have, for example, the desired biological function.
“Functional equivalents” are additionally fusion proteins which have one of the abovementioned polypeptide sequences or functional equivalents derived therefrom and at least one other heterologous sequence functionally different therefrom in functional N- or C-terminal linkage (i.e. with negligible mutual impairment of the functions of the parts of the fusion proteins). Nonlimiting examples of such heterologous sequences are, for example, signal peptides, enzymes, immunoglobulins, surface antigens, receptors or receptor ligands.
“Functional equivalents” include according to the invention homologs of the specifically disclosed proteins. These have at least 60%, preferably at least 75%, in particular at least 85%, such as, for example, 90%, 95% or 99%, homology to one of the specifically disclosed sequences, calculated by the algorithm of Pearson and Lipman, Proc. Natl. Acad. Sci. (USA) 85(8), 1988, 2444-2448.
In the case where protein glycosylation is possible, equivalents of the invention include proteins of the type defined above in deglycosylated or glycosylated form, and modified forms obtainable by altering the glycosylation pattern.
Homologs of the proteins or polypeptides of the invention can be generated by mutagenesis, for example by point mutation or truncation of the protein. The term “homolog” as used here relates to a variant form of the protein which acts as agonist or antagonist of the protein activity.
Homologs of the proteins of the invention can be identified by screening combinatorial libraries of mutants such as, for example, truncation mutants. It is possible, for example, to generate a variegated library of protein variants by combinatorial mutagenesis at the nucleic acid level, such as, for example, by enzymatic ligation of a mixture of synthetic oligonucleotides. There is a large number of methods which can be used to produce libraries of potential homologs from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic gene can then be ligated into a suitable expression vector. The use of a degenerate set of genes makes it possible to provide all sequences which encode the desired set of potential protein sequences in one mixture. Methods for synthesizing degenerate oligonucleotides are known to the skilled worker (for example Narang, S. A. (1983) Tetrahedron 39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al., (1984) Science 198:1056; Ike et al. (1983) Nucleic Acids Res. 11:477).
In addition, libraries of fragments of the protein codon can be used to generate a variegated population of protein fragments for screening and for subsequent selection of homologs of a protein of the invention. In one embodiment, a library of coding sequence fragments can be generated by treating a double-stranded PCR fragment of a coding sequence with a nuclease under conditions under which nicking takes place only about once per molecule, denaturing the double-stranded DNA, renaturing the DNA to form double-stranded DNA, which may comprise sense/antisense pairs of different nicked products, removing single-stranded sections from newly formed duplices by treatment with S1 nuclease and ligating the resulting fragment library into an expression vector. It is possible by this method to derive an expression library which encodes N-terminal, C-terminal and internal fragments having different sizes of the protein of the invention.
Several techniques are known in the prior art for screening gene products from combinatorial libraries which have been produced by point mutations or truncation and for screening cDNA libraries for gene products with a selected property. These techniques can be adapted to rapid screening of gene libraries which have been generated by combinatorial mutagenesis of homologs of the invention. The most frequently used techniques for screening large gene libraries undergoing high-throughput analysis comprise the cloning of the gene library into replicable expression vectors, transformation of suitable cells with the resulting vector library and expression of the combinatorial genes under conditions under which detection of the required activity facilitates isolation of the vector which encodes the gene whose product has been detected. Recursive ensemble mutagenesis (REM), a technique which increases the frequency of functional mutants in the libraries, can be used in combination with the screening tests for identifying homologs (Arkin and Yourvan (1992) PNAS 89:7811-7815; Delgrave et al. (1993) Protein Engineering 6(3):327-331).
Recombinant preparation of the polypeptides of the invention is possible (see following sections) or they can be isolated in native form from microorganisms, especially those of the genus Ashbya, by use of conventional biochemical techniques (see Cooper, T. G., Biochemische Arbeitsmethoden, Verlag Walter de Gruyter, Berlin, New York or in Scopes, R., Protein Purification, Springer Verlag, New York, Heidelberg, Berlin.
Nucleic Acid Sequences:
The invention also relates to nucleic acid sequences (single- and double-stranded DNA and RNA sequences such as, for example, cDNA and mRNA), coding for one of the above polypeptides and their functional equivalents which are obtainable, for example, by use of artificial nucleotide analogs.
The invention relates both to isolated nucleic acid molecules which code for polypeptides or proteins of the invention or biologically active sections thereof, and to nucleic acid fragments which can be used, for example, for use as hybridization probes or primers for identifying or amplifying coding nucleic acids of the invention.
The nucleic acid molecules of the invention may additionally comprise untranslated sequences from the 3′ and/or 5′ end of the coding region of the gene.
An “isolated” nucleic acid molecule is separated from other nucleic acid molecules which are present in the natural source of the nucleic acid and may moreover be essentially free of other cellular material or culture medium if it is produced by recombinant techniques, or free of chemical precursors or other chemicals if it is chemically synthesized.
A nucleic acid molecule of the invention can be isolated by using standard techniques of molecular biology and the sequence information provided according to the invention. For example, cDNA can be isolated from a suitable cDNA library by using one of the specifically disclosed complete sequences or a section thereof as hybridization probe and standard hybridization techniques (as described, for example, in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). It is moreover possible for a nucleic acid molecule comprising one of the disclosed sequences or a section thereof to be isolated by polymerase chain reaction using the oligonucleotide primers constructed on the basis of this sequence. The nucleic acid amplified in this way can be cloned into a suitable vector and be characterized by DNA sequence analysis. The oligonucleotides of the invention which correspond to a TT nucleotide sequence can also be produced by standard synthetic methods, for example using an automatic DNA synthesizer.
The invention additionally comprises the nucleic acid molecules which are complementary to the specifically described nucleotide sequences, or a section thereof.
The nucleotide sequences of the invention make it possible to generate probes and primers which can be used for identifying and/or cloning homologous sequences in other cell types and organisms. Such probes and primers usually comprise a nucleotide sequence region which hybridizes under stringent conditions onto at least about 12, preferably at least about 25, such as, for example, 40, 50 or 75, consecutive nucleotides of a sense strand of a nucleic acid sequence of the invention or a corresponding antisense strand.
Further nucleic acid sequences of the invention are derived from SEQ ID NO: 1, 4, 6, 10, 12, 14, 17, 19, 21, 24, 26, 29, 31, 36, 38, 40, 42, 46, 48, 51, 53, 56, 58, 60, 63, 65, 67, 70, 72, 74, 75, or SEQ ID NO: 77 and differ therefrom through addition, substitution, insertion or deletion of one or more nucleotides, but still code for polypeptides having the desired profile of properties.
The invention also encompasses nucleic acid sequences which comprise so-called silent mutations or are modified, by comparison with a specifically mentioned sequence, in accordance with the codon usage of a specific source or host organism, as well as naturally occurring variants such as, for example, splice variants or allelic variants, thereof. It likewise relates to sequences which are obtainable by conservative nucleotide substitutions (i.e. the relevant amino acid is replaced by an amino acid with the same charge, size, polarity and/or solubility).
The invention also relates to molecules derived from the specifically disclosed nucleic acids through sequence polymorphisms. These genetic polymorphisms may exist because of the natural variation between individuals within a population. These natural variations normally result in a variance of from 1 to 5% in the nucleotide sequence of a gene.
The invention additionally encompasses nucleic acid sequences which hybridize with or are complementary to the abovementioned coding sequences. These polynucleotides can be found on screening of genomic or cDNA libraries and, where appropriate, be amplified therefrom by means of PCR using suitable primers, and then, for example, be isolated with suitable probes. Another possibility is to transform suitable microorganisms with polynucleotides or vectors of the invention, multiply the microorganisms and thus the polynucleotides, and then isolate them. An additional possibility is to synthesize polynucleotides of the invention by chemical routes.
The property of being able to “hybridize” onto polynucleotides means the ability of a polynucleotide or oligonucleotide to bind under stringent conditions to an almost complementary sequence, while there are no nonspecific bindings between noncomplementary partners under these conditions. For this purpose, the sequences should be 70-100%, preferably 90-100%, complementary. The property of complementary sequences being able to bind specifically to one another is made use of, for example, in the Northern or Southern blot technique or in PCR or RT-PCR in the case of primer binding. Oligonucleotides with a length of 30 base pairs or more are normally employed for this purpose. Stringent conditions mean, for example, in the Northern blot technique the use of a washing solution at 50-70° C., preferably 60-65° C., for example 0.1×SSC buffer with 0.1% SDS (20×SSC: 3M NaCl, 0.3M Na citrate, pH 7.0) for eluting nonspecifically hybridized cDNA probes or oligonucleotides. In this case, as mentioned above, only nucleic acids with a high degree of complementarity remain bound to one another. The setting up of stringent conditions is known to the skilled worker and is described, for example, in Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6.
A further aspect of the invention relates to antisense nucleic acids. This comprises a nucleotide sequence which is complementary to a coding sense nucleic acid. The antisense nucleic acid may be complementary to the entire coding strand or only to a section thereof. In a further embodiment, the antisense nucleic acid molecule is antisense to a noncoding region of the coding strand of a nucleotide sequence. The term “noncoding region” relates to the sequence sections which are referred to as 5′- and 3′-untranslated regions.
An antisense oligonucleotide may be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides long. An antisense nucleic acid of the invention can be constructed by chemical synthesis and enzymatic ligation reactions using methods known in the art. An antisense nucleic acid can be synthesized chemically, using naturally occurring nucleotides or variously modified nucleotides which are configured so that they increase the biological stability of the molecules or increase the physical stability of the duplex formed between the antisense and sense nucleic acids. Examples which can be used are phosphorothioate derivatives and acridine-substituted nucleotides. Examples of modified nucleosides which can be used for generating the antisense nucleic acid are, inter alia, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxymethyl)uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueuosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueuosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N-6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queuosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, methyl uracil-5-oxyacetate, 3-(3-amino-3-carboxypropyl)uracil, (acp3)w and 2,6-diaminopurine. The antisense nucleic acid may also be produced biologically by using an expression vector into which a nucleic acid has been subcloned in the antisense direction.
The antisense nucleic acid molecules of the invention are normally administered to a cell or generated in situ so that they hybridize with the cellular mRNA and/or a coding DNA or bind thereto, so that expression of the protein is inhibited for example by inhibition of transcription and/or translation.
The antisense molecule can be modified so that it binds specifically to a receptor or to an antigen which is expressed on a selected cell surface, for example through linkage of the antisense nucleic acid molecule to a peptide or an antibody which binds to a cell surface receptor or antigen. The antisense nucleic acid molecule can also be administered to cells by using the vectors described herein. The vector constructs preferred for achieving adequate intracellular concentrations of the antisense molecules are those in which the antisense nucleic acid molecule is under the control of a strong bacterial, viral or eukaryotic promoter.
In a further embodiment, the antisense nucleic acid molecule of the invention is an alpha-anomeric nucleic acid molecule. An alpha-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA, with the strands running parallel to one another, in contrast to normal alpha units (Gaultier et al., (1987) Nucleic Acids Res. 15:6625-6641). The antisense nucleic acid molecule may additionally comprise a 2′-O-methylribonucleotide (Inoue et al., (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analog (Inoue et al. (1987) FEBS Lett. 215:327-330).
The invention also relates to ribozymes. These are catalytic RNA molecules with ribonuclease activity which are able to cleave a single-stranded nucleic acid such as an mRNA to which they have a complementary region. It is thus possible to use ribozymes (for example hammerhead ribozymes (described in Haselhoff and Gerlach (1988) Nature 334:585-591)) for the catalytic cleavage of transcripts of the invention in order thereby to inhibit the translation of the corresponding nucleic acid. A ribozyme with specificity for a coding nucleic acid of the invention can be formed, for example, on the basis of a cDNA specifically disclosed herein. For example a derivative of a tetrahymena-L-19 IVS RNA can be constructed, with the nucleotide sequence of the active site being complementary to the nucleotide sequence to be cleaved in a coding mRNA of the invention. (Compare, for example, U.S. Pat. No. 4,987,071 and U.S. Pat. No. 5,116,742). Alternatively, mRNA can be used for selecting a catalytic RNA with specific ribonuclease activity from a pool of RNA molecules (see, for example, Bartel, D., and Szostak, J. W. (1993) Science 261:1411-1418).
Gene expression of sequences of the invention can alternatively be inhibited by targeting nucleotide sequences which are complementary to the regulatory region of a nucleotide sequence of the invention (for example to a promoter and/or enhancer of a coding sequence) so that there is formation of triple helix structures which prevent transcription of the corresponding gene in target cells (Helene, C. (1991) Anticancer Drug Res. 6(6) 569-584; Helene, C. et al., (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher., L. J. (1992) Bioassays 14(12):807-815).
Expression Constructs and Vectors:
The invention additionally relates to expression constructs comprising, under the genetic control of regulatory nucleic acid sequences, a nucleic acid sequence coding for a polypeptide of the invention; and to vectors comprising at least one of these expression constructs. Such constructs of the invention preferably comprise a promoter 5′-upstream from the particular coding sequence, and a terminator sequence 3′-downstream, and, where appropriate, other usual regulatory elements, in particular each operatively linked to the coding sequence. “Operative linkage” means the sequential arrangement of promoter, coding sequence, terminator and, where appropriate, other regulatory elements in such a way that each of the regulatory elements is able to comply with its function as intended for expression of the coding sequence. Examples of sequences which can be operatively linked are targeting sequences and enhancers, polyadenylation signals and the like. Other regulatory elements comprise selectable markers, amplification signals, origins of replication and the like. Suitable regulatory sequences are described, for example, in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990).
In addition to the artificial regulatory sequences it is possible for the natural regulatory sequence still to be present in front of the actual structural gene. This natural regulation can, where appropriate, be switched off by genetic modification, and expression of the genes can be increased or decreased. The gene construct can, however, also have a simpler structure, that is to say no additional regulatory signals are inserted in front of the structural gene, and the natural promoter with its regulation is not deleted. Instead, the natural regulatory sequence is mutated so that regulation no longer takes place, and gene expression is enhanced or diminished. The nucleic acid sequences may be present in one or more copies in the gene construct.
Examples of promoters which can be used are: cos, tac, trp, tet, trp-tet, lpp, lac, lpp-lac, laclq, T7, T5, T3, gal, trc, ara, SP6, λ-PR or λ-PL promoter, which are advantageously used in Gram-negative bacteria; and the Gram-positive promoters amy and SPO2, the yeast promoters ADC1, MFα, AC, P-60, CYC1, GAPDH or the plant promoters CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33, not or the ubiquitin or phaseolin promoter. The use of inducible promoters is particularly preferred, such as, for example, light- and, in particular, temperature-inducible promoters such as the PrPl promoter. It is possible in principle for all natural promoters with their regulatory sequences to be used. In addition, it is also possible advantageously to use synthetic promoters.
Said regulatory sequences are intended to make specific expression of the nucleic acid sequences possible. This may mean, for example, depending on the host organism, that the gene is expressed or overexpressed only after induction or that it is immediately expressed and/or overexpressed.
The regulatory sequences or factors may moreover preferably influence positively, and thus increase or reduce, expression. Thus, enhancement of the regulatory elements can take place advantageously at the level of transcription by using strong transcription signals such as promoters and/or enhancers. However, it is also possible to enhance translation by, for example, improving the stability of the mRNA.
An expression cassette is produced by fusing a suitable promoter to a suitable nucleotide sequence of the invention and to a terminator signal or polyadenylation signal. Conventional techniques of recombination and cloning are used for this purpose, as described, for example, in T. Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1982) and in T. J. Silhavy, M. L. Berman and L. W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1984) and in Ausubel, F. M. et al., Current Protocols in Molecular Biology, Greene Publishing Assoc. and Wiley Interscience (1987).
For expression in a suitable host organism, the recombinant nucleic acid construct or gene construct is advantageously inserted into a host-specific vector, which makes optimal expression of the genes in the host possible. Vectors are well known to the skilled worker and can be found, for example, in “Cloning Vectors” (Pouwels P. H. et al., eds, Elsevier, Amsterdam-New York-Oxford, 1985). Vectors also mean not only plasmids but also all other vectors known to the skilled worker, such as, for example, phages, viruses, such as SV40, CMV, baculovirus and adenovirus, transposons, IS elements, phasmids, cosmids, and linear or circular DNA. These vectors may undergo autonomous replication in the host organism or chromosomal replication.
Examples of suitable expression vectors which may be mentioned are:
Conventional fusion expression vectors such as pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT 5 (Pharmacia, Piscataway, N.J.), with which respectively glutathione S-transferase (GST), maltose E-binding protein and protein A are fused to the recombinant target protein.
Non-fusion protein expression vectors such as pTrc (Amann et al., (1988) Gene 69:301-315) and pET 11d (Studier et al. Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 60-89).
Yeast expression vector for expression in the yeast S. cerevisiae, such as pYepSec1 (Baldari et al., (1987) Embo J. 6:229-234), pMFα (Kurjan and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz et al. (1987) Gene 54:113-123) and pYES2 (Invitrogen Corporation, San Diego, Calif.). Vectors and methods for constructing vectors suitable for the use in other fungi such as filamentous fungi comprise those which are described in detail in: van den Hondel, C. A. M. J. J. & Punt, P. J. (1991) “Gene transfer systems and vector development for filamentous fungi, in: Applied Molecular Genetics of Fungi, J. F. Peberdy et al., eds, pp. 1-28, Cambridge University Press: Cambridge.
Baculovirus vectors which are available for expression of proteins in cultured insect cells (for example Sf9 cells) comprise the pAc series (Smith et al., (1983) Mol. Cell Biol. 3:2156-2165) and pVL series (Lucklow and Summers (1989) Virology 170:31-39).
Plant expression vectors such as those described in detail in: Becker, D., Kemper, E., Schell, J. and Masterson, R. (1992) “New plant binary vectors with selectable markers located proximal to the left border”, Plant Mol. Biol. 20:1195-1197; and Bevan, M. W. (1984) “Binary Agrobacterium vectors for plant transformation”, Nucl. Acids Res. 12:8711-8721.
Mammalian expression vectors such as pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195).
Further suitable expression systems for prokaryotic and eukaryotic cells are described in chapters 16 and 17 of Sambrook, J., Fritsch, E. F. and Maniatis, T., Molecular cloning:
A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
Recombinant Microorganisms:
The vectors of the invention can be used to produce recombinant microorganisms which are transformed, for example, with at least one vector of the invention and can be employed for producing the polypeptides of the invention. The recombinant constructs of the invention described above are advantageously introduced and expressed in a suitable host system. Cloning and transfection methods familiar to the skilled worker, such as, for example, coprecipitation, protoplast fusion, electroporation, retroviral transfection and the like, are preferably used to bring about expression of said nucleic acids in the particular expression system. Suitable systems are described, for example, in Current Protocols in Molecular Biology, F. Ausubel et al., eds, Wiley Interscience, New York 1997, or Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
It is also possible according to the invention to produce homologously recombined microorganisms. This entails production of a vector which contains at least one section of a gene of the invention or a coding sequence, in which, where appropriate, at least one amino acid deletion, addition or substitution has been introduced in order to modify, for example functionally disrupt, the sequence of the invention (knockout vector). The introduced sequence may, for example, also be a homolog from a related microorganism or be derived from a mammalian, yeast or insect source. The vector used for homologous recombination may alternatively be designed so that the endogenous gene is mutated or otherwise modified during the homologous recombination but still encodes the functional protein (for example the regulatory region located upstream may be modified in such a way that this modifies expression of the endogenous protein). The modified section of the TT gene is in the homologous recombination vector. The construction of suitable vectors for homologous recombination is, for example, described in Thomas, K. R. and Capecchi, M. R. (1987) Cell 51:503.
Suitable host organisms are in principle all organisms which enable expression of the nucleic acids of the invention, their allelic variants, their functional equivalents or derivatives. Host organisms mean, for example, bacteria, fungi, yeasts, plant or animal cells. Preferred organisms are bacteria, such as those of the genera Escherichia, such as, for example, Escherichia coli, Streptomyces, Bacillus or Pseudomonas, eukaryotic microorganisms such as Saccharomyces cerevisiae, Aspergillus, higher eukaryotic cells from animals or plants, for example Sf9 or CHO cells. Preferred organisms are selected from the genus Ashbya, in particular from A. gossypii strains.
Successfully transformed organisms can be selected through marker genes which are likewise present in the vector or in the expression cassette. Examples of such marker genes are genes for antibiotic resistance and for enzymes which catalyze a color-forming reaction which causes staining of the transformed cell. These can then be selected by automatic cell sorting. Microorganisms which have been successfully transformed with a vector and harbor an appropriate antibiotic resistance gene (for example G418 or hygromycin) can be selected by appropriate antibiotic-containing media or nutrient media. Marker proteins present on the surface of the cell can be used for selection by means of affinity chromatography.
The combination of the host organisms and the vectors appropriate for the organisms, such as plasmids, viruses or phages, such as, for example, plasmids with the RNA polymerase/promoter system, phages λ or μ or other temperate phages or transposons and/or other advantageous regulatory sequences forms an expression system. The term “expression system” means, for example, the combination of mammalian cells, such as CHO cells, and vectors, such as pcDNA3neo vector, which are suitable for mammalian cells.
If desired, the gene product can also be expressed in transgenic organisms such as transgenic animals such as, in particular, mice, sheep or transgenic plants.
Recombinant Production of the Polypeptides:
The invention further relates to methods for the recombinant production of a polypeptide of the invention or functional, biologically active fragments thereof, wherein a polypeptide-producing microorganism is cultured, expression of the polypeptides is induced where appropriate, and they are isolated from the culture. The polypeptides can also be produced on the industrial scale in this way if desired.
The recombinant microorganism can be cultured and fermented by known methods. Bacteria can be grown, for example, in TB or LB medium and at a temperature of 20 to 40° C. and a pH of from 6 to 9. Details of suitable culturing conditions are described, for example, in T. Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1982).
If the polypeptides are not secreted into the culture medium, the cells are then disrupted and the product is obtained from the lysate by known protein isolation methods. The cells may alternatively be disrupted by high-frequency ultrasound, by high pressure, such as, for example, in a French pressure cell, by osmolysis, by the action of detergents, lytic enzymes or organic solvents, by homogenizers or by a combination of a plurality of the methods mentioned.
The polypeptides can be purified by known chromatographic methods such as molecular sieve chromatography (gel filtration), such as Q-Sepharose chromatography, ion exchange chromatography and hydrophobic chromatography, and by other usual methods such as ultrafiltration, crystallization, salting out, dialysis and native gel electrophoresis. Suitable methods are described, for example, in Cooper, T. G., Biochemische Arbeitsmethoden, Verlag Walter de Gruyter, Berlin, New York or in Scopes, R., Protein Purification, Springer Verlag, New York, Heidelberg, Berlin.
It is particularly advantageous for isolation of the recombinant protein to use vector systems or oligonucleotides which extend the cDNA by particular nucleotide sequences and thus code for modified polypeptides or fusion proteins which serve, for example, for simpler purification. Suitable modifications of this type are, for example, so-called tags which act as anchors, such as, for example, the modification known as hexa-histidine anchor, or epitopes which can be recognized as antigens by antibodies (described, for example, in Harlow, E. and Lane, D., 1988, Antibodies: A Laboratory Manual. Cold Spring Harbor (N.Y.) Press). These anchors can be used to attach the proteins to a solid support, such as, for example, a polymer matrix, which can, for example, be packed into a chromatography column, or can be used on a microtiter plate or another support.
These anchors can at the same time also be used for recognition of the proteins. It is also possible to use for recognition of the proteins conventional markers such as fluorescent dyes, enzyme markers which form a detectable reaction product after reaction with a substrate, or radioactive labels, alone or in combination with the anchors for derivatizing the proteins.
The invention additionally relates to a method for the microbiological production of vitamin B2 and/or precursors and/or derivatives thereof.
If the conversion is carried out with a recombinant microorganism, the microorganisms are preferably initially cultured in the presence of oxygen and in a complex medium, such as, for example, at a culturing temperature of about 20° C. or more, and at a pH of about 6 to 9 until an adequate cell density is reached. In order to be able to control the reaction better, it is preferred to use an inducible promoter. The culturing is continued in the presence of oxygen for 12 hours to 3 days after induction of vitamin B2 production.
The following nonlimiting examples describe specific embodiments of the invention.
General Experimental Details
a) General Cloning Methods
The cloning steps carried out for the purpose of the present invention, such as, for example, restriction cleavages, agarose gel electrophoresis, purification of DNA fragments, transfer of nucleic acids to nitrocellulose and nylon membranes, linkage of DNA fragments, transformation of E. coli cells, culturing of bacteria, replication of phages and sequence analysis of recombinant DNA, were carried out as described by Sambrook et al. (1989) loc. cit.
b) Polymerase Chain Reaction (PCR)
PCR was carried out in accordance with a standard protocol with the following standard mixture:
8 μl of dNTP mix (200 μM), 10 μl of Taq polymerase buffer (10×) without MgCl2, 8 μl of MgCl2 (25 mM), 1 μl of each primer (0.1 μM), 1 μl of DNA to be amplified, 2.5 U of Taq polymerase (MBI Fermentas, Vilnius, Lithuania), demineralized water ad 100 μl.
c) Culturing of E. coli
The recombinant E. coli DH5α strain was cultured in LB-amp medium (tryptone 10.0 g, NaCl 5.0 g, yeast extract 5.0 g, ampicillin 100 g/ml, H2O ad 1000 ml) at 37° C. For this purpose, in each case one colony was transferred, using an inoculating loop, from an agar plate into 5 ml of LB-amp. After culturing for about 18 hours shaking at a frequency of 220 rpm, 400 ml of medium in a 2 l flask were inoculated with 4 ml of culture. Induction of P450 expression in E. coli took place after the OD578 reached a value between 0.8 and 1.0 by heat-shock induction at 42° C. for three to four hours.
The required product can be isolated from the microorganism or from the culture supernatant by various methods known in the art. If the required product is not secreted by the cells, the cells can be harvested from the culture by slow centrifugation, and the cells can be lysed by standard techniques such as mechanical force or ultrasound treatment.
The cell detritus is removed by centrifugation, and the supernatant fraction which contains the soluble proteins is obtained for further purification of the required compound. If the product is secreted by the cells, the cells are removed from the culture by slow centrifugation, and the supernatant fraction is retained for further purification.
The supernatant fraction from the two purification methods is subjected to a chromatography with a suitable resin, with the required molecule either being retained on the chromatography resin, or passing through the latter, with greater selectivity than the impurities. These chromatography steps can be repeated if necessary, using the same or different chromatography resins. The skilled worker is proficient in the selection of suitable chromatography resins and their most effective use for a particular molecule to be purified. The purified product can be concentrated by filtration or ultrafiltration and be stored at a temperature at which the stability of the product is maximal.
Many purification methods are known in the art. These purification techniques are described, for example, in Bailey, J. E. & Ollis, D. F. Biochemical Engineering Fundamentals, McGraw-Hill: New York (1986).
The identity and purity of the isolated compounds can be determined by prior art techniques. These comprise high performance liquid chromatography (HPLC), spectroscopic methods, staining methods, thin layer chromatography, NIRS, enzyme assay or microbiological assays. These analytical methods are summarized in: Patek et al. (1994) Appl. Environ. Microbiol. 60:133-140; Malakhova et al. (1996) Biotekhnologiya 11 27-32; and Schmidt et al. (1998) Bioprocess Engineer. 19:67-70. Ullmann's Encyclopedia of Industrial Chemistry (1996) Vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, pp. 575-581 and pp. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of HPLC in Biochemistry in: Laboratory Techniques in Biochemistry and Molecular Biology, Vol.17.
e) General Description of the MPSS Method, Clone Identification and Homology Search
The MPSS technology (Massive Parallel Signature Sequencing as described by Brenner et al, Nat. Biotechnol. (2000) 18, 630-634; to which express reference is hereby made) was applied to the filamentous, vitamin B2-producing fungus Ashbya gossypii. It is possible with the aid of this technology to obtain with high accuracy quantitative information about the level of expression of a large number of genes in a eukaryotic organism. This entails the mRNA of the organism being isolated at a particular time X, being transcribed with the aid of the enzyme reverse transcriptase into cDNA and then being cloned into special vectors which have a specific tag sequence. The number of vectors with a different tag sequence is chosen to be high enough (about 1000 times higher) for statistically each DNA molecule to be cloned into a vector which is unique through its tag sequence.
The vector inserts are then cut out together with the tag. The DNA molecules obtained in this way are then incubated with microbeads which possess the molecular counterparts of the tags mentioned. After incubation it can be assumed that each microbead is loaded via the specific tags or counterparts with only one type of DNA molecules. The beads are transferred into a special flow cell and fixed there so that it is possible to carry out a mass sequencing of all the beads with the aid of an adapted sequencing method based on fluorescent dyes and with the aid of a digital color camera. Although numerically high analysis is possible with this method, it is limited by a reading width of about 16 to 20 base pairs. The sequence length is, however, sufficient to make an unambiguous correlation between sequence and gene possible for most organisms (20 bp have a sequence frequency of ˜1×1012; compared with this, the human genome has a size of “only” ˜3×109 bp).
The data obtained in this way are analyzed by counting the number of identical sequences and comparing their frequencies with one another. Frequently occurring sequences reflect a high level of expression, and sequences which occur singly a low level of expression. If the mRNA was isolated at two different time points (X and Y), it is possible to construct a chronological expression pattern of individual genes.
Isolation of mRNA from Ashbya gossypii
Ashbya gossypii was cultured in a manner known per se (nutrient medium: 27.5 g/l yeast extract; 0.5 g/l magnesium sulfate; 50 ml/l soybean oil; pH 7). Ashbya gossypii mycelium samples are taken at various times during the fermentation (24 h, 48 h and 72 h), and the corresponding RNA or mRNA is isolated therefrom according to the protocol of Sambrook et al. (1989).
Application of the MPSS
Isolated mRNA from A. gossypii is then subjected to an MPSS analysis as explained above.
The sets of data found are subjected to a statistical analysis and categorized according to the significance of the differences in expression. This entailed examination both in relation to an increase and a reduction in the level of expression. A division is made by classifying the change in expression into a) monotonic change, b) change after 24 h, and c) change after 48 h.
The 20 bp sequences representing a change in expression and found by MPSS analysis are then used as probes and hybridized with a gene library from Ashbya gossypii, with an average insert size of about 1 kb. The hybridization temperature in this case was in the range from about 30 to 57° C.
Construction of a Genomic Gene Library from Ashbya gossypii
To construct a genomic DNA library, initially chromosomal DNA is isolated by the method of Wright and Philippsen (Gene (1991) 109: 99-105) and Mohr (1995, PhD Thesis, Biozentrum Universitat Basel, Switzerland).
The DNA is partially digested with Sau3A. For this purpose, 6 μg of genomic DNA are subjected to a Sau3A digestion with various amounts of enzyme (0.1 to 1 U). The fragments are fractionated in a sucrose density gradient. The 1 kb region is isolated and subjected to a QiaEx extraction. The largest fragments are ligated to the BamHI-cut vector pRS416 (Sikorski and Hieter, Genetics (1988) 122; 19-27) (90 ng of BamHI-cut, dephosphorylated vector; 198 ng of insert DNA; 5 ml of water; 2 μl of 10× ligation buffer; 1 U ligase). This ligation mixture is used to transform the E. coli laboratory strain XL-1 blue, and the resulting clones are employed for identifying the insert.
Preparation of an Ordered Gene Library (CHIP Technology)
About 25,000 colonies of the Ashbya gossypii gene library (this corresponds to approximately a 3-fold coverage of the genome) were transferred in an ordered manner to a nylon membrane and then treated by the method of colony hybridization as described in Sambrook et al. (1989). Oligonucleotides were synthesized from the 20 bp sequences found by MPSS analysis and were radiolabeled with 32P. In each case 10 labeled oligonucleotides with a similar melting point are combined and hybridized together with the nylon membranes. After hybridization and washing steps, positive clones are identified by autoradiography and analyzed directly by PCR sequencing.
In this way, a clone which harbors an insert with the internal name “Oligo 28” and has significant homology with the MIPS tag “Yta7” or TBP7 from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 1.
In this way, a further clone which harbors an insert with the internal name “Oligo 45” and has significant homology with the MIPS tag “p39” or “Tif34” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 6.
In this way, a further clone which harbors an insert with the internal name “Oligo 85” and has significant homology with the MIPS tag “Rpl35a” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 12.
In this way, a further clone which harbors an insert with the internal name “Oligo 133” and has significant homology with the MIPS tag “Nop13” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 17.
In this way, a further clone which harbors an insert with the internal name “Oligo 172” and has significant homology with the MIPS tag “Sua5” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 21.
In this way, a further clone which harbors an insert with the internal name “Oligo 63” and has significant homology with the MIPS tag “Rps25a” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 26.
In this way, a further clone which harbors an insert with the internal name “Oligo 132” and has significant homology with the MIPS tag “Nic96” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 31.
In this way, a further clone which harbors an insert with the internal name “Oligo 174” and has significant homology with the MIPS tag “Ahcl” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 38.
In this way, a further clone which harbors an insert with the internal name “Oligo 51” and has significant homology with the MIPS tag “Rok1” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 42.
In this way, a further clone which harbors an insert with the internal name “Oligo 30” and has significant homology with the MIPS tag “Rpa34” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 48.
In this way, a further clone which harbors an insert with the internal name “Oligo 124” and has significant homology with the MIPS tag “Sub2” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 53.
In this way, a further clone which harbors an insert with the internal name “Oligo 139” and has significant homology with the MIPS tag “DCP1” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 58.
In this way, a further clone which harbors an insert with the internal name “Oligo 144” and has significant homology with the MIPS tag “PRT1” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 63.
In this way, a further clone which harbors an insert with the internal name “Oligo 168” and has significant homology with the MIPS tag “Rrp9” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 67.
In this way, a further clone which harbors an insert with the internal name “Oligo 160” and has significant homology with the MIPS tag “Rpl8b” from S. cerevisiae was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 72.
In this way, a further clone which harbors an insert with the internal name “Oligo 18” was identified. The insert has a nucleic acid sequence as shown in SEQ ID NO: 75 (complementary strand with SEQ ID NO: 74). A potential ORF is located between positions 958 and 1272 shown in SEQ ID NO: 75.
Analysis of the Sequence Data by Means of a BLASTX Search
An analysis of the resulting nucleic acid sequences, i.e. their functional assignment to a functional amino acid sequence took place by means of a BLASTX search in sequence databases. Almost all of the amino acid sequence homologies found related to Saccharomyces cerevisiae (baker's yeast). Since this organism had already been completely sequenced, more detailed information about these genes could be referred to under:
http://www.mips.gsf.de/proj/yeast/search/code search.htm.
The following homologies with amino acid fragments from S. cerevisiae were found in this way. The corresponding alignments are shown in FIGS. 1 to 15.
a) Amino acid sequences (corresponding to nucleotides 3 to 374 and 373 to 1479) derived from SEQ ID NO:1 have significant sequence homologies with a 26 S proteasome subunit or the TAT-binding homolog 7 (TBP7) from S. cerevisiae. A corresponding alignment is shown in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a 26 S proteasome subunit or a TAT-binding homolog 7 (TBP7).
b) An amino acid sequence derived from SEQ ID NO: 6 (cf. SEQ ID NO: 7; corresponding to nucleotides 5 to 463 in SEQ ID NO: 6) has significant sequence homology with a translation initiation factor (EIF3) subunit (P39) from S. cerevisiae. A corresponding alignment is shown in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a translation initiation factor subunit.
c) The amino acid sequence derived from the coding strand to SEQ ID NO: 12 has significant sequence homology with a ribosomal protein from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 469 to 825 from SEQ ID NO: 12) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a ribosomal protein.
d) The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 17 has significant sequence homology with a nucleolar protein from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 114 to 1 from SEQ ID NO: 17) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a nucleolar protein.
e) The amino acid sequence derived from the coding strand to SEQ ID NO: 21 has significant sequence homology with a translation initiation protein from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 2 to 349 from SEQ ID NO: 21) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a translation initiation protein.
f) The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 26 has significant sequence homology with a precursor of ribosomal protein S 31 from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 609 to 562 from SEQ ID NO: 26) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a precursor of ribosomal protein S 31.
g) An amino acid sequence derived from SEQ ID NO: 31 (cf. SEQ ID NO: 32, corresponding to nucleotides 108 to 764 in SEQ ID NO: 31) has significant sequence homology with a cell nuclear pore protein from S. cerevisiae.
The A. gossypii nucleic acid sequence found could thus be assigned the function of a cell nuclear pore protein.
h) The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 38 has significant sequence homology with a constituent of the ADH-histone acetyltransferase complex from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 174 to 1 from SEQ ID NO: 38) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a constituent of the ADH-histone acetyltransferase complex.
i) The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 42 has significant sequence homology with an S. cerevisiae RNA helicase which is involved in RNA processing. An amino acid part-sequence derived therefrom (corresponding to nucleotides 1086 to 1012 from SEQ ID NO: 42) with a part-sequence of the S. cerevisiae enzyme is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of an RNA helicase which is involved in RNA processing.
k) The amino acid sequence derived from the coding strand to SEQ ID NO: 48 has significant sequence homology with the nonessential constituent of RNA poll from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 1 to 102 from SEQ ID NO: 48) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned a function of the nonessential constituent of RNA poll.
l) The amino acid sequence derived from the coding strand to SEQ ID NO: 53 has significant sequence homology with an RNA helicase from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 2 to 148 from SEQ ID NO: 53) with a part-sequence of the S. cerevisiae enzyme is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of an RNA helicase.
m) The amino acid sequence derived from the coding strand to SEQ ID NO: 58 has significant sequence homology with an mRNA decapping enzyme from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 2 to 82 from SEQ ID NO: 58) with a part-sequence of the S. cerevisiae enzyme is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of an mRNA decapping enzyme.
n) The amino acid sequence derived from the coding strand to SEQ ID NO: 63 has significant sequence homology with an S. cerevisiae subunit to translation initiation factor eIF3. An amino acid part-sequence derived therefrom (corresponding to nucleotides 21 to 695 from SEQ ID NO: 63) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a subunit of translation initiation factor eIF3.
o) The amino acid sequence derived from the coding strand to SEQ ID NO: 67 has significant sequence homology with an S. cerevisiae U3 small nucleolar ribonucleoprotein-associated protein which is involved in preribosomal RNA processing. An amino acid part-sequence derived therefrom (corresponding to nucleotides 1 to 111 from SEQ ID NO: 67) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of a U3 small nucleolar ribonucleoprotein-associated protein which is involved in preribosomal RNA processing.
p) The amino acid sequence derived from the corresponding complementary strand to SEQ ID NO: 72 has significant sequence homology with a ribosomal protein (L7a.e.B/large 60 S subunit) from S. cerevisiae. An amino acid part-sequence derived therefrom (corresponding to nucleotides 508 to 176 from SEQ ID NO: 72) with a part-sequence of the S. cerevisiae protein is depicted in
The A. gossypii nucleic acid sequence found could thus be assigned the function of the ribosomal protein (L7a.e.B/large 60 S subunit).
Isolation of Full-Length DNA
a) Construction of an A. gossypii Gene Library
High molecular weight cellular complete DNA from A. gossypii was prepared from a 2-day old 100 ml culture grown in a liquid MA2 medium (10 g of glucose, 10 g of peptone, 1 g of yeast extract, 0.3 g of myo-inositol ad 1000 ml). The mycelium was filtered off, washed twice with distilled H2O, suspended in 10 ml of 1 M sorbitol, 20 mM EDTA, containing 20 mg of zymolyase 20T, and incubated at 27° C., shaking gently, for 30 to 60 min. The protoplast suspension was adjusted to 50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 100 mM EDTA and 0.5% strength sodium dodecyl sulfate (SDS) and incubated at 65° C. for 20 min. After two extractions with phenol/chloroform (1:1 vol/vol), the DNA was precipitated with isopropanol, suspended in TE buffer, treated with RNase, reprecipitated with isopropanol and resuspended in TE.
An A. gossypii cosmid gene library was produced by binding genomic DNA which had been selected according to size and partially digested with Sau3A to the dephosphorylated arms of the cosmid vector Super-Cos1 (Stratagene). The Super-Cos1 vector was opened between the two cos sites by digestion with XbaI and dephosphorylation with calf intestinal alkaline phosphatase (Boehringer), followed by opening of the cloning site with BamHI. The ligations were carried out in 20 μl, containing 2.5 μg of partially digested chromosomal DNA, 1 μg of Super-Cos1 vector arms, 40 mM Tris-HCl, pH 7.5, 10 mM MgCl2, 1 mM dithiothreitol, 0.5 mM ATP and 2 Weiss units of T4-DNA ligase (Boehringer) at 15° C. overnight. The ligation products were packaged in vitro using the extracts and the protocol of Stratagene (Gigapack II Packaging Extract). The packaged material was used to infect E. coli NM554 (recA 13, araD139, Δ(ara,leu)7696, Δ(lac)17A, galU, galK, hsrR, rps(strr), mcrA, mcrB) and distributed on LB plates containing ampicillin (50 μg/ml). Transformants containing an A. gossypii insert with an average length of 30-45 kb were obtained.
b) Storage and Screening of the Cosmid Gene Library
In total, 4×104 fresh single colonies were inoculated singly into wells of 96-well microtiter plates (Falcon, No. 3072) in 100 μl of LB medium, supplemented with the freezing medium (36 mM K2HPO4/13.2 mM KH2PO4, 1.7 mM sodium citrate, 0.4 mM MgSO4, 6.8 mM (NH4)2SO4, 4.4% (w/v) glycerol) and ampicillin (50 μg/ml), allowed to grow at 37° C. overnight with shaking, and frozen at −70° C. The plates were rapidly thawed and then duplicated in fresh medium using a 96-well replicator which had been sterilized in an ethanol bath with subsequent evaporation of the ethanol on a hot plate. Before the freezing and after the thawing (before any other measures) the plates were briefly shaken in a microtiter shaker (Infors) in order to ensure a homogeneous suspension of cells. A robotic system (Bio-Robotics) with which it is possible to transfer small amounts of liquid from 96 wells of a microtiter plate to nylon membrane (GeneScreen Plus, New England Nuclear) was used to place single clones on nylon membranes. After the culture had been transferred from the 96-well microtiter plates (1920 clones), the membranes were placed on the surface of LB agar with ampicillin (50 μg/ml) in 22×22 cm culture dishes (Nunc) and incubated at 37° C. overnight. Before cell confluence was reached, the membranes were processed as described by Herrmann, B. G., Barlow, D. P. and Lehrach, H. (1987) in Cell 48, pp. 813-825, including as additional treatment after the first denaturation step a 5-minute exposure of the filters to vapors on a pad impregnated with denaturation solution on a boiling water bath.
The random hexamer primer method (Feinberg, A. P. and Vogelstein, B. (1983), Anal. Biochem. 132, pp. 6-13) was used to label double-stranded probes by uptake of [alpha-32P]dCTP with high specific activity. The membranes were prehybridized and hybridized at 42° C. in 50% (vol/vol) formamide, 600 mM sodium phosphate, pH 7.2, 1 mM EDTA, 10% dextran sulfate, 1% SDS, and 10× Denhardt's solution, containing salmon sperm DNA (50 μg/ml) with 32P-labeled probes (0.5-1×106 cpm/ml) for 6 to 12 h. Typically, washing steps were carried out at 55 to 65° C. in 13 to 30 mM NaCl, 1.5 to 3 mM sodium citrate, pH 6.3, 0.1% SDS for about 1 h and the filters were autoradiographed at −70° C. with Kodak intensifying screens for 12 to 24 h. To date, individual membranes have been reused successfully more than 20 times. Between the autoradiographies, the filters were stripped by incubation at 95° C. in 2 mM Tris-HCl, pH 8.0, 0.2 mM EDTA, 0.1% SDS for 2×20 min.
c) Recovery of Positive Colonies from the Stored Gene Library
Frozen bacterial cultures in microtiter wells were scraped out using sterile disposable lancets, and the material was streaked onto LB agar Petri dishes containing ampicillin (50 μg/ml). Single colonies were then used to inoculate liquid cultures to produce DNA by the alkaline lysis method (Bimboim, H. C. and Doly, J. (1979), Nucleic Acids Res. 7, pp. 1513-1523).
d) Full-Length DNA
It was possible as described above to identify clones which harbor an insert with the appropriate complete sequence. These clones have the internal names given below:
“Oligo 28v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 4.
“Oligo 45v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 10.
“Oligo 85v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 14. The protein encoded thereby preferably comprises at least one of the amino acid sequences shown in SEQ ID NO: 15 and 16.
“Oligo 133v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 19.
“Oligo 172v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 24.
“Oligo 63v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 29.
“Oligo 132v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 36.
“Oligo 174v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 40.
“Oligo 51v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 46.
“Oligo 30v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 51.
“Oligo 124v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 56.
“Oligo 139v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 60. The protein encoded thereby preferably comprises at least one of the amino acid sequences as shown in SEQ ID NO: 61 and 62.
“Oligo 144v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 65.
“Oligo 168v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 70.
“Oligo 18v”. The insert comprising the complete sequence has a nucleic acid sequence as shown in SEQ ID NO: 77.
Detection of a Modulating Effect of Oligo 18 on Vitamin B2 Production
In order to test whether integration of DNA in the vicinity of the potential reading frame of oligo 18 has adverse effects on riboflavin synthesis, a DNA fragment was integrated by means of homologous recombination into the genome of the Ashbya gossypii strain used (Ashbya TEF promoter+G418 resistance gene—cf.
Shaken flask experiments for riboflavin determination:
10 ml of preculture medium (9.5 ml [9.5 g] of medium+0.5 ml soybean oil) in a 100-mi 2-baffle Erlenmeyer flask are inoculated with 0.5 ml of a glycerol culture or with about one inoculating loop of mycelium from a 5-day old, well grown SP agar plate, and shaken at 180 rpm (cabinet shaker, excursion 2.5 cm) and 28° C. for 40 hours.
1.1 ml of this culture are used to inoculate 25.7 ml of main culture medium (21.2 ml [21.2 g] of medium, 1 ml of urea [10 g/45 ml]+3.5 ml [3.2 g] of soybean oil, final volume=26.8 ml, of which 4.4 ml compensates for evaporation during shaking without humidification) or 21.8 ml of main culture medium (17.3 ml [17.3 g] of medium, 1 ml of urea [10 g/45 ml]+3.5 ml [3.2 g] of soybean oil, final volume=22.9 ml, of which 0.5 ml compensates for evaporation during shaking in an environment without artificial humidification) in a 250 ml Erlenmeyer flask and shaken at 220 rpm (industrial shaker, excursion 5 cm) or 300 rpm (cabinet shaker, excursion 2.5 cm) and 28° C. for 5 days.
0.5 ml of the main culture is vigorously shaken with 4.5 ml [5 g] of a 40% strength nicotinamide solution (dilution factor 10) or 0.25 ml with 4.75 ml [5.27 g] of a 40% strength nicotinamide solution (dilution factor 20) in a test tube and incubated in a water bath at 70° C. for about 2×20 minutes (cells lyzed, shaking in between). After cooling, 40 μl are put in a macro dispersible cuvette, mixed with 3 ml of deionized water and measured as quickly as possible in a photometer, because vitamin B2 decomposes very rapidly. This entails measurement of the extinctions at 402, 446 and 550 nm and calculation as follows:
V=(W1−W2×C+W3×(C−1)):(B1−B2×C)
with
The A. gossypii nucleic acid sequence found could on the basis of the above observations be assigned the function of a protein for modulating the vitamin B2 productivity.
Number | Date | Country | Kind |
---|---|---|---|
10142056.6 | Aug 2001 | DE | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP02/09656 | 8/29/2002 | WO |