The present invention relates to the field of biology, particularly to the field of biotechnology. More particular, the present invention relates to the field of RNA-Biology and its implication in regulating gene expression, e.g. for expressing genes of interest in a host cell.
In bacteria all steps of transcription, from initiation to elongation and termination, are prone to regulation by trans-acting proteins and small RNAs. Additionally, the nascent RNA itself contains many cis-acting regulatory signals. Riboswitches are cis-acting RNA elements mainly encoded in the 5′ UTR of the nascent RNA that directly sense small metabolites, ions, temperature, or pH, leading to regulation of transcription, translation, or RNA processing (A. Serganov, E. Nudler, Cell. 152, 17-24 (2013); and S. Proshkin, A. Mironov, E. Nudler, Biochim. Biophys. Acta (2014), doi:10.1016/j.bbagrm.2014.04.002). Termination signals are another example of regulatory RNA signals that can be categorized into two major groups: intrinsic and factor-dependent terminators (E. Nudler, M. E. Gottesman, Genes to Cells. 7, 755-768 (2002); and J. M. Peters, A. D. Vangeloff, R. Landick, J. Mol. Biol. 412, 793-813 (2011)). GC-rich RNA hairpins followed by a stretch of uridines are intrinsic signals that cause termination independently of cofactors. Termination at factor-dependent signals requires the presence of protein factors, such as Rho, recognizing particular RNA regions. Nascent RNA can also increase the processivity of RNAP providing factor-dependent or -independent antitermination signals. For example, phage HK022 put RNA binds directly to RNAP throughout elongation rendering it resistant to termination (R. Sen, R. A. King, R. A. Weisberg, Mol. Cell. 7, 993-1001 (2001); and N. Komissarova et al., Mol. Cell. 31, 683-94 (2008)). Factor-dependent cis-acting nut RNA (box A+box B) assists phage λ N protein during processive antitermination together with bacterial NusA, NusB, S10. No common attributes could until today be found. All of the known regulatory elements, however, request a distinct secondary structure which makes them difficult to design. Hence, there is a need for regulatory sequences of RNA not needing distinct secondary structures.
Heterogenous expression of sequences in host cells is often prone to sequences having regulatory effects when introduced into a host cell for expression. They may inhibit expression, e.g. in terms of termination of transcription or translation. For example, a heterogenous sequence may contain sequences having inhibitory effects on expression in the host cell. This would result in an unwanted restriction on expression. Furthermore, output of the target sequence expression in host cells can be affected by surrounding sequences with regulatory effects in the host cell. Moreover, unwanted transcription of the reverse complement strand, e.g. by unidentified promoter sequences, may cause a loss transcription efficiency due to mutual hindrance of the polymerases transcribing the opposing strands. There is hence a need for a tool to allow proper expression of sequence of interest in a host cell irrespective of the potential presence of inhibitory elements within the sequence to be expressed. In particular tools are missing or desired that allow an efficient production of recombinant proteins in host cells.
While start and amount of transcription may often be regulated or enhanced using promoters, such as inducible promoters, the regulation of translation efficiency is yet insufficiently solved.
Furthermore, the expression of a certain sequence, e.g. leading to a protein, in a heterologous host cell may cause difficulties when the protein has deleterious effects on the host cell. Nowadays, expression systems are employed which make use of inducible promoters. Such promoters effectively initiate transcription only in case a certain stimulus is present, e.g. a certain compound is added to the culturing medium. However, such promoters often promote basal expression in the un-stimulated environment leading to a certain leakiness of the promoter. In case of proteins with deleterious effects this may have adverse effects and may lead to ineffective expression once the expression shall be induced.
The inventors now provide a novel class of regulating RNA aptamers which provide a powerful in overcoming the above outlined drawbacks.
The inventors interestingly found short RNA-polymerase binding aptamers (termed RAPs) that interact with RNA-polymerases with high affinity. The inventors did identify several naturally occurring RAPs. They furthermore, showed that these aptamers are able to regulate expression of the DNA-sequences they are encoded on. Thereby a completely new tool is provided to regulate RNA-polymerases and thereby a tool for regulating expression of DNA-encoded molecules, like RNAs and/or Proteins, in a host cell.
Hence, the present invention in one aspect relates to a RNA molecule comprising a RNA-polymerase binding aptamer, wherein said RNA-polymerase binding aptamer has a length of 15 to 60 nt, and wherein said RNA-polymerase binding aptamer binds RNA-polymerase with a high affinity, preferably with a KD of 50 nM or lower, preferably with a KD of 10 nM or lower.
The invention also relates to a DNA-molecule encoding a RNA-polymerase binding aptamer according to the present invention. The invention further relates to an expression cassette comprising a sequence encoding the RNA-polymerase binding aptamer according to the present invention and a promoter, wherein the sequence encoding said aptamer is operatively linked to said promoter.
The invention furthermore relates to a vector, preferably a vector for expression of a sequence of interest, said vector comprising a sequence encoding a RNA-polymerase binding aptamer according to the present invention. The vector in a preferred embodiment further comprises a promoter, a multiple cloning site for integration of the sequence of interest. The vector may comprise a DNA-molecule according to the invention, preferably an expression cassette according to the invention. Furthermore, in one aspect of the invention the expression cassette comprises the reverse complement sequence of the sequence encoding an inhibitory RNA-polymerase binding aptamer downstream of the promoter. In one aspect of the invention, the expression cassette comprises a promoter, a DNA sequence to be expressed or a multiple cloning site for inserting a DNA sequence to be expressed and a DNA sequence coding for a 3′UTR downstream of said DNA sequence to be expressed or said multiple cloning site, wherein the DNA sequence coding for a 3′UTR comprises a the reverse complement of a DNA sequence encoding for an inhibiting RNA-Polymerase binding aptamer according to the invention.
Also encompassed by the present invention is a host cell comprising a RNA-polymerase binding aptamer according to the present invention. Furthermore, the invention relates to a host cell comprising DNA-molecule comprising a sequence encoding a RNA-polymerase binding aptamer according to the present invention.
Furthermore, the invention relates to a method for expression of a sequence of interest, preferably a DNA sequence, comprising the steps of:
The invention, furthermore, relates to the use of RNA-molecule or the DNA-molecule or the expression cassette or the vector of the present invention to regulate expression of a sequence to be expressed. In other words, the RNA-polymerase binding aptamer according to the invention is employed in methods for expressing a sequence to be expressed, e.g. in cis by encoding the aptamer on the same DNA molecule as the sequence to be expressed is encoded on.
(A) Design of the experiments for evaluating effect of an activating RAP (RAP ID #5713; SEQ ID NO:2) on transcription in trans. E. coli cells were re-transformed with two compatible plasmids carrying different resistance markers (shown schematically as circles). The first plasmid (pMW) encodes GFP reporter gene with one of two well characterized terminators; i.e. either carrying the rho-dependent rrnB (pWM_rrnB) or the rut intrinsic terminator (pWM_rut). The second plasmid carries an arabinose-inducible promoter for overexpression of RAP #5713 in a form of bacterial small (sRNA). (B) Left panel: sRNA design, sRNA without (sRNA) and with (RAP_sRNA). The scaffold of DsrA, RprA and their hybrid was used to create 3 different RAP-containing sRNAs (RAP_sRNA1, RAP_sRNA2 and RAP_sRNA3. respectively) by replacing the base-pairing region with the corresponding RAP sequence. The location of a probe for RAP-containing sRNA detection is depicted as an elliptic bar. Right panel: Northern blot analysis of total RNA from transformed E. coli cells using a probe hybridizing to RAP #5713. The Northern blot shows the accumulation of RAP#5713-containing full-length stable transcripts (thicker (major) band indicated with an arrow), overexpressed from RAP_sRNA (pBAD30-based) plasmid upon induction with L-arabinose (1%). Probing for the 5S RNA was performed for the loading control.
(C)-(D) The results of co-transformation with two plasmids: pWM with intrinsic rrnB terminator (C) or rut site (D) (schematically shown as hairpin or rectangle, respectively) upstream of the reporter GFP. Further, pBAD30-based RAP_sRNA plasmid for overxpression of RAP-containing sRNAs (upper panel). “P” (grey triangle) indicates the location of constitutive bacterial RNAP promoter. TSS: transcription start site; RBS: ribosomal binding site. Middle and lower panels: E. coli cells co-transformed with described GFP reporter plasmid and different plasmids for RAP_sRNA overexpression were grown on LB agar plates followed by fluorescence intensity measurements (middle panel—GFP mode). The same plate was also captured under visible light showing bacterial density of different strains (lower panel—light mode).
(D) E. coli growth experiment with real-time fluorescence detection. Cells transformed with nad(RAP15)-GFP or nad(rev #15)-GFP reporter constructs (shown in blue and green) were grown in LB media with simultaneous cell density (upper panel) and GFP intensities (lower panel) measurements. Grey vertical line separates exponential and stationary phases. At each time point values represent means±SD, n=3.
(E) A model for RAP-mediated transcription termination. (1) Coupling between the leading ribosome and RNAP leaves no room for Rho to load on the nascent transcript (S. Proshkin, A. R. Rahmouni, A. Mironov, E. Nudler, Science. 328, 504-8 (2010), B. M. Burmann et al., Science. 328, 501-4 (2010)); (2) An inhibitory RAP emerging from the RNAP exit channel interacts with RNAP, resulting in ribosome pausing and RNA looping; (3) Naked unstructured RNA becomes available for Rho to load and terminate transcription.
The inventors interestingly found that the short RNA-polymerase binding aptamers interacting with RNA-polymerases with high affinity regulate expression of the DNA-sequences they are encoded on. Thereby a completely new tool is provided to regulate RNA-polymerases and thereby a tool for regulating expression of DNA-encoded molecules, like RNAs and/or Proteins, in a host cell. Hence, the present invention in particular relates to the use of the RNA-polymerase binding aptamers as disclosed herein for the use in regulating expression of a sequence of interest. The inventors found that the inventive RNA-polymerase binding aptamers surprisingly act in cis, i.e. they regulate the expression of nucleic acid they are encoded on and transcribed with, e.g. of the DNA strand they are encoded on. Hence, the RNA-polymerase binding aptamer according to the invention in all its embodiments and aspects is preferably used in cis. Be used in cis in context of the present invention means that the sequence encoding the RNA-polymerase binding aptamer is positioned such that it is transcribed together with the sequence to be expressed, i.e. it forms part of the same RNA when transcribed. Generally, this means that the sequence encoding the RNA-polymerase binding aptamer is positioned downstream of the transcription starting site and upstream of a transcription terminating signal of the constructs of the present invention.
The RNA-polymerase binding aptamer according to the present invention is preferably RNA. Further preferred, said RNA-polymerase binding aptamer may be encoded on another nucleic acid, e.g. and preferably DNA. Hence, the invention also relates to a DNA molecule comprising a sequence encoding said RNA-polymerase binding aptamer.
Methods to determine the degree of affinity are well established and known at the RNA biochemistry field. In particular methods are known to determine the affinity between a nucleic acid molecule, like RNA or DNA, and a protein or polypeptide, like RNA-Polymerase. Affinity of interactions can be expressed in terms of dissociation constant. The dissociation constant is commonly used to describe the affinity between a ligand (“L”; in the present case the RNA-polymerase binding aptamer) and a protein (“P”; in the present case RNA-polymerase). In other words, the dissociation constant expresses how tightly a ligand binds to a particular protein. Ligand-protein affinities are influenced by non-covalent intermolecular interactions between the two molecules such as hydrogen bonding, electrostatic interactions, hydrophobic and Van der Waals forces. The formation of a ligand-protein complex (“C”) can be described by a two-state process
C
P+L
The respective dissociation constant (KD) can be expressed as
[P], [L] and [C] represent molar concentrations of the protein, ligand and complex, respectively. The unit of the dissociation constant is molarity (M) and corresponds to the concentration of ligand at which the binding site on a particular protein is half occupied, i.e. the concentration of ligand, at which the concentration of protein with ligand bound [C], equals the concentration of protein with no ligand bound [P]. The smaller the dissociation constant, the more tightly bound the ligand is, or the higher the affinity between ligand and protein. For example, a ligand with a nanomolar (nM) dissociation constant binds more tightly to a particular protein than a ligand with a micromolar (μM) dissociation constant. The determination of KD for an RNA-polymerase aptamer according to the present invention may be performed with methods well known to the skilled person. For example the RNA-polymerase aptamer (being RNA) could be analysed by RNA electrophoretic mobility shift assays (RNA EMSAs). In this case EMSA experiments are performed with an in vitro transcribed RNA aptamer and purified RNA polymerase. To determine the KD values one several binding reactions are performed with a fixed concentration of radioactively labeled RNA and increasing concentrations of the protein of choice (RNAP). The detailed step-by-step protocol is provided in “Handbook of RNA Biochemistry” (2014), Edited by R. K. Hartmann, A. Bindereif, A. Schön, E. Westhof. Volume 2, pp. 975-983, Wiley, which is incorporated herein by reference. The inventors have ascertained that the aptamers bind to the RNA-polymerase with the desired high affinity by repeatedly applying the SELEX procedure (7 cycles of genomic SELEX resulting in dissociation constant of about 10 nM). Hence, the skilled person may perform EMSA experiments for the RNA aptamer of choice (e.g. in vitro transcribed) in order to determine the exact Kd if required. Such procedure is for example described in N. Windbichler, F. von Pelchrzim, O. Mayer, E. Csaszar, R. Schroeder, RNA Biol. 5, 30-40, and genomic SELEX is described in C. Lorenz, F. von Pelchrzim, R. Schroeder, Nat. Protoc. 1, 2204-12 (2006), which are both incorporated herein by reference.
The RNA-polymerase binding aptamer according to the present invention binds the RNA-Polymerase with a KD of 50 nM or less, more preferably with a KD of 10 nM or less. The RNA-polymerase binding aptamer according to the present invention preferably binds the RNA-polymerase with a KD of 50 nM or less, more preferably with a KD of 10 nM or less as a single stranded RNA. It will be understood by the skilled person that the RNA-polymerase binding aptamer according to the present invention is able to bind the RNA-polymerase as single stranded RNA, i.e. without the need for being hybridized to other nucleic acid, such as DNA or another RNA strand. Hence, preferably the RNA-polymerase binding aptamer according to the present invention binds the RNA-polymerase according to the present invention as single stranded RNA with a KD of 50 nM or less, more preferably with a KD of 10 nM or less. The RNA-polymerase binding aptamer according to the present invention preferably binds the RNA-polymerase with a KD of 50 nM or less, more preferably with a KD of 10 nM or less as a single stranded RNA.
The inventors found RNA-polymerase binding aptamers binding RNA-polymerase from Escherichia coli. However, as will be apparent to the skilled person, it is plausible that the invention is likewise applicable for other RNA-polymerases, e.g. RNA-polymerases of other prokaryotic or eukaryotic organisms. The skilled person will acknowledge that applicable also to the eukaryotic RNA-polymerases, as they share high structural similarities with prokaryotic in general and bacterial RNA-polymerases in particular. Furthermore, the inventors have proven in the enclosed Example section that also RNA-polymerase of eukaryotic organisms binding aptamers are a tool for regulating the expression of a sequence to be expressed. Hence, in one embodiment of the present invention said RNA-polymerase binding aptamer is binding to a RNA-polymerase selected from the group consisting of a prokaryotic RNA-polymerase, and a eukaryotic RNA-polymerase, particularly preferred the RNA-polymerase is a prokaryotic RNA-polymerase, more preferably RNA-polymerase from E. coli. Eukaryotic RNA polymerases II and III are the most closely related to bacterial RNA-polymerases (see e.g. Ebright R H (2000), “RNA polymerase: structural similarities between bacterial RNA polymerase and eukaryotic RNA polymerase II”. J. Mol. Biol. 304(5): 687-698; and Nielsen S, Yuzenkova Y, Zenkin N. (2013), Mechanism of eukaryotic RNA polymerase III transcription termination Science, 340(6140):1577-1580) for which the Examples show data. Hence, the eukaryotic RNA-polymerase according to the present invention is preferably selected from the group consisting of eukaryotic RNA-polymerase II, and eukaryotic RNA-polymerase III, particularly preferred eukaryotic RNA-polymerase II, such as from yeast or mammalian origin.
The RNA-polymerase is preferably of the organism in which the expression by RNA-polymerase shall be regulated, e.g. if the RNA-polymerase binding aptamer is intended to regulate expression of a sequence to be expressed in yeast it preferably binds RNA-polymerase of yeast or mammalian cells, such as human cells, non-human primate cells and rodent cells, including hamster cells such as BHK21, BHK, CHO, DG44, or derivatives/descendants of these cell lines. Also suitable are myeloma cells from the mouse, preferably NS0 and Sp2/0-AG14 cells and human cell lines such as HEK293 or PER.C6, as well as derivatives/descendants of these mouse and human cell line, e.g. RNA-polymerase II or RNA-polymerase III from yeast or these mammalian cell lines. If the RNA-polymerase shall be regulated in a bacterium, e.g. E. coli, the RNA-polymerase the aptamer is binding to is the RNA-polymerase of this organism, e.g. E. coli.
In one embodiment, the prokaryotic RNA-polymerase is a bacterial RNA-polymerase, more preferably a bacterial RNA-polymerase derived from gram negative bacteria or gram positive bacteria, most preferably gram negative bacteria. In one particular preferred embodiment the RNA-polymerase according to the present invention is of Escherichia coli. It will be understood by the skilled artisan that the RNA-polymerase binding aptamer may interact with any subunit of the RNA-polymerases according to the present invention. However, in a preferred embodiment of the invention the RNA-polymerase binding aptamer binds the RNA-Polymerase holoenzyme, preferably with a KD of 50 nM or less, more preferably with a KD of 10 nM or less. For example, such RNA-polymerase holoenzyme of the RNA-polymerase of a bacterium, e.g. E. coli, may comprise or consist of RNA-polymerase core and the sigma 70.
In one embodiment, the eukaryotic RNA-polymerase is a RNA-polymerase of fungal or mammalian origin. In one preferred embodiment the RNA-polymerase according to the present invention is of yeast or a mammalian cell line such as human cells, non-human primate cells and rodent cells, including hamster cells such as BHK21, BHK, CHO, DG44, or derivatives/descendants of these cell lines. Also suitable are myeloma cells from the mouse, preferably NS0 and Sp2/0-AG14 cells and human cell lines such as HEK293 or PER.C6, as well as derivatives/descendants of these mouse and human cell. It will be understood by the skilled artisan that the RNA-polymerase binding aptamer may interact with any subunit of the RNA-polymerases according to the present invention. However, in a preferred embodiment of the invention the RNA-polymerase binding aptamer binds a eukaryotic RNA-Polymerase II, preferably with a KD of 50 nM or less, more preferably with a KD of 10 nM or less. For example, such RNA-polymerase II is from yeast or a mammalian cell line, such as human cells or the cell lines listed above.
The RNA-polymerase binding aptamers according to the present invention are short. Nowadays, bacteriophage RNA sequences are known that regulate expression DNA sequences. However, such sequences are rather long and require particular and distinct secondary structures. Examples from phages like HK022 exist where the phage RNA sequences have a length of significantly more than 60 nt and show very particular secondary structures with two long stems. However, the present invention is based on the finding that shorter RNA aptamers, e.g. having a length of 23 to 50 nt, are able to regulate RNA-Polymerase activity when they are directly interacting with said RNA-Polymerase. Hence, the RNA-polymerase binding aptamer according to the present invention have a length of 15 to 60 nt, preferably 20 to 50 nt, more preferably 23 to 50 nt.
The need for structural complexity for the previously described regulatory bacteriophage RNAs make them prone to misfolding dependent on the sequence they are attached to. However, the inventors now found that the natural bacterial or eukaryotic RNA-polymerase binding aptamer according to the present invention supersedes the need for distinct secondary structures. Without being bound by theory, it is believed that this property is conveyed by the high affinity to RNA-polymerases. Hence, in one embodiment the RNA-polymerase binding aptamer according to the present invention do not posses a defined secondary structure, preferably as predicted by RNfold Software (Viena RNA Package, RNAfold—http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi (R. Lorenz, S. H. Bernhart, C. Hoener zu Siederdissen, H. Tafer, C. Flamm, P. F. Stadler and I. L. Hofacker (2011), “ViennaRNA Package 2.0”, Algorithms for Molecular Biology: 6:26; and I. L. Hofacker, W. Fontana, P. F. Stadler, S. Bonhoeffer, M. Tacker, P. Schuster (1994), “Fast Folding and Comparison of RNA Secondary Structures”, Monatshefte f. Chemie: 125, pp 167-188) and/or Mfold (http://mfold.rna.albany.edu/?q=mfold (Zuker, M (2003), Mfold web server for nucleic acid folding and hybridization prediction, Nucl Acid Res, 31:3406-3415)).
The inventors have identified several preferred aptamers. Hence, the RNA-polymerase binding aptamer according to the present invention is in one embodiment encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a further embodiment the RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In yet a further preferred embodiment the RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a particular preferred embodiment the RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a further particular embodiment the RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. A particular preferred RNA-polymerase binding aptamer according to the present invention is encoded by the sequence of SEQ ID NO:2 or a sequence having at least 80% identity thereto, preferably at least 90% identity, more preferably at least 95% identity, yet more preferred at least 97% identity, even more preferred at least 99% identity. The invention also relates to a nucleic acid, e.g. RNA or DNA, hybridizing to any of these sequences under stringent conditions.
The inventors unexpectedly found that expression of DNA molecules can be regulated by introducing a sequence encoding a RNA-polymerase binding aptamer according to the present invention. The above outlined properties of the RNA-polymerase binding aptamer according to the present invention are properties of the RNA-molecule. Hence, in a particular preferred embodiment the RNA-polymerase binding aptamer according to the present invention is RNA.
The sequences given herein show the DNA-sequence encoding the preferred aptamers. Hence, whenever a DNA-sequence or an encoding sequence is referred to, the sequence is preferably the one given under the respective SEQ ID NO. By being encoded by a certain SEQ ID NO: it is meant that the RNA aptamer has the sequence shown in the referred sequence except for the nucleobase thymin (T) being exchanged by an uracil (U) and the molecule being an RNA molecule rather than a DNA molecule.
However, as the skilled person will readily acknowledge, these aptamers are particularly useful to control and/or regulate expression of a DNA molecule, e.g. a DNA molecule the aptamer is encoded on. The inventors found that the aptamer is particularly useful for regulating expression of RNA from DNA in cis. Hence, the present invention provides for DNA molecules comprising a sequence encoding the RNA-polymerase binding aptamer according to the present invention and allows thereby regulation of the expression of and from said DNA molecule. The invention, hence, relates to a DNA molecule encoding an RNA-polymerase binding aptamer according to the present invention.
The DNA molecule preferably also comprises a sequence to be expressed. Such sequence to be expressed according to the present invention is preferably a nucleotide sequence, particularly preferred a DNA sequence, encoding a RNA molecule or protein of interest. For example the encoded molecule of interest may be a regulatory RNA or a protein or polypeptide. The skilled person will instantly acknowledge that the invention is not limited to a particular type of sequence to be expressed, as the inventors have shown that expression is regulated and influenced through the RNA-polymerase binding aptamers on both levels, the transcription product level (e.g. mRNA) as well as on the subsequent translation product level (e.g. a protein or polypeptide). Nevertheless, DNA sequences are particular preferred sequences to be expressed, preferably encoding a RNA molecule or protein (polypeptide) of interest.
Several types of regulatory RNA-elements are known, for example some can down-regulate gene expression by being complementary to a part of an mRNA or a gene's DNA. MicroRNAs (miRNA; 21-22 nt) are found in eukaryotes and act through RNA interference (RNAi), where an effector complex of miRNA and enzymes can cleave complementary mRNA, block the mRNA from being translated, or accelerate its degradation. While small interfering RNAs (siRNA; 20-25 nt) are often produced by breakdown of viral RNA, there are also endogenous sources of siRNAs. siRNAs act through RNA interference in a fashion similar to miRNAs. Some miRNAs and siRNAs can cause genes they target to be methylated, thereby decreasing or increasing transcription of those genes. Animals have Piwi-interacting RNAs (piRNA; 29-30 nt) that are active in germline cells and are thought to be a defense against transposons and play a role in gametogenesis. Many prokaryotes have CRISPR RNAs, a regulatory system similar to RNA interference. Antisense RNAs are widespread; most downregulate a gene, but a few are activators of transcription. One way antisense RNA can act is by binding to an mRNA, forming double-stranded RNA that is enzymatically degraded. There are many long non-coding RNAs that regulate genes in eukaryotes, one such RNA is Xist, which coats one X chromosome in female mammals and inactivates it.
An mRNA may further contain regulatory elements itself, such as riboswitches, in the 5′ untranslated region or 3′ untranslated region; these cis-regulatory elements regulate the activity of that mRNA. The untranslated regions can also contain elements that regulate other genes. Hence, in one embodiment of the present invention the DNA-molecule comprises a sequence encoding for one or more further regulatory RNA-elements.
The DNA-molecule or expression cassette or vector, infra, can further comprise expression control sequences operably linked to said sequence to be expressed. These expression control sequences may be suited to ensure transcription and synthesis of RNA, like translatable RNA, in bacteria; such as E. coli; or eukaryotes; such as fungi or mammalian cell lines.
In a preferred embodiment of the present invention the sequence to be expressed encodes a polypeptide or protein of interest, e.g. a protein which is to be expressed.
“Expression” refers to transcription and translation occurring within a host cell. The level of expression of a DNA molecule in a host cell may be determined on the basis of either the amount of corresponding mRNA that is present within the cell or the amount of the DNA-encoded protein produced by the host cells. Further detail for the term “expression” within the context of the present invention can be obtained via a review of Sambrook et al. (2012), “Molecular Cloning: A Laboratory Manual”, 4th edition, Cold Spring Harbor Laboratory Press, ISBN: 978-1-936113-42-2, which is incorporated herein by reference.
The inventors found that the sequence encoding the RNA-polymerase binding aptamer according to the present invention may be located at differing positions within a DNA-molecule in order to regulate its expression. However, in a particular preferred embodiment the sequence encoding the RNA-polymerase binding aptamer according to the present invention is operatively linked to a promoter, preferably located downstream of a promoter. This is particularly preferred, as the results presented show that the effect achieved by the invention is present in particular if the aptamer is located downstream of the promoter and thus is being transcribed.
The DNA-molecule according to the present invention at least comprises a sequence encoding the RNA-polymerase binding aptamer according to the present invention. It may contain further elements as outlined herein in great detail. However, these elements may themselves not have any functional properties, like being a promoter, transcription start site, being a sequence to be expressed. For example, the DNA molecule according to the present invention may also be used for integration purposes into the genome of a host cell or organism. In such case one or more sequences flanking the RNA-polymerase binding aptamer encoding sequences may be sufficient to trigger integration into the host's genome, preferably site specific. The mechanism used for integration may be for instance homologous recombination, non-specific incorporation of viral DNA throughout the host genome (Youngsuk et al., “Current Advances in Retroviral Gene Therapy”, Curr Gene Ther. 2011 June; 11(3): 218-228), adaptation of CRISP/Cas9 system (Harrison et al, “A CRISPR view of development”, 10.1101/gad.248252.114 Genes & Dev. 2014. 28: 1859-1872), etc.
The sequence encoding the RNA-polymerase binding aptamer according to the present invention is preferably operatively linked to a promoter on the DNA molecule or vector according to present invention to regulate expression from said promoter. The sequence encoding the RNA-polymerase binding aptamer according to the present invention is another embodiment is positioned on the DNA-molecule according to the present invention to operatively link said sequence to a promoter after integration of the DNA-molecule or fragments thereof in a nucleic acid, e.g. a host's genome, to regulate expression from said promoter.
The term “operatively linked” or “operably linked”, as used in connection with the present invention, refers to a linkage between one or more expression control sequences and/or the coding region in the polynucleotide to be expressed in such a way that expression is achieved under conditions compatible with the expression control sequence, e.g. so that the transcription enhancing RNA-polymerase binding aptamer increases expression of a sequence, or so that a transcription inhibiting RNA-polymerase binding aptamer inhibits or decreases expression of a sequence.
In a particular preferred embodiment the sequence to be expressed codes for a polypeptide or protein of interest and the RNA-polymerase binding aptamer is located in the 5′UTR region of the sequence encoding said polypeptide or protein of interest.
The DNA-molecule according to the present invention may also comprise an (entire) expression cassette. An expression cassette may be a part of a vector DNA used for cloning and transformation. The expression cassette directs the cell's machinery to make RNA and protein. An expression cassette is composed of one or more genes and the sequences controlling their expression. An expression cassette usually comprises: a promoter sequence, an open reading frame, and optionally a 3′-untranslated region that, in eukaryotes, usually contains a polyadenylation site. As outlined in great detail herein, the invention provides for a novel mechanism to regulate expression of DNA sequences, e.g. from an expression cassette. Hence, the present invention also relates to a DNA-molecule comprising an expression cassette, said expression cassette comprising a sequence encoding an RNA-polymerase binding aptamer according to the present invention, a promoter, and an open reading frame encoding a protein to be expressed, preferably the sequences encoding the aptamer, the promoter and the open reading frame are operatively linked within said expression cassette on said DNA-molecule. The expression cassette may further comprise a 3′untranslated region. The inventors have shown that the position of the sequence encoding the aptamer is not crucial as long as it is transcribed. Hence, in a preferred embodiment of the present invention the DNA-molecule according to the present invention comprises an expression cassette comprising a promoter, an open reading frame encoding a protein to be expressed, and a sequence encoding a RNA-polymerase binding aptamer according to the present invention, wherein the open reading frame and the sequence encoding said RNA-polymerase binding aptamer are positioned downstream of said promoter. The skilled person will acknowledge that the sequence encoding said aptamer may be positioned between the promoter and the open reading frame (e.g. in the 5′-untranslated region) or may be positioned within the open reading frame. The skilled person will be able to redesign coding sequences accordingly. However, in a preferred embodiment of the DNA-molecule comprising said expression cassette according to the present invention the sequence encoding the RNA-polymerase binding aptamer according to the present invention is located downstream of the promoter and upstream of the open reading frame of the protein to be expressed, i.e. in the 5′-untranslated region, particularly preferred downstream of the transcription start site of the promoter and the translation initiation site of the open reading from. Furthermore, in a preferred embodiment said sequence encoding a RNA-polymerase binding aptamer according to the present invention encodes two or more, preferably three or more consecutive aptamers, e.g. as tandem repeats, optionally the aptamers being separated through linker sequences. Preferably, the RNA-polymerase binding aptamer according to the invention is encoded downstream of the promoter and upstream of the sequence to be expressed is an activating RNA-polymerase binding aptamer according to the invention. For increasing the expression, this RNA-polymerase binding aptamer is preferably an activating RNA-polymerase binding aptamer according to the invention. This activating construct may further be enhanced by introducing the reverse complement of a sequence encoding an inhibiting RNA-polymerase binding aptamer according to the invention at the 3′ end of the expression cassette, preferably in the 3′ untranslated region (also referred to as 3′UTR). Without being bound by theory that this inhibiting aptamer represses unwanted expression from the complement strand (e.g. during pervasive transcription) and thereby avoids blocking of the expression from the desired strand by opposing transcription.
As will be readily understood, the present invention is not limited to a specific promoter or type or sequence to be expressed, e.g. a specific protein or RNA. Expression cassettes are designed for modular cloning of (protein-encoding) sequences so that the same cassette can easily be altered to make different proteins. To this end, an expression cassette may be provided which does not contain an open reading frame but a multiple cloning site allowing the insertion of a sequence to be expressed, e.g. a sequence encoding a protein of interest or a RNA of interest. Hence, the invention further relates to DNA-molecule comprising an expression cassette, said expression cassette comprising a sequence encoding an RNA-polymerase binding aptamer according to the present invention, a promoter, and an multiple cloning site for inserting a sequence to be expressed, preferably said sequence encoding the aptamer, said promoter and said multiple cloning site are operatively linked. In a preferred embodiment of the present invention the DNA-molecule according to the present invention comprises an expression cassette comprising a promoter, a multiple cloning site for inserting a sequence to be expressed, and a sequence encoding a RNA-polymerase binding aptamer according to the present invention, wherein said multiple cloning site and said sequence encoding said RNA-polymerase binding aptamer are positioned downstream of said promoter. In a preferred embodiment of the DNA-molecule comprising said expression cassette according to the present invention the sequence encoding the RNA-polymerase binding aptamer according to the present invention is located downstream of the promoter and upstream of the multiple cloning site for inserting a sequence to be expressed.
The expression cassette may also contain more than one sequence to be expressed, e.g. being a polycistronic expression cassette. That means that transcription is initiated from a single promoter. The mRNA found in bacteria is mainly polycistronic. This means that a single bacterial mRNA strand can be translated into several different proteins. This will occur if spacers separate the different proteins, and each spacer has to have a translation initiation site (ribosome binding site, Shine-Dalgarno sequence) located upstream of the respective start codon. The transcribed mRNA than comprises a plurality of sequences to be expressed, e.g. a two or more open reading frames encoding for a protein. The skilled person will acknowledge that the RNA-polymerase binding aptamers according to the present invention are also suited to enhance expression of such polycistronic transcripts. In one embodiment the expression cassette according to the present invention comprises a promoter, two or more open reading frames transcribed from said promoter, wherein the expression cassette comprises two or more sequences encoding a RNA-polymerase binding aptamer according to the present invention, wherein said sequences encoding said aptamer are operatively linked to said promoter and said open reading frames to allow expression of all open reading frames. In a preferred embodiment each open reading frame is operatively linked to one or more sequences encoding a RNA-polymerase binding aptamer according to the present invention, particularly preferred the two or open reading frames are separated by a sequence comprising one or more sequences encoding a RNA binding aptamer according to the present invention, and a translation initiation site operatively linked thereto, preferably a ribosomal binding site downstream of said sequence encoding the aptamer. In a preferred embodiment the two or more sequences encode transcription enhancing RNA-polymerase binding aptamers according to the present invention. Each open reading frame may furthermore been linked to a translation initiation site (e.g. a ribosomal binding site, Shine-Dalgarno sequence). In a preferred embodiment the expression cassette hence comprises a promoter, and two or more open reading frames, wherein the expression cassette comprises upstream of each open reading frame a sequence encoding a RNA-polymerase binding aptamer according to the present invention, preferably a transcription enhancing RNA-polymerase binding aptamer according to the present invention, and a translation initiation site. The open reading frames may encode all the same protein or may encode for different proteins.
Different expression cassettes can be transformed into different organisms including bacteria, yeast, plants, and mammalian cells as long as the correct regulatory sequences are used.
Particularly preferred are expression cassettes for bacteria, preferably E. coli. A promoter is a region of DNA that initiates transcription of the downstream sequence. Promoters are located near the transcription start sites of genes, on the same strand and upstream on the DNA (towards the 5′ region of the sense strand). Promoters can be of differing sized, e.g. about 100-1000 nt long. For the transcription to take place, the RNA polymerase binds to the promoter sequence or a sequence nearby. Promoters may contain specific DNA sequences such as response elements that provide a secure initial binding site for RNA polymerase and for proteins called transcription factors that recruit RNA polymerase. These transcription factors have specific activator or repressor sequences of corresponding nucleotides that attach to specific promoters and regulate gene expression. In bacteria the promoter is recognized by RNA polymerase and an associated sigma factor, which in turn are often brought to the promoter DNA by an activator protein's binding to its own DNA binding site nearby. In eukaryotes the process of transcription initiation is more complex, and at least seven different factors are necessary for the binding of an RNA polymerase II to the promoter. In bacteria, the promoter contains two short sequence elements approximately −10 and −35 nucleotides upstream from the transcription start site. The sequence at −10 (the −10 element) has the consensus sequence TATAAT. The sequence at −35 (the −35 element) has the consensus sequence TTGACA. These consensus sequences, while conserved on average, are not found intact in most promoters. On average, only 3 to 4 of the 6 base pairs in each consensus sequence are found in any given promoter. Few natural promoters have been identified to date that possess intact consensus sequences at both the −10 and −35; artificial promoters with complete conservation of the −10 and −35 elements have been found to transcribe at lower frequencies than those with a few mismatches with the consensus. The optimal spacing between the −35 and −10 sequences is 17 bp. It should be noted that the above promoter sequences are recognized only by RNA polymerase holoenzyme containing sigma-70. RNA polymerase holoenzymes containing other sigma factors recognize different core promoter sequences and are likewise covered by the present invention. As has been shown by the inventors, the superior technical effects of the RNA-polymerase binding aptamers according to the present invention are achieved independent of the promoter used. As will be understood by the skilled person, the promoter shall be chosen according to the intended host cell, i.e. when the host cell is a bacterium a bacterial promoter shall be used, and, likewise, in case the host cell is a eukaryotic cell a eukaryotic promoter shall be used. Preferred promoters are promoters being active in E. coli, e.g. E. coli promoters. Preferred promoters according to the present invention include constitutive and inducible promoters. Promoters for use in connection with the DNA molecule and the expression cassette, respectively, may be homologous or heterologous with regard to its origin and/or with regard to the gene or sequence to be expressed. Suitable promoters are for instance promoters which lend themselves to constitutive expression. However, promoters which are only activated at a point in time determined by external influences can also be used. Artificial and/or chemically inducible promoters may be used in this context. Inducible promoters comprise arabinose-, glucose-, IPTG-, alcohol-, metal-inducible promoters.
One example of a constitutive promoter is the one used in pWM3110 (see Example section). In one embodiment the expression cassette according to the present invention comprises a promoter having at least 80% identity to SEQ ID NO:66, preferably at least 90% identity to SEQ ID NO:66, more preferably at least 95% identity to SEQ ID NO:66, yet more preferred at least 99% identity to SEQ ID NO:66. In a particular preferred embodiment the expression cassette according to the present invention comprises a promoter having the sequence of SEQ ID NO:66.
One example of an inducible promoter is an arabinose inducible promoter, e.g. the arabinose-inducible promoter as used in the examples and having the sequence of SEQ ID NO:67. Hence, in one embodiment the expression cassette according to the present invention comprises a promoter having at least 80% identity to SEQ ID NO:67, preferably at least 90% identity to SEQ ID NO:67, more preferably at least 95% identity to SEQ ID NO:67, yet more preferred at least 99% identity to SEQ ID NO:67. In a particular preferred embodiment the expression cassette according to the present invention comprises a promoter having the sequence of SEQ ID NO:67.
It has been shown herein, that the RNA-polymerase binding aptamers are able to regulate transcription of a transcript comprising the aptamer. As outlined herein, the sequence encoding the RNA-polymerase binding aptamer according to the present invention is preferably operatively linked to the promoter sequence and the transcription starting site of the promoter. It is preferably positioned downstream of the promoter and the transcription starting site on the DNA-molecule according to the present invention. The distance of the RNA-polymerase binding aptamer encoding sequence to the transcription starting site is believed not being crucial. However, with certain distances superior effects in terms of regulating the transcription may be observed. Hence, in one embodiment the sequence encoding the RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt downstream of the transcription starting site, preferably within a distance of 50 to 500 nt downstream of the transcription starting site, more preferred within a distance of 60 to 250 nt downstream of the transcription starting site, particularly preferred within a distance of 80 to 103 nt downstream of the transcription starting site.
The sequence encoding the RNA-polymerase binding aptamer according to the present invention is preferably also operatively linked to the sequence to be expressed, e.g. an open reading frame, or a multiple cloning site. The distance of the RNA-polymerase binding aptamer encoding sequence to the sequence to be expressed or the multiple cloning site is not crucial. However, with certain distances superior effects in terms of regulating the transcription and expression may be observed. Hence, in one embodiment the sequence encoding the RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt upstream of the sequence to be expressed or the multiple cloning site, preferably within a distance of 50 to 500 nt upstream of the sequence to be expressed or the multiple cloning site, more preferred within a distance of 60 to 250 nt upstream of the sequence to be expressed or multiple cloning site, particularly preferred within a distance of 70 to 108 nt upstream of the sequence to be expressed or multiple cloning site.
Any combination of the above mentioned preferred distances may be used, e.g. a distance of 10 to 1000 nt downstream of the transcription starting site and a distance of 10 to 1000 nt upstream of the sequence to be expressed, or distance of 50 to 500 nt downstream of the transcription starting site and a distance of 50 to 500 nt upstream of the sequence to be expressed, or the multiple cloning site, or a distance of 60 to 250 nt downstream of the transcription starting site and a distance of 60 to 250 nt upstream of the sequence to be expressed, or the multiple cloning site, or a distance of 80 to 103 nt downstream of the transcription starting site and a distance of 70 to 108 nt upstream of the sequence to be expressed, or the multiple cloning site, or any other possible combination.
The skilled person will acknowledge that the DNA molecule according to the present invention is applicable to a variety of applications. Hence, the present invention also relates to the use of a sequence encoding a RNA-polymerase binding aptamer according to the present invention for regulating expression from a DNA molecule.
In a preferred embodiment the invention relates to the use of the DNA-molecule according to the present invention for regulating expression of an endogenous sequence to be expressed or an exogenous sequence to be expressed in a host cell or organism.
The skilled person is in the position to choose the specific embodiments for the respective use and application when considering the disclosure of the present application. For instance, the DNA-molecule may comprise a part or the entire sequence to be expressed. It may for purpose of site specific integration at the proper site of the host's genome, e.g. by homologous recombination, comprise only a part of the endogenous sequence to be expressed in order to allow integration at the proper and desired site to secure proper expression. Hence, in one embodiment the DNA-molecule according to the present invention comprises one or more elements (e.g. sequences) for site specific integration into a host's genome.
The skilled person, when considering the disclosure of the present invention, is in the position to choose the proper site of integration and to design the DNA-molecule accordingly. For such design he will in particular by consider certain embodiments of the present invention as described herein in great detail, e.g. with respect to the distance between the sequence encoding the RNA-polymerase binding aptamer according to the present invention to, for example, the promoter/transcription starting site and/or the ribosome binding site and/or the start codon of a gene to be expressed. As outlined herein above, the sequence encoding the RNA-polymerase binding aptamer according to the present invention for regulating the expression of the sequence to be expressed is preferably located downstream of the transcription starting site of a promoter. The promoter, however, may be present in the DNA-molecule itself or the DNA-molecule may contain sequences for site specific integration of the construct into the host's genome to position the sequence encoding the RNA-polymerase binding aptamer downstream of an endogenous transcription start site of a promoter. Hence, in one embodiment of the present invention, the DNA-molecule according to the present invention comprises a sequence for site specific integration into a host's genome, preferably for site specific integration downstream of a promoter and/or transcription start site.
In a one embodiment the DNA-molecule comprises a for site specific integration upstream of an endogenous sequence to be expressed. In such embodiment the DNA-molecule may comprise an endogenous or exogenous promoter and/or a transcription start site, preferably upstream of the sequence encoding the RNA-polymerase binding aptamer according to the present invention.
The RNA-polymerase binding aptamer may be chosen according to the needs, e.g. an inhibiting RNA-polymerase binding aptamer according to the present invention may be employed, i.e. encoded on said DNA-molecule, if the expression of said endogenous sequence is to be downregulated. Hence, the invention relates to the use of an inhibiting RNA-polymerase binding aptamer according to the present invention or a DNA-molecule encoding the same for inhibiting the expression of an endogenous sequence within a host cell. The host cell preferably being a non-human host cell, preferably bacteria origin. The DNA-molecule encoding inhibiting RNA-polymerase binding aptamer according to the present invention is preferably inserted downstream of the promoter from which said endogenous sequence is transcribed. The invention also relates to a DNA-molecule according to the present invention for use in treating a human through gene therapy.
The skilled person will acknowledge that the DNA-molecule according to the present invention may be used for regulating the expression of an endogenous sequence to be expressed or for regulating the expression of an exogenous sequence through the RNA-polymerase binding aptamer according to the present invention by employing an endogenous promoter of the host. In case the DNA-molecule according to the present invention is used for regulating the expression of an endogenous sequence to be expressed this may be performed by using the endogenous promoter of the sequence to be expressed or by introducing an exogenous promoter to control the start of the transcription. In the latter case, the skilled artisan will understand that the DNA-molecule according to the present invention preferably comprises a promoter, preferably upstream of the sequence encoding the RNA-polymerase binding aptamer and optionally additional sequences for proper integration.
Furthermore, the skilled person will acknowledge the plurality of different uses and applications of the aptamers according to the invention. The aptamers or the DNA-molecules according to the present invention may be used for regulating expression of sequences to be expressed within a host cell or in an in vitro expression system. By application of the different aptamers the skilled person may regulate the expression with a finer tuning than known from prior art. When using gene silencing or knock out techniques to inhibit expression of a gene, e.g. for functional studies, the result is usually the complete loss of the gene product, i.e. the RNA or the encoded protein. However, for some purposes it is desirable to not completely diminish expression and/or activity of a certain gene product. For instance if the gene and/or gene product is essential for cell survival. The skilled person will instantly acknowledge that the present invention provides a superior tool allowing down-regulation of gene expression, e.g. of endogenous sequences to be expressed, without completely depleting the cell for the encoded molecule. The aptamers according to the present invention are therefore particularly applicable for scientific research, e.g. by using them to down or up regulate the expression of a sequence, e.g. encoding a molecule of interest. The skilled person will acknowledge that this is the case for prokaryotic as well as eukaryotic cells. For the latter the present invention provides for a superior alternative to gene-silencing through siRNAs and the like. Hence, the disclosed methods and uses herein as they relate on the inhibiting RNA-polymerase binding aptamers in a particular preferred embodiment are methods or uses for knock down of a sequence of interest in a host cell (e.g. a gene). Preferred methods are outlined further below.
In one embodiment of the invention the use of the DNA-molecule relates to the use for regulating the expression of an endogenous sequence to be expressed by employing the endogenous promoter of said sequence. In such embodiment the DNA-molecule preferably comprises a sequence encoding a RNA-polymerase binding aptamer and one or more sequences for site specific integration of said sequence encoding a RNA-polymerase binding aptamer downstream of said endogenous promoter and upstream of said sequence to be expressed.
“Endogenous” refers to any sequence, promoter, transcription start site, gene, open reading frame or the like that is present in the host cell or organism without insertion or application of external nucleic acid sequence, e.g. before external application of the DNA-molecule according to the present invention. An endogenous sequence is present in a cell or organism without any external manipulation, e.g. transfection or transformation of nucleic acids.
“Exogenous ” refers to any sequence, promoter, transcription start site, gene or the like that is not present in the host cell or organism or not present at the specific locus (site) of the host's genome without insertion or application of external nucleic acid sequence, e.g. the “exogenous” element is only present at the specific site of the host's genome after external application of the DNA-molecule according to the present invention.
“Sequence to be expressed” as used herein refers to a DNA sequence encoding a molecule of interest. As outlined herein, the RNA-polymerase binding aptamers according to the present invention regulate the activity of RNA-polymerases, e.g. and preferably DNA-dependent RNA-polymerases, i.e. they influence the transcription of a DNA sequence into RNA. Thereby also the yield of subsequent steps, like translation into a polypeptide, can be influenced. Hence, the “sequence to be expressed” as used herein preferably refers to a DNA sequence encoding a molecule of interest, the molecule of interest preferably being a RNA molecule or a polypeptide or protein molecule of interest.
“Host cell” or “host cells” refers to cells expressing or capable of expressing a sequence to be expressed. Host cells of the present invention express polynucleotides encoding polypeptides or RNAs having any number of uses, including biotechnological, molecular biological and clinical settings. Examples of suitable host cells in the present invention include, but are not limited to, bacterial, yeast, insect, animal and mammalian cells. Specific examples of such cells include E. coli DH5a cells and BL21(DE3), as well as various other bacterial cell sources, for example the E. coli strains: DH10b cells, XL1Blue cells, XL2Blue cells, Top10 cells, HB101 cells and DH12S cells; yeast host cells from the genera including Saccharomyces, Pichia, and Kluveromyces, animal cell lines, e.g. mammal and even human cell lines, including CHO cells, HEK293 cells, and HeLa. Preferred host cells are those using bacterial RNA-polymerase, or eukaryotic DNA-dependent RNA-polymerase II and/or III; preferably RNA-polymerase II; for expression of DNA sequences. In one particular preferred embodiment the host cell is a bacterial cell, preferably E. coli; even more preferred selected from the group of E. coli strains consisting of DH5a, BL21(DE3), DH10b cells, XL1Blue cells, XL2Blue cells, Top10 cells, HB101 cells and DH12S cells, yet more preferred DH5α, or BL21(DE3). In another preferred embodiment the host cell is a fungus, more preferably a fungus of the genus Saccharomyces, Schizosaccharomyces, Aspergillus, Trichoderma, Kluyveromyces or Pichia and even more preferably of the species Saccharomyces cerevisiae, Schizosaccharomyces pombe, Aspergillus niger, Trichoderma reesei, Kluyveromyces marxianus, Kluyveromyces lactis, Pichia pastoris, Pichia torula or Pichia utilis. In another preferred embodiment the host cell is a eukaryotic host cell selected from the group of yeasts (e.g. Saccharomyces cervisiae), fungal host cells, insect host cells, mammalian host cells such as primate, human or rodent host cells, including CHO or COS cells, HEK293 cells, and HeLa The skilled artisan will acknowledge that the host cell may comprise further modifications, like additional genes, mutations or deletions, e.g. to enhance expression efficiency.
Even though the identified aptamers may play a role in gene expression in nature, one embodiment of the present invention is the provision of a DNA-molecule, a vector, and/or a host cell for regulated expression of a sequence of interest (a sequence to be expressed). Hence, in a preferred embodiment the subject-matter of the present invention relates to recombinant products, like recombinant RNA- or DNA-molecules, vectors and/or host cells. Hence, the invention preferably excludes naturally occurring host cells as found in nature expressing employing the regulatory mechanism of RNA-polymerase binding aptamers for the sequence to be expressed. Instead, the host cell of the present invention and employed in a method or use of the present invention is preferably a non-naturally occurring host cell, whether it has been genetically modified to express (including overexpression and repression/inhibition of expression) a sequence to be expressed in a context (regulatory mechanism through employment of the aptamers according to the present invention) not normally existing in its genome or whether it has been engineered to overexpress an exogenous/heterologous enzyme.
Thus, the DNA-molecules, vectors and host cells employed in connection with the present invention are preferably non-naturally occurring DNA-molecules, vectors and host cells, respectively, i.e. they are DNA-molecules, vectors and host cells which differ significantly from naturally occurring DNA-molecules, vectors and host cells and which do not occur in nature.
Also variants of the host cells are included. Such variants include preferably genetically modified organisms as described herein above which differ from naturally occurring organisms due to a genetic modification. Genetically modified organisms are organisms which do not naturally occur, i.e., which cannot be found in nature, and which differ substantially from naturally occurring organisms due to the introduction of a foreign (recombinant) nucleic acid molecule. Such organisms are also referred to as recombinant.
In a further embodiment the invention relates to the use of the DNA-molecule according to the present invention for regulating expression from a DNA-molecule. It will be understood by those of ordinary skills in the art that in this embodiment the DNA-molecule in addition to the sequence encoding the RNA-polymerase binding aptamer according to the present invention further may comprise the sequence to be expressed.
The invention also relates to a nucleic acid hybridizing to an RNA-molecule or a DNA-molecule according to the present invention under stringent conditions. Hybridization conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 6.3.1-6.3.6, 1991. Stringent conditions are defined as equivalent to hybridization in 6× sodium chloride/sodium citrate (SSC) at 45° C., followed by a wash in 0.2×SSC, 0.1% SDS at 65° C.
The DNA molecule according to the present invention is preferably comprised in a vector. In case it comprises an expression cassette the vector contains said DNA-molecule is also referred to as an expression vector or a vector for expression.
The term “vector” refers to a polynucleotide or a mixture of polynucleotides and polypeptides/proteins which is capable of being introduced or of introducing the nucleic acid comprised into a cell. It is preferred that the sequences to be expressed and encoded by the introduced polynucleotide are expressed within the cell upon introduction of the vector.
In a preferred embodiment the vector of the present invention comprises plasmids, phagemids, phages, cosmids, artificial mammalian chromosomes, knock-out or knock-in constructs, viruses, in particular adenoviruses, vaccinia viruses, attenuated vaccinia viruses, canary pox viruses, lentivirus (Chang, L. J. and Gay, E. E. (20001) Curr. Gene Therap. 1:237-251), herpes viruses, in particular Herpes simplex virus (HSV-1, Carlezon, W. A. et al. (2000) Crit. Rev. Neurobiol.), baculovirus, retrovirus, adeno-associated-virus (AAV, Carter, P. J. and Samulski, R. J. (2000) J. Mol. Med 6:17-27), rhinovirus, human immune deficiency virus (HIV), filovirus and engineered versions thereof (see, for example, Cobinger G. P. et al (2001) Nat. Biotechnol. 19:225-30), virosomes, “naked” DNA liposomes, and nucleic acid coated particles, in particular gold spheres. Particularly preferred are viral vectors like adenoviral vectors or retroviral vectors (Lindemann et al. (1997) Mol. Med 3:466-76 and Springer et al. (1998) Mol. Cell. 2:549-58). Liposomes are usually small unilamellar or multilamellar vesicles made of cationic, neutral and/or anionic lipids, for example, by ultrasound treatment of liposomal suspensions. The DNA can, for example, be ionically bound to the surface of the liposomes or internally enclosed in the liposome. Suitable lipid mixtures are known in the art and comprise, for example, DOTMA (1,2-Dioleyloxpropyl-3-trimethylammoniumbromid) and DPOE (Dioleoylphosphatidyl-ethanolamin) which both have been used on a variety of cell lines. The vector according to the present invention may serve for different purposes as outlined herein. It may for example be used to introduce a DNA molecule comprising a sequence encoding a RNA-polymerase binding aptamer according to the present invention into an existing gene in order to regulate, e.g. enhance/increase or inhibit/repress/decrease, expression of the gene. Hence, vector according to the present invention at least comprises a DNA-molecule comprising a sequence encoding a RNA-polymerase binding aptamer according to the present invention. In a preferred embodiment the vector comprises a DNA-molecule, said DNA-molecule comprising an expression cassette according to the present invention.
Furthermore, the DNA-molecule of the vector of the present invention may, in addition to the sequence encoding the RNA-polymerase binding aptamer of the invention, comprise further expression control elements, allowing proper expression of the coding regions in suitable hosts. Such control elements are known to the skilled person and may include a promoter, a splice cassette, translation initiation codon, translation and insertion site for introducing an insert into the vector (also referred to as multiple cloning site). Preferably, the sequence encoding the RNA-polymerase binding aptamer of the invention is operatively linked to said further expression control sequences allowing expression of a sequence to be expressed and to be inserted into said multiple cloning site in eukaryotic or prokaryotic cells. Accordingly, the present invention relates to a vector comprising DNA molecule comprising a sequence encoding the RNA-polymerase binding aptamer of the invention, wherein the nucleic acid is operably linked to a promoter that is recognized by a host cell when the eukaryotic and/or prokaryotic (host) cell is transfected with the vector. In a preferred embodiment the vector preferably comprises a ribosomal binding site, wherein said ribosomal binding site is located downstream of said sequence encoding the RNA-polymerase binding aptamer.
Furthermore, the vector of the present invention is preferably an expression vector, i.e. comprising a DNA-molecule, said DNA-molecule comprising an expression cassette according to the present invention. The DNA-molecules and vectors of the invention may be designed for direct introduction or for introduction via liposomes, viral vectors (e.g. adenoviral, retroviral), electroporation, ballistic (e.g. gene gun) or other delivery systems into the cell. Additionally, a baculoviral system can be used as eukaryotic expression system of the invention. An expression vector, otherwise known as an expression construct, is usually a plasmid or virus designed for protein expression in cells. The vector is used to introduce a specific gene into a target cell, and can commandeer the cell's mechanism for protein synthesis to produce the protein encoded by the gene. The plasmid is engineered to contain regulatory sequences that act as enhancer and/or promoter regions and lead to efficient transcription of the gene carried on the expression vector. The aim of an expression vector is the production of significant amount of stable messenger RNA, and consequently proteins or polypeptides. An expression vector has features that any vector may have, such as an origin of replication, a selectable marker, and a suitable site for the insertion of a gene such as the multiple cloning site. The cloned gene may be transferred from a specialized cloning vector to an expression vector, although it is possible to clone directly into an expression vector. The cloning process is normally performed in bacteria, e.g. Escherichia coli, and vectors used for protein expression in organisms other than the one used for cloning, e.g. E. coli, may have, in addition to a suitable origin of replication for its propagation in the cloning organism, elements that allow them to be maintained in another organism, and these vectors are called shuttle vectors. An expression vector has elements necessary for protein expression. These may include a promoter, constitutive or inducible, the correct translation initiation sequence such as a ribosomal binding site and/or start codon, a termination codon, and a transcription termination sequence. There are differences in the machinery for protein synthesis between prokaryotes and eukaryotes, therefore the expression vectors must have the elements for expression that is appropriate for the chosen host. For example, prokaryotes expression vectors may have a Shine-Dalgarno sequence at its translation initiation site for the binding of ribosomes, while eukaryotes expression vectors would contain the Kozak consensus sequence. The vector according to the present invention is preferably a vector is for expressing a sequence of interest in bacteria, preferably in Escherichia coli.
The promoter initiates the transcription and is therefore a point of control for the expression of the cloned gene. The promoters used in expression vector are constitutive or inducible. The latter promoters meaning that protein synthesis is only initiated or fully initiated, when required, e.g. by the introduction of an inducer, such as IPTG or arabinose. Protein expression however may also be constitutive (i.e. transcription is constant and the protein is constantly expressed) in some expression vectors. Low level of constitutive protein synthesis may occur even in expression vectors with tightly controlled promoters.
After the expression of the gene product, it is usually necessary to purify the expressed protein; however, separating the protein of interest from the great majority of proteins of the host cell can be a protracted process. To make this purification process easier, a purification tag may be added to the cloned gene or being present in an expression cassette or vector to allow cloning of a sequence of interest such that the gene and the tag are in frame and expressed as a single polypeptide chain. Such tags optionally being present in the vector according to the present invention include histidine (His) tag, marker peptides, and fusion partners, such as glutathione S-transferase or maltose-binding protein or GFP or β-galactosidase. Some of these fusion partners may also help to increase the solubility of some expressed proteins. Other fusion proteins such as green fluorescent protein may act as a reporter gene for the identification of successful cloned genes.
The vector or DNA-molecule comprising an expression cassette may be transformed or transfected (also referred to herein as introduced) into the host cell for protein synthesis. The vectors of the invention may have element for transformation or the insertion of DNA into the host chromosome, for example the vir genes for plant transformation, and integrase sites for chromosomal insertion. The vector of the invention may include targeting sequence that may target the expressed protein to a specific location such as the periplasmic space of bacteria.
The vector may include further control elements ensuring and/or regulating expression. Control elements ensuring expression in eukaryotic and prokaryotic (host) cells are well known to those skilled in the art. As mentioned herein above, they usually comprise regulatory sequences ensuring initiation of transcription and optionally signals ensuring termination of transcription and stabilization of the transcript. Additional regulatory elements may include transcriptional as well as translational enhancers, and/or naturally-associated or heterologous promoter regions.
The promoter according to the invention is preferably constitutive or inducible as outlined herein. Preferred promoters according to the present invention include prokaryotic and eukaryotic promoters, such as bacterial, fungal or mammalian promoters. Preferred mammalian promoters are selected from the group consisting of CMV-HSV thymidine kinase promoter, SV40, RSV-promoter (Rous Sarcoma Virus), human elongation factor 1α-promoter, the glucocorticoid-inducible MMTV-promoter (Moloney Mouse Tumor Virus), neurofilament-promoter, PGDF-promoter, NSE-promoter, PrP-promoter, thy-1-promoter, metallothionein-inducible promoter, and tetracyclin-inducible promoter. Furthermore, enhancers may be present in the vector or DNA-molecule according to the present invention. Preferred enhancers are selected from the group consisting of CMV enhancer, and SV40-enhancer. For expression in neural cells, it is envisaged that neurofilament-, PGDF-, NSE-, PrP-, or thy-1-promoters can be employed. Said promoters are known in the art and, inter alia, described in Charron J. Biol. Chem. 270 (1995), 25739-25745. Preferred promoters are outlined herein above for the DNA-molecule according to the present invention and apply also to the vectors and plasmids.
Besides elements which are responsible for the initiation of transcription such regulatory elements may also comprise transcription termination signals, such as SV40-poly-A site or the tk-poly-A site, downstream of the sequence to be expressed. In this context, suitable expression vectors are known in the art such as Okayama-Berg cDNA expression vector pcDV1 (Pharmacia), pRc/CMV, pcDNA1, pcDNA3 (In-vitrogene), pSPORT1 (GIBCO BRL), pX (Pagano, Science 255 (1992), 1144-1147), yeast two-hybrid vectors, such as pEG202 and dpJG4-5 (Gyuris, Cell 75 (1995), 791-803), or preferred prokaryotic expression vectors, such as pET21a (Novagen), pWM015 (W. G. Miller et al., Appl. Environ. Microbiol. 66, 5426-5436 (2000)), pBAD24 (L. M. Guzman et al., 177 (1995), Lambda gt11 or pGEX (Amersham-Pharmacia). The vector may further comprise nucleic acid sequences encoding for a secretion signal. Such sequences are well known to the person skilled in the art. Furthermore, depending on the expression system used, leader sequences capable of directing the peptides of the invention to a cellular compartment may be added to the coding sequence of the nucleic acid molecules of the invention and are well known in the art. The leader sequence(s) is (are) assembled in appropriate phase with translation, initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein, or a protein thereof, into the periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including a C- or N-terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant products. Once the vector has been incorporated into the appropriate host, the host is maintained under conditions suitable maintaining the vector and optionally expression of the sequence to be expressed, and, as desired, the collection and purification of the so expressed molecules may follow.
The present invention also relates to a host cell transfected or transformed, i.e. comprising with the vector or DNA-molecule of the invention or a non-human host organism carrying the vector of the present invention, i.e. to a host cell or host which is genetically modified with a nucleic acid molecule according to the invention or with a vector comprising such a nucleic acid molecule. The term “genetically modified” means that the host cell or host comprises in addition to its natural genome a nucleic acid molecule or vector according to the invention which was introduced into the cell or host or into one of its predecessors/parents. The nucleic acid molecule or vector may be present in the genetically modified host cell or host either as an independent molecule outside the genome, preferably as a molecule which is capable of replication, or it may be stably integrated into the genome of the host cell or host.
The skilled person will acknowledge that the DNA-molecules according to the present invention are also applicable for gene therapy in a subject. Hence, the present invention also relates to the DNA-molecule according to the present invention for use in gene therapy in a subject. “Subject” in the meaning of the invention is understood to be all persons, animals, plants or microorganisms, irrespective whether or not they exhibit pathological changes, unless stated otherwise. In a preferred embodiment the patient according to the invention is a human. In a further preferred embodiment of the invention the patient is a human suspected to have a disease to be treated by gene therapy.
For medical use the DNA-molecule or vector or plasmid may be presented in as a pharmaceutical composition. The pharmaceutical composition may comprise further pharmaceutical excipients deemed necessary, e.g. for stabilizing the DNA-molecule, for diminishing adverse effects of the subject upon application of the composition etc. The DNA-molecule or vector or plasmid may be present in pure form or as a pharmaceutically acceptable salt thereof. The phrase “pharmaceutically acceptable salt(s)”, as used herein, denotes those salts of compounds of the invention that are nontoxic, safe and effective for medical use in mammals and that possess the desired biological activity, including, but not limited to, acid addition and/or base salts. The acid addition salts are formed from basic invention compounds, whereas the base addition salts are formed from acidic invention compounds. All of these forms are within the scope of the compounds useful in the invention. For a review on pharmaceutically acceptable salts see Berge et al. (1977), J. Pharm. Sci. 66:1-19, incorporated herein by reference.
The present invention furthermore relates to the use of a DNA-molecule comprising sequence encoding a RNA-polymerase binding aptamer according to the present invention for regulating expression of a protein, preferably for regulating expression of a protein in a microorganism, such as prokaryotic or eukaryotic cell lines, including bacteria and mammal cell lines. The protein to be expressed is preferably encoded on the same DNA-molecule as the RNA-polymerase binding aptamer, downstream of the RNA-polymerase binding aptamer sequence and preferably positioned such that it is transcribed into the same RNA molecule. In a preferred embodiment of the present invention said sequence encoding said RNA-polymerase binding aptamer is positioned downstream of the promoter from which the expression of said protein is initiated. This means that the RNA-polymerase binding aptamer is part of the mRNA encoding the protein to be expressed. The use according to the present invention encompasses the use for regulating the expression of any protein of interest. A protein of interest may be any protein that can be expressed in expression systems. Such expression systems usually employs host cells transfected or transformed with a DNA-molecule, e.g. a vector, according to the present invention. Hence, the invention also encompasses the use of a host cell according to the present invention for expression of a sequence of interest, e.g. a protein.
The present invention relates to methods of producing a protein of interest comprising providing a vector according to the present invention, preferably comprising an expression cassette according to the present invention, said vector comprising a sequence encoding the protein of interest operatively linked to the promoter and the sequence encoding the RNA-polymerase binding aptamer, introducing said vector into a host cell, and culturing said host cell in culture medium under conditions inducing transcription from the promoter of the expression vector, and optionally recovering the protein of interest from the host cell or culture medium.
It will be understood by the skilled artisan that the provision of the vector comprising the expression cassette may include the cloning of the sequence to be expressed into an expression cassette according to the present invention. Hence, in one embodiment the step of providing a vector according to the present invention may include cloning of the sequence encoding said protein of interest into said vector such that the sequence is operatively linked to the promoter and the sequence encoding the RNA-polymerase binding aptamer to obtain the final expression vector, preferably cloning into a multiple cloning site of the expression cassette. Hence, the invention also relates to a method methods of producing a protein of interest comprising the steps of providing a vector according to the present invention, preferably comprising an expression cassette according to the present invention, cloning the sequence encoding a protein of interest into said vector, preferably into a multiple cloning site, such that the sequence is operatively linked to the promoter and the sequence encoding the RNA-polymerase binding aptamer to obtain the final expression vector, introducing said expression vector into a host cell, and culturing said host cell in culture medium under conditions inducing transcription from the promoter of the expression vector, and optionally recovering the protein of interest from the host cell or culture medium.
It has been found by the inventors that a subgroup of the RNA-polymerase binding aptamers according to the present invention increase the activity of RNA polymerase when transcribed from a DNA strand encoding said aptamer. The efficiency of transcription of a DNA segment is increased if the DNA segment comprises a sequence encoding a RNA-polymerase binding aptamer according to the invention. Hence, in one preferred embodiment of the present invention the RNA-polymerase binding aptamer increases the activity of the RNA-polymerase. The skilled person is aware of methods, assays and systems to determine activity of RNA-polymerases. Such methods include the expression of reporter genes in a host cell or may apply in vitro transcription assays as both outlined in detail in the enclosed Examples. In host cell based assays basically a DNA construct is used which comprises a promoter to allow start of transcription and a reporter, e.g. a reporter gene. To test for the increase of RNA-polymerase activity the expression of the construct starting from the promoter is measured and compared between a construct comprising a sequence encoding an RNA-polymerase binding aptamer according to the present invention and the construct not comprising said aptamer. Expression can be measured using methods commonly known by the skilled artisan. For example, the activity of a protein encoded by the reporter gene can be tested, e.g. fluorescence or enzymatic activity. Additionally or alternatively, mRNA amounts as the direct transcription product can be determined using common techniques, like Northern Blot or quantitative reverse transcription (qRT) real-time PCR. For example, to investigate the effect of a sequence encoding a RNA-polymerase binding aptamer on RNA-polymerase activity and thereby on transcription a plasmid-based GFP reporter system may be used. Possible systems are used in the presented Examples (see e.g.
The inventors further found that some of the RNA-polymerase binding aptamer increasing the activity of the RNA-polymerase are increasing the activity of the polymerase itself (also referred to herein as “activating RNA-polymerase binding aptamers”) and others or the same additionally increase the expression in a terminator-dependent fashion, i.e. the suppress the terminating effect of terminator sequences (also referred to herein as “antiterminating RNA-polymerase binding aptamers”). Both, activating and antiterminating, types of RNA-polymerase binding aptamers, are referred to as “transcription enhancing aptamer”.
Two classes of transcription terminators in prokaryotes, Rho-dependent and Rho-independent, have been identified throughout prokaryotic genomes. These widely distributed sequences are responsible for triggering the end of transcription upon normal completion of gene or operon transcription, mediating early termination of transcripts as a means of regulation such as that observed in transcriptional attenuation, and to ensure the termination of runaway transcriptional complexes that manage to escape earlier terminators by chance, which prevents unnecessary energy expenditure for the cell. Rho-dependent transcription terminators require a protein called Rho factor, which exhibits RNA helicase activity, to disrupt the mRNA-DNA-RNA polymerase transcriptional complex. Rho-dependent terminators are found in bacteria and phages. The Rho-dependent terminator occurs downstream of translational stop codons and consists of an unstructured, cytosine-rich sequence on the mRNA known as a Rho utilization site (rut), and a downstream transcription stop point (tsp). The rut serves as an mRNA loading site and as an activator for Rho; activation enables Rho to efficiently hydrolyze ATP and translocate down the mRNA while it maintains contact with the rut site. Rho is able to catch up with the RNA polymerase, which is stalled at the downstream tsp sites (see e.g. Richardson, J. P. (1996). “Rho-dependent Termination of Transcription Is Governed Primarily by the Upstream Rho Utilization (rut) Sequences of a Terminator”. Journal of Biological Chemistry 271 (35): 21597-21603). Contact between Rho and the RNA polymerase complex stimulates dissociation of the transcriptional complex through a mechanism involving allosteric effects of Rho on RNA polymerase (see e.g. Ciampi, M S. (September 2006). “Rho-dependent terminators and transcription termination”, Microbiology, 152(Pt 9): 2515-2528; and Epshtein, V; Dutta, D; Wade, J; Nudler, E (Jan. 14, 2010). “An allosteric mechanism of Rho-dependent transcription termination”, Nature, 463 (7278): 245-249).
Intrinsic transcription terminators or Factor/Rho-independent terminators require the formation of a self-annealing hairpin structure on the elongating transcript, which results in the disruption of the mRNA-DNA-RNA polymerase ternary complex. The terminator sequence contains a 20 basepair GC-rich region of dyad symmetry followed by a short poly-T tract or “T stretch” which is transcribed to RNA to form the terminating hairpin and a 7-9 nucleotide “U track” respectively. The mechanism of termination is hypothesized to occur through a combination of direct promotion of dissociation through allosteric effects of hairpin binding interactions with the RNA polymerase and “competitive kinetics”. The hairpin formation causes RNA polymerase stalling and destabilization, leading to a greater likelihood that dissociation of the complex will occur at that location due to an increased time spent paused at that site and reduced stability of the complex (see e.g. von Hippel, P. H. (1998). “An Integrated Model of the Transcription Complex in Elongation, Termination, and Editing” Science 281 (5377): 660-665; and Gusarov, Ivan; Nudler, Evgeny (1999). “The Mechanism of Intrinsic Transcription Termination” Molecular Cell 3 (4): 495-504).
The antiterminating RNA-polymerase binding aptamers suppress termination as shown in the examples. Such suppression may be a suppression of termination through intrinsic terminators (factor independent) and factor-dependent terminators. Two types have been for example described in Santangelo and Artsimovitch (2011), Nat. Rev. Microbiol. 9:319-329, which is incorporated herein by reference. The skilled person when considering the disclosure of the present invention and his common knowledge is in the position to test the ability of a certain aptamer for suppressing termination of transcription. The skilled person is aware of methods, assays and systems to determine activity of RNA-polymerases, supra. Such methods include the expression of reporter genes in a host cell or may apply in vitro transcription assays as outlined in detail in the enclosed Examples. In host cell based assays basically a DNA construct is used which comprises a promoter to allow start of transcription and a reporter, e.g. a reporter gene. To test for the increase of RNA-polymerase activity the expression of the construct starting from the promoter is measured and compared between a construct comprising a sequence encoding an RNA-polymerase binding aptamer according to the present invention and the construct not comprising said aptamer. When testing for the suppression of termination, the construct shall further comprise, in both constructs, the control and the aptamer containing construct, the terminator sequence, e.g. rrnB or T7t terminators (see e.g. Santangelo and Artsimovitch (2011), Nat. Rev. Microbiol. 9:319-329). The terminator is preferably located downstream of the sequence encoding the RNA-polymerase binding aptamer. Expression can be measured using methods commonly known by the skilled artisan, supra. For example, the activity of a protein encoded by the reporter gene can be tested, e.g. fluorescence or enzymatic activity. Additionally or alternatively, mRNA amounts as the direct transcription product can be determined using common techniques, like Northern Blot or quantitative reverse transcription (qRT) real-time PCR. For example, to investigate the effect of a sequence encoding a RNA-polymerase binding aptamer on RNA-polymerase activity and thereby on transcription a plasmid-based GFP reporter system may be used. Possible systems are used in the presented Examples (see e.g.
Even though activating and antiterminating mechanisms could be identified, several identified aptamers have both activities, i.e. enhancing RNA-polymerase transcription activity in a terminator dependent as well as a terminator independent way. Examples of such enhancing aptamers acting through both mechanisms are the RNA-polymerase binding aptamers encoded by SEQ ID NOs:2 to 24, which are particularly preferred herein.
However, in one embodiment of the present invention the enhancing RNA-polymerase binding aptamer according to the invention is an anti-terminating RNA-polymerase binding aptamers and enhances the activity of the RNA-polymerase in a terminator-dependent fashion. In a further embodiment the enhancing RNA-polymerase binding aptamer according to the invention is an activating RNA-polymerase binding aptamers and enhances the activity of the RNA-polymerase in a terminator-independent fashion. In yet a particular preferred embodiment, the enhancing RNA-polymerase binding aptamer according to the invention is an activating and antiterminating RNA-polymerase binding aptamers and enhances the activity of the RNA-polymerase in a terminator-dependent and terminator-independent fashion.
Transcription terminators are well known by the skilled person and include factor-dependent as well as intrinsic terminator sequences. Examples of prokaryotic terminators are for example disclosed in Santangelo and Artsimovitch (2011), Nat. Rev. Microbiol. 9:319-329. In a preferred embodiment of the present invention the term “terminator” refers to a terminator selected from rut (Rho dependent terminator), rrnB, 77t and tR2 terminators (intrinsic terminator sequences). Preferred terminator sequences are shown in Table 3 and have a sequence selected from the group consisting of SEQ ID NO:42, SEQ ID NO:43, and SEQ ID NO:44.
In a preferred embodiment of the present invention the RNA-polymerase binding aptamer increases expression of a sequence to be expressed (also referred to as “transcription enhancing aptamer” or “transcription enhancing RNA-polymerase binding aptamer”), wherein the increase of expression is at least 10% as compared to the expression of said sequence to be expressed when not comprising said RNA-polymerase binding aptamer, preferably by at least 20%, more preferably at least 100%. Preferably the RNA-polymerase binding aptamer increases expression of a sequence to be expressed in a reporter assay as outlined herein.
Antiterminating activity of a RNA-polymerase binding aptamer can be tested by the skilled person when considering the disclosure of the present application. In a preferred embodiment of the present invention the antiterminating RNA-polymerase binding aptamer increases expression of a sequence to be expressed comprising a terminator sequence, wherein the increase of expression is at least 10% as compared to the expression of said sequence to be expressed when not comprising said antiterminating RNA-polymerase binding aptamer, preferably by at least 20%, more preferably at least 100%. Preferably the antiterminating RNA-polymerase binding aptamer increases expression of a sequence to be expressed comprising a terminator sequence in a reporter assay as outlined herein.
The inventors identified a plurality of RNA-polymerase binding aptamers (SEQ ID NOs:2 to 24) that possess the feature of increasing the expression of a sequence to be expressed, these are preferably of bacterial origin. Detailed analysis of the sequences revealed a common motif comprised in these sequences. This RNA sequence is encoded by the DNA sequence of G[A/G][C/A/T][A/T][A/G/T][C/G]AT[G/T/C][A/C]G[G/A][G/T][C/A][A/G][G/A/C][T/G][T/A/G][C/T/G]AGCA and outlined herein as SEQ ID NO:1 using the expanded one letter code (see Table 2). Hence, in one embodiment of the RNA-polymerase binding aptamer, the RNA-polymerase binding aptamer is an transcription enhancing aptamer and encoded by a sequence of SEQ ID NO:1. In a further preferred embodiment the RNA-polymerase binding aptamer is an activating RNA-polymerase binding aptamer and comprises a sequence encoded by a sequence of SEQ ID NO:1. In yet a further embodiment the RNA-polymerase binding aptamer is an antiterminating RNA-polymerase binding aptamer and comprises a sequence encoded by a sequence of SEQ ID NO:1. It has been found that this consensus sequence is present in DNAs encoding transcription enhancing RNA-polymerase binding aptamers derived of bacterial origin, e.g. E. coli. A further consensus sequence has been identified in DNA encoding for transcription enhancing RNA-polymerase binding aptamers derived from eukaryotic origin, i.e. SEQ ID NO:138. The thereby encoded transcription enhancing RNA-polymerase binding aptamers have also been shown to haven transcription enhancing properties in bacterial expression systems, such as E. coli; see Examples. Further, it has been found using GLAM2 that the enhancing sequences, i.e. the sequences sharing motif of SEQ ID NO:1 or SEQ ID NO:138, all share the common motif of SEQ ID NO:139. This motif furthermore fits the identified essential nt as tested for the aptamer encoded by SEQ ID NO:2; see Examples.
Hence, in a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer is encoded by a sequence according to SEQ ID NO:139 (CAN0-3[AC][CA]N[GC][AT]N[CA][CA][CAT]). A sequence according to SEQ ID NO:139 is preferably selected from the group consisting of SEQ ID NO:1 and SEQ ID NO:138. Particular preferred sequences of transcription enhancing RNA-polymerase binding aptamers are encoded by a sequence according to SEQ ID NO:1. Preferred sequences falling within the consensus of SEQ ID NO:1 are selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75, or a sequence at least 80% identical to any of these sequences, more preferably SEQ ID NO:2 or a sequence at least 80% identical thereto. Preferred sequences falling within the consensus of SEQ ID NO:138 are selected from the group consisting of SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137 or a sequence at least 80% identical to any of these sequences.
The inventors found a plurality of RNA-polymerase binding aptamers which enhance the activity of the RNA-polymerase they are binding to. Furthermore, a common motif has been identified as SEQ ID NO:1. Furthermore, the inventors demonstrated that variations within the sequence of one example aptamer are possible at certain positions without interfering with its properties. Hence the skilled person will instantly acknowledge that the aptamers of certain sequences as disclosed herein retain their properties. Hence, the transcription enhancing RNA-polymerase binding aptamer according to the present invention is in one embodiment encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75 SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. In a further embodiment the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. In yet a further embodiment the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75 SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. In a preferred embodiment the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. In a particular embodiment the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, and SEQ ID NO:137; preferably selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. A particular preferred transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by the sequence of SEQ ID NO:2 or a sequence having at least 80% identity thereto, preferably at least 90% identity, more preferably at least 95% identity, yet more preferred at least 97% identity, even more preferred at least 99% identity. The invention also relates to a nucleic acid hybridizing to any of these sequences under stringent conditions.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:2, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:2. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:2. In a preferred embodiment of the transcription enhancing RNA-polymerase binding aptamer being encoded by a sequence having a certain identity to SEQ ID NO:2, the sequence encoding the aptamer is not altered at the nucleotides at positions 1, 7, 8, 11, and 20 to 23 of SEQ ID NO:2. The inventors have shown that the sequences encoding the aptamer may be mutated without loosing its activity. In particular, it has been shown that variants of RAP ID #5713 as encoded by SEQ ID NO:2, i.e. SEQ ID NOs: 72 to 75 retain their property as transcription enhancing RNA-polymerase binding aptamers. Hence, in one embodiment the (transcription enhancing) RNA-polymerase binding aptamer according to the present invention is encoded by a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. The sequence of SEQ ID NO:75 is the sequence of SEQ ID NO:2 shortened at its 3′-end by 6 nt and thereby only comprising the part of SEQ ID NO:2 corresponding to the minimal consensus sequence of SEQ ID NO:1. The thereby encoded aptamer retains its function. Hence, in a particular preferred embodiment the (transcription enhancing) RNA-polymerase binding aptamer according to the present invention is encoded by a sequence comprising a sequence having at least 80% identity to SEQ ID NO:75, preferably comprising a sequence having at least 90% identity, more preferably comprising a sequence having at least 95% identity, even more preferred comprising a sequence having at least 99% identity to SEQ ID NO:75. In a particular embodiment the transcription enhancing RNA-polymerase binding aptamer is encoded by a sequence comprising the sequence of SEQ ID NO:75. In yet a further particular preferred embodiment the transcription enhancing RNA-polymerase binding aptamer is encoded by a sequence having the sequence of SEQ ID NO:75. In a preferred embodiment of the transcription enhancing RNA-polymerase binding aptamer being encoded comprising or consisting of a sequence having a certain identity to SEQ ID NO:75, the sequence encoding the aptamer is not altered at the nucleotides at positions 1, 7, 8, 11, and 20 to 23 of SEQ ID NO:75.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:3, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:3. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:3.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:4, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:4. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:4.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:5, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:5. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:5.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:6, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:6. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:6.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:7, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:7. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:7.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:8, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:8. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:8.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:9, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:9. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:9.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:10, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:10. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:10.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:11, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:11. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:11.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:12, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:12. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:12.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:13, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:13. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:13.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:14, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:14. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:14.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:15, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:15. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:15.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:16, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:16. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:16.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:17, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:17. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:17.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:18, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:18. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:18.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:19, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:19. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:19.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:20, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:20. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:20.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:21, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:21. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:21.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:22, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:22. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:22.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:23, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:23. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:23.
In a preferred embodiment of the present invention the transcription enhancing RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:24, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:24. In a particular embodiment the RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:24.
For the aptamer encoded by the sequences of SEQ ID NOs: 2 to 14, and 72 to 75 antiterminating activity has been observed. Hence, in a preferred embodiment of the present invention the antiterminating RNA-polymerase binding aptamer is encoded by a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75, or a sequence having at least 80% identity thereto, preferably at least 90%, more preferably at least 95%, even more preferred at least 99% identity.
The determination of percent identity between two sequences is accomplished using the mathematical algorithm of Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877. Such an algorithm is incorporated into the BLASTN and BLASTP programs of Altschul et al. (1990) J. Mol. Biol. 215: 403-410. BLAST nucleotide searches are performed with the BLASTN program, score=100, word length=12, to obtain nucleotide sequences homologous to the respective nucleotide sequence. BLAST protein searches are performed with the BLASTP program, score=50, word length=3, to obtain amino acid sequences homologous to the respective amino acid sequence sequence, respectively. To obtain gapped alignments for comparative purposes, Gapped BLAST is utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25: 3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs are used.
The above outlined embodiments for the DNA-molecule, the expression cassettes and vectors apply to the invention as a whole, i.e. also specifically to the transcription enhancing RNA-polymerase binding aptamers. However, further embodiments in particular apply to the subject-matter relating to said transcription enhancing RNA-polymerase binding aptamers.
As outlined in great detail, the sequence encoding the RNA-polymerase binding aptamer is preferably operatively linked to a promoter and/or a sequence to be expressed. In particularly preferred embodiment of the present invention, the sequence encoding the transcription enhancing RNA-polymerase binding aptamer is operatively linked to a promoter, and optionally to a sequence to be expressed. It has been shown herein, that the transcription enhancing RNA-polymerase binding aptamers are able to increase transcription/expression of a transcript comprising the aptamer. The sequence encoding the transcription enhancing RNA-polymerase binding aptamer according to the present invention is preferably operatively linked to the promoter sequence and the transcription starting site. It is preferably positioned downstream of the promoter and the transcription starting site on the DNA-molecule according to the present invention. The distance of the transcription enhancing RNA-polymerase binding aptamer encoding sequence to the transcription starting site is not crucial. However with certain distances superior effects in terms of regulating the transcription may be observed. Hence, in one embodiment the sequence encoding the transcription enhancing RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt downstream of the transcription starting site, preferably within a distance of 50 to 500 nt downstream of the transcription starting site, more preferred within a distance of 60 to 250 nt downstream of the transcription starting site, particularly preferred within a distance of 80 to 103 nt downstream of the transcription starting site. The sequence encoding the transcription enhancing RNA-polymerase binding aptamer according to the present invention is preferably also operatively linked to the sequence to be expressed, e.g. an open reading frame. The distance of the transcription enhancing RNA-polymerase binding aptamer encoding sequence to the sequence to be expressed is not crucial. However, with certain distances superior effects in terms of regulating the transcription and expression may be observed. Hence, in one embodiment the sequence encoding the transcription enhancing RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt upstream of the sequence to be expressed, or the multiple cloning site, preferably within a distance of 50 to 500 nt upstream of the sequence to be expressed, or the multiple cloning site, more preferred within a distance of 60 to 250 nt upstream of the sequence to be expressed, or the multiple cloning site, particularly preferred within a distance of 70 to 108 nt upstream of the sequence to be expressed, or the multiple cloning site. Any combination of the above mentioned preferred distances may be used, e.g. a distance of 10 to 1000 nt downstream of the transcription starting site and a distance of 10 to 1000 nt upstream of the sequence to be expressed, or the multiple cloning site, or distance of 50 to 500 nt downstream of the transcription starting site and a distance of 50 to 500 nt upstream of the sequence to be expressed, or the multiple cloning site, or a distance of 60 to 250 nt downstream of the transcription starting site and a distance of 60 to 250 nt upstream of the sequence to be expressed, or the multiple cloning site, of a distance of 80 to 103 nt downstream of the transcription starting site and a distance of 70 to 108 nt upstream of the sequence to be expressed, or the multiple cloning site, or any other possible combination. It has to be noted again that the term “transcription enhancing RNA-polymerase binding aptamer” refers to both, “activating RNA-polymerase binding aptamers” and “antiterminating RNA-polymerase binding aptamers”. Preferred transcription enhancing aptamers according to the present invention possess both features, i.e. are activating and antiterminating RNA-polymerase binding aptamers; particularly preferred are at least antiterminating RNA-polymerase binding aptamers.
It may further be advantageous to avoid hybridizing sequences in the region of the DNA sequence encoding the RNA-polymerase binding aptamer or the RNA-polymerase binding aptamer after transcription, such as reverse complement sequences. Without being bound by theory it is thought that such sequences under certain circumstances might hybridize to the DNA sequence or transcribed RNA-polymerase binding aptamer and thereby hinder it activity. Hence, in a preferred embodiment the DNA molecule or vector comprising the RNA-polymerase binding aptamer according to the present invention does not comprise a sequence that has 100% identity over a stretch of 20 nt or more. In a preferred embodiment the DNA molecule or vector does not include such hybridizing sequence on the same strand as the strand encoding the RNA-polymerase binding aptamer, preferably at least not on the strand section transcribed with the sequence encoding the RNA-polymerase binding aptamer. In a further preferred embodiment the DNA molecule or vector does not code for an RNA sequence hybridizing RNA-polymerase binding aptamer on the same RNA transcript, preferably at least not within a range of 30 nt surrounding the sequence encoding the RNA-polymerase binding aptamer is coded on.
Furthermore, the transcription enhancing RNA-polymerase binding aptamers exhibit anti-terminating activity (antiterminating RNA-polymerase binding aptamers), preferably when positioned upstream of a terminator sequence. In a preferred embodiment of the present invention the antiterminating RNA-polymerase binding aptamer is positioned upstream of a terminator sequence within the expression cassette. The terminator sequence may be positioned upstream, within or downstream of the sequence to be expressed. It may for example be part of the sequence encoding a protein of interest In such case it may be particularly desirable to include an antiterminating RNA-polymerase binding aptamer according to the present invention to allow sufficient expression of the protein without the need of interfering with the coding sequence comprising the terminator. Hence, the expression cassette according to the present invention preferably comprises a sequence encoding an antiterminating RNA-polymerase binding aptamer according to the present invention, preferably comprising a sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75, more preferred selected from the group consisting of SEQ ID NO:2, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, and SEQ ID NO:75. Embodiments of the sequences as outlined herein above apply likewise, e.g. with respect to identity.
The sequence encoding the antiterminating RNA-polymerase binding aptamer is preferably operatively linked to the terminator sequence, more preferably is positioned upstream of the terminator sequence. The distance of the antiterminating RNA-polymerase binding aptamer encoding sequence to the terminator sequence is not crucial. However, with certain distances superior effects in terms of regulating the transcription and expression may be observed. Hence, in one embodiment the sequence encoding the antiterminating RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt upstream of the terminator sequence, preferably within a distance of 15 to 500 nt upstream of the terminator sequence, more preferred within a distance of 15 to 250 nt upstream of the terminator sequence, particularly preferred within a distance of 20 to 111 nt upstream of the terminator sequence. Any combination of the above mentioned preferred distances may be used. In one particular preferred embodiment the antiterminating RNA-polymerase binding aptamer coding sequence is positioned 80 to 103 nt downstream of the transcription starting site and 20 to 111 nt upstream of a terminator sequence. Preferred terminator sequences are selected from the group consisting of intrinsic terminator sequences, and factor dependent terminator sequences as further outlined herein.
In a particular preferred embodiment the expression cassette according to the present invention comprises a sequence encoding a transcription enhancing RNA-polymerase binding aptamer according to the present invention, a promoter and a multiple cloning site, wherein the sequence encoding said transcription enhancing RNA-polymerase binding aptamer is located downstream of the transcription starting site of said promoter and upstream of said multiple cloning site. The expression cassette may be comprised in a DNA-molecule, preferably comprised in a vector. Particularly preferred the transcription enhancing RNA-polymerase binding aptamer is an antiterminating RNA-polymerase binding aptamer according to the present invention.
Furthermore, the invention also relates to the use of a transcription enhancing RNA-polymerase binding aptamer according to the present invention or a DNA-molecule encoding the same for enhancing the expression of an endogenous sequence within a host cell. The host cell preferably being a non-human host cell more preferably bacteria, particularly preferred E. coli. The DNA-molecule encoding said transcription enhancing RNA-polymerase binding aptamer according to the present invention is preferably inserted downstream of the promoter from which said endogenous sequence is transcribed. The invention also relates to a method for enhancing the expression of an endogenous sequence in a host cell comprising the step of inserting a DNA-molecule comprising a sequence encoding an transcription enhancing RNA-polymerase binding aptamer according to the present invention into the genome of said host cell such that the sequence encoding said transcription enhancing RNA-polymerase binding aptamer according to the present invention is operatively linked to the promoter of said endogenous sequence, preferably inserted downstream of said promoter. Further preferred said aptamer is inserted downstream of said promoter and upstream of said endogenous sequence.
Furthermore, the invention relates to the use of antiterminating RNA-polymerase binding aptamer or a DNA-molecule comprising a sequence encoding the same for suppressing for the presence of a terminator in an endogenous sequence of a host cell.
The skilled person may choose the desired transcription enhancing RNA-polymerase binding aptamer, in particular regarding the strength of enhancement. He may consider certain aspects therefore, like strength of the promoter, i.e. the degree of transcription taking place without the aptamer. Furthermore, the skilled person will consider the amount of expression desired, e.g. upon induction of transcription. To this end, he may choose from the transcription enhancing RNA-polymerase binding aptamers provided herewith, as they have different degrees of enhancement. Furthermore, a plurality of sequences encoding transcription enhancing RNA-polymerase binding aptamers may be used. Hence, in one embodiment of the present invention the expression cassette comprises a sequence encoding two or more transcription enhancing RNA-polymerase binding aptamer, preferably three or more sequences encoding an transcription enhancing RNA-polymerase binding aptamer. The sequence comprises said two or more, or said three or more RNA-polymerase binding aptamer encoding sequences preferably as consecutive sequences, e.g. tandem repeats, optionally separated by spacer sequences. The aptamers encoded by said sequence may all be the same transcription enhancing RNA-polymerase binding aptamer or they may encode different transcription enhancing RNA-polymerase binding aptamers.
The transcription enhancing RNA-polymerase binding aptamers according to the present invention are applicable to a variety of uses and methods. The present invention, hence, relates to the use of transcription enhancing RNA-polymerase binding aptamers according to the present invention for regulating expression of nucleic acids, preferably to the use of a DNA sequence encoding a transcription enhancing RNA-polymerase binding aptamers according to the present invention for regulating expression of a sequence to be expressed. Furthermore, the present invention relates to the use of a DNA-molecule according to the present invention for enhancing expression of a sequence to be expressed. Furthermore, the invention relates to the use of an expression cassette according to the present invention for the expression of sequence of interest, preferably a protein. Likewise the invention relates to the use of a vector according to the present invention for the expression of a sequence of interest, preferably a protein. The invention likewise relates to the use of an expression cassette according to the present invention, i.e. comprising a sequence encoding the transcription enhancing RNA-polymerase binding aptamers according to the present invention, for expression of a sequence of interest, e.g. a sequence encoding a protein of interest. The expression may be performed in a host cell or in in vitro expression systems. The DNA-molecule, expression cassette and/or vector according to the present invention may used in either a host cell or an in vitro expression system.
In vitro expression systems are known by those of ordinary skills in the art and are widely used in the laboratory practice and the assays (see e.g. Spirin, A.; Baranov, V.; Ryabova, L.; Ovodov, S.; Alakhov, Y. (1988). “A continuous cell-free translation system capable of producing polypeptides in high yield”. Science 242 (4882): 1162; Jackson, A M et al. (2004). “Cell-free protein synthesis for proteomics”. Briefings in Functional Genomics and Proteomics 2 (4): 308-319; Carlson, E D et al. (2011). “Cell-free protein synthesis: Applications come of age”. Biotechnol Adv 30 (5): 1185; and Bundy, Bradley C.; Franciszkowicz, Marc J.; Swartz, James R. (2008). “Escherichia coli-based cell-free synthesis of virus-like particles”. Biotechnology and Bioengineering 100 (1): 28-37). One example of an in vitro expression system is PURExpress® In Vitro Protein Synthesis Kit (NEB; https://www.neb.com/products/e6800-purexpress-invitro-protein-synthesis-kit), which may be employed in connection with the present application. In vitro expression systems employ a DNA-dependent RNA polymerase for transcription of a DNA molecule into RNA and optionally may employ ribosomes for translation of the RNA into a polypeptide if desired. The systems are designed predominantly for the phage polymerases (T7, SP6, etc.) but may also be further optimised for the bacterial RNAP, preferred RNAP from E. coli. It will be apparent that the RNA-binding aptamer according to the invention preferably binds to the polymerase of the in vitro expression system, preferably with the affinities as outlined herein, preferably a non-phage polymerase, more preferably a bacterial RNA polymerase of the in vitro expression or transcription system. Nowadays, most of the in vitro systems use phage polymerases due to high yield of transcription. Nevertheless, they are prone to affect secondary structures in the transcribed RNAs and RNA misfolding. Currently in vitro systems with bacterial RNA polymerases are available but not commonly used, as the RNA yield is usually low. It is apparent to the skilled person that this is improved by the present invention. The invention hence in a preferred embodiment relates to the use of DNA-molecule according to the present invention, preferably comprising a sequence encoding an transcription enhancing RNA-polymerase binding aptamer according to the invention, for use in in vitro transcription and/or expression. Preferably the transcription enhancing RNA-polymerase binding aptamer binds to the bacterial RNA polymerase used for in vitro transcription and/or expression. The invention, furthermore, relates to a method for in vitro transcription of a RNA of interest comprising providing a DNA-molecule according to the present invention, the DNA-molecule comprising a sequence encoding RNA-polymerase binding aptamer according to the present invention, preferably the DNA-molecule comprising an expression cassette according to the present invention, said DNA-molecule comprising a sequence encoding the RNA of interest operatively linked to a promoter and the sequence encoding the RNA-polymerase binding aptamer, incubating said DNA-molecule with a RNA-polymerase according to the present invention under conditions allowing transcription from said promoter, and optionally recovering the RNA of interest. The RNA-polymerase binding aptamer may be a transcription enhancing RNA-polymerase binding aptamer or an inhibiting RNA-polymerase binding aptamer according to the present invention, and combinations thereof. The nature of the aptamer may be chosen according to the needs, e.g. whether the transcription is to be enhanced or inhibited. In a particular preferred embodiment the RNA-polymerase binding aptamer is a transcription enhancing RNA-polymerase binding aptamer according to the invention.
The skilled person knows conditions suited for in vitro transcription. They include the presence of nucleoside triphosphates (NTP) and suited buffers. Nucleoside triphosphade include natural nucleoside triphosphates, like adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP), 5-methyluridine triphosphate (m5UTP), and uridine triphosphate (UTP), but are not limited thereto. Furthermore, cofactors such as magnesium may be added. In one embodiment incubation is performed using a cell extract. Furthermore, transcription inhibitors may be added, e.g. Rho inhibitor of E. coli. This allows finer controlling of the expression.
Furthermore, the invention relates to a method for in vitro expression of a protein of interest comprising providing a DNA-molecule according to the present invention, the DNA-molecule comprising a sequence encoding a transcription enhancing RNA-polymerase binding aptamer according to the present invention, preferably the DNA-molecule comprising an expression cassette according to the present invention, said DNA-molecule comprising a sequence encoding the protein of interest operatively linked to a promoter and the sequence encoding a transcription enhancing RNA-polymerase binding aptamer, incubating said DNA-molecule with components and under conditions allowing transcription from said promoter and translation of the transcript, and optionally recovering the protein of interest. In vitro expression is also known as cell free expression. Common components of a cell free reaction include a cell extract, an energy source, a supply of amino acids, cofactors of the translation and/or translation machinery of the cell extract, and the DNA-molecule according to the invention. Such cofactors may be divalent cations, such as magnesium. A cell extract may be obtained by lysing the cell of interest and centrifuging out the cell walls, DNA genome, and other debris. The remains are the necessary cell machinery including ribosomes, aminoacyl-tRNA synthetases, translation initiation and elongation factors, nucleases, etc. Preferred cell extracts are extracts from cells outlined above as suited host cells according to the present invention, preferably eukaryotic or prokaryotic cell extracts, preferably bacterial, more preferably cell extracts of E. coli.
The invention also relates to the use of a DNA-molecule comprising a sequence encoding a transcription enhancing RNA-polymerase binding aptamer according to the present invention for increasing the yield of an in vitro transcription or an in vitro expression assay. This may for example be performed by the method as outlined above or by including said DNA-molecule or at least the sequence into an available assay.
Two types of DNA-molecules can be used in in vitro expression or transcription, i.e. plasmids and linear DNA-molecules (linear expression templates). Plasmids are circular, and are normally made using cells. Linear DNA-molecules can be made much more effectively via PCR (see polymerase chain reaction), which replicates DNA much faster than raising cells containing the plasmid. While Linear DNA-molecules are easier and faster to make, plasmid yields are usually much higher in cell free expression. Because of this, much research today is focused on optimizing cell free expression yields to approach the yields of plasmids when using the linear DNA-molecules. The present invention provides a solution.
An “energy source” refers to a needed energy source for transcription and/or translation, is added to the reaction, e.g. in addition to the cell extract. Preferred energy sources are phosphoenol pyruvate, acetyl phosphate, and creatine phosphate.
“Amino acids” as used herein include natural and non natural amino acids, preferably one or more of following proteinogenic amino acids are are added: Alanine, Cysteine, Aspartic acid, Glutamic acid, Phenylalanine, Glycine, Histidine, Isoleucine, Lysine, Leucine, Methionine, Asparagine, Pyrrolysine, Proline, Glutamine, Arginine, Serine, Threonine, Selenocysteine, Valine, Tryptophan, and Tyrosine. In one embodiment all of the listed amino acids are added.
The invention, hence, also pertains kit comprising a DNA-molecule or vector according to the present invention. The kit may comprise further components usable for the respective assay they are intended for, e.g. components of an in vitro transcription or expression system, or reagents for transfection or transformation of a host cell, buffer or further reagents, or host cells as further described herein.
By using the transcription enhancing RNA-polymerase binding aptamers according to the present invention expression of sequences can be enhanced, e.g. sequences encoding a protein of interest. Hence, the present invention in a particular preferred embodiment relates to a method of producing a protein of interest comprising providing a vector according to the present invention, the vector comprising a sequence encoding a transcription enhancing RNA-polymerase binding aptamer according to the present invention, preferably the vector comprising an expression cassette according to the present invention, cloning the sequence encoding said protein of interest into said vector such that the sequence is operatively linked to the promoter and the sequence encoding the transcription enhancing RNA-polymerase binding aptamer to obtain the final expression vector, introducing said expression vector into a host cell, and culturing said host cell in culture medium under conditions inducing transcription from the promoter of the expression vector, and optionally recovering the protein of interest from the host cell or culture medium. It will be understood by the skilled artisan that the cloning step of the method of producing a protein of interest may be omitted if the initial vector comprises the sequence encoding said protein of interest. In this embodiment the method according to the present invention comprises the steps of providing a vector according to the present invention, preferably comprising an expression cassette according to the present invention, said vector comprising a sequence encoding the protein of interest operatively linked to the promoter and the sequence encoding a transcription enhancing RNA-polymerase binding aptamer, introducing said vector into a host cell, and culturing said host cell in culture medium under conditions inducing transcription from the promoter of the expression vector, and optionally recovering the protein of interest from the host cell or culture medium. The transcription enhancing RNA-polymerase binding aptamer may be an activating or an antiterminating RNA-polymerase binding aptamer, preferably an antiterminating according to the present invention and embodiments outlined above.
It has been found by the inventors that a subgroup of the RNA-polymerase binding aptamers according to the present invention decrease the activity of RNA polymerase when transcribed from a DNA strand encoding said aptamer. The efficiency of transcription of a DNA segment is decreased if the DNA segment comprises a sequence encoding a RNA-polymerase binding aptamer according to the invention. Hence, in one preferred embodiment of the present invention the RNA-polymerase binding aptamer decreases (reduces or inhibits) the activity of a RNA-polymerase. The RNA-polymerase binding aptamers according to this embodiment are also referred to interchangeably as “inhibiting” or “transcription inhibiting” RNA-polymerase binding aptamers. The skilled person is aware of methods, assays and systems to determine activity of RNA-polymerases and effects of sequences thereon. Such methods include the expression of reporter genes in a host cell or may apply in vitro transcription assays as outlined in detail herein above and in the enclosed Examples.
In one preferred embodiment the RNA-polymerase binding aptamer according the present invention inhibits the activity of the RNA-polymerase, preferably in a reporter assay as disclosed herein. The inhibition may be Rho-dependent or Rho-independent, preferably Rho-dependent.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer decreases or reduces expression of a sequence to be expressed, wherein the reduction of expression is at least 30% as compared to the expression of said sequence to be expressed when not comprising said inhibiting RNA-polymerase binding aptamer, preferably by at least 20%, even more preferably by at least 50%.
The inhibiting RNA-polymerase binding aptamer according to the present invention preferably comprises a transcription pausing motif, preferably with the consensus sequence of G−10Y−1G+1.
Furthermore, it has been found by the inventors that the inhibiting RNA-polymerase binding aptamers have an increased C-content median value of 33.67%, whereas G-content median value is 15.69% (
Consensus sequences have been identified which are common to different inhibiting RNA-polymerase binding aptamers. Hence, in a preferred embodiment of the inhibiting RNA-polymerase binding aptamer according to the invention, the aptamer comprises a sequence as encoded by a sequence selected from the group consisting of SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, and SEQ ID NO:87.
Different sequences fall within these consensus sequences; see Example 2. Hence, in a preferred embodiment of the inhibiting RNA-polymerase binding aptamer according to the invention, the aptamer comprises a sequence as encoded by a sequence of SEQ ID NO:84, and comprises as sequence encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:27, and SEQ ID NO:39, preferably encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:27, and SEQ ID NO:39; more preferably encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:27, and SEQ ID NO:39; yet further preferred encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:27, and SEQ ID NO:39. In a certain embodiment the inhibiting RNA-polymerase binding aptamer according to the invention comprises a sequence as encoded by a sequence of SEQ ID NO:84, and comprises a sequence encoded by a sequence selected from the group consisting of SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:27, and SEQ ID NO:39.
In a further preferred embodiment of the inhibiting RNA-polymerase binding aptamer according to the invention, the aptamer comprises a sequence as encoded by a sequence of SEQ ID NO:85, and comprises as sequence encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:31, SEQ ID NO:91, SEQ ID NO:88, SEQ ID NO:32, SEQ ID NO:26, and SEQ ID NO:36, preferably encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:31, SEQ ID NO:91, SEQ ID NO:88, SEQ ID NO:32, SEQ ID NO:26, and SEQ ID NO:36; more preferably encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:31, SEQ ID NO:91, SEQ ID NO:88, SEQ ID NO:32, SEQ ID NO:26, and SEQ ID NO:36; yet further preferred encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:31, SEQ ID NO:91, SEQ ID NO:88, SEQ ID NO:32, SEQ ID NO:26, and SEQ ID NO:36. In a certain embodiment the inhibiting RNA-polymerase binding aptamer according to the invention comprises a sequence as encoded by a sequence of SEQ ID NO:85, and comprises a sequence encoded by a sequence selected from the group consisting of SEQ ID NO:31, SEQ ID NO:91, SEQ ID NO:88, SEQ ID NO:32, SEQ ID NO:26, and SEQ ID NO:36.
In a further preferred embodiment of the inhibiting RNA-polymerase binding aptamer according to the invention, the aptamer comprises a sequence as encoded by a sequence of SEQ ID NO:86, and comprises as sequence encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:89, and SEQ ID NO:34, preferably encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:89, and SEQ ID NO:34; more preferably encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:89, and SEQ ID NO:34; yet further preferred encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:89, and SEQ ID NO:34. In a certain embodiment the inhibiting RNA-polymerase binding aptamer according to the invention comprises a sequence as encoded by a sequence of SEQ ID NO:86, and comprises a sequence encoded by a sequence selected from the group consisting of SEQ ID NO:89, and SEQ ID NO:34.
In a further preferred embodiment of the inhibiting RNA-polymerase binding aptamer according to the invention, the aptamer comprises a sequence as encoded by a sequence of SEQ ID NO:87, and comprises as sequence encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:33, SEQ ID NO:25; SEQ ID NO:92, SEQ ID NO:30, SEQ ID NO:90, and SEQ ID NO:38, preferably encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:33, SEQ ID NO:25; SEQ ID NO:92, SEQ ID NO:30, SEQ ID NO:90, and SEQ ID NO:38; more preferably encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:33, SEQ ID NO:25; SEQ ID NO:92, SEQ ID NO:30, SEQ ID NO:90, and SEQ ID NO:38; yet further preferred encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:33, SEQ ID NO:25; SEQ ID NO:92, SEQ ID NO:30, SEQ ID NO:90, and SEQ ID NO:38. In a certain embodiment the inhibiting RNA-polymerase binding aptamer according to the invention comprises a sequence as encoded by a sequence of SEQ ID NO:87, and comprises a sequence encoded by a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:40, SEQ ID NO:33, SEQ ID NO:25; SEQ ID NO:92, SEQ ID NO:30, SEQ ID NO:90, and SEQ ID NO:38.
The inventors found a plurality of inhibiting RNA-polymerase binding aptamers which decrease the activity of the RNA-polymerase they are binding to. Hence, the inhibiting RNA-polymerase binding aptamer according to the present invention is in one embodiment encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a further embodiment the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In yet a further embodiment the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a preferred embodiment the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 99% identity to a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. In a particular embodiment the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:92. The invention also relates to a nucleic acid hybridizing to any of the above sequences under stringent conditions.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:25, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:25. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:25. In a preferred embodiment of the inhibiting RNA-polymerase binding aptamer being encoded by a sequence having a certain identity to SEQ ID NO:25, the sequence encoding the aptamer is not altered at the nucleotides at positions corresponding to 10, 11, 18, 20, 21, and 24 to 27 of SEQ ID NO:25.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:26, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:26. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:26.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:27, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:27. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:27.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:28, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:28. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:28.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:29, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:29. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:29.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:30, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:30. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:30.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:31, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:31. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:31.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:32, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:32. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:32.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:33, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:33. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:33.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:34, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:34. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:34.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:35, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:35. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:35.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:36, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:36. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:36.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:37, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:37. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:37.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:38, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:38. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:38.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:39, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:39. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:39.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:40, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:40. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:40.
In a preferred embodiment of the present invention the inhibiting RNA-polymerase binding aptamer according to the present invention is encoded by a sequence having at least 80% identity to SEQ ID NO:41, preferably by a sequence having at least 90% identity, more preferably by a sequence having at least 95% identity, even more preferred by a sequence having at least 99% identity to SEQ ID NO:41. In a particular embodiment the inhibiting RNA-polymerase binding aptamer is encoded by the sequence of SEQ ID NO:41.
All embodiments as outlined above in general for the DNA-molecule, the expression cassette, the vector and the host cell apply likewise to the preferred embodiments of the inhibiting RNA-polymerase binding aptamers.
It will also be acknowledged by the skilled person that further regulation of expression of endogenous or exogenous sequence using the RNA-polymerase binding aptamers according to the present invention may be conducted by employing the dependency of the found mechanism on the Rho inhibition factor. When for example employing an inhibiting RNA-polymerase binding aptamer according to the present invention its inhibiting effect on the expression may controlled by the incubation of the expression system (in vitro or in vivo) with an inhibitor of Rho inhibitor, e.g. bicyclomycin. This allows a fine control of expression. Hence, in one embodiment of the uses and methods according to the present invention, an inhibitor of Rho transcription inhibitor is added to further regulate expression. The inhibitor or Rho transcription inhibitor according to the present invention is preferably bicyclomycin.
This is particularly useful in connection with the inhibiting RNA-polymerase binding aptamers according to the present invention. Hence, in one embodiment the invention relates to a method for controlled expression of a sequence of interest in a host cell comprising the steps of: providing a host cell comprising a DNA-molecule, said DNA molecule comprising a sequence encoding for an inhibiting RNA-polymerase binding aptamer according to the present invention operatively linked to a promoter and the sequence of interest, culturing said host cell in culture medium under conditions allowing expression of the sequence of interest at the desired level, wherein said culture medium comprises an inhibitor of Rho transcription inhibitor at a concentration allowing the expression at the desired level. The sequence of interest may be an exogenous sequence or an endogenous sequence of the host cell. Hence, the method may for example be used for controlled down-regulation (inhibition) of endogenous sequence of a host cell, e.g. for functional analysis. Any inhibiting RNA-polymerase according to the invention may be used, preferably those as defined above, as all tested inhibiting RNA-polymerase binding aptamers according to the invention have shown significant dependency on Rho inhibitor and bicyclomycin (BCM). Hence, in a preferred embodiment in this regard, the inhibiting RNA-polymerase binding aptamer is encoded by a sequence as defined herein above. However, preferred inhibiting RNA-polymerase binding aptamer is encoded by a sequence having at least 80% identity to a sequence selected from the group consisting of SEQ ID NO:25, and SEQ ID NO:92, preferably having at least 90% identity to a sequence selected from the group consisting of SEQ ID NO:25, and SEQ ID NO:92, more preferably having at least 95% identity to a sequence selected from the group consisting of SEQ ID NO:25, and SEQ ID NO:92, more the inhibiting RNA-polymerase binding aptamer is encoded by a sequence selected from the group consisting of SEQ ID NO:25, and SEQ ID NO:92.
The inhibitor of Rho transcription inhibitor according to the present invention is preferably bicyclomycin, preferably at a level of 1 μg/mL to 100 μg/mL; more preferably 1 μg/mL to 50 μg/mL, most preferably 5 μg/mL to 25 μg/mL, including 8 μg/mL, 10 μg/mL, 15 p/mL, 20 μg/ml, 24 μg/mL.
As outline herein above, the invention also relates to a DNA-molecule comprising a sequence encoding a RNA-polymerase binding aptamer according to the present invention. In a preferred embodiment the RNA-polymerase binding aptamer is an inhibiting RNA-polymerase binding aptamer, preferably selected from the group of inhibiting RNA-polymerase aptamers as listed herein above. As will be understood by the person skilled in the art, a DNA-molecule comprising a sequence encoding an inhibiting RNA-polymerase binding aptamer may preferably be used for repressing or inhibiting expression from a promoter to which it is operatively linked. Hence, in a preferred embodiment of the invention the DNA-molecule according to the present invention comprises a sequence encoding an inhibiting RNA-polymerase binding aptamer and a promoter, wherein the sequence encoding said aptamer is operatively linked to said promoter, preferably operatively linked to repress or reduce transcription from said promoter. Several approaches make use of promoters which from which transcription is only active or increased under certain conditions, like the presence of a certain compound or other environmental conditions, such as temperature. However, these approaches often suffer from the fact that the inducible promoters are not completely “tight” under non-induction conditions, i.e. basal transcription and expression occurring from the promoter even in the absence of the stimulus for inducing transcription from the promoter. This, however, may be deleterious to the host cell, particularly in the case of proteins of interest having deleterious properties. Under these circumstances it is desirable to provide for an expression system or cassette which is “tight” under non-induction conditions. The inventors now provide inhibiting RNA-polymerase binding aptamers which provide unexpected effects in this regard. By including a sequence encoding such inhibiting RNA-polymerase binding aptamer operatively linked to the promoter, it is possible to tighten the promoter. Hence, in a preferred embodiment the present invention relates to DNA-molecule comprising an expression cassette, said expression cassette comprising an inducible promoter according to the invention, and a sequence encoding an inhibiting RNA-polymerase binding aptamer according to the invention, wherein the sequence encoding the aptamer is operatively linked to said inducible promoter, preferably positioned downstream of said inducible promoter. The skilled person may choose the desired inhibiting RNA-polymerase binding aptamer, in particular regarding the strength of inhibition. He may consider certain aspects therefore, like leakiness of the promoter in the non-induced state, i.e. the degree of transcription taking place without a stimulus. Furthermore, the skilled person will consider the amount of expression desired upon induction of transcription. To this end, he may choose from the inhibiting RNA-polymerase binding aptamers provided herewith, as they have different degrees of inhibition. Furthermore, a plurality of sequences encoding inhibiting RNA-polymerase binding aptamers may be used. Hence, in one embodiment of the present invention the expression cassette comprises a sequence encoding two or more inhibiting RNA-polymerase binding aptamers, preferably three or more inhibiting RNA-polymerase binding aptamers. These aptamers may all be the same inhibiting RNA-polymerase binding aptamer or they may be different inhibiting RNA-polymerase binding aptamers according to the invention.
The present invention in one embodiment relates to the use of a sequence encoding an inhibiting RNA-polymerase binding aptamer according to the present invention for inhibiting expression of a protein, preferably for inhibiting expression of a protein in a microorganism, such as prokaryotic or eukaryotic cell lines, including bacteria and mammal cell lines. In a preferred embodiment of the present invention said sequence encoding said RNA-polymerase binding aptamer is positioned downstream of the promoter from which the expression of said protein is initiated. This means that the RNA-polymerase binding aptamer is part of the mRNA encoding the protein to be expressed. The use according to the present invention encompasses the use for regulating the expression of any protein of interest.
The invention also relates to the use, or a method of use of an inhibiting RNA-polymerase binding aptamer according to the present invention or a DNA-molecule encoding the same for inhibiting the expression of an endogenous sequence within a host cell. The host cell preferably being a non-human host cell. The DNA-molecule encoding said inhibiting RNA-polymerase binding aptamer according to the present invention is preferably inserted downstream of the promoter from which said endogenous sequence is transcribed. The invention also relates to a method for inhibiting the expression of an endogenous sequence in a host cell comprising the step of inserting a DNA-molecule comprising a sequence encoding an inhibiting RNA-polymerase binding aptamer according to the present invention into the genome of said host cell such that the sequence encoding said inhibiting RNA-polymerase binding aptamer according to the present invention is operatively linked to the promoter of said endogenous sequence, preferably inserted downstream of said promoter. Further preferred said aptamer is inserted downstream of said promoter and upstream of said endogenous sequence.
The sequence encoding the inhibiting RNA-polymerase binding aptamer is preferably operatively linked to a promoter and/or a sequence to be expressed, or a multiple cloning site. In particularly preferred embodiment of the present invention, the sequence encoding the inhibiting RNA-polymerase binding aptamer is operatively linked to a promoter, and optionally to a sequence to be expressed, or a multiple cloning site. It has been shown herein, that the inhibiting RNA-polymerase binding aptamers are able to decrease transcription/expression of a transcript comprising the aptamer. The sequence encoding the inhibiting RNA-polymerase binding aptamer according to the present invention is preferably operatively linked to the promoter sequence and the transcription starting site. It is preferably positioned downstream of the promoter and the transcription starting site on the DNA-molecule according to the present invention. The distance of the inhibiting RNA-polymerase binding aptamer encoding sequence to the transcription starting site is not crucial. However, with certain distances superior effects in terms of regulating and inhibiting the transcription may be observed. Hence, in one embodiment the sequence encoding the inhibiting RNA-polymerase binding aptamer according to the present invention is preferably positioned within a distance of 10 to 1000 nt downstream of the transcription starting site, preferably within a distance of 15 to 500 nt downstream of the transcription starting site, more preferred within a distance of 25 to 250 nt downstream of the transcription starting site. The sequence encoding the inhibiting RNA-polymerase binding aptamer according to the present invention is preferably also operatively linked to the sequence to be expressed, e.g. an open reading frame, or a multiple cloning site. The distance of the inhibiting RNA-polymerase binding aptamer encoding sequence to the sequence to be expressed is not crucial. However, with certain distances superior effects in terms of regulating and inhibiting the transcription and expression may be observed. Hence, in one embodiment the sequence encoding the inhibiting RNA-polymerase binding aptamer according to the present invention is preferably positioned at least 20 nt upstream of the sequence to be expressed, or the multiple cloning site, e.g. within a distance of 20 to 1000 nt upstream of the sequence to be expressed, or the multiple cloning site, preferably within a distance of 30 to 500 nt upstream of the sequence to be expressed, or the multiple cloning site, more preferred within a distance of 40 to 250 nt upstream of the sequence to be expressed, or the multiple cloning site. Any combination of the above mentioned preferred distances may be used, e.g. a distance of 10 to 1000 nt downstream of the transcription starting site and a distance of 20 to 1000 nt upstream of the sequence to be expressed, or distance of 15 to 500 nt downstream of the transcription starting site and a distance of 15 to 500 nt upstream of the sequence to be expressed, or a distance of 25 to 250 nt downstream of the transcription starting site and a distance of 40 to 250 nt upstream of the sequence to be expressed, or any other possible combination.
The inventors have found that one mechanism of function is temporarily uncoupling transcription from translation. Upon binding to the surface of RNA polymerase, the inhibiting RNA-binding aptamer create a physical barrier to ribosome progression (
The invention also relates to a method of producing a protein of interest comprising the steps of: providing a vector comprising an expression cassette according to the present invention, the expression cassette comprising a sequence encoding an inhibiting RNA-polymerase binding aptamer according to the present invention, introducing said vector into a host cell, and culturing said host cell in culture medium under conditions inducing transcription from the promoter of the expression cassette, and optionally recovering the protein of interest from the host cell or the the culture medium. The promoter preferably is selected from constitutive promoters or inducible promoters according to the invention, preferably being an inducible promoter according to the invention. Further, the protein to be expressed preferably having deleterious effects on said host cell. The method may comprise the step of culturing said host cell under conditions not inducing transcription from the inducible promoter of the expression cassette before the step of culturing said host cell in culture medium under conditions inducing transcription from the inducible promoter of the expression cassette.
In this specification, a number of documents including patent applications are cited. The disclosure of these documents, while not considered relevant for the patentability of this invention, is herewith incorporated by reference in its entirety. More specifically, all referenced documents are incorporated by reference to the same extent as if each individual document was specifically and individually indicated to be incorporated by reference.
The invention will now be described by reference to the following examples which are merely illustrative and are not to be construed as a limitation of the scope of the present invention.
The RNA library from SELEX cycle 7 was first reverse transcribed to cDNA and amplified by 16 cycles of PCR with Phusion polymerase (New England Biolabs) using primers with fixed part and added barcode for multiplexing according to P. Parameswaran et al., Nucleic Acids Res. 35, e130 (2007) and N. Windbichler, F. von Pelchrzim, O. Mayer, E. Csaszar, R. Schroeder, RNA Biol. 5, 30-40 (2008) (fixFW: TATAGGGGAATTCGGAGCGGG (SEQ ID NO:45, fixRV_multi: TAGCCCGGGATCCTCGGGGCTG (Sigma; SEQ ID NO:46)). Library preparation (adapter ligation step) and deep sequencing were performed at the CSF NGS unit http://csf.ac.at/. The SELEX library was multiplexed and sequenced with the Solexa technology on a GAIIx with paired-end 76 base-pair reads. Reads were trimmed using trimmomatic (A. M. Bolger, M. Lohse, B. Usadel, Bioinformatics. 30, 2114-20 (2014)) and mapped against the reference genomes of the E. coli strain K12 (substrain MG1655, GenBank ID: U00096.3) using NextGenMap (F. J. Sedlazeck, P. Rescheneder, A. von Haeseler, Bioinformatics. 29, 2790-1 (2013)). Read mapping was conducted with permissive settings that allowed up to 20% sequence divergence in order to account for the considerable amounts of sequence alterations introduced by the SELEX procedure (B. Zimmermann, I. Bilusic, C. Lorenz, R. Schroeder, Methods. 52, 125-32 (2010); and B. Zimmermann, T. Gesell, D. Chen, C. Lorenz, R. Schroeder, PLoS One, 5, e9169 (2010)). The mapped reads were then filtered for a minimum mapping quality of 20, which resulted in 915,594 paired-end reads. Corresponding ORF annotations were downloaded from EcoGene 3.0 (J. Zhou, K. E. Rudd, Nucleic Acids Res. 41, D613-24 (2013)) and 5′ and 3′ UTR annotations—from RegulonDB (J. Zhou, K. E. Rudd, Nucleic Acids Res, 41, D613-24 (2013)).
Strand-specific coverage signals from K12 alignments were extracted and a custom peak-finding method was applied to these signals in order to identify the originating genomic regions of mRNA-fragments that were found to be enriched in the respective SELEX cycle. Briefly, our peak-calling method consists of the following steps: (1) signal smoothing using a moving Gaussian kernel, (2) approximation of the first derivative at each signal position by cubic Hermite interpolation, (3) calling peaks based on derivative sign-changes and defined minimum/maximum peak dimensions (minimum width: 10 bp, maximum width: 500 bp, minimum (smoothed) peak height: T. L. Bailey et al., Nucleic Acids Res. 37, W202-8 (2009)).
Peak intervals were classified according to whether they overlap with annotated ORFs or UTRs. Intervals were annotated as being “antisense” when they mapped to the strand opposite an existing ORF or UTR annotation. Based on the genomic location they were classified into several categories: 5′UTR (corresponding sequence aligns within any annotated 5′UTR), 3′UTR (corresponding sequence aligns within any annotated 3′UTR), intragenic (sense) (corresponding sequence aligns within the annotated gene, excluding cases of 5′- and 3′UTR), antisense (corresponding sequence aligns on the strand opposite to the annotated gene) and intergenic (sense) (corresponding sequence in between the annotated genes).
In order to characterize the location of RAPs within a transcript, their relative positioning was calculated within annotated transcripts (
Motifs in the RAP sequences were identified using MEME algorithm (T. L. Bailey et al., Nucleic Acids Res. 37, W202-8 (2009)). Using default settings a (CAN)n motif was identified with a length of 29 bp in 1010 of the total identified 15,724 RAPs (6.4%). The vast majority (89%) of the RAPs containing this motif are anti-sense to an annotated ORF. Homer (S. Heinz et al., Mol. Cell. 38, 576-89 (2010)), another motif discovery tool, confirmed the (CAN)n motif when run with a motif length of 30 bp in 1909 of the 15,724 RAPs (12.14%).
Additionally, the RAPs were analyzed for the consensus RNA polymerase pausing sequence (see M. H. Larson et al., Science, 344, 1042-1047 (2014); and I. O. Vvedenskaya et al., Science, 344, 1285-1289 (2014)). The G−10Y−1G+1 motif was identified in 4656 (out of 5439) sense and in 8324 (out of 9956) anti-sense RAPs. To assess whether the motif is enriched, a Monte Carlo simulation was performed where we randomly shuffled the positions of the detected RAPs on the genome (keeping number, length and orientation of the RAPs constant) and counted the number of occurrences of the G−10Y−1G+1 motif in this random set for each trail. 10,000 trails were performed separately for sense and anti-sense RAP to get the distribution of the expected number of occurrences assuming that the RAPs are randomly distributed on the genome. The 95% quantile of these distributions is 4483 for sense and 8532 for anti-sense RAPs. By comparing these values to the actual number of RAPs containing the motif, it can be shown that there is an over representation of the G−10Y−1G+1 motif in the sense inhibitory RAPs.
Two E. coli strains were used: DH5α [F-φ80lacZΔM15 Δ(lacZYA-argF) U169 recA1 endA1 hsdR17 (rk−, mk+) phoA supE44 λ− thi-1 gyrA96 relA1] and BL21(DE3)pLysS [F−, ompT, hsdSB (rB−, mB−), dcm, gal, λ(DE3), pLysS, Cmr]. Cells were grown in LB media to logarithmic phase (OD600=0.5-0.6) or stationary (OD600=1.6-1.8) phase with aeration (shaking at 200 rpm) at 37° C. When required, e.g. for plasmid selection, cultures were supplemented with ampicillin (100 μg/ml) or kanamycin (30 μg/ml). Cells with pBAD reporters were induced with 0.2% L-arabinose (Sigma). BL21 cells with pET reporters (transcription by phage T7 RNA-polymerase) were induced with 1 mM IPTG. In experiments with bicyclomycin (BCM) treatment in liquid culture, cultures were first grown to early stationary phase (OD600≈0.4). BCM was added to a final concentration of 25 μg/mL or 75 μg/mL (like indicated) and cultivated for 20 minutes before harvesting. In experiments with bicyclomycin (BCM) treatment on plates, cells were grown on LB-agar plates supplemented with BCM added to a final concentration of 8 μg/mL (low concentration allowing cell growth). In growth experiments with stress induction, bacterial cells were cultivated for 20 minutes before harvesting (if not mentioned otherwise).
To monitor expression of RAP-lacZ fusions, E. coli cultures were grown overnight at 37° C. in LB medium with shacking, and afterwards new cultures were started from 1:100 dilution in fresh medium. Cultures were grown to logarithmic phase (OD600=0.5-0.6). To determine β-galactosidase activity, corresponding β-galactosidase assays were performed in triplicates as described in (H. Miller, Experiments in molecular genetics (1972); http://books.google.at/books/about/Experiments_in_molecular_genetics.html?id=PtVpAAAAMAAJ&pgis=1).
To monitor GFP fluorescence in transformed E. coli with RAP-GFP fusions, cells were plated by streaking on LB-agar plates supplemented with kanamycin (30 g/ml) and incubated for 16-20 hours at 37° C. Plates were photographed at GFP mode with excitation wavelength 460 nm (Fusion Fx7 Imager, PEQlab, Germany). To compare cell density, plates were imaged at visible light mode with excitation wavelength 510 nm (Fusion Fx7 Imager, PEQlab, Germany).
Total RNA isolation was performed using the hot phenol method. Bacterial cells grown to the particular OD600 (logarithmic phase) were collected and mixed with Stop Solution (95% ethanol, 5% phenol) in 8:1 ratio, followed by centrifugation at 3,000×g at 4° C. for 5 min. The obtained bacterial pellets were frozen in liquid N2. Pellets were resuspended in a lysis buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, 0.5 mg/mL lysozyme) with the addition of SDS to a final concentration of 0.1%. The obtained lysate was incubated at 64° C. for 2 min, followed by the addition of NaOAc (pH 5.2) to a final conc. of 0.1 M. Then an equal volume of phenol (water-saturated, pH ca 4.0, AppliChem GmbH) was added, gently mixed and incubated at 64° C. for 6 min with inverting 6-8 times. Afterwards, samples were cooled down on ice and centrifuged at 16,100×g (4° C., 10 min). The aqueous phase was mixed with the equal volume of phenol/chloroform/isoamyl alcohol (25:24:1, pH ca 4.0, AppliChem GmbH) and centrifuged for 5 min at 16,100×g (4° C.) in the Phase Lock Gel Heavy tube (5Prime). Then the aqueous phase was mixed with chloroform/isoamyl alcohol (24:1) followed by 5 min centrifugation at 16,100×g (4° C.). RNA was ethanol-precipitated from the aqueous phase (3 volumes ethanol, 1/10 volume of 3 M sodium acetate, 1/100 volume 0.5 M EDTA). To remove traces of DNA, total RNA was treated with Turbo DNase I (Roche) according to the manufacturer's protocol. RNA integrity was checked on agarose gel.
qRT-PCR
To measure the levels of transcripts, 1-2 μg total DNA-free RNA was reverse transcribed using random oligo 9-mers (Sigma) and SuperScript™ II Reverse Transcriptase (Life Technologies) according to the manufacturer's protocol. Depending on the target sequence, cDNA was amplified with the primers described below (see Table 4). In each case the real-time PCR amplification efficiency was estimated using the standard curve method in one color detection system as described before (L. Jolla, 87-112 (2004)). qPCR was performed using Eppendorf Mastercycler® RealPlex2 and 5× HOT FIREPol® EvaGreen® qPCR Mix Plus (no ROX) from Medibena as suggested by the manufacturers. gapA gene levels were used for internal normalization.
For Northern blotting (NB) 10 μg of total RNA was separated by gel electrophoresis using denaturing 8% polyacrylamide-TBE-Urea (8M) gels in 1×TBE. RNA was loaded onto the gel after denaturing at 70° C. (for 10 min) in 2×RNA load dye (Fermentas) followed by transferring on ice. Gel-separated RNA was transferred to HybondXL membranes (Ambion) by wet electro-blotting at 12 V for 1 hour in 0.5×TBE. The membranes were cross-linked by UV (150 mJ/cm2) and probed with 5′-end γ32P-labeled DNA oligonucleotide probes (see List of sequences) in ULTRAhyb®-Oligo Hybridization Buffer (Ambion) according to the manufacturer's instructions. The DNA labeling reaction was performed with T4 PNK (New England Biolabs) according to the manufacturer's instructions.
To precisely determine the 3′end of transcripts, 550 pmol of 5′-phosphorylated RNA adapter (adapter sequence-5′P-AAUGGACUCGUAUCACACCCGACAA-idT (SEQ ID NO:61)) was ligated to 6 μg of total RNA using Ligase 1 for ssRNA (New England Biolabs) according to the manufacturer's protocol overnight at 16° C. The reaction was mixed with the equal volume of phenol/chloroform/isoamyl alcohol (25:24:1, pH ca 4.0, AppliChem GmbH) and centrifuged for 5 min at 16,100×g (4° C.) in the Phase Lock Gel Heavy tube (5Prime). Then the aqueous phase was mixed with chloroform/isoamyl alcohol (24:1) followed by 5 min centrifugation at 16,100×g (4° C.). Nucleic acid was ethanol-precipitated from the aqueous phase (3 volumes ethanol, 1/10 volume of 3 M sodium acetate, 1/100 volume 0.5 M EDTA) and subjected to RT using SuperScript™ II Reverse Transcriptase (Life Technologies) followed by RNaseH treatment (Promega) according to the manufacturer's protocols. cDNA was amplified with Phusion High-Fidelity DNA Polymerase (NEB) using the forward primer to the reporter construct upstream RAP (5′-TTGGGCTAGAATTCGTGTTTA-3′ (SEQ ID NO:62)) and the reverse primer to the ligated adapter (5′-TTGTCGGGTGTGATACGAGTCCATT-3′ (SEQ ID NO:63)). The obtained PCR-amplified library was separated and checked by gel electrophoresis at 6% PAA/1×TBE gel. The obtained bands were gel-purified, TA-cloned using pGEM-T Easy Vector System (Promega, according to the manufacturer's instructions) and subjected to Sanger sequencing. ClustalW2 was used to create the alignments.
DNA templates with fused strong T7A1 promoter (SEQ ID NO:58) for in vitro transcription experiments were synthesized by PCR with Phusion Polymerase (New England Biolabs) using reporter plasmids with different RAPs and appropriate synthetic primers (Sigma Aldrich). Transcription reactions were performed in solution. Wild-type E. coli His6 RNAP (Evgeny Nudler Lab) and wild-type E. coli RNAP Holoenzyme (Affymetrix) were used in the assays. To assemble the transcription initiation complexes, 2-4 pmol of His6 RNAP were mixed with 1-2 pmol of DNA template in 20 μl of Transcription Buffer 100 (10 mM MgCl2, 40 mM Tris-HCl, pH 7.9, 100 mM NaCl) (in case of RNAP from Affymetrix, complexes were assembled with same amount of enzyme in 20 mM Tris.HCl pH 8.0 containing 20 mM NaCl, 20 mM MgCl2, 14 mM 2-mercaptoethanol, and 0.1 mM EDTA, as suggested by the company) and incubated for 5 min at 37° C. Then AUC RNA primer (Dharmacon) was added up to 10 μM together with GTP (25 μM) and ATP (25 μM) followed by incubation at 37° C. for 5 minutes. Next, [α-32P]-CTP (800 Ci/mmol, 2 μl, Hartmann Analytic) was added for another 2 min. To minimize RNA degradation 10 U of RNasin® Ribonuclease Inhibitor (Promega) were added to the reaction. For the reactions with Rho and NusG, the purified transcription factors were added in a concentration of up to 0.5 μM to the transcriptional mixtures where indicated, followed by additional incubation at 37° C. for 5 minutes. Chasing reaction was performed with all four NTPs added to a final concentration of 10-50 μM. To prevent re-initiation, rifampicin (Sigma) was added concurrently at this step to a final concentration of 10 μM. At the indicated time points, an aliquot of chase reaction was quenched with 3× Stop Solution, containing 100 mM EDTA, 8M UREA, 0.025% xylenthianol and 0.025% bromophenol blue. The products were separated by 6% denaturing TBE-PAGE (8M urea). RAP sequence mapping within the transcript was performed by transcribing RAP-containing template and 25 μM of NTPs mixed with one of the four 3′ dNTPs (3′dGTP, 3′dATP, 3′dUTP or 3′dCTP) at 3:1 ratio in four different transcription reactions for 10 min.
Construction of Reporter Construct pWM3110
The assays performed herein use a reporter system based on the plasmid pWM3110 as constructed by the present inventors. This construct was obtained by adapting a well-characterized GFP-containing plasmid pWM015 (W. G. Miller et al., Appl. Environ. Microbiol. 66, 5426-36 (2000), and C. H. Eggers et al., Mol. Microbiol. 43, 281-95 (2002), both incorporated herein by reference). pWM015 contains a Kanamycin resistance gene (KanR) and the gene encoding for the green fluorescence protein (GFP) under control of a bacterial constitutive promoter (SEQ ID NO:66). pWM1015 was adapted through extending 5′UTR of the GFP gene with additional 108 nt in order to introduce unique XhoI and SacI restriction enzyme recognition sites for successful cloning in the 5′UTR. For this purpose, the insert of SEQ ID NO:64 was cloned between EcoRI site in the original pWM015 plasmid. pWM015 was digested (linearized) using EcoRI (Thermo Scientific) according to manufacturer's protocol. The synthetically produced double stranded oligonucleotide of SEQ ID NO:64 (Sigma Aldrich) was digested using EcoRI and ApoI (Thermo Scientific) according to the manufacturer's protocol, resulting in overhangs compatible with EcoRI overhangs to allow proper orientation of the construct. The digestion products were ligated with T4 DNA Ligase (New England Biolabs) according to the manufacturer's protocol. Correct and single insertion was confirmed by bacterial colony PCR and subsequent Sanger sequencing. The sequence of the resulting pWM3110 is given as SEQ ID NO:65.
The different sequences encoding RNA-Polymerase binding aptamers were cloned into pWM3310, supra, using the unique XhoI and SacI restriction sites. The synthetically produced (Sigma Aldrich) double stranded SEQ ID NO:68 (containing rut Rho termination insert), SEQ ID NO:69 (containing rrnB intrinsic terminator insert), SEQ ID NO:70 (containing T7t intrinsic terminator insert), or SEQ ID NO:71 (containing the combination of three T7t-tR2-rrnB intrinsic terminators) were digested with SacI (site at the 5′ end of each construct) and BsiHKA1(ALw211) (a SacI compatible restriction site on the 3′end allowing preservation of SacI as a unique restriction site in the final construct) according the manufacturer's protocol. Thereafter, the digested and purified constructs were cloned into SacI-linearized pWM3310 by ligation with T4 DNA Ligase (New England Biolabs) according to the manufacturer's protocol. Correct and single insertion was confirmed by bacterial colony PCR and subsequent Sanger sequencing. The constructs were named according to the terminator sequence they contained as pWM3110_rut, pWM3110_rrnB, pWM3110_T7t, and pWM3110_triple terminator, respectively.
For assays employing lacZ as the reporter gene a derivative of pBAD24 was constructed. pBAD24 contains an bacterial arabinose-inducible promoter (SEQ ID NO:67) and an ampicillin resistance gene (AmpR) (L. M. Guzman et al., 177 (1995), incorporated herein by reference). The original ribosomal binding site of pBAD24 was removed from the multiple cloning site by restriction with NheI/EcoRI, followed by DNA Polymerase I Large (Klenow) Fragment treatment and blunt ligation (recirculation) to obtain pBAD13. A new +70nt RBS (SEQ ID NO:80) was cloned between the XbaI/PstI restriction sites and the lacZ gene (SEQ ID NO:79) between the PstI/HindIII restriction sites of pBAD13 to obtain pBAD13_lacZ.
The different RNA-polymerase binding aptamers were cloned into pBAD13_lacZ between the EcoRI/NcoI restriction sites to obtain pB13_RAP_lacZ. To ensure that the observed effects are not affected by the initial RAP 3′end-flanking sequence a further +99nt insert (cobC-derived, no RAP contained SEQ ID NO:80) was inserted through NcoI/Xbal restriction sites. The resulting plasmid was pB13_RAP_+99_lacZ. To ensure that the effects observed were also not affected by initial RAP 5′-flanking sequences the respective RAP sequence was exchanged by a construct containing additional 49 nt (SEQ ID NO:81) at the 5′-end of the RAP and the RAP-insert using the EcoRI/NcoI restriction sites, resulting in pB13_+49RAP_+99_lacZ.
A further construct was created to test effects on expression from T7 promoter. To this end, the orginal RBS was removed from the MCS of pET21a (Novagen) by Xba/NdeI restriction, followed by DNA Polymerase I Large (Klenow) Fragment treatment and blunt ligation (recirculation). The resulting construct was named pET21_minRBS. The RAPs were cloned into pET21_minRBS by amplification of the full length RAP_+99_lacZ including additional 5 nt from the transcription starting site from pB13_RAP_+99_lacZ as the template and pET21_Insert_FW_NheI (SEQ ID NO:82) and pET21_Insert_RV_HindII: (SEQ ID NO:83) as primers. The amplification product was cloned into pET21_minRBS using the NheI/HindIII restriction sites.
Identification of RAPs that Promote Activation of Transcription
To monitor the potential effect of identified RAPs on transcription, they were tested using a plasmid-based GFP reporter pWM3110 (see
To test for specificity, the same RAP-containing reporter constructs were expressed from the phage T7 promoter using phage T7 RNA Polymerase (
Control experiments were performed to check if the observed transcriptional activation could be explained by enhanced promoter-like RNAP recruitment to the RAP-encoding DNA sequence. To rule it out, activating RAPs were checked in a promoter activity assay. RAPs were cloned individually to the reporter construct replacing the constitutive promoter (
Further checked was whether activating RAP#5713 would activate transcription when being located upstream of the promoter, in a non-transcribed region (
For further characterization activating RAPs capacity to suppress termination was investigated. Two types of termination signals have been described in bacteria: factor-independent (intrinsic) and factor-dependent (Rho-dependent) ones (Santangelo and Artsimovitch, 2011).
First, the effect of activating RAPs on the efficiency of read-through the intrinsic termination signal was tested. Two different heterologous RNAP intrinsic terminators were cloned in-between a RAP signal and the GFP reporter (
RAPs that Suppress Rho-Dependent Termination
The activating RAPs where tested as signals able to activate read-through the Rho-dependent termination signal. Rho-dependent termination requires Rho binding to specific nascent RNA regions, named Rho utilization (rut) sites. One of the well-characterized rut sites (Krebs et al., “Lewin's GENES X”, 2009) was inserted in between RAP #5713 and the reporter gene GFP (
The antitermination potential of RAP #5713 (SEQ ID NO:2) was further analyses by inserting 3 heterologous intrinsic terminators in a row upstream of GFP (
To gain a first insight into the mechanism of RAP-mediated transcription antitermination, in vitro transcription studies were performed to test if direct RAP#5713-RNAP interaction would be sufficient for efficient antitermination and transcription enhancing, respectively. Template with intrinsic terminator rrnB was used for these studies. RAP#5713 or control Rev#5713 (sequence, reverse complement to RAP#5713) was placed upstream of the terminator (
To gain insight into the frequency of activating RAPs in the 7th cycle SELEX pool, they were cloned into the reporter construct shown in
To identify further common features of the transcription enhancing RAPs sequence and secondary structure analysis were performed.
In contrast to the known antiterminatory phage sequences, the transcription enhancing RNA-polymerase binding aptamers according to the present invention are (a) much shorter, (b) have no common particular requirements for complex secondary structure according to predictions with specialized software (like Vienna RNA Package, RNAfold—http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi; or mfold http://mfold.rna.albany.edu/?q=mfold). An analysis of the secondary structure of RAP ID #5713 (SEQ ID NO:2) is shown in
Sequence analysis has been performed with the sequences of SEQ ID NOs:2 to 14 as well as the sequences of mutated variants of RAP ID #5713 (SEQ ID NO:72-75) to identify a common motif. All sequences were combined with the RAP ID #5713 (SEQ ID NO:2). Prediction was performed with The MEME Suite 4.10.0: Motif-based sequence analysis tools (T. Bailey, M. Bodén, F. Buske, M. Frith, C. Charles E. Grant, L. Clementi, J. Ren, W. Li, W. Noble, “MEME SUITE: tools for motif discovery and searching”, Nucleic Acids Research, 37:W202-W208, 2009). Reported activating RAP sequences were submitted to MEME program with the following parameters: “site distribution”—one per sequence; “search given strand only”. The analysis revealed a consensus sequence of SEQ ID NO:1 for enhancing RAPs (see
To test for potential activity of the named activating RAPs in trans in vivo, a reporter system was designed consisting of two compatible plasmids suitable for the co-transformation in E. coli (
The presented results provide an essential advantage for the targeted use of activating RAPs as, while activating the expression of the desired sequence in cis, they do not affect the expression of the other host genes in trans.
RAPs with Transcription Enhancing Properties Derived from Eukaryotic Organisms
By the definition, RAPs are natural RNA aptamers binding RNAP with high affinity. The above RAPs originate from the bacterial (E. coli) genome. Further investigations were preformed using genomic SELEX with the RNAP II for yeast and Homo sapiens. These yeast/human RAP sequences with high affinity to the eukaryotic RNAP have been named yh1 to yh45 (SEQ ID NO:93 to 137). Taking into account the conservation of RNAP machineries between all the cellular organisms, it follows that the obtained yeast/human RAPs can activate and antiterminate transcription by prokaryotic RNAP in cis, possessing the corresponding activities as previously above bacteria derived RAPs. To prove this, a number of eukaryotic RAPs was tested for the effect on transcription activation in bacteria. The pWM3110_rrnB construct with GFP reporter (schematically presented in
The presented sequences can be described with a Regular Expression for Motif: [AC][AC][AG]CAN[AC][AT][CTG]A[CT][AT]CCNA[AC]N[AT]C[AC] (SEQ ID NO:138; prediction with GLAM2, see
Further, a motif common for all enhancing aptamers, pro- and eukaryotic has been identified using GLAM2 prediction which is reflected in SEQ DI NO:139. As will be acknowledged by the skilled person, this motif also very well reflects the essential nucleotides of the core motif as identified for the RAP encoded by SEQ ID NO:2.
Identification of RAPs that Inhibit Transcription
The enriched sequences obtained under the most stringent selection conditions (7 cycles of genomic SELEX) were deep-sequenced and ˜1.0 million reads were mapped to the E. coli K12 MG1655 genome followed by peak calling using a custom algorithm. Overall, we identified over 15,000 RAPs—RNA aptamers with high affinity to RNAP—in the E. coli genome (Table S1). The Kd of the total selected pool was shown to be below 10 nM (see N. Windbichler, F. von Pelchrzim, O. Mayer, E. Csaszar, R. Schroeder, RNA Biol. 5, 30-40). The majority of RAPs (64.3%) maps antisense to genes and approximately ⅓ of them (31.5%) are intragenic (
To elucidate a possible effect of RAPs on transcription, the activity of representative sense and antisense RAPs was tested using a plasmid-based LacZ reporter (
To test the specificity of the observed RAP effect for bacterial RNAP, the “inhibiting” RAPs were cloned in a similar reporter system with the phage T7 promoter. IPTG-inducible E. coli BL21 strain was used to ensure transcription by phage T7 RNAP. In this case, we did not observe any significant reduction of transcript levels in response to RAPs (
To gain further insight into the mechanism leading to transcription inhibition by RAPs, Northern blot analysis of RAP-containing transcripts were performed. Total RNA from logarithmic growing cells was probed with oligonucleotides annealing before and after the RAPs (probe location is indicated in
To elucidate the mechanism of RAP-triggered transcription termination, it was searched for possible terminator hairpins formed between the RAP and downstream sequences in the reporter construct. No potential intrinsic terminators could be identified. Hence, RAP-mediated termination may be Rho-dependent. To test this, bicyclomycin (BCM)—a highly specific antibiotic that inhibits Rho (14)—was utilized. Upon addition of BCM (25 μg/ml) to the cells transformed with the RAP constructs described above, transcription inhibition by RAPs #1086 and #15 was abolished (
To directly confirm Rho dependence of the RAP effects, single round in vitro transcription assays using synthetic DNA templates with selected RAPs and the strong T7A1 promoter were performed. Without Rho, the in vitro transcription with E. coli RNAP yielded full-length runoff transcripts (
To investigate whether RAPs affected elongation by E. coli RNAP, single-round time-course transcriptional assays using DNA templates with RAP #15 and its reverse complement sequence at the same position as a control were performed. The position of RAP was precisely mapped (
To address the physiological role of this phenomenon the activity of RAP #15 in its natural genomic context was studied. In E. coli RAP #15 is encoded within the ORF of the nicotinate-mononucleotide adenylyltransferase (nadD) gene. NadD is an essential enzyme required for de novo biosynthesis and salvage pathways of redox cofactors NAD+ and NADP+ (R. A. Mehl, C. Kinsland, T. P. Begley, J. Bacteriol. 182, 4372-4 (2000); and H. Zhang et al., Structure. 10, 69-79 (2002)). This led to the hypothesis that RAP #15 would trigger transcription termination within its nadD gene. In this case, the steady state level of the nadD mRNA upstream of RAP #15 sequence should be higher than just downstream of RAP #15. We designed RAP flanking primers to amplify the respective regions of the nadD mRNA and compared the amount of mRNA up- and downstream of RAP#15 (
Strikingly, RAP15-mediated Rho termination occurs within the coding region of nadD. Because transcription and translation are directly coupled in bacteria (S. Proshkin, A. R. Rahmouni, A. Mironov, E. Nudler, Science. 328, 504-8 (2010)), the leading ribosome normally follows RNAP closely to prevent Rho-dependent termination. To verify that efficient translation does indeed take place during RAP15-mediated termination, constructed nad(RAP15)-GFP translational fusion was constructed, placing RAP15 in its natural genomic context in frame with the reporter gene (
From the data it concluded that natural RNA polymerase aptamers are specific signals impacting on transcription in a controllable manner.
Subgroups of the identified inhibiting were analyzed for presence of consensus sequences. Using default parameters of ClustalW2 for nucleic acid sequences clusters of related sequences and the respective consensus sequences of SEQ ID NO:84 ([A/T][T/C][G/T/A]A[A/C][G[A/C/T][A/C][G/C][A/C/T][G/T][G/A][A/C][A/T][G/C][G/T/A][A/T][A/C]), SEQ ID NO:85 (G[A/C][G/T/A/C][G/T]AT[T/C][G/C/A][G/C][G/T/A/C]T[G/T], SEQ ID NO:86 (GCCA[G/T]GA[G/T]CG), and SEQ ID NO:87 ([G/T/C][A/C/T][A/C][G/C/A][G/T/A][G/C][G/C/A][G/A][G/C][G/A][A/T] could be identified. The consensus sequence of SEQ ID NO:84 is present in at least RAP ID NOs: 5039, 9533, 10822, 10070, and 1510; SEQ ID NO:85 is present in at least RAP ID NOs:1800, 1760, 4599, 3930, 683, 803, and 4599, SEQ ID NO:86 is present in at least RAP ID NOs: 1436, and 15003; and SEQ ID NO:87 is present in RAP ID NOs: 9505, 10243, 11599, 15, 1086, 2136, 6819, and 9559. Results are also shown in
The presented in vitro and in vivo studies show a role of Rho in RAP-mediated transcription inhibition. Rho is a ring-shaped hexameric RNA helicase acting on the elongation complex to promote transcription termination (E. Nudler, M. E. Gottesman, Genes to Cells. 7, 755-768 (2002); and J. W. Roberts, Nature. 224, 1168-74 (1969)). In the inhibitory RAPs the C-content median value is 33,67%, whereas G-content median value is 15,69% (
Taken together, the identification of widespread RAPs as a novel class of transcription-regulating RNA signals reveals that direct interaction of the nascent RNA with RNAP broadly impacts on the composition of the E. coli transcriptome. RAPs realize RNA's potential as a self-regulatory system.
E. coli gene
GRHWDSATBMGRKMRVKDBAGCA
Herein, nucleotide sequences are shown using the expanded letter code. In addition to the conventional GATC symbols, the expanded letter code to indicate a position within a sequence that may be flexible when defining sequences. The following table outlines the letter used and specifies which nucleotides may be included at the respective position of the sequence; e.g. if the letter H is used at position 1, this indicates that position 1 of the sequence may either be A, C, or T.
CTTGAGGGGTTTTTTTACCGTTAATAACAGGCCTGCTGG
TAATCGCAGGCCTTTTTATTTTGTTACACGCACAGCGGC
GCCTTTCGTTTTATCTGTTAATTTTGTTTAACTTttACT
GAATTCATCGCGTGTGCGTATTCGATCGAATTGGAT
AAATTC
CTTCCTTCTCCCCATCGCTACCTCATATCCGCACCT
CCTCAAACGCTACCTCGACCAGCCTCCCTCCCTCCC
CGTTTTATCTGTTAATTTTGTTTAACTTTAGGAGTG
GTTTTTTTTAATTTTGTTTAACTTTAGGAGTGCTC
TAAACGGGTCTTGAGGGGTTTTTTTACCGTTAATAA
CAGGCCTGCTGGTAATCGCAGGCCTTTTTATTTTGT
GCTCAGTCGGAAGACTGGGCCTTTCGTTTTATCTGT
GRHWDSATBMGRKMRVKDBAGCA
GACAACATGAGAACAGTTCAGCAGCACTA
Mutations with respect to SEQ ID NO:2 are underlined. In SEQ ID NO:1 and 2 nucleotides without alternatives in the consensus sequence of SEQ ID NO:1 are in italics. SEQ ID NO:75 is shortened by 6 nt at the 3′-end as compared to SEQ ID NO:2. It hence, only comprises the sequence corresponding to the consensus sequence of SEQ ID NO:1.
The present invention while not narrowing the above, in particular relates to the following items:
Number | Date | Country | Kind |
---|---|---|---|
15162198.4 | Apr 2015 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2016/056827 | 3/29/2016 | WO | 00 |