RNA REPLICON FOR EXPRESSING AT CELL RECEPTOR OR AN ARTIFICIAL T CELL RECEPTOR

TECHNICAL FIELD OF THE INVENTION

The present invention embraces a RNA replicon that can be replicated by a replicase of alphavirus origin and comprises an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor. Such RNA replicons are useful for expressing a T cell receptor or an artificial T cell receptor in a cell, in particular an immune effector cell such as a T cell. Cells engineered to express such T cell receptor or artificial T cell receptor are useful in the treatment of diseases characterized by expression of antigens bound by the T cell receptor or artificial T cell receptor.

BACKGROUND OF THE INVENTION

T cells play a central role in cell-mediated immunity in humans and animals. The recognition and binding of a particular antigen is mediated by the T cell receptors (TCRs) expressed on the surface of T cells. The TCR of a T cell is able to interact with immunogenic peptides (epitopes) bound to major histocompatibility complex (MHC) molecules and presented on the surface of target cells. Specific binding of the TCR triggers a signal cascade inside the T cell leading to proliferation and differentiation into a maturated effector T cell.

The TCR is a part of a complex signaling machinery, which includes the heterodimeric complex of the TCR α- and β-chains, the co-receptor CD4 or CD8 and the CD3 signal transduction module. The TCR α/β heterodimer is responsible for antigen recognition and relaying the activation signal through the cell membrane in concert with CD3, while the CD3 chains themselves transfer the incoming signal to adaptor proteins inside the cell. Thus, the transfer of the TCR α/β chains offers the opportunity to redirect T cells towards any antigen of interest.

Adoptive cell transfer (ACT) based immunotherapy can be broadly defined as a form of passive immunization with previously sensitized T cells that are transferred to non-immune recipients or to the autologous host after ex vivo expansion from low precursor frequencies to clinically relevant cell numbers. Cell types that have been used for ACT experiments include lymphokine-activated killer (LAK) cells (Mule, J. J. et al. (1984) Science 225, 1487-1489; Rosenberg, S. A. et al. (1985) N. Engl. J. Med. 313, 1485-1492), tumor-infiltrating lymphocytes (TILs) (Rosenberg, S. A. et al. (1994) J. Natl. Cancer Inst. 86, 1159-1166), donor lymphocytes after hematopoietic stem cell transplantation (HSCT) as well as tumor-specific T cell lines or clones (Dudley, M. E. et al. (2001) J. Immunother. 24, 363-373; Yee, C. et al. (2002) Proc. Natl. Acad. Sci. U. S. A 99, 16168-16173). Adoptive T cell transfer was shown to have therapeutic activity against human viral infections such as CMV. For adoptive immunotherapy of melanoma Rosenberg and co-workers established an ACT approach relying on the infusion of in vitro expanded autologous tumor-infiltrating lymphocytes (TILs) isolated from excised tumors in combination with a non-myeloablative lymphodepleting chemotherapy and high-dose IL2. A clinical study resulted in an objective response rate of ˜50% of treated patients suffering from metastatic melanoma (Dudley, M. E. et al. (2005) J. Clin. Oncol. 23: 2346-2357).

An alternative approach is the adoptive transfer of autologous T cells reprogrammed to express a tumor-reactive immunoreceptor of defined specificity during short-time ex vivo culture followed by reinfusion into the patient (Kershaw M.H. et al. (2013) Nature Reviews Cancer 13 (8):525-41). This strategy makes ACT applicable to a variety of common malignancies even if tumor-reactive T cells are absent in the patient. Since the antigenic specificity of T cells is rested entirely on the heterodimeric complex of the TCR α- and β-chain, the transfer of cloned TCR genes into T cells offers the potential to redirect them towards any antigen of interest. Therefore, TCR gene therapy provides an attractive strategy to develop antigen-specific immunotherapy with autologous lymphocytes as treatment option. Major advantages of TCR gene transfer are the creation of therapeutic quantities of antigen-specific T cells within a few days and the possibility to introduce specificities that are not present in the endogenous TCR repertoire of the patient. Several groups demonstrated, that TCR gene transfer is an attractive strategy to redirect antigen-specificity of primary T cells (Morgan, R. A. et al. (2003) J. Immunol. 171, 3287-3295; Cooper, L. J. et al. (2000) J. Virol. 74, 8207-8212; Fujio, K. et al. (2000) J. Immunol. 165, 528-532; Kessels, H. W. et al. (2001) Nat. Immunol. 2, 957-961; Dembic, Z. et al. (1986) Nature 320, 232-238). Feasibility of TCR gene therapy in humans was initially demonstrated in clinical trials for the treatment of malignant melanoma by Rosenberg and his group. The adoptive transfer of autologous lymphocytes retrovirally transduced with melanoma/melanocyte antigen-specific TCRs resulted in cancer regression in up to 30% of treated melanoma patients (Morgan, R. A. et al. (2006) Science 314, 126-129; Johnson, L. A. et al. (2009) Blood 114, 535-546). In the meantime clinical testing of TCR gene therapy was extended also to cancers other than melanoma targeting many different tumor antigens (Park, T. S. et al., (2011) Trends Biotechnol. 29, 550-557).

The use of genetic engineering approaches to insert antigen-targeted receptors of defined specificity into T cells has greatly extended the potential capabilities of ACT. Chimeric antigen receptors (CARs) are a type of antigen-targeted receptor composed of intracellular T cell signaling domains fused to extracellular antigen-binding domains, most commonly single-chain variable fragments (scFv's) from monoclonal antibodies. CARs directly recognize cell surface antigens, independent of MHC-mediated presentation, permitting the use of a single receptor construct specific for any given antigen in all patients. Initial CARs fused antigen-recognition domains to the CD3ζ activation chain of the T cell receptor (TCR) complex. Subsequent CAR iterations have included secondary costimulatory signals in tandem with CD3ζ, including intracellular domains from CD28 or a variety of TNF receptor family molecules such as 4-1BB (CD137) and OX40 (CD134). Further, third generation receptors include two costimulatory signals in addition to CD37, most commonly from CD28 and 4-1BB. Second and third generation CARs dramatically improved antitumor efficacy in vitro and in vivo (Zhao et al., (2009) J. Immunol., (183) 5563-5574), in some cases inducing complete remissions in patients with advanced cancer (Porter et al., (2011) N.Engl.J.Med., (365) 725-733).

A classical CAR consists of an antigen-specific single chain antibody (scFv) fragment, fused to a transmembrane and signaling domain such as CD3ζ. Upon introduction into T cells it is expressed as a membrane-bound protein and induces immune responses upon binding to its cognate antigen (Eshhar et al., (1993) PNAS, (90) 720-724). The induced antigen-specific immune response results in the activation of cytotoxic CD8+ T cells which in turn leads to the eradication of cells expressing the specific antigen, such as tumor cells or virus-infected cells expressing the specific antigen. These classical CAR constructs do not activate/stimulate the T cells through their endogenous CD3 complex, which is normally essential for T cell activation. Due to the fusion of the antigen binding domain to CD37, T cell activation is induced through a biochemical “short circuit” (Aggen et al., (2012) Gene Therapy, (19) 365-374).

An alternative approach, in which activation of the T cell occurs through a more physiological mechanism, was the provision of an analogous single chain-TCR (scTv)-fragment fused to the CB constant domain derived from the T cell receptor (TCR) and its co-expression with a TCR-derived Ca constant domain (Voss et al., (2010) Blood, (115) 5154-5163), the latter which recruits the essential endogenous CD3ζ homodimer (Call et al., (2002) Cell, (111) 967-79.). In order for these constructs to function as immune system activators, it was essential that their constant domains originate from murine TCRs or need to be murinized (Cohen et al., (2006) Cancer Res., (66) 8878-86; Bialer et al., (2010) J. Immunol., (184) 6232-41) to achieve chain pairing between the scTCR and C.

Alternate recombinant artificial T cell receptors, in which the receptor, upon antigen binding, is able to activate the T cell in which it is expressed have been described.

It is generally thought that the number of transferred T cells is correlated with therapeutic responses. However, the generation of T cells suitable for adoptive T cell transfer still remains a challenge.

Nucleic acid molecules comprising foreign genetic information encoding one or more polypeptides for prophylactic and therapeutic purposes have been studied in biomedical research for many years. Influenced by safety concerns associated with the use of deoxyribonucleic acid (DNA) molecules, ribonucleic acid (RNA) molecules have received growing attention in recent years. Various approaches have been proposed, including administration of single stranded or double-stranded RNA, in the form of naked RNA, or in complexed or packaged form, e.g. in non-viral or viral delivery vehicles. In viruses and in viral delivery vehicles, the genetic information is typically encapsulated by proteins and/or lipids (virus particle). For example, engineered RNA virus particles derived from RNA viruses have been proposed as delivery vehicle for treating plants (WO 2000/053780 A2) or for vaccination of mammals (Tubulekas et al., 1997, Gene, vol. 190, pp. 191-195). In general, RNA viruses are a diverse group of infectious particles with an RNA genome. RNA viruses can be sub-grouped into single-stranded RNA (ssRNA) and double-stranded RNA (dsRNA) viruses, and the ssRNA viruses can be further generally divided into positive-stranded [(+) stranded] and/or negative-stranded [(−) stranded] viruses. Positive-stranded RNA viruses are prima facie attractive as a delivery system in biomedicine because their RNA may serve directly as template for translation in the host cell.

Alphaviruses are typical representatives of positive-stranded RNA viruses. The hosts of alphaviruses include a wide range of organisms, comprising insects, fish and mammals, such as domesticated animals and humans. Alphaviruses replicate in the cytoplasm of infected cells (for review of the alphaviral life cycle see José et al., Future Microbiol., 2009, vol. 4, pp. 837-856). The total genome length of many alphaviruses typically ranges between 11,000 and 12,000 nucleotides, and the genomic RNA typically has a 5′-cap, and a 3′ poly(A) tail. The genome of alphaviruses encodes non-structural proteins (involved in transcription, modification and replication of viral RNA and in protein modification) and structural proteins (forming the virus particle). There are typically two open reading frames (ORFs) in the genome. The four non-structural proteins (nsP1-nsP4) are typically encoded together by a first ORF beginning near the 5′ terminus of the genome, while alphavirus structural proteins are encoded together by a second ORF which is found downstream of the first ORF and extends near the 3′ terminus of the genome. Typically, the first ORF is larger than the second ORF, the ratio being roughly 2:1.

In cells infected by an alphavirus, only the nucleic acid sequence encoding non-structural proteins is translated from the genomic RNA, while the genetic information encoding structural proteins is translatable from a subgenomic transcript, which is an RNA molecule that resembles eukaryotic messenger RNA (mRNA; Gould et al., 2010, Antiviral Res., vol. 87 pp. 111-124). Following infection, i.e. at early stages of the viral life cycle, the (+) stranded genomic RNA directly acts like a messenger RNA for the translation of the open reading frame encoding the non-structural poly-protein (nsP1234). In some alphaviruses, there is an opal stop codon between the coding sequences of nsP3 and nsP4: polyprotein P123, containing nsP1, nsP2, and nsP3, is produced when translation terminates at the opal stop codon, and polyprotein P1234, containing in addition nsP4, is produced upon readthrough of this opal codon (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562; Rupp et al., 2015, J. Gen. Virology, vol. 96, pp. 2483-2500). nsP1234 is autoproteolytically cleaved into the fragments nsP123 and nsP4. The polypeptides nsP123 and nsP4 associate to form the (−) strand replicase complex that transcribes (−) stranded RNA, using the (+) stranded genomic RNA as template. Typically at later stages, the nsP123 fragment is completely cleaved into individual proteins nsP1, nsP2 and nsP3 (Shirako & Strauss, 1994, J. Virol., vol. 68, pp. 1874-1885). All four proteins form the (+) strand replicase complex that synthesizes new (+) stranded genomes, using the (−) stranded complement of genomic RNA as template (Kim et al., 2004, Virology, vol. 323, pp. 153-163, Vasiljeva et al., 2003, J. Biol. Chem. vol. 278, pp. 41636-41645).

In infected cells, subgenomic RNA as well as new genomic RNA is provided with a 5′-cap by nsP1 (Pettersson et al. 1980, Eur. J. Biochem. 105, 435-443; Rozanov et al., 1992, J. Gen. Virology, vol. 73, pp. 2129-2134), and provided with a poly-adenylate [poly(A)] tail by nsP4 (Rubach et al., Virology, 2009, vol. 384, pp. 201-208). Thus, both subgenomic RNA and genomic RNA resemble messenger RNA (mRNA).

Alphavirus structural proteins (core nucleocapsid protein C, envelope protein E2 and envelope protein E1, all constituents of the virus particle) are typically encoded by one single open reading frame under control of a subgenomic promoter (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562). The subgenomic promoter is recognized by alphaviral non-structural proteins acting in cis. In particular, alphavirus replicase synthesizes a (+) stranded subgenomic transcript using the (−) stranded complement of genomic RNA as template. The (+) stranded subgenomic transcript encodes the alphavirus structural proteins (Kim et al., 2004, Virology, vol. 323, pp. 153-163, Vasiljeva et al., 2003, J. Biol. Chem. vol. 278, pp. 41636-41645). The subgenomic RNA transcript serves as template for translation of the open reading frame encoding the structural proteins as one poly-protein, and the poly-protein is cleaved to yield the structural proteins. At a late stage of alphavirus infection in a host cell, a packaging signal which is located within the coding sequence of nsP2 ensures selective packaging of genomic RNA into budding virions, packaged by structural proteins (White et al., 1998, J. Virol., vol. 72, pp. 4320-4326).

In infected cells, (−) strand RNA synthesis is typically observed only in the first 3-4 h post infection, and is undetectable at late stages, at which time the synthesis of only (+) strand RNA (both genomic and subgenomic) is observed. According to Frolov et al., 2001, RNA, vol. 7, pp. 1638-1651, the prevailing model for regulation of RNA synthesis suggests a dependence on the processing of the non-structural poly-protein: initial cleavage of the non-structural polyprotein nsP1234 yields nsP123 and nsP4; nsP4 acts as RNA-dependent RNA polymerase (RdRp) that is active for (−) strand synthesis, but inefficient for the generation of (+) strand RNAs. Further processing of the polyprotein nsP123, including cleavage at the nsP2/nsP3 junction, changes the template specificity of the replicase to increase synthesis of (+) strand RNA and to decrease or terminate synthesis of (−) strand RNA.

The synthesis of alphaviral RNA is also regulated by cis-acting RNA elements, including four conserved sequence elements (CSEs; Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562; and Frolov, 2001, RNA, vol. 7, pp. 1638-1651).

In general, the 5′ replication recognition sequence of the alphavirus genome is characterized by low overall homology between different alphaviruses, but has a conserved predicted secondary structure. The 5′ replication recognition sequence of the alphavirus genome is not only involved in translation initiation, but also comprises the 5′ replication recognition sequence comprising two conserved sequence elements involved in synthesis of viral RNA, CSE 1 and CSE 2. For the function of CSE 1 and 2, the secondary structure is believed to be more important than the linear sequence (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562).

In contrast, the 3′ terminal sequence of the alphavirus genome, i.e. the sequence immediately upstream of the poly(A) sequence, is characterized by a conserved primary structure, particularly by conserved sequence element 4 (CSE 4), also termed “19-nt conserved sequence”, which is important for initiation of (−) strand synthesis.

CSE 3, also termed “junction sequence” is a conserved sequence element on the (+) strand of alphaviral genomic RNA, and the complement of CSE 3 on the (−) strand acts as promoter for subgenomic RNA transcription (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562; Frolov et al., 2001, RNA, vol. 7, pp. 1638-1651). CSE 3 typically overlaps with the region encoding the C-terminal fragment of nsP4.

In addition to alphavirus proteins, also host cell factors, presumably proteins, may bind to conserved sequence elements (Strauss & Strauss, supra).

Alphavirus-derived vectors have been proposed for delivery of foreign genetic information into target cells or target organisms. In simple approaches, the open reading frame encoding alphaviral structural proteins is replaced by an open reading frame encoding a protein of interest. Alphavirus-based trans-replication systems rely on alphavirus nucleotide sequence elements on two separate nucleic acid molecules: one nucleic acid molecule encodes a viral replicase (typically as poly-protein nsP1234), and the other nucleic acid molecule is capable of being replicated by said replicase in trans (hence the designation trans-replication system). trans-replication requires the presence of both these nucleic acid molecules in a given host cell. The nucleic acid molecule capable of being replicated by the replicase in trans must comprise certain alphaviral sequence elements to allow recognition and RNA synthesis by the alphaviral replicase.

There is a need to provide immune effector cells such as T cells expressing a T cell receptor or an artificial T cell receptor and which are suitable for adoptive cell transfer, in a safe and efficient manner. As described herein, the aspects and embodiments of the present invention address this need.

SUMMARY OF THE INVENTION

Immunotherapeutic strategies represent promising options for the prevention and therapy of e.g. infectious diseases and cancer diseases. The identification of a growing number of pathogen- and tumor-associated antigens led to a broad collection of suitable targets for immunotherapy. The present invention embraces improved agents and methods suitable for efficient expression of T cell receptors or artificial T cell receptors in immune effector cells such as T cells, suitable for immunotherapeutic treatment for the prevention and therapy of diseases.

The present invention demonstrates that alphavirus-derived RNA vectors (RNA replicons) are useful for expressing T cell receptors or artificial T cell receptors in immune effector cells such as T cells. Such immune effector cells are functional in that they bind antigen through their receptors and exhibit effector functions of immune effector cells.

Different types of RNA replicons are useful according to the invention. In one type of RNA replicon the open reading frame of an alphavirus-derived RNA vector encoding alphavirus structural proteins is replaced by an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor. A respective replicon is illustrated as “Cis-replicon; WT-RRS” in FIG. 1. Other types of RNA replicons according to the invention relate to alphavirus-based trans-replication systems. A respective replicon is illustrated as “trans-replicon; WT-RRS” in FIG. 1. Such replicon is associated with the advantage of allowing for amplification of an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor under control of a subgenomic promoter.

The open reading frame encoding nsP1234 typically overlaps with the 5′ replication recognition sequence of the alphavirus genome (coding sequence for nsP1) and typically also with the subgenomic promoter comprising CSE 3 (coding sequence for nsP4). Accordingly, in such “trans-replicon”, the 5′ replication recognition sequence required for RNA replication comprises an AUG start codon for nsP1 and thus overlaps with the coding sequence for the N-terminal fragment of the alphavirus non-structural protein and a replicon comprising the 5′ replication recognition sequence will typically encode (at least) a part of alphavirus non-structural protein, typically the N-terminal fragment of nsP1. This is disadvantageous in several aspects: In the case of cis-replicons this overlap limits for instance adaptation of codon usage of the replicase ORF to different mammalian target cells (human, mouse, farm animals). It is conceivable that the secondary structure of the 5′ replication recognition sequence as it is found in the viruses is not optimal in every target cell. However, the secondary structure cannot be altered freely as possibly resulting amino acid changes in the replicase ORF have to be considered and tested for the effect on replicase function. It is also not possible to exchange the complete replicase ORF for replicases from heterologous origin since this can results in disruption of the 5′ replication recognition sequence structure. In the case of trans-replicons this overlap results in the synthesis of a fragment of nsP1 protein since the 5′ replication recognition sequence needs to be retained in trans replicons. A fragment of nsP1 is typically not required and not desired: the undesired translation imposes an unnecessary burden on the host cell, and RNA replicons intended for therapeutic applications that encode, in addition to a pharmaceutically active protein, a fragment of nsP1, may face regulatory concerns. For instance, it will be necessary to demonstrate that the truncated nsP1 does not create unwanted side effects. In addition, the presence of an AUG start codon for nsP1 within the 5′ replication recognition sequence has prevented the design of trans-replicons encoding a heterologous gene of interest in a fashion wherein the start codon for translation of the gene of interest is at the most 5′ position that is accessible for ribosomal translation initiation. In turn, 5′-cap-dependent translation of transgenes from prior art trans-replicon RNA is challenging, unless cloned as fusion protein in frame to the start codon of nsP1 (such fusion constructs are described e.g. by Michel et al., 2007, Virology, vol. 362, pp. 475-487). Such fusion constructs lead to the same unnecessary translation of the nsP1 fragment mentioned above, raising the same concerns as above. Moreover, fusion proteins cause additional concerns as they might alter the function or activity of the fused transgene of interest, or when used as vaccine vector, peptides spanning the fusion region could alter immunogenicity of the fused antigen.

Accordingly, the present invention provides a further type of RNA replicon which comprises sequence elements required for replication by the replicase, but these sequence elements do not encode any protein or fragment thereof, such as an alphavirus non-structural protein or fragment thereof. Thus, the sequence elements required for replication by the replicase and protein-coding regions are uncoupled. A respective replicon is illustrated as “Trans-replicon; Δ5ATG-RRSΔSGP” in FIG. 1. Uncoupling is achieved by the removal of at least one initiation codon compared to a native alphavirus genomic RNA. The replicase may be encoded by the RNA replicon or by a separate nucleic acid molecule. In one particularly preferred embodiment, such replicon does not comprise a subgenomic promotor and the start codon for translation of the open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor is at the most 5′ position that is accessible for ribosomal translation initiation.

In a first aspect, the present invention provides a RNA replicon comprising an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor.

In one embodiment, the T cell receptor comprises a T cell receptor α-chain and a T cell receptor β-chain. In one embodiment, the RNA replicon comprises an open reading frame encoding a T cell receptor chain of a T cell receptor and a further open reading frame encoding a different T cell receptor chain of the T cell receptor.

In one embodiment, the artificial T cell receptor comprises a single chain and the RNA replicon comprises an open reading frame encoding said single chain of said artificial T cell receptor.

In one embodiment, the artificial T cell receptor comprises more than one chain and the RNA replicon comprises an open reading frame encoding a chain of the artificial T cell receptor and one or more further open reading frame(s) encoding different chains of the artificial T cell receptor. In one embodiment, the artificial T cell receptor comprises two chains and the RNA replicon comprises an open reading frame encoding a chain of the artificial T cell receptor and a further open reading frame encoding a different chain of the artificial T cell receptor.

In one embodiment, the artificial T cell receptor comprises an antigen binding domain, a transmembrane domain and a T cell signaling domain.

In one embodiment, the T cell receptor or artificial T cell receptor targets a disease-specific antigen, preferably a tumor antigen.

In one embodiment, the RNA replicon is a cis-replicon or trans-replicon.

In one embodiment, in particular if the RNA replicon is a cis-replicon, the RNA replicon comprises an open reading frame encoding functional alphavirus non-structural protein.

In one embodiment, in particular if the RNA replicon is a trans-replicon, the RNA replicon does not comprise an open reading frame encoding functional alphavirus non-structural protein. In this embodiment, the functional alphavirus non-structural protein for replication of the replicon may be provided in trans as described herein.

In one embodiment, the RNA replicon comprises a first open reading frame encoding a protein of interest, e.g. functional alphavirus non-structural protein or a chain of a T cell receptor or of an artificial T cell receptor. In one embodiment, the first open reading frame does not overlap with the 5′ replication recognition sequence.

If the RNA replicon is a cis-replicon, the first open reading generally will be an open reading encoding functional alphavirus non-structural protein. In this embodiment, the RNA replicon generally comprises at least one further open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor which is under control of a subgenomic promotor. If the RNA replicon is a trans-replicon, the first open reading generally will be an open reading encoding a chain of a T cell receptor or of an artificial T cell receptor and the RNA replicon preferably comprises no open reading frame encoding functional alphavirus non-structural protein.

In one embodiment, the RNA replicon comprises a 5′ replication recognition sequence, wherein the 5′ replication recognition sequence is characterized in that it comprises the removal of at least one initiation codon compared to a native alphavirus 5′ replication recognition sequence.

In one embodiment, the RNA replicon comprises a (modified) 5′ replication recognition sequence and a first open reading frame encoding a protein of interest, e.g. functional alphavirus non-structural protein or a chain of a T cell receptor or of an artificial T cell receptor, located downstream from the 5′ replication recognition sequence, wherein the 5′ replication recognition sequence and the first open reading frame encoding a protein of interest do not overlap and preferably the 5′ replication recognition sequence does not overlap with any open reading frame of the RNA replicon, e.g. the 5′ replication recognition sequence does not contain a functional initiation codon and preferably does not contain any initiation codon. Most preferably, the initiation codon of the first open reading frame is in the 5′->3′ direction of the RNA replicon the first functional initiation codon, preferably the first initiation codon. In one embodiment, the first open reading frame and preferably the entire RNA replicon does not express non-functional alphavirus non-structural protein, such as a fragment of alphavirus non-structural protein, in particular a fragment of nsP1 and/or nsP4. In one embodiment, the functional alphavirus non-structural protein is heterologous to the 5′ replication recognition sequence. In one embodiment, the first open reading frame is not under control of a subgenomic promotor.

In one embodiment, the first open reading frame encodes functional alphavirus non-structural protein and the RNA replicon comprises at least one further open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor which is under control of a subgenomic promotor. In one embodiment, the subgenomic promotor and the first open reading frame do not overlap.

In another embodiment, the first open reading frame encodes a chain of a T cell receptor or of an artificial T cell receptor and the RNA replicon preferably comprises no open reading frame encoding functional alphavirus non-structural protein. The RNA replicon may comprise at least one further open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor (e.g., a chain of a T cell receptor or of an artificial T cell receptor which together with the chain of a T cell receptor or of an artificial T cell receptor encoded by the first open reading frame forms a functional T cell receptor or artificial T cell receptor) which is under control of a subgenomic promotor. In one embodiment, the subgenomic promotor and the first open reading frame do not overlap.

In one particularly preferred embodiment, the first open reading frame, located downstream from the 5′ replication recognition sequence, encodes a chain of a T cell receptor or of an artificial T cell receptor, the 5′ replication recognition sequence and the first open reading frame do not overlap, the 5′ replication recognition sequence does not contain a functional initiation codon and preferably does not contain any initiation codon. and the RNA replicon does not comprise an open reading frame encoding functional alphavirus non-structural protein. In this embodiment, the initiation codon of the first open reading frame is in the 5′→3′ direction of the RNA replicon the first functional initiation codon, preferably the first initiation codon such that the RNA replicon does not express non-functional alphavirus non-structural protein, such as a fragment of alphavirus non-structural protein, in particular a fragment of nsP1 and/or nsP4. The RNA replicon may comprise at least one further open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor (e.g., a chain of a T cell receptor or of an artificial T cell receptor which together with the chain of a T cell receptor or of an artificial T cell receptor encoded by the first open reading frame forms a functional T cell receptor or artificial T cell receptor) which is under control of a subgenomic promotor. In one embodiment, the subgenomic promotor and the first open reading frame do not overlap.

In one embodiment, the 5′ replication recognition sequence of the RNA replicon comprises sequences homologous to conserved sequence element 1 (CSE 1) and conserved sequence element 2 (CSE 2) of an alphavirus.

In a preferred embodiment, the RNA replicon comprises CSE 2 and is further characterized in that it comprises a fragment of an open reading frame of a non-structural protein from an alphavirus. In a more preferred embodiment, said fragment of an open reading frame of a non-structural protein does not comprise any initiation codon.

In one embodiment, the 5′ replication recognition sequence comprises a sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus, wherein the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus is characterized in that it comprises the removal of at least one initiation codon compared to the native alphavirus sequence.

In a preferred embodiment, the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus is characterized in that it comprises the removal of one or more initiation codons other than the native start codon of the open reading frame of a non-structural protein. In a more preferred embodiment, said nucleic acid sequence is additionally characterized by the removal of the native start codon of the open reading frame of a non-structural protein, preferably of nsP1.

In a preferred embodiment, the 5′ replication recognition sequence comprises one or more stem loops providing functionality of the 5′ replication recognition sequence with respect to RNA replication. In a preferred embodiment, one or more stem loops of the 5′ replication recognition sequence are not deleted or disrupted. More preferably, one or more of stem loops 1, 3 and 4, preferably all stem loops 1, 3 and 4, or stem loops 3 and 4 are not deleted or disrupted. More preferably, none of the stem loops of the 5′ replication recognition sequence is deleted or disrupted.

In a preferred embodiment, the RNA replicon comprises one or more nucleotide changes compensating for nucleotide pairing disruptions within one or more stem loops introduced by the removal of at least one initiation codon.

In one embodiment, the RNA replicon does not comprise an open reading frame encoding a truncated alphavirus non-structural protein.

In one embodiment, the RNA replicon comprises a 3′ replication recognition sequence.

In one embodiment, the RNA replicon is characterized in that the protein of interest encoded by the first open reading frame can be expressed from the RNA replicon as a template. In one embodiment, the RNA replicon comprises a subgenomic promotor controlling production of subgenomic RNA comprising the first open reading frame.

In one embodiment, the RNA replicon is characterized in that it comprises a subgenomic promoter. Typically, the subgenomic promoter controls production of subgenomic RNA comprising an open reading frame encoding a protein of interest.

In one embodiment, the protein of interest encoded by the first open reading frame can be expressed from the RNA replicon as a template. In a more preferred embodiment, the protein of interest encoded by the first open reading frame can additionally be expressed from the subgenomic RNA.

In a preferred embodiment, the RNA replicon is further characterized in that it comprises a subgenomic promoter controlling production of subgenomic RNA comprising a second open reading frame encoding a protein of interest. The protein of interest may be a second protein that is identical to or different from the protein of interest encoded by the first open reading frame.

In a more preferred embodiment, the subgenomic promoter and the second open reading frame encoding a protein of interest are located downstream from the first open reading frame encoding a protein of interest.

In one embodiment, the RNA replicon can be replicated by functional alphavirus non-structural protein.

In a second aspect, the present invention provides a system comprising:

- a RNA construct for expressing functional alphavirus non-structural protein,
- the RNA replicon according to the first aspect of the invention, which can be replicated by the functional alphavirus non-structural protein in trans. Preferably, the RNA replicon is further characterized in that it does not encode a functional alphavirus non-structural protein.

In one embodiment, the RNA replicon according to the first aspect or the system according to the second aspect is characterized in that the alphavirus is Venezuelan equine encephalitis virus.

In a third aspect, the present invention provides a DNA comprising a nucleic acid sequence encoding the RNA replicon according to the first aspect of the present invention.

In a further aspect, the present invention provides a method of producing an immunoreactive cell comprising the step of transducing a T cell or a progenitor thereof with one or more RNA replicons of the invention encoding the chains of a T cell receptor or the chain(s) of an artificial T cell receptor, or DNA encoding said RNA replicons. In one embodiment, the cell expresses the T cell receptor or artificial T cell receptor on its cell surface.

In a further aspect, the present invention provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor, the method comprising the steps of:

- (a) obtaining one or more RNA replicons of the invention, which RNA replicon(s) comprise(s) an open reading frame encoding functional alphavirus non-structural protein, can be replicated by the functional alphavirus non-structural protein and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (b) inoculating the RNA replicon(s) or the DNA into a cell.

In a further aspect, the present invention provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor, the method comprising the steps of:

- (a) obtaining a RNA construct for expressing functional alphavirus non-structural protein or DNA comprising nucleic acid sequence encoding the RNA construct,
- (b) obtaining one or more RNA replicon(s) of the invention, which RNA replicon(s) can be replicated by the functional alphavirus non-structural protein in trans and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (c) co-inoculating the RNA construct or the DNA and the RNA replicon(s) or the DNA into a cell.

In one embodiment of the invention, a T cell receptor or an artificial T cell receptor comprises more than one chain such as two chains. In one embodiment, a RNA replicon comprises one or more open reading frames encoding all chains of said T cell receptor or artificial T cell receptor. In one embodiment, different RNA replicons comprise open reading frames encoding different chains of said T cell receptor or artificial T cell receptor. In the latter embodiment, these different RNA replicons may be co-inoculated (optionally together with a RNA construct for expressing functional alphavirus non-structural protein) into a cell to provide a functional T cell receptor or artificial T cell receptor.

In one embodiment, the cell expresses the T cell receptor or artificial T cell receptor on its cell surface.

In a further aspect, the present invention provides a cell produced by the method of the invention for producing a cell. In one embodiment, the cell is a recombinant cell.

In a further aspect, the present invention provides a cell expressing a T cell receptor or an artificial T cell receptor comprising one or more RNA replicon(s) of the invention, which RNA replicon(s) comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor. The cell may have the T cell receptor or artificial T cell receptor on its cell surface.

In one embodiment, the above cell is a cell which is useful for adoptive cell transfer. The cell may be an immune effector cell or stem cell, preferably an immunoreactive cell. The immunoreactive cell may be a T cell or progenitor thereof, preferably a cytotoxic T cell or progenitor thereof. In one embodiment, the cell is a human cell. In one embodiment, the modified cell is reactive with a disease-associated antigen. In one embodiment, said antigen is present on the surface of a cell such as a diseased cell. In one embodiment, said antigen is presented on the surface of a cell such as a diseased cell in the context of MHC molecules. In one embodiment, the modified cell is reactive with a disease-associated antigen when presented in the context of MHC. In one embodiment, said cell lacks surface expression of an endogenous TCR.

The present invention generally embraces the treatment of diseases by targeting cells expressing an antigen such as diseased cells expressing a disease-specific antigen, in particular cancer cells expressing a tumor antigen. The cells may express the antigen on their surface and/or may present the antigen. The methods provide for the selective eradication of cells that express an antigen, thereby minimizing adverse effects to normal cells not expressing the antigen. In one embodiment, T cells genetically modified according to the invention to express a T cell receptor or an artificial T cell receptor targeting the cells through binding to antigen, in particular when present on the surface of a cell or when presented in the context of MHC, are administered. T cells are able to recognize diseased cells expressing antigen, resulting in the eradication of diseased cells. In one embodiment, the target cell population or target tissue is tumor cells or tumor tissue.

In a further aspect, the present invention provides a pharmaceutical composition comprising the RNA replicon of the invention, e.g., comprising a set of RNA replicons of the invention each RNA replicon encoding one of the different chains of a T cell receptor or of an artificial T cell receptor, the DNA of the invention, or the cell of the invention, and a pharmaceutically acceptable carrier.

The pharmaceutical composition of the invention may be used as a medicament, in particular in the treatment of a disease such as cancer characterized by expression of antigen which is bound by, i.e., targeted by, the T cell receptor or artificial T cell receptor such as a tumor antigen.

In a further aspect, the present invention provides the pharmaceutical composition of the invention for use as a medicament.

In a further aspect, the present invention provides the pharmaceutical composition of the invention for use in the treatment of a disease involving cells characterized by expression of an antigen which is targeted by the T cell receptor or artificial T cell receptor.

In a further aspect, the present invention provides a method for the treatment of a disease comprising administering to a subject a therapeutically effective amount of the pharmaceutical composition of the invention, wherein the disease involves cells characterized by expression of an antigen which is targeted by the T cell receptor or artificial T cell receptor.

In a further aspect, the present invention provides a method of treating a subject having a disease involving cells characterized by expression of an antigen, the method comprising administering to the subject cells produced by the method of the invention for producing a cell expressing a T cell receptor or an artificial T cell receptor targeting the antigen.

In one embodiment of the invention, an antigen is a tumor antigen. In one embodiment of the invention, the disease is cancer. In one embodiment, the cells such as T cells may be autologous, allogeneic or syngeneic to the subject.

In one embodiment of all aspects of the invention, the method of treating further comprises obtaining a sample of cells from a subject, the sample preferably comprising T cells or T cell progenitors, and transfecting the cells with one or more replicons described herein encoding a T cell receptor or an artificial T cell receptor or DNA encoding these replicons to provide cells such as T cells genetically modified to express a T cell receptor or an artificial T cell receptor. In one embodiment of all aspects of the invention, the cells genetically modified to express a T cell receptor or an artificial T cell receptor are transiently transfected with nucleic acid encoding the T cell receptor or artificial T cell receptor. Thus, the nucleic acid encoding a T cell receptor or an artificial T cell receptor is not integrated into the genome of the cells. In one embodiment of all aspects of the invention, the cells and/or the sample of cells are from the subject to which the cells genetically modified to express a T cell receptor or an artificial T cell receptor are administered. In one embodiment of all aspects of the invention, the cells and/or the sample of cells are from a mammal which is different to the mammal to which the cells genetically modified to express a T cell receptor or an artificial T cell receptor are administered.

In one embodiment of all aspects of the invention, the T cells genetically modified to express a T cell receptor or an artificial T cell receptor are inactivated for expression of an endogenous T cell receptor and/or endogenous HLA.

In one embodiment of all aspects of the invention, an antigen is expressed in a diseased cell such as a cancer cell. In one embodiment, an antigen is expressed on the surface of a diseased cell such as a cancer cell and/or is presented on the surface of a diseased cell such as a cancer cell in the context of MHC molecules.

In one embodiment, an artificial T cell receptor binds to an extracellular domain or to an epitope in an extracellular domain of an antigen. In one embodiment, an artificial T cell receptor binds to native epitopes of an antigen present on the surface of living cells.

In one embodiment, a T cell receptor binds to T cell epitopes presented in the context of MHC molecules.

In one embodiment of all aspects of the invention, the antigen is a tumor antigen. In one embodiment of all aspects of the invention, the antigen is selected from the group consisting of claudins, such as claudin 6 and claudin 18.2, CD19, CD20, CD22, CD33, CD123, mesothelin, CEA, c-Met, PSMA, GD-2, and NY-ESO-1. In one embodiment of all aspects of the invention, the antigen is a pathogen antigen. The pathogen may be a fungal, viral, or bacterial pathogen. In one embodiment of all aspects of the invention, expression of the antigen is at the cell surface. In one embodiment, binding of said artificial T cell receptor when expressed by T cells and/or present on T cells to an antigen present on cells or binding of said T cell receptor when expressed by T cells and/or present on T cells to T cell epitopes presented in the context of MHC molecules results in immune effector functions of said T cells such as the release of cytokines. In one embodiment, binding of said artificial T cell receptor when expressed by T cells and/or present on T cells to an antigen present on diseased cells such as cancer cells or binding of said T cell receptor when expressed by T cells and/or present on T cells to T cell epitopes presented in the context of MHC molecules on diseased cells such as cancer cells results in cytolysis and/or apoptosis of the diseased cells, wherein said T cells preferably release cytotoxic factors, e.g. perforins and granzymes.

In one embodiment of all aspects of the invention, the domains of an artificial T cell receptor forming antigen binding sites are comprised by an ectodomain of the artificial T cell receptor. In one embodiment of all aspects of the invention, an artificial T cell receptor comprises a transmembrane domain. In one embodiment, the transmembrane domain is a hydrophobic alpha helix that spans the membrane. In one embodiment of all aspects of the invention, an artificial T cell receptor comprises a signal peptide which directs the nascent protein into the endoplasmic reticulum. In one embodiment, the signal peptide precedes the domains forming antigen binding sites.

In one embodiment of all aspects of the invention, an artificial T cell receptor is preferably specific for the antigen to which it is targeted, in particular when present on the surface of a cell such as a diseased cell.

In one embodiment of all aspects of the invention, an artificial T cell receptor may be expressed by and/or present on the surface of an immunoreactive cell, such as a T cell, preferably a cytotoxic T cell. In one embodiment, the immunoreactive cell is reactive with the antigen to which the artificial T cell receptor is targeted.

In a further aspect, the invention provides the agents and compositions described herein for use in the methods described herein.

Other features and advantages of the instant invention will be apparent from the following detailed description and claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1: Parental viral genome and vectors cis- and trans-replicating RNA.

(A) General organization of alphaviral genomes. Two large open reading frames (ORF) are separated by a subgenomic promoter (SGP). The 5′ ORF encoded an enzyme complex for RNA amplification (replicase), the 3′ ORF encodes the viral structural genes (Capsid and envelope glycoproteins). At the 5′-end two conserved sequence elements (CSE) build up the 5′-replication recognition sequence (RRS) overlapping partially with the replicase coding region. The 3′ RRS is built by a CSE 4 (3′-terminal 19 nucleotides) and approximately 15 nucleotides of the poly-A tail (An). (B) Cis-replicon vectors keep the WT sequence of RRSs and SGP, they lack the ORF of the structural genes which is replaced by genes of interest (here: chimeric antigen receptors (CARs) or T cell receptors (TCRs). (C) Trans-replicon vector systems. The cis-replicon is split into an mRNA encoding the replicase but being unable to replicate, and short RNAs amplified in trans by the replicase. These so called trans-replicons have two different designs, one contains all viral RRSs in the WT sequence identical to the cis-replicon. The other version contains a shortened 5′CSE mutated to remove any AUG codon that could serve as translation start codon. The removal of 5′AUG ensures that translation starts exclusively with the start codon of the ORF of interest, which is inserted downstream of the mutated 5′CSE. Genes of interest in this invention are chimeric antigen receptors (CARs) and T cell receptors (TCR). Alpha and beta chains of the TCRs were inserted into separate replicating RNA vectors.

FIG. 2: IFNg release from CAR-transfected resting CD4+ cells is stimulated using replicative RNA

CD4+ T cells were electroporated with the indicated RNA species encoding a CAR reacting to human claudin-6. NTR and TR were cotransfected with mRNA encoding replicase as indicated, the CAR-coding RNA was adjusted to equimolarity based on 10 μg CAR coding mRNA. 24 h after electroporation the cells were harvested and stained with a CAR-specific antibody to monitor CAR expression. Furthermore a cocultivation of the transfected T cells with JY cells expressing or not human claudin-6 was started. 24 h later cell culture supernatants were harvested and the concentration of secreted IFNg was determined by ELISA. (A) Flow cytometric analysis of CAR-expression 24 h after electroporation. (B) Bar graph of histograms shown in A. (C) Concentration of IFNg after 24 h of cocultivation with h-claudin-6 positive and negative target cells. (D) IFN-g release from cells stimulated unspecifically using the staphylococcal enterotoxin B (SEB) superantigen and exposed to JY-hClaudin6 cells. (mRNA: non-replicative mRNA; TR: trans-replicon; RRS: replication recognition sequence; SGP: subgenomic promoter)

FIG. 3: IFN-g release from CAR-transfected CD8 T cells is stimulated and more sustained using replicative RNA

CD8+ T cells were isolated from fresh or frozen peripheral blood mononuclear cells of healthy donors using magnetic assisted cell sorting. Cells isolated from fresh cells were pre-stimulated with OKT3 and IL-2 for 48 h and expanded in the presence of IL-2 for 72 h, CD8+ cells from frozen PBMCs were used directly after MACS. Both T cell populations were electroporated with the indicated RNA species encoding a CAR reacting to human Claudin-6. NTR and TR were cotransfected with mRNA encoding replicase (+R), the CAR-coding RNA was adjusted to equimolarity based on 10 μg CAR coding mRNA. 24 h to 120 h after electroporation the cells were stained with a CAR-specific antibody to monitor CAR expression. Furthermore, a cocultivation of the transfected T cells with JY cells expressing or not human Claudin-6 was started at each time point of CAR expression analysis. 24 h later cell culture supernatants were harvested and IFN-g secretion was quantified by ELISA. (A) Flow cytometric analysis of CAR-expression in resting CD8 cells. The mean CAR expression level per cell (mean fluorescence intensity of the CAR specific staining, MFI). (B) Concentration of IFNg upon 24 h of cocultivation of the resting cells with target cells. (C, D) Same as A & B, but using OKT3/IL-2 pre-stimulated CD8 cells.

(mRNA: non-replicative mRNA; TR: trans-replicon; RRS: replication recognition sequence; SGP: subgenomic promoter; +R: replicase cotransfection).

FIG. 4. Improved neo-antigen-specific TCR-mediated recognition and IFN-g secretion in response to melanoma cells after replicative RNA transfer.

A) CD8+ T cells were transfected with different RNA formats and molar ratios of neoantigen (“Mut14”) specific TCR α/β RNAs+/− replicase RNAs, rested overnight and 3×10⁵T cells were cocultured with 5×10⁴MZ-GABA-018_PGK_hB2M_bln_C5_P9 melanoma cells. Specific IFNγ secretion was analyzed by IFNγ ELISPOT assay. B) TCR surface expression on CD8+ T cells was analyzed after staining with a fluorochrome-conjugated CD8-specific and Vβ-specific antibodies by flow cytometry. Cells were gated on single living cells.

FIG. 5. Improved tumor cell lysis mediated by autologous neo-antigen-specific TCRs after replicative RNA transfer.

Preactivated CD8+ T cells were transfected with 6.4 pmol of neoantigen specific TCR α/β RNA+/−12.8 pmol replicase RNA and cocultured 20 h later together with MZ-GaBa-18-β2m melanoma cell cell line at different E:T ratios. Specific lysis mediated by M05-TCR-transfected T cells (A) or M14-TCR-transfected T cells (B) was analyzed by luciferase-based cytotoxicity assay after 48 h coculture.

FIG. 6: Schematic representation of RNA replicons comprising an unmodified or a modified 5′ replication recognition sequence useful according to the invention

Abbreviations: AAAA=Poly(A) tail; ATG=start codon/initiation codon (ATG on DNA level; AUG on RNA level); 5× ATG=nucleic acid sequence comprising all start codons in the nucleic acid sequence encoding nsP1* (in the case of the nucleic acid sequence encoding nsP1* from Semliki Forest virus 5× ATG corresponds to five specific start codons, see Example 1); Δ5ATG=nucleic acid sequence corresponding to a nucleic acid sequence encoding nsP1*; however not comprising any start codons of the nucleic acid sequence that encodes nsP1* in alphavirus found in nature (in the case of nsP1* derived from Semliki Forest virus, “Δ5ATG” corresponds to the removal of five specific start codons compared to Semliki Forest virus found in nature, see Example 1); EcoRV=EcoRV restriction site; nsP=nucleic acid sequence encoding an alphavirus non-structural protein (e.g. nsP1, nsP2, nsP3, nsP4); nsP1*=nucleic acid sequence encoding a fragment of nsP1, wherein the fragment does not comprise the C-terminal fragment of nsP1; *nsP4=nucleic acid sequence encoding a fragment of nsP4, wherein the fragment does not comprise the N-terminal fragment of nsP4; RRS=5′ replication recognition sequence; Sall=Sal restriction site; SGP=subgenomic promoter; SL=stem loop (e.g. SL1, SL2, SL3, SL4); the positions of SL1-4 are graphically illustrated; UTR=untranslated region (e.g. 5′-UTR, 3′-UTR); WT=wild type; TransgenePreferably relates to an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor.

cisReplicon WT-RRS: RNA replicon essentially corresponding to the genome of an alphavirus, except that the nucleic acid sequence encoding alphavirus structural proteins has been replaced by an open reading frame encoding a gene of interest (“Transgene”). When “Replicon WT-RRS” is introduced into a cell, the translation product of the open reading frame encoding replicase (nsP1234 or fragment(s) thereof) can drive replication of the RNA replicon in cis and drive synthesis of a nucleic acid sequence (the subgenomic transcript) downstream of the subgenomic promoter (SGP).

Trans-replicon or template RNA WT-RRS: RNA replicon essentially corresponding to “Replicon WT-RRS”, except that most of the nucleic acid sequence encoding alphavirus non-structural proteins nsP1-4 has been removed. More specifically, the nucleic acid sequence encoding nsP2 and nsP3 has been removed completely; the nucleic acid sequence encoding nsP1 has been truncated so that the “Template RNA WT-RRS” encodes a fragment of nsP1, which fragment does not comprise the C-terminal fragment of nsP1 (but it comprises the N-terminal fragment of nsP1; nsP1*); the nucleic acid sequence encoding nsP4 has been truncated so that the “Template RNA WT-RRS” encodes a fragment of nsP4, which fragment does not comprise the N-terminal fragment of nsP4 (but it comprises the C-terminal fragment of nsP4; *nsP4). This truncated nsP4 sequence overlaps partially with the fully active subgenomic promoter. The nucleic acid sequence encoding nsP1* comprises all initiation codons of the nucleic acid sequence that encodes nsP1* in alphavirus found in nature (in the case of nsP1* from Semliki Forest virus, five specific initiation codons).

Δ5ATG-RRS: RNA replicon essentially corresponding to “Template RNA WT-RRS”, except that it does not comprise any initiation codons of the nucleic acid sequence that encodes nsP1* in alphavirus found in nature (in the case of Semliki Forest virus, “Δ5ATG-RRS” corresponds to the removal of five specific initiation codons compared to Semliki Forest virus found in nature). All nucleotide changes introduced to remove start codons were compensated by additional nucleotide changes to conserve the predicted secondary structure of the RNA.

Δ5ATG-RRSΔSGP: RNA replicon essentially corresponding to “Δ5ATG-RRS”, except that it does not comprise the subgenomic promoter (SGP) and does not comprise the nucleic acid sequence that encodes *nsP4. “Transgene 1”=a gene of interest.

Δ5ATG-RRS—bicistronic: RNA replicon essentially corresponding to “Δ5ATG-RRS”, except that it comprises a first open reading frame encoding a first gene of interest (“Transgene 1”) upstream of the subgenomic promoter, and a second open reading frame encoding a second gene of interest (“Transgene 2”) downstream of the subgenomic promoter. The localization of the second open reading frame corresponds to the localization of the gene of interest (“Transgene”) in the RNA replicon “Δ5ATG-RRS”.

cisReplicon Δ5ATG-RRS: RNA replicon essentially corresponding to “Δ5ATG-RRS—bicistronic”, except that the open reading frame encoding a first gene of interest encodes functional alphavirus non-structural protein (typically one open reading frame encoding the poly-protein nsP1-nsP2-nsP3-nsP4, i.e. nsP1234). “Transgene” in “cisReplicon Δ5ATG-RRS” corresponds to “Transgene 2” in “Δ5ATG-RRS—bicistronic”. The functional alphavirus non-structural protein is capable of recognizing the subgenomic promoter and of synthesizing subgenomic transcripts comprising the nucleic acid sequence encoding the gene of interest (“Transgene”). “cisReplicon Δ5ATG-RRS” encodes a functional alphavirus non-structural protein in cis as does “cisReplicon WT-RRS”; however, it is not required that the coding sequence for nsP1 encoded by “cisReplicon Δ5ATG-RRS” comprises the exact nucleic acid sequence of “cisReplicon WT-RRS” including all stem loops.

FIG. 7. Structures of cap dinucleotides. Top: a natural cap dinucleotide, m⁷GpppG. Bottom: Phosphorothioate cap analog beta-S-ARCA dinucleotide: There are two diastereomers of beta-S-ARCA due to the stereogenic P center, which are designated D1 and D2 according to their elution characteristics in reverse phase HPLC.

DETAILED DESCRIPTION OF THE INVENTION

Although the present invention is described in detail below, it is to be understood that this invention is not limited to the particular methodologies, protocols and reagents described herein as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art.

Preferably, the terms used herein are defined as described in “A multilingual glossary of biotechnological terms: (IUPAC Recommendations)”, H.G.W. Leuenberger, B. Nagel, and H. Kölbl, Eds., Helvetica Chimica Acta, CH-4010 Basel, Switzerland, (1995).

The practice of the present invention will employ, unless otherwise indicated, conventional methods of chemistry, biochemistry, cell biology, immunology, and recombinant DNA techniques which are explained in the literature in the field (cf., e.g., Molecular Cloning: A Laboratory Manual, 2nd Edition, J. Sambrook et al. eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor 1989).

In the following, the elements of the present invention will be described. These elements are listed with specific embodiments, however, it should be understood that they may be combined in any manner and in any number to create additional embodiments. The variously described examples and preferred embodiments should not be construed to limit the present invention to only the explicitly described embodiments. This description should be understood to disclose and encompass embodiments which combine the explicitly described embodiments with any number of the disclosed and/or preferred elements. Furthermore, any permutations and combinations of all described elements in this application should be considered disclosed by this description unless the context indicates otherwise.

The term “about” means approximately or nearly, and in the context of a numerical value or range set forth herein preferably means+/−10% of the numerical value or range recited or claimed.

The terms “a” and “an” and “the” and similar reference used in the context of describing the invention (especially in the context of the claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. Recitation of ranges of values herein is merely intended to serve as a shorthand method of referring individually to each separate value falling within the range. Unless otherwise indicated herein, each individual value is incorporated into the specification as if it was individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., “such as”), provided herein is intended merely to better illustrate the invention and does not pose a limitation on the scope of the invention otherwise claimed. No language in the specification should be construed as indicating any non-claimed element essential to the practice of the invention.

Unless expressly specified otherwise, the term “comprising” is used in the context of the present document to indicate that further members may optionally be present in addition to the members of the list introduced by “comprising”. It is, however, contemplated as a specific embodiment of the present invention that the term “comprising” encompasses the possibility of no further members being present, i.e. for the purpose of this embodiment “comprising” is to be understood as having the meaning of “consisting of”.

Indications of relative amounts of a component characterized by a generic term are meant to refer to the total amount of all specific variants or members covered by said generic term. If a certain component defined by a generic term is specified to be present in a certain relative amount, and if this component is further characterized to be a specific variant or member covered by the generic term, it is meant that no other variants or members covered by the generic term are additionally present such that the total relative amount of components covered by the generic term exceeds the specified relative amount; more preferably no other variants or members covered by the generic term are present at all.

Several documents are cited throughout the text of this specification. Each of the documents cited herein (including all patents, patent applications, scientific publications, manufacturer's specifications, instructions, etc.), whether supra or infra, are hereby incorporated by reference in their entirety. Nothing herein is to be construed as an admission that the present invention was not entitled to antedate such disclosure.

Terms such as “reduce” or “inhibit” as used herein means the ability to cause an overall decrease, preferably of 5% or greater, 10% or greater, 20% or greater, more preferably of 50% or greater, and most preferably 75% or greater, in the level. The term “inhibit” or similar phrases includes a complete or essentially complete inhibition, i.e. a reduction to zero or essentially to zero.

Terms such as “increase” or “enhance” preferably relate to an increase or enhancement by about at least 10%, preferably at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 80%, and most preferably at least 100%.

The term “net charge” refers to the charge on a whole object, such as a compound or particle.

An ion having an overall net positive charge is a cation, while an ion having an overall net negative charge is an anion. Thus, according to the invention, an anion is an ion with more electrons than protons, giving it a net negative charge; and a cation is an ion with fewer electrons than protons, giving it a net positive charge.

Terms as “charged”, “net charge”, “negatively charged” or “positively charged”, with reference to a given compound or particle, refer to the electric net charge of the given compound or particle when dissolved or suspended in water at pH 7.0.

According to the invention, a nucleic acid is a deoxyribonucleic acid (DNA) or a ribonucleic acid (RNA). In general, a nucleic acid molecule or a nucleic acid sequence refers to a nucleic acid which is preferably deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). According to the invention, nucleic acids comprise genomic DNA, cDNA, mRNA, viral RNA, recombinantly prepared and chemically synthesized molecules. According to the invention, a nucleic acid may be in the form of a single-stranded or double-stranded and linear or covalently closed circular molecule. The term “nucleic acid” according to the invention also comprises a chemical derivatization of a nucleic acid on a nucleotide base, on the sugar or on the phosphate, and nucleic acids containing non-natural nucleotides and nucleotide analogs.

According to the invention “nucleic acid sequence” refers to the sequence of nucleotides in a nucleic acid, e.g. a ribonucleic acid (RNA) or a deoxyribonucleic acid (DNA). The term may refer to an entire nucleic acid molecule (such as to the single strand of an entire nucleic acid molecule) or to a part (e.g. a fragment) thereof.

According to the present invention, the term “RNA” or “RNA molecule” relates to a molecule which comprises ribonucleotide residues and which is preferably entirely or substantially composed of ribonucleotide residues. The term “ribonucleotide” relates to a nucleotide with a hydroxyl group at the 2′-position of a β-D-ribofuranosyl group. The term “RNA” comprises double-stranded RNA, single stranded RNA, isolated RNA such as partially or completely purified RNA, essentially pure RNA, synthetic RNA, and recombinantly generated RNA such as modified RNA which differs from naturally occurring RNA by addition, deletion, substitution and/or alteration of one or more nucleotides. Such alterations can include addition of non-nucleotide material, such as to the end(s) of a RNA or internally, for example at one or more nucleotides of the RNA. Nucleotides in RNA molecules can also comprise non-standard nucleotides, such as non-naturally occurring nucleotides or chemically synthesized nucleotides or deoxynucleotides. These altered RNAs can be referred to as analogs, particularly analogs of naturally occurring RNAs.

According to the invention, RNA may be single-stranded or double-stranded. In some embodiments of the present invention, single-stranded RNA is preferred. The term “single-stranded RNA” generally refers to an RNA molecule to which no complementary nucleic acid molecule (typically no complementary RNA molecule) is associated. Single-stranded RNA may contain self-complementary sequences that allow parts of the RNA to fold back and to form secondary structure motifs including without limitation base pairs, stems, stem loops and bulges. Single-stranded RNA can exist as minus strand [(−) strand] or as plus strand [(+) strand]. The (+) strand is the strand that comprises or encodes genetic information. The genetic information may be for example a polynucleotide sequence encoding a protein. When the (+) strand RNA encodes a protein, the (+) strand may serve directly as template for translation (protein synthesis). The (−) strand is the complement of the (+) strand. In the case of double-stranded RNA, (+) strand and (−) strand are two separate RNA molecules, and both these RNA molecules associate with each other to form a double-stranded RNA (“duplex RNA”).

The term “stability” of RNA relates to the “half-life” of RNA. “Half-life” relates to the period of time which is needed to eliminate half of the activity, amount, or number of molecules. In the context of the present invention, the half-life of an RNA is indicative for the stability of said RNA. The half-life of RNA may influence the “duration of expression” of the RNA. It can be expected that RNA having a long half-life will be expressed for an extended time period.

The term “translation efficiency” relates to the amount of translation product provided by an RNA molecule within a particular period of time.

“Fragment”, with reference to a nucleic acid sequence, relates to a part of a nucleic acid sequence, i.e. a sequence which represents the nucleic acid sequence shortened at the 5′- and/or 3′-end(s). Preferably, a fragment of a nucleic acid sequence comprises at least 80%, preferably at least 90%, 95%, 96%, 97%, 98%, or 99% of the nucleotide residues from said nucleic acid sequence. In the present invention those fragments of RNA molecules are preferred which retain RNA stability and/or translational efficiency.

“Fragment”, with reference to an amino acid sequence (peptide or protein), relates to a part of an amino acid sequence, i.e. a sequence which represents the amino acid sequence shortened at the N-terminus and/or C-terminus. A fragment shortened at the C-terminus (N-terminal fragment) is obtainable e.g. by translation of a truncated open reading frame that lacks the 3′-end of the open reading frame. A fragment shortened at the N-terminus (C-terminal fragment) is obtainable e.g. by translation of a truncated open reading frame that lacks the 5′-end of the open reading frame, as long as the truncated open reading frame comprises a start codon that serves to initiate translation. A fragment of an amino acid sequence comprises e.g. at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 10%, at least 20%, at least 30% at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% of the amino acid residues from an amino acid sequence.

The term “variant” with respect to, for example, nucleic acid and amino acid sequences, according to the invention includes any variants, in particular mutants, viral strain variants, splice variants, conformations, isoforms, allelic variants, species variants and species homologs, in particular those which are naturally present. An allelic variant relates to an alteration in the normal sequence of a gene, the significance of which is often unclear. Complete gene sequencing often identifies numerous allelic variants for a given gene. With respect to nucleic acid molecules, the term “variant” includes degenerate nucleic acid sequences, wherein a degenerate nucleic acid according to the invention is a nucleic acid that differs from a reference nucleic acid in codon sequence due to the degeneracy of the genetic code. A species homolog is a nucleic acid or amino acid sequence with a different species of origin from that of a given nucleic acid or amino acid sequence. A virus homolog is a nucleic acid or amino acid sequence with a different virus of origin from that of a given nucleic acid or amino acid sequence.

According to the invention, nucleic acid variants include single or multiple nucleotide deletions, additions, mutations, substitutions and/or insertions in comparison with the reference nucleic acid. Deletions include removal of one or more nucleotides from the reference nucleic acid. Addition variants comprise 5′- and/or 3′-terminal fusions of one or more nucleotides, such as 1, 2, 3, 5, 10, 20, 30, 50, or more nucleotides. In the case of substitutions, at least one nucleotide in the sequence is removed and at least one other nucleotide is inserted in its place (such as transversions and transitions). Mutations include abasic sites, crosslinked sites, and chemically altered or modified bases. Insertions include the addition of at least one nucleotide into the reference nucleic acid.

According to the invention, “nucleotide change” can refer to single or multiple nucleotide deletions, additions, mutations, substitutions and/or insertions in comparison with the reference nucleic acid. In some embodiments, a “nucleotide change” is selected from the group consisting of a deletion of a single nucleotide, the addition of a single nucleotide, the mutation of a single nucleotide, the substitution of a single nucleotide and/or the insertion of a single nucleotide, in comparison with the reference nucleic acid. According to the invention, a nucleic acid variant can comprise one or more nucleotide changes in comparison with the reference nucleic acid.

Variants of specific nucleic acid sequences preferably have at least one functional property of said specific sequences and preferably are functionally equivalent to said specific sequences, e.g. nucleic acid sequences exhibiting properties identical or similar to those of the specific nucleic acid sequences.

As described below, some embodiments of the present invention are characterized inter alia by nucleic acid sequences that are homologous to nucleic acid sequences of an alphavirus, such as an alphavirus found in nature. These homologous sequences are variants of nucleic acid sequences of an alphavirus, such as an alphavirus found in nature.

Preferably the degree of identity between a given nucleic acid sequence and a nucleic acid sequence which is a variant of said given nucleic acid sequence will be at least 70%, preferably at least 75%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90% or most preferably at least 95%, 96%, 97%, 98% or 99%. The degree of identity is preferably given for a region of at least about 30, at least about 50, at least about 70, at least about 90, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, or at least about 400 nucleotides. In preferred embodiments, the degree of identity is given for the entire length of the reference nucleic acid sequence.

“Sequence similarity” indicates the percentage of amino acids that either are identical or that represent conservative amino acid substitutions. “Sequence identity” between two polypeptide or nucleic acid sequences indicates the percentage of amino acids or nucleotides that are identical between the sequences.

The term “% identical” is intended to refer, in particular, to a percentage of nucleotides which are identical in an optimal alignment between two sequences to be compared, with said percentage being purely statistical, and the differences between the two sequences may be randomly distributed over the entire length of the sequence and the sequence to be compared may comprise additions or deletions in comparison with the reference sequence, in order to obtain optimal alignment between two sequences. Comparisons of two sequences are usually carried out by comparing said sequences, after optimal alignment, with respect to a segment or “window of comparison”, in order to identify local regions of corresponding sequences. The optimal alignment for a comparison may be carried out manually or with the aid of the local homology algorithm by Smith and Waterman, 1981, Ads App. Math. 2, 482, with the aid of the local homology algorithm by Needleman and Wunsch, 1970, J. Mol. Biol. 48, 443, and with the aid of the similarity search algorithm by Pearson and Lipman, 1988, Proc. Natl Acad. Sci. USA 85, 2444 or with the aid of computer programs using said algorithms (GAP, BESTFIT, FASTA, BLAST P, BLAST N and TFASTA in Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.).

Percentage identity is obtained by determining the number of identical positions in which the sequences to be compared correspond, dividing this number by the number of positions compared and multiplying this result by 100.

For example, the BLAST program “BLAST 2 sequences” which is available on the website http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi may be used.

A nucleic acid is “capable of hybridizing” or “hybridizes” to another nucleic acid if the two sequences are complementary with one another. A nucleic acid is “complementary” to another nucleic acid if the two sequences are capable of forming a stable duplex with one another. According to the invention, hybridization is preferably carried out under conditions which allow specific hybridization between polynucleotides (stringent conditions). Stringent conditions are described, for example, in Molecular Cloning: A Laboratory Manual, J. Sambrook et al., Editors, 2nd Edition, Cold Spring Harbor Laboratory press, Cold Spring Harbor, New York, 1989 or Current Protocols in Molecular Biology, F.M. Ausubel et al., Editors, John Wiley & Sons, Inc., New York and refer, for example, to hybridization at 65° C. in hybridization buffer (3.5× SSC, 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin, 2.5 mM NaH₂PO₄(pH 7), 0.5% SDS, 2 mM EDTA). SSC is 0.15 M sodium chloride/0.15 M sodium citrate, pH 7. After hybridization, the membrane to which the DNA has been transferred is washed, for example, in 2×SSC at room temperature and then in 0.1-0.5×SSC/0.1×SDS at temperatures of up to 68° C.

A percent complementarity indicates the percentage of contiguous residues in a nucleic acid molecule that can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” or “fully complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. Preferably, the degree of complementarity according to the invention is at least 70%, preferably at least 75%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90% or most preferably at least 95%, 96%, 97%, 98% or 99%. Most preferably, the degree of complementarity according to the invention is 100%.

The term “derivative” comprises any chemical derivatization of a nucleic acid on a nucleotide base, on the sugar or on the phosphate. The term “derivative” also comprises nucleic acids which contain nucleotides and nucleotide analogs not occurring naturally. Preferably, a derivatization of a nucleic acid increases its stability.

According to the invention, a “nucleic acid sequence which is derived from a nucleic acid sequence” refers to a nucleic acid which is a variant of the nucleic acid from which it is derived. Preferably, a sequence which is a variant with respect to a specific sequence, when it replaces the specific sequence in an RNA molecule retains RNA stability and/or translational efficiency.

“nt” is an abbreviation for nucleotide; or for nucleotides, preferably consecutive nucleotides in a nucleic acid molecule.

According to the invention, the term “codon” refers to a base triplet in a coding nucleic acid that specifies which amino acid will be added next during protein synthesis at the ribosome.

The terms “transcription” and “transcribing” relate to a process during which a nucleic acid molecule with a particular nucleic acid sequence (the “nucleic acid template”) is read by an RNA polymerase so that the RNA polymerase produces a single-stranded RNA molecule. During transcription, the genetic information in a nucleic acid template is transcribed. The nucleic acid template may be DNA; however, e.g. in the case of transcription from an alphaviral nucleic acid template, the template is typically RNA. Subsequently, the transcribed RNA may be translated into protein. According to the present invention, the term “transcription” comprises “in vitro transcription”, wherein the term “in vitro transcription” relates to a process wherein RNA, in particular mRNA, is in vitro synthesized in a cell-free system. Preferably, cloning vectors are applied for the generation of transcripts. These cloning vectors are generally designated as transcription vectors and are according to the present invention encompassed by the term “vector”. The cloning vectors are preferably plasmids. According to the present invention, RNA preferably is in vitro transcribed RNA (IVT-RNA) and may be obtained by in vitro transcription of an appropriate DNA template. The promoter for controlling transcription can be any promoter for any RNA polymerase. A DNA template for in vitro transcription may be obtained by cloning of a nucleic acid, in particular cDNA, and introducing it into an appropriate vector for in vitro transcription. The cDNA may be obtained by reverse transcription of RNA.

The single-stranded nucleic acid molecule produced during transcription typically has a nucleic acid sequence that is the complementary sequence of the template.

According to the invention, the terms “template” or “nucleic acid template” or “template nucleic acid” generally refer to a nucleic acid sequence that may be replicated or transcribed.

“Nucleic acid sequence transcribed from a nucleic acid sequence” and similar terms refer to a nucleic acid sequence, where appropriate as part of a complete RNA molecule, which is a transcription product of a template nucleic acid sequence. Typically, the transcribed nucleic acid sequence is a single-stranded RNA molecule.

“3′ end of a nucleic acid” refers according to the invention to that end which has a free hydroxy group. In a diagrammatic representation of double-stranded nucleic acids, in particular DNA, the 3′ end is always on the right-hand side. “5′ end of a nucleic acid” refers according to the invention to that end which has a free phosphate group. In a diagrammatic representation of double-strand nucleic acids, in particular DNA, the 5′ end is always on the left-hand side.

5′ end 5′--P-NNNNNNN-OH-3′ 3′ end

3′-HO-NNNNNNN-P--5′

“Upstream” describes the relative positioning of a first element of a nucleic acid molecule with respect to a second element of that nucleic acid molecule, wherein both elements are comprised in the same nucleic acid molecule, and wherein the first element is located nearer to the 5′ end of the nucleic acid molecule than the second element of that nucleic acid molecule. The second element is then said to be “downstream” of the first element of that nucleic acid molecule. An element that is located “upstream” of a second element can be synonymously referred to as being located “5” of that second element. For a double-stranded nucleic acid molecule, indications like “upstream” and “downstream” are given with respect to the (+) strand. According to the invention, “functional linkage” or “functionally linked” relates to a connection within a functional relationship. A nucleic acid is “functionally linked” if it is functionally related to another nucleic acid sequence. For example, a promoter is functionally linked to a coding sequence if it influences transcription of said coding sequence. Functionally linked nucleic acids are typically adjacent to one another, where appropriate separated by further nucleic acid sequences, and, in particular embodiments, are transcribed by RNA polymerase to give a single RNA molecule (common transcript).

In particular embodiments, a nucleic acid is functionally linked according to the invention to expression control sequences which may be homologous or heterologous with respect to the nucleic acid.

The term “expression control sequence” comprises according to the invention promoters, ribosome-binding sequences and other control elements which control transcription of a gene or translation of the derived RNA. In particular embodiments of the invention, the expression control sequences can be regulated. The precise structure of expression control sequences may vary depending on the species or cell type but usually includes 5′-untranscribed and 5′- and 3′-untranslated sequences involved in initiating transcription and translation, respectively. More specifically, 5′-untranscribed expression control sequences include a promoter region which encompasses a promoter sequence for transcription control of the functionally linked gene. Expression control sequences may also include enhancer sequences or upstream activator sequences. An expression control sequence of a DNA molecule usually includes 5′-untranscribed and 5′- and 3′-untranslated sequences such as TATA box, capping sequence, CAAT sequence and the like. An expression control sequence of alphaviral RNA may include a subgenomic promoter and/or one or more conserved sequence element(s). A specific expression control sequence according to the present invention is a subgenomic promoter of an alphavirus, as described herein.

The nucleic acid sequences specified herein, in particular transcribable and coding nucleic acid sequences, may be combined with any expression control sequences, in particular promoters, which may be homologous or heterologous to said nucleic acid sequences, with the term “homologous” referring to the fact that a nucleic acid sequence is also functionally linked naturally to the expression control sequence, and the term “heterologous” referring to the fact that a nucleic acid sequence is not naturally functionally linked to the expression control sequence.

A transcribable nucleic acid sequence, in particular a nucleic acid sequence coding for a peptide or protein, and an expression control sequence are “functionally” linked to one another, if they are covalently linked to one another in such a way that transcription or expression of the transcribable and in particular coding nucleic acid sequence is under the control or under the influence of the expression control sequence. If the nucleic acid sequence is to be translated into a functional peptide or protein, induction of an expression control sequence functionally linked to the coding sequence results in transcription of said coding sequence, without causing a frame shift in the coding sequence or the coding sequence being unable to be translated into the desired peptide or protein.

The term “promoter” or “promoter region” refers to a nucleic acid sequence which controls synthesis of a transcript, e.g. a transcript comprising a coding sequence, by providing a recognition and binding site for RNA polymerase. The promoter region may include further recognition or binding sites for further factors involved in regulating transcription of said gene. A promoter may control transcription of a prokaryotic or eukaryotic gene. A promoter may be “inducible” and initiate transcription in response to an inducer, or may be “constitutive” if transcription is not controlled by an inducer. An inducible promoter is expressed only to a very small extent or not at all, if an inducer is absent. In the presence of the inducer, the gene is “switched on” or the level of transcription is increased. This is usually mediated by binding of a specific transcription factor. A specific promoter according to the present invention is a subgenomic promoter of an alphavirus, as described herein. Other specific promoters are genomic plus-strand or negative-strand promoters of an alphavirus.

The term “core promoter” refers to a nucleic acid sequence that is comprised by the promoter. The core promoter is typically the minimal portion of the promoter required to properly initiate transcription. The core promoter typically includes the transcription start site and a binding site for RNA polymerase.

A “polymerase” generally refers to a molecular entity capable of catalyzing the synthesis of a polymeric molecule from monomeric building blocks. A “RNA polymerase” is a molecular entity capable of catalyzing the synthesis of a RNA molecule from ribonucleotide building blocks. A “DNA polymerase” is a molecular entity capable of catalyzing the synthesis of a DNA molecule from deoxy ribonucleotide building blocks. For the case of DNA polymerases and RNA polymerases, the molecular entity is typically a protein or an assembly or complex of multiple proteins. Typically, a DNA polymerase synthesizes a DNA molecule based on a template nucleic acid, which is typically a DNA molecule. Typically, a RNA polymerase synthesizes a RNA molecule based on a template nucleic acid, which is either a DNA molecule (in that case the RNA polymerase is a DNA-dependent RNA polymerase, DdRP), or is a RNA molecule (in that case the RNA polymerase is a RNA-dependent RNA polymerase, RdRP).

A “RNA-dependent RNA polymerase” or “RdRP”, is an enzyme that catalyzes the transcription of RNA from an RNA template. In the case of alphaviral RNA-dependent RNA polymerase, sequential synthesis of (−) strand complement of genomic RNA and of (+) strand genomic RNA leads to RNA replication. Alphaviral RNA-dependent RNA polymerase is thus synonymously referred to as “RNA replicase”. In nature, RNA-dependent RNA polymerases are typically encoded by all RNA viruses except retroviruses. Typical representatives of viruses encoding a RNA-dependent RNA polymerase are alphaviruses.

According to the present invention, “RNA replication” generally refers to an RNA molecule synthesized based on the nucleotide sequence of a given RNA molecule (template RNA molecule). The RNA molecule that is synthesized may be e.g. identical or complementary to the template RNA molecule. In general, RNA replication may occur via synthesis of a DNA intermediate, or may occur directly by RNA-dependent RNA replication mediated by a RNA-dependent RNA polymerase (RdRP). In the case of alphaviruses, RNA replication does not occur via a DNA intermediate, but is mediated by a RNA-dependent RNA polymerase (RdRP): a template RNA strand (first RNA strand)—or a part thereof—serves as template for the synthesis of a second RNA strand that is complementary to the first RNA strand or to a part thereof. The second RNA strand—or a part thereof—may in turn optionally serve as a template for synthesis of a third RNA strand that is complementary to the second RNA strand or to a part thereof. Thereby, the third RNA strand is identical to the first RNA strand or to a part thereof. Thus, RNA-dependent RNA polymerase is capable of directly synthesizing a complementary RNA strand of a template, and of indirectly synthesizing an identical RNA strand (via a complementary intermediate strand).

According to the invention, the term “template RNA” refers to RNA that can be transcribed or replicated by an RNA-dependent RNA polymerase.

According to the invention, the term “gene” refers to a particular nucleic acid sequence which is responsible for producing one or more cellular products and/or for achieving one or more intercellular or intracellular functions. More specifically, said term relates to a nucleic acid section (typically DNA; but RNA in the case of RNA viruses) which comprises a nucleic acid coding for a specific protein or a functional or structural RNA molecule.

An “isolated molecule” as used herein, is intended to refer to a molecule which is substantially free of other molecules such as other cellular material. The term “isolated nucleic acid” means according to the invention that the nucleic acid has been (i) amplified in vitro, for example by polymerase chain reaction (PCR), (ii) recombinantly produced by cloning, (iii) purified, for example by cleavage and gel-electrophoretic fractionation, or (iv) synthesized, for example by chemical synthesis. An isolated nucleic acid is a nucleic acid available to manipulation by recombinant techniques.

The term “vector” is used here in its most general meaning and comprises any intermediate vehicles for a nucleic acid which, for example, enable said nucleic acid to be introduced into prokaryotic and/or eukaryotic host cells and, where appropriate, to be integrated into a genome. Such vectors are preferably replicated and/or expressed in the cell. Vectors comprise plasmids, phagemids, virus genomes, and fractions thereof.

The term “recombinant” in the context of the present invention means “made through genetic engineering”. Preferably, a “recombinant object” such as a recombinant cell in the context of the present invention is not occurring naturally.

The term “naturally occurring” as used herein refers to the fact that an object can be found in nature. For example, a peptide or nucleic acid that is present in an organism (including viruses) and can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally occurring. The term “found in nature” means “present in nature” and includes known objects as well as objects that have not yet been discovered and/or isolated from nature, but that may be discovered and/or isolated in the future from a natural source.

According to the invention, the term “expression” is used in its most general meaning and comprises production of RNA, or of RNA and protein. It also comprises partial expression of nucleic acids. Furthermore, expression may be transient or stable. With respect to RNA, the term “expression” or “translation” relates to the process in the ribosomes of a cell by which a strand of coding RNA (e.g. messenger RNA) directs the assembly of a sequence of amino acids to make a peptide or protein.

According to the invention, the term “mRNA” means “messenger-RNA” and relates to a transcript which is typically generated by using a DNA template and encodes a peptide or protein. Typically, mRNA comprises a 5′-UTR, a protein coding region, a 3′-UTR, and a poly(A) sequence. mRNA may be generated by in vitro transcription from a DNA template. The in vitro transcription methodology is known to the skilled person. For example, there is a variety of in vitro transcription kits commercially available. According to the invention, mRNA may be modified by stabilizing modifications and capping.

According to the invention, the terms “poly(A) sequence” or “poly(A) tail” refer to an uninterrupted or interrupted sequence of adenylate residues which is typically located at the 3′ end of an RNA molecule. An uninterrupted sequence is characterized by consecutive adenylate residues. In nature, an uninterrupted poly(A) sequence is typical. While a poly(A) sequence is normally not encoded in eukaryotic DNA, but is attached during eukaryotic transcription in the cell nucleus to the free 3′ end of the RNA by a template-independent RNA polymerase after transcription, the present invention encompasses poly(A) sequences encoded by DNA.

According to the invention, the term “primary structure”, with reference to a nucleic acid molecule, refers to the linear sequence of nucleotide monomers.

According to the invention, the term “secondary structure”, with reference to a nucleic acid molecule, refers to a two-dimensional representation of a nucleic acid molecule that reflects base pairings; e.g. in the case of a single-stranded RNA molecule particularly intramolecular base pairings. Although each RNA molecule has only a single polynucleotide chain, the molecule is typically characterized by regions of (intramolecular) base pairs. According to the invention, the term “secondary structure” comprises structural motifs including without limitation base pairs, stems, stem loops, bulges, loops such as interior loops and multi-branch loops. The secondary structure of a nucleic acid molecule can be represented by a two-dimensional drawing (planar graph), showing base pairings (for further details on secondary structure of RNA molecules, see Auber et al., J. Graph Algorithms Appl., 2006, vol. 10, pp. 329-351). As described herein, the secondary structure of certain RNA molecules is relevant in the context of the present invention.

According to the invention, secondary structure of a nucleic acid molecule, particularly of a single-stranded RNA molecule, is determined by prediction using the web server for RNA secondary structure prediction (http://rna.urmc.rochester.edu/RNAstructureWeb/Servers/Predict1/Predict1.html).

Preferably, according to the invention, “secondary structure”, with reference to a nucleic acid molecule, specifically refers to the secondary structure determined by said prediction. The prediction may also be performed or confirmed using MFOLD structure prediction (http://unafold.rna.albany.edu/?q=mfold).

According to the invention, a “base pair” is a structural motif of a secondary structure wherein two nucleotide bases associate with each other through hydrogen bonds between donor and acceptor sites on the bases. The complementary bases, A:U and G:C, form stable base pairs through hydrogen bonds between donor and acceptor sites on the bases; the A:U and G:C base pairs are called Watson-Crick base pairs. A weaker base pair (called Wobble base pair) is formed by the bases G and U (G:U).

The base pairs A:U and G:C are called canonical base pairs. Other base pairs like G:U (which occurs fairly often in RNA) and other rare base-pairs (e.g. A:C; U:U) are called non-canonical base pairs.

According to the invention, “nucleotide pairing” refers to two nucleotides that associate with each other so that their bases form a base pair (canonical or non-canonical base pair, preferably canonical base pair, most preferably Watson-Crick base pair).

According to the invention, the terms “stem loop” or “hairpin” or “hairpin loop”, with reference to a nucleic acid molecule, all interchangeably refer to a particular secondary structure of a nucleic acid molecule, typically a single-stranded nucleic acid molecule, such as single-stranded RNA. The particular secondary structure represented by the stem loop consists of a consecutive nucleic acid sequence comprising a stem and a (terminal) loop, also called hairpin loop, wherein the stem is formed by two neighbored entirely or partially complementary sequence elements which are separated by a short sequence (e.g. 3-10 nucleotides), which forms the loop of the stem-loop structure. The two neighbored entirely or partially complementary sequences may be defined as e.g. stem loop elements stem 1 and stem 2. The stem loop is formed when these two neighbored entirely or partially reverse complementary sequences, e.g. stem loop elements stem 1 and stem 2, form base-pairs with each other, leading to a double stranded nucleic acid sequence comprising an unpaired loop at its terminal ending formed by the short sequence located between stem loop elements stem 1 and stem 2. Thus, a stem loop comprises two stems (stem 1 and stem 2), which—at the level of secondary structure of the nucleic acid molecule—form base pairs with each other, and which—at the level of the primary structure of the nucleic acid molecule—are separated by a short sequence that is not part of stem 1 or stem 2. For illustration, a two-dimensional representation of the stem loop resembles a lollipop-shaped structure. The formation of a stem-loop structure requires the presence of a sequence that can fold back on itself to form a paired double strand; the paired double strand is formed by stem 1 and stem 2. The stability of paired stem loop elements is typically determined by the length, the number of nucleotides of stem 1 that are capable of forming base pairs (preferably canonical base pairs, more preferably Watson-Crick base pairs) with nucleotides of stem 2, versus the number of nucleotides of stem 1 that are not capable of forming such base pairs with nucleotides of stem 2 (mismatches or bulges). According to the present invention, the optimal loop length is 3-10 nucleotides, more preferably 4 to 7 nucleotides, such as 4 nucleotides, 5 nucleotides, 6 nucleotides or 7 nucleotides. If a given nucleic acid sequence is characterized by a stem loop, the respective complementary nucleic acid sequence is typically also characterized by a stem loop. A stem loop is typically formed by single-stranded RNA molecules. For example, several stem loops are present in the 5′ replication recognition sequence of alphaviral genomic RNA (illustrated in FIG. 6).

According to the invention, “disruption” or “disrupt”, with reference to a specific secondary structure of a nucleic acid molecule (e.g. a stem loop) means that the specific secondary structure is absent or altered. Typically, a secondary structure may be disrupted as a consequence of a change of at least one nucleotide that is part of the secondary structure. For example, a stem loop may be disrupted by change of one or more nucleotides that form the stem, so that nucleotide pairing is not possible.

According to the invention, “compensates for secondary structure disruption” or “compensating for secondary structure disruption” refers to one or more nucleotide changes in a nucleic acid sequence; more typically it refers to one or more second nucleotide changes in a nucleic acid sequence, which nucleic acid sequence also comprises one or more first nucleotide changes, characterized as follows: while the one or more first nucleotide changes, in the absence of the one or more second nucleotide changes, cause a disruption of the secondary structure of the nucleic acid sequence, the co-occurrence of the one or more first nucleotide changes and the one or more second nucleotide changes does not cause the secondary structure of the nucleic acid to be disrupted. Co-occurrence means presence of both the one or more first nucleotide changes and of the one or more second nucleotide changes. Typically, the one or more first nucleotide changes and the one or more second nucleotide changes are present together in the same nucleic acid molecule. In a specific embodiment, one or more nucleotide changes that compensate for secondary structure disruption is/are one or more nucleotide changes that compensate for one or more nucleotide pairing disruptions. Thus, in one embodiment, “compensating for secondary structure disruption” means “compensating for nucleotide pairing disruptions”, i.e. one or more nucleotide pairing disruptions, for example one or more nucleotide pairing disruptions within one or more stem loops. The one or more one or more nucleotide pairing disruptions may have been introduced by the removal of at least one initiation codon. Each of the one or more nucleotide changes that compensates for secondary structure disruption is a nucleotide change, which can each be independently selected from a deletion, an addition, a substitution and/or an insertion of one or more nucleotides. In an illustrative example, when the nucleotide pairing A:U has been disrupted by substitution of A to C (C and U are not typically suitable to form a nucleotide pair); then a nucleotide change that compensates for nucleotide pairing disruption may be substitution of U by G, thereby enabling formation of the C:G nucleotide pairing. The substitution of U by G thus compensates for the nucleotide pairing disruption. In an alternative example, when the nucleotide pairing A:U has been disrupted by substitution of A to C; then a nucleotide change that compensates for nucleotide pairing disruption may be substitution of C by A, thereby restoring formation of the original A:U nucleotide pairing. In general, in the present invention, those nucleotide changes compensating for secondary structure disruption are preferred which do neither restore the original nucleic acid sequence nor create novel AUG triplets. In the above set of examples, the U to G substitution is preferred over the C to A substitution.

According to the invention, the term “tertiary structure”, with reference to a nucleic acid molecule, refers to the three dimensional structure of a nucleic acid molecule, as defined by the atomic coordinates.

According to the invention, a nucleic acid such as RNA, e.g. mRNA, may encode a peptide or protein. Accordingly, a transcribable nucleic acid sequence or a transcript thereof may contain an open reading frame (ORF) encoding a peptide or protein.

According to the invention, the term “nucleic acid encoding a peptide or protein” means that the nucleic acid, if present in the appropriate environment, preferably within a cell, can direct the assembly of amino acids to produce the peptide or protein during the process of translation. Preferably, coding RNA according to the invention is able to interact with the cellular translation machinery allowing translation of the coding RNA to yield a peptide or protein.

According to the invention, the term “peptide” comprises oligo- and polypeptides and refers to substances which comprise two or more, preferably 3 or more, preferably 4 or more, preferably 6 or more, preferably 8 or more, preferably 10 or more, preferably 13 or more, preferably 16 or more, preferably 20 or more, and up to preferably 50, preferably 100 or preferably 150, consecutive amino acids linked to one another via peptide bonds. The term “protein” refers to large peptides, preferably peptides having at least 151 amino acids, but the terms “peptide” and “protein” are used herein usually as synonyms.

The terms “peptide” and “protein” comprise, according to the invention, substances which contain not only amino acid components but also non-amino acid components such as sugars and phosphate structures, and also comprise substances containing bonds such as ester, thioether or disulfide bonds.

According to the invention, the terms “initiation codon” and “start codon” synonymously refer to a codon (base triplet) of a RNA molecule that is potentially the first codon that is translated by a ribosome. Such codon typically encodes the amino acid methionine in eukaryotes and a modified methionine in prokaryotes. The most common initiation codon in eukaryotes and prokaryotes is AUG. Unless specifically stated herein that an initiation codon other than AUG is meant, the terms “initiation codon” and “start codon”, with reference to an RNA molecule, refer to the codon AUG. According to the invention, the terms “initiation codon” and “start codon” are also used to refer to a corresponding base triplet of a deoxyribonucleic acid, namely the base triplet encoding the initiation codon of a RNA. If the initiation codon of messenger RNA is AUG, the base triplet encoding the AUG is ATG. According to the invention, the terms “initiation codon” and “start codon” preferably refer to a functional initiation codon or start codon, i.e. to an initiation codon or start codon that is used or would be used as a codon by a ribosome to start translation. There may be AUG codons in an RNA molecule that are not used as codons by a ribosome to start translation, e.g. due to a short distance of the codons to the cap. These codons are not encompassed by the term functional initiation codon or start codon.

According to the invention, the terms “start codon of the open reading frame” or “initiation codon of the open reading frame” refer to the base triplet that serves as initiation codon for protein synthesis in a coding sequence, e.g. in the coding sequence of a nucleic acid molecule found in nature. In an RNA molecule, the start codon of the open reading frame is often preceded by a 5′ untranslated region (5′-UTR), although this is not strictly required.

According to the invention, the terms “native start codon of the open reading frame” or “native initiation codon of the open reading frame” refer to the base triplet that serves as initiation codon for protein synthesis in a native coding sequence. A native coding sequence may be e.g. the coding sequence of a nucleic acid molecule found in nature.

In some embodiments, the present invention provides variants of nucleic acid molecules found in nature, which are characterized in that the native start codon (which is present in the native coding sequence) has been removed (so that it is not present in the variant nucleic acid molecule).

According to the invention, “first AUG” means the most upstream AUG base triplet of a messenger RNA molecule, preferably the most upstream AUG base triplet of a messenger RNA molecule that is used or would be used as a codon by a ribosome to start translation. Accordingly, “first ATG” refers to the ATG base triplet of a coding DNA sequence that encodes the first AUG. In some instances, the first AUG of a mRNA molecule is the start codon of an open reading frame, i.e. the codon that is used as start codon during ribosomal protein synthesis.

According to the invention, the terms “comprises the removal” or “characterized by the removal” and similar terms, with reference to a certain element of a nucleic acid variant, mean that said certain element is not functional or not present in the nucleic acid variant, compared to a reference nucleic acid molecule. Without limitation, a removal can consist of deletion of all or part of the certain element, of substitution of all or part of the certain element, or of alteration of the functional or structural properties of the certain element. The removal of a functional element of a nucleic acid sequence requires that the function is not exhibited at the position of the nucleic acid variant comprising the removal. For example, a RNA variant characterized by the removal of a certain initiation codon requires that ribosomal protein synthesis is not initiated at the position of the RNA variant characterized by the removal. The removal of a structural element of a nucleic acid sequence requires that the structural element is not present at the position of the nucleic acid variant comprising the removal. For example, a RNA variant characterized by the removal of a certain AUG base triplet, i.e. of a AUG base triplet at a certain position, may be characterized, e.g. by deletion of part or all of the certain AUG base triplet (e.g. AAUG), or by substitution of one or more nucleotides (A, U, G) of the certain AUG base triplet by any one or more different nucleotides, so that the resulting nucleotide sequence of the variant does not comprise said AUG base triplet. Suitable substitutions of one nucleotide are those that convert the AUG base triplet into a GUG, CUG or UUG base triplet, or into a AAG, ACG or AGG base triplet, or into a AUA, AUC or AUU base triplet. Suitable substitutions of more nucleotides can be selected accordingly.

According to the invention, the term “alphavirus” is to be understood broadly and includes any virus particle that has characteristics of alphaviruses. Characteristics of alphavirus include the presence of a (+) stranded RNA which encodes genetic information suitable for replication in a host cell, including RNA polymerase activity. Further characteristics of many alphaviruses are described e.g. in Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562. The term “alphavirus” includes alphavirus found in nature, as well as any variant or derivative thereof. In some embodiments, a variant or derivative is not found in nature.

In one embodiment, the alphavirus is an alphavirus found in nature. Typically, an alphavirus found in nature is infectious to any one or more eukaryotic organisms, such as an animal (including a vertebrate such as a human, and an arthropod such as an insect).

An alphavirus found in nature is preferably selected from the group consisting of the following: Barmah Forest virus complex (comprising Barmah Forest virus); Eastern equine encephalitis complex (comprising seven antigenic types of Eastern equine encephalitis virus); Middelburg virus complex (comprising Middelburg virus); Ndumu virus complex (comprising Ndumu virus); Semliki Forest virus complex (comprising Bebaru virus, Chikungunya virus, Mayaro virus and its subtype Una virus, O'Nyong Nyong virus, and its subtype Igbo-Ora virus, Ross River virus and its subtypes Bebaru virus, Getah virus, Sagiyama virus, Semliki Forest virus and its subtype Me Tri virus); Venezuelan equine encephalitis complex (comprising Cabassou virus, Everglades virus, Mosso das Pedras virus, Mucambo virus, Paramana virus, Pixuna virus, Rio Negro virus, Trocara virus and its subtype Bijou Bridge virus, Venezuelan equine encephalitis virus); Western equine encephalitis complex (comprising Aura virus, Babanki virus, Kyzylagach virus, Sindbis virus, Ockelbo virus, Whataroa virus, Buggy Creek virus, Fort Morgan virus, Highlands J virus, Western equine encephalitis virus); and some unclassified viruses including Salmon pancreatic disease virus; Sleeping Disease virus; Southern elephant seal virus; Tonate virus. More preferably, the alphavirus is selected from the group consisting of Semliki Forest virus complex (comprising the virus types as indicated above, including Semliki Forest virus), Western equine encephalitis complex (comprising the virus types as indicated above, including Sindbis virus), Eastern equine encephalitis virus (comprising the virus types as indicated above), Venezuelan equine encephalitis complex (comprising the virus types as indicated above, including Venezuelan equine encephalitis virus).

In a further preferred embodiment, the alphavirus is Semliki Forest virus. In an alternative further preferred embodiment, the alphavirus is Sindbis virus. In an alternative further preferred embodiment, the alphavirus is Venezuelan equine encephalitis virus.

In some embodiments of the present invention, the alphavirus is not an alphavirus found in nature. Typically, an alphavirus not found in nature is a variant or derivative of an alphavirus found in nature, that is distinguished from an alphavirus found in nature by at least one mutation in the nucleotide sequence, i.e. the genomic RNA. The mutation in the nucleotide sequence may be selected from an insertion, a substitution or a deletion of one or more nucleotides, compared to an alphavirus found in nature. A mutation in the nucleotide sequence may or may not be associated with a mutation in a polypeptide or protein encoded by the nucleotide sequence. For example, an alphavirus not found in nature may be an attenuated alphavirus. An attenuated alphavirus not found in nature is an alphavirus that typically has at least one mutation in its nucleotide sequence by which it is distinguished from an alphavirus found in nature, and that is either not infectious at all, or that is infectious but has a lower disease-producing ability or no disease-producing ability at all. As an illustrative example, TC83 is an attenuated alphavirus that is distinguished from the Venezuelan equine encephalitis virus (VEEV) found in nature (Mckinney et al., 1963, Am. J. Trop. Med. Hyg., 1963, vol. 12; pp. 597-603).

Members of the alphavirus genus may also be classified based on their relative clinical features in humans: alphaviruses associated primarily with encephalitis, and alphaviruses associated primarily with fever, rash, and polyarthritis.

The term “alphaviral” means found in an alphavirus, or originating from an alphavirus or derived from an alphavirus, e.g. by genetic engineering.

According to the invention, “SFV” stands for Semliki Forest virus. According to the invention, “SIN” or “SINV” stands for Sindbis virus. According to the invention, “VEE” or “VEEV” stands for Venezuelan equine encephalitis virus.

According to the invention, the term “of an alphavirus” refers to an entity of origin from an alphavirus. For illustration, a protein of an alphavirus may refer to a protein that is found in alphavirus and/or to a protein that is encoded by alphavirus; and a nucleic acid sequence of an alphavirus may refer to a nucleic acid sequence that is found in alphavirus and/or to a nucleic acid sequence that is encoded by alphavirus. Preferably, a nucleic acid sequence “of an alphavirus” refers to a nucleic acid sequence “of the genome of an alphavirus” and/or “of genomic RNA of an alphavirus”.

According to the invention, the term “alphaviral RNA” refers to any one or more of alphaviral genomic RNA (i.e. (+) strand), complement of alphaviral genomic RNA (i.e. (−) strand), and the subgenomic transcript (i.e. (+) strand), or a fragment of any thereof.

According to the invention, “alphavirus genome” refers to genomic (+) strand RNA of an alphavirus.

According to the invention, the term “native alphavirus sequence” and similar terms typically refer to a (e.g. nucleic acid) sequence of a naturally occurring alphavirus (alphavirus found in nature). In some embodiments, the term “native alphavirus sequence” also includes a sequence of an attenuated alphavirus.

According to the invention, the term “5′ replication recognition sequence” preferably refers to a continuous nucleic acid sequence, preferably a ribonucleic acid sequence, that is identical or homologous to a 5′ fragment of the alphavirus genome. The “5′ replication recognition sequence” is a nucleic acid sequence that can be recognized by an alphaviral replicase. The term 5′ replication recognition sequence includes native 5′ replication recognition sequences as well as functional equivalents thereof, such as, e.g., functional variants of a 5′ replication recognition sequence of alphavirus found in nature. According to the invention, functional equivalents include derivatives of 5′ replication recognition sequences characterized by the removal of at least one initiation codon as described herein. The 5′ replication recognition sequence is required for synthesis of the (−) strand complement of alphavirus genomic RNA, and is required for synthesis of (+) strand viral genomic RNA based on a (−) strand template. A native 5′ replication recognition sequence typically encodes at least the N-terminal fragment of nsP1; but does not comprise the entire open reading frame encoding nsP1234. In view of the fact that a native 5′ replication recognition sequence typically encodes at least the N-terminal fragment of nsP1, a native 5′ replication recognition sequence typically comprises at least one initiation codon, typically AUG. In one embodiment, the 5′ replication recognition sequence comprises conserved sequence element 1 of an alphavirus genome (CSE 1) or a variant thereof and conserved sequence element 2 of an alphavirus genome (CSE 2) or a variant thereof. The 5′ replication recognition sequence is typically capable of forming four stem loops (SL), i.e. SL1, SL2, SL3, SL4. The numbering of these stem loops begins at the 5′ end of the 5′ replication recognition sequence.

According to the invention, the term “at the 5′ end of an alphavirus” refers to the 5′ end of the genome of an alphavirus. A nucleic acid sequence at the 5′ end of an alphavirus encompasses the nucleotide located at the 5′ terminus of alphavirus genomic RNA, plus optionally a consecutive sequence of further nucleotides. In one embodiment, a nucleic acid sequence at the 5′ end of an alphavirus is identical to the 5′ replication recognition sequence of the alphavirus genome.

The term “conserved sequence element” or “CSE” refers to a nucleotide sequence found in alphavirus RNA. These sequence elements are termed “conserved” because orthologs are present in the genome of different alphaviruses, and orthologous CSEs of different alphaviruses preferably share a high percentage of sequence identity and/or a similar secondary or tertiary structure. The term CSE includes CSE 1, CSE 2, CSE 3 and CSE 4.

According to the invention, the terms “CSE 1” or “44-nt CSE” synonymously refer to a nucleotide sequence that is required for (+) strand synthesis from a (−) strand template. The term “CSE 1” refers to a sequence on the (+) strand; and the complementary sequence of CSE 1 (on the (−) strand) functions as a promoter for (+) strand synthesis. Preferably, the term CSE 1 includes the most 5′ nucleotide of the alphavirus genome. CSE 1 typically forms a conserved stem-loop structure. Without wishing to be bound to a particular theory, it is believed that, for CSE 1, the secondary structure is more important than the primary structure, i.e. the linear sequence. In genomic RNA of the model alphavirus Sindbis virus, CSE 1 consists of a consecutive sequence of 44 nucleotides, which is formed by the most 5′ 44 nucleotides of the genomic RNA (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562).

According to the invention, the terms “CSE 2” or “51-nt CSE” synonymously refer to a nucleotide sequence that is required for (−) strand synthesis from a (+) strand template. The (+) strand template is typically alphavirus genomic RNA or an RNA replicon (note that the subgenomic RNA transcript, which does not comprise CSE 2, does not function as a template for (−) strand synthesis). In alphavirus genomic RNA, CSE 2 is typically localized within the coding sequence for nsP1. In genomic RNA of the model alphavirus Sindbis virus, the 51-nt CSE is located at nucleotide positions 155-205 of genomic RNA (Frolov et al., 2001, RNA, vol. 7, pp. 1638-1651). CSE 2 forms typically two conserved stem loop structures. These stem loop structures are designated as stem loop 3 (SL3) and stem loop 4 (SL4) because they are the third and fourth conserved stem loop, respectively, of alphavirus genomic RNA, counted from the 5′ end of alphavirus genomic RNA. Without wishing to be bound to a particular theory, it is believed that, for CSE 2, the secondary structure is more important than the primary structure, i.e. the linear sequence.

According to the invention, the terms “CSE 3” or “junction sequence” synonymously refer to a nucleotide sequence that is derived from alphaviral genomic RNA and that comprises the start site of the subgenomic RNA. The complement of this sequence in the (−) strand acts to promote subgenomic RNA transcription. In alphavirus genomic RNA, CSE 3 typically overlaps with the region encoding the C-terminal fragment of nsP4 and extends to a short non-coding region located upstream of the open reading frame encoding the structural proteins.

According to the invention, the terms “CSE 4” or “19-nt conserved sequence” or “19-nt CSE” synonymously refer to a nucleotide sequence from alphaviral genomic RNA, immediately upstream of the poly(A) sequence in the 3′ untranslated region of the alphavirus genome. CSE 4 typically consists of 19 consecutive nucleotides. Without wishing to be bound to a particular theory, CSE 4 is understood to function as a core promoter for initiation of (−) strand synthesis (José et al., Future Microbiol., 2009, vol. 4, pp. 837-856); and/or CSE 4 and the poly(A) tail of the alphavirus genomic RNA are understood to function together for efficient (−) strand synthesis (Hardy & Rice, J. Virol., 2005, vol. 79, pp. 4630-4639).

According to the invention, the term “subgenomic promoter” or “SGP” refers to a nucleic acid sequence upstream (5′) of a nucleic acid sequence (e.g. coding sequence), which controls transcription of said nucleic acid sequence by providing a recognition and binding site for RNA polymerase, typically RNA-dependent RNA polymerase, in particular functional alphavirus non-structural protein. The SGP may include further recognition or binding sites for further factors. A subgenomic promoter is typically a genetic element of a positive strand RNA virus, such as an alphavirus. A subgenomic promoter of alphavirus is a nucleic acid sequence comprised in the viral genomic RNA. The subgenomic promoter is generally characterized in that it allows initiation of the transcription (RNA synthesis) in the presence of an RNA-dependent RNA polymerase, e.g. functional alphavirus non-structural protein. A RNA (−) strand, i.e. the complement of alphaviral genomic RNA, serves as a template for synthesis of a (+) strand subgenomic transcript, and synthesis of the (+) strand subgenomic transcript is typically initiated at or near the subgenomic promoter. The term “subgenomic promoter” as used herein, is not confined to any particular localization in a nucleic acid comprising such subgenomic promoter. In some embodiments, the SGP is identical to CSE 3 or overlaps with CSE 3 or comprises CSE 3.

The terms “subgenomic transcript” or “subgenomic RNA” synonymously refer to a RNA molecule that is obtainable as a result of transcription using a RNA molecule as template (“template RNA”), wherein the template RNA comprises a subgenomic promoter that controls transcription of the subgenomic transcript. The subgenomic transcript is obtainable in the presence of an RNA-dependent RNA polymerase, in particular functional alphavirus non-structural protein. For instance, the term “subgenomic transcript” may refer to the RNA transcript that is prepared in a cell infected by an alphavirus, using the (−) strand complement of alphavirus genomic RNA as template. However, the term “subgenomic transcript”, as used herein, is not limited thereto and also includes transcripts obtainable by using heterologous RNA as template. For example, subgenomic transcripts are also obtainable by using the (−) strand complement of SGP-containing replicons according to the present invention as template. Thus, the term “subgenomic transcript” may refer to a RNA molecule that is obtainable by transcribing a fragment of alphavirus genomic RNA, as well as to a RNA molecule that is obtainable by transcribing a fragment of a replicon according to the present invention.

The term “autologous” is used to describe anything that is derived from the same subject. For example, “autologous cell” refers to a cell derived from the same subject. Introduction of autologous cells into a subject is advantageous because these cells overcome the immunological barrier which otherwise results in rejection.

The term “allogeneic” is used to describe anything that is derived from different individuals of the same species. Two or more individuals are said to be allogeneic to one another when the genes at one or more loci are not identical.

The term “syngeneic” is used to describe anything that is derived from individuals or tissues having identical genotypes, i.e., identical twins or animals of the same inbred strain, or their tissues or cells.

The term “heterologous” is used to describe something consisting of multiple different elements. As an example, the introduction of one individual's cell into a different individual constitutes a heterologous transplant. A heterologous gene is a gene derived from a source other than the subject.

The following provides specific and/or preferred variants of the individual features of the invention. The present invention also contemplates as particularly preferred embodiments those embodiments, which are generated by combining two or more of the specific and/or preferred variants described for two or more of the features of the present invention.

RNA Replicon

A nucleic acid construct that is capable of being replicated by a replicase, preferably an alphaviral replicase, is termed replicon. According to the invention, the term “replicon” defines a RNA molecule that can be replicated by RNA-dependent RNA polymerase, yielding—without DNA intermediate—one or multiple identical or essentially identical copies of the RNA replicon. “Without DNA intermediate” means that no deoxyribonucleic acid (DNA) copy or complement of the replicon is formed in the process of forming the copies of the RNA replicon, and/or that no deoxyribonucleic acid (DNA) molecule is used as a template in the process of forming the copies of the RNA replicon, or complement thereof. The replicase function is typically provided by functional alphavirus non-structural protein.

According to the invention, the terms “can be replicated” and “capable of being replicated” generally describe that one or more identical or essentially identical copies of a nucleic acid can be prepared. When used together with the term “replicase”, such as in “capable of being replicated by a replicase”, the terms “can be replicated” and “capable of being replicated” describe functional characteristics of a nucleic acid molecule, e.g. a RNA replicon, with respect to a replicase. These functional characteristics comprise at least one of (i) the replicase is capable of recognizing the replicon and (ii) the replicase is capable of acting as RNA-dependent RNA polymerase (RdRP). Preferably, the replicase is capable of both (i) recognizing the replicon and (ii) acting as RNA-dependent RNA polymerase.

The expression “capable of recognizing” describes that the replicase is capable of physically associating with the replicon, and preferably, that the replicase is capable of binding to the replicon, typically non-covalently. The term “binding” can mean that the replicase has the capacity of binding to any one or more of a conserved sequence element 1 (CSE 1) or complementary sequence thereof (if comprised by the replicon), conserved sequence element 2 (CSE 2) or complementary sequence thereof (if comprised by the replicon), conserved sequence element 3 (CSE 3) or complementary sequence thereof (if comprised by the replicon), conserved sequence element 4 (CSE 4) or complementary sequence thereof (if comprised by the replicon). Preferably, the replicase is capable of binding to CSE 2 [i.e. to the (+) strand] and/or to CSE 4 [i.e. to the (+) strand], or of binding to the complement of CSE 1 [i.e. to the (−) strand] and/or to the complement of CSE 3 [i.e. to the (−) strand].

The expression “capable of acting as RdRP” means that the replicase is capable to catalyze the synthesis of the (−) strand complement of alphaviral genomic (+) strand RNA, wherein the (+) strand RNA has template function, and/or that the replicase is capable to catalyze the synthesis of (+) strand alphaviral genomic RNA, wherein the (−) strand RNA has template function. In general, the expression “capable of acting as RdRP” can also include that the replicase is capable to catalyze the synthesis of a (+) strand subgenomic transcript wherein a (−) strand RNA has template function, and wherein synthesis of the (+) strand subgenomic transcript is typically initiated at an alphavirus subgenomic promoter.

The expressions “capable of binding” and “capable of acting as RdRP” refer to the capability at normal physiological conditions. In particular, they refer to the conditions inside a cell, which expresses functional alphavirus non-structural protein or which has been transfected with a nucleic acid that codes for functional alphavirus non-structural protein. The cell is preferably a eukaryotic cell. The capability of binding and/or the capability of acting as RdRP can be experimentally tested, e.g. in a cell-free in vitro system or in a eukaryotic cell. Optionally, said eukaryotic cell is a cell from a species to which the particular alphavirus that represents the origin of the replicase is infectious. For example, when the alphavirus replicase from a particular alphavirus is used that is infectious to humans, the normal physiological conditions are conditions in a human cell. More preferably, the eukaryotic cell (in one example human cell) is from the same tissue or organ to which the particular alphavirus that represents the origin of the replicase is infectious.

According to the invention, “compared to a native alphavirus sequence” and similar terms refer to a sequence that is a variant of a native alphavirus sequence. The variant is typically not itself a native alphavirus sequence.

The RNA replicon of the invention comprises a 5′ replication recognition sequence. A 5′ replication recognition sequence is a nucleic acid sequence that can be recognized by functional alphavirus non-structural protein. In other words, functional alphavirus non-structural protein is capable of recognizing the 5′ replication recognition sequence.

In one embodiment, the RNA replicon of the invention comprises a 5′ replication recognition sequence, wherein the 5′ replication recognition sequence is characterized in that it comprises the removal of at least one initiation codon compared to a native alphavirus 5′ replication recognition sequence.

The 5′ replication recognition sequence that is characterized in that it comprises the removal of at least one initiation codon compared to a native alphavirus 5′ replication recognition sequence, according to the present invention, can be referred to herein as “modified 5′ replication recognition sequence” or “5′ replication recognition sequence according to the invention”. As described herein below, the 5′ replication recognition sequence according to the invention may optionally be characterized by the presence of one or more additional nucleotide changes.

In one embodiment, the RNA replicon comprises a 3′ replication recognition sequence. A 3′ replication recognition sequence is a nucleic acid sequence that can be recognized by functional alphavirus non-structural protein. In other words, functional alphavirus non-structural protein is capable of recognizing the 3′ replication recognition sequence. Preferably, the 3′ replication recognition sequence is located at the 3′ end of the replicon (if the replicon does not comprise a poly(A) tail), or immediately upstream of the poly(A) tail (if the replicon comprises a poly(A) tail). In one embodiment, the 3′ replication recognition sequence consists of or comprises CSE 4.

In one embodiment, the 5′ replication recognition sequence and the 3′ replication recognition sequence are capable of directing replication of the RNA replicon according to the present invention in the presence of functional alphavirus non-structural protein. Thus, when present alone or preferably together, these recognition sequences direct replication of the RNA replicon in the presence of functional alphavirus non-structural protein.

It is preferable that a functional alphavirus non-structural protein is provided in cis (encoded as protein of interest by an open reading frame on the replicon) or in trans (encoded as protein of interest by an open reading frame on a separate replicase construct as described in the second aspect), that is capable of recognizing both the optionally modified 5′ replication recognition sequence and the 3′ replication recognition sequence of the replicon. In one embodiment, this is achieved when the 5′ replication recognition sequence and the 3′ replication recognition sequence are native to the alphavirus from which the functional alphavirus non-structural protein is derived, or when the 3′ replication recognition sequence is native to the alphavirus from which the functional alphavirus non-structural protein is derived and the modified 5′ replication recognition sequence is a variant of the 5′ replication recognition sequence that is native to the alphavirus from which the functional alphavirus non-structural protein is derived. Native means that the natural origin of these sequences is the same alphavirus. In an alternative embodiment, the (modified) 5′ replication recognition sequence and/or the 3′ replication recognition sequence are not native to the alphavirus from which the functional alphavirus non-structural protein is derived, provided that the functional alphavirus non-structural protein is capable of recognizing both the (modified) 5′ replication recognition sequence and the 3′ replication recognition sequence of the replicon. In other words, the functional alphavirus non-structural protein is compatible to the (modified) 5′ replication recognition sequence and the 3′ replication recognition sequence. When a non-native functional alphavirus non-structural protein is capable of recognizing a respective sequence or sequence element, the functional alphavirus non-structural protein is said to be compatible (cross-virus compatibility). Any combination of (3′/5′) replication recognition sequences and CSEs, respectively, with functional alphavirus non-structural protein is possible as long as cross-virus compatibility exists. Cross-virus compatibility can readily be tested by the skilled person working the present invention by incubating a functional alphavirus non-structural protein to be tested together with an RNA, wherein the RNA has 3′- and (optionally modified) 5′ replication recognition sequences to be tested, at conditions suitable for RNA replication, e.g. in a suitable host cell. If replication occurs, the (3′/5′) replication recognition sequences and the functional alphavirus non-structural protein are determined to be compatible.

The removal of at least one initiation codon provides several advantages. Absence of an initiation codon in the nucleic acid sequence encoding nsP1* will typically cause that nsP1* (N-terminal fragment of nsP1) is not translated. Further, since nsP1* is not translated, the open reading frame encoding the protein of interest (“Transgene”) is the most upstream open reading frame accessible to the ribosome; thus when the replicon is present in a cell, translation is initiated at the first AUG of the open reading frame (RNA) encoding the gene of interest. This represents an advantage over prior art trans-replicons, such as those described by Spuul et al. (J. Virol., 2011, vol. 85, pp. 4739-4751): replicons according to Spuul et al. direct the expression of the N-terminal portion of nsP1, a peptide of 74 amino acids. It is also known from the prior art that construction of RNA replicons from full-length virus genomes is not a trivial matter, as certain mutations can render RNA incapable of being replicated (WO 2000/053780 A2), and removal of some parts of the 5′ structure that is important for replication of alphavirus affects efficiency of replication (Kamrud et al., 2010, J. Gen. Virol., vol. 91, pp. 1723-1727).

The advantage over conventional cis-replicons is that removal of at least one initiation codon uncouples the coding region for the alphaviral non-structural protein from the 5′ replication recognition sequence. This enables a further engineering of cis-replicons e.g. by exchanging the native 5′ replication recognition sequence to an artificial sequence, a mutated sequence, or a heterologous sequence taken from another RNA virus. Such sequence manipulations in conventional cis-replicons are restricted by the amino acid sequence of nsP1. Any point mutation, or clusters of point mutations, would require experimental assessment whether replication is affected and small insertions or deletion leading to frame shift mutations are impossible due to their detrimental effect on the protein.

The removal of at least one initiation codon according to the present invention can be achieved by any suitable method known in the art. For example, a suitable DNA molecule encoding the replicon according to the invention, i.e. characterized by the removal of an initiation codon, can be designed in silico, and subsequently synthesized in vitro (gene synthesis); alternatively, a suitable DNA molecule may be obtained by site-directed mutagenesis of a DNA sequence encoding a replicon. In any case, the respective DNA molecule may serve as template for in vitro transcription, thereby providing the replicon according to the invention.

The removal of at least one initiation codon compared to a native alphavirus 5′ replication recognition sequence is not particularly limited and may be selected from any nucleotide modification, including substitution of one or more nucleotides (including, on DNA level, a substitution of A and/or T and/or G of the initiation codon); deletion of one or more nucleotides (including, on DNA level, a deletion of A and/or T and/or G of the initiation codon), and insertion of one or more nucleotides (including, on DNA level, an insertion of one or more nucleotides between A and T and/or between T and G of the initiation codon). Irrespective of whether the nucleotide modification is a substitution, an insertion or a deletion, the nucleotide modification must not result in the formation of a new initiation codon (as an illustrative example: an insertion, at DNA level, must not be an insertion of an ATG).

The 5′ replication recognition sequence of the RNA replicon that is characterized by the removal of at least one initiation codon (i.e. the modified 5′ replication recognition sequence according to the present invention) is preferably a variant of a 5′ replication recognition sequence of the genome of an alphavirus found in nature. In one embodiment, the modified 5′ replication recognition sequence according to the present invention is preferably characterized by a degree of sequence identity of 80% or more, preferably 85% or more, more preferably 90% or more, even more preferably 95% or more, to the 5′ replication recognition sequence of the genome of at least one alphavirus found in nature.

In one embodiment, the 5′ replication recognition sequence of the RNA replicon that is characterized by the removal of at least one initiation codon comprises a sequence homologous to about 250 nucleotides at the 5′ end of an alphavirus, i.e. at the 5′ end of the alphaviral genome. In a preferred embodiment, it comprises a sequence homologous to about 250 to 500, preferably about 300 to 500 nucleotides at the 5′ end of an alphavirus, i.e. at the 5′ end of the alphaviral genome. “At the 5′ end of the alphaviral genome” means a nucleic acid sequence beginning at, and including, the most upstream nucleotide of the alphaviral genome. In other words, the most upstream nucleotide of the alphaviral genome is designated nucleotide no. 1, and e.g. “250 nucleotides at the 5′ end of the alphaviral genome” means nucleotides 1 to 250 of the alphaviral genome. In one embodiment, the 5′ replication recognition sequence of the RNA replicon that is characterized by the removal of at least one initiation codon is characterized by a degree of sequence identity of 80% or more, preferably 85% or more, more preferably 90% or more, even more preferably 95% or more, to at least 250 nucleotides at the 5′ end of the genome of at least one alphavirus found in nature. At least 250 nucleotides includes e.g. 250 nucleotides, 300 nucleotides, 400 nucleotides, 500 nucleotides.

The 5′ replication recognition sequence of an alphavirus found in nature is typically characterized by at least one initiation codon and/or by conserved secondary structural motifs. For example, the native 5′ replication recognition sequence of Semliki Forest virus (SFV) comprises five specific AUG base triplets. According to Frolov et al. (2001, RNA, vol. 7, pp. 1638-1651) analysis by MFOLD revealed that the native 5′ replication recognition sequence of Semliki Forest virus is predicted to form four stem loops (SL), termed stem loops 1 to 4 (SL1, SL2, SL3, SL4). According to Frolov et al., analysis by MFOLD revealed that also the native 5′ replication recognition sequence of a different alphavirus, Sindbis virus, is predicted to form four stem loops: SL1, SL2, SL3, SL4.

It is known that the 5′ end of the alphaviral genome comprises sequence elements that enable replication of the alphaviral genome by functional alphavirus non-structural protein. In one embodiment of the present invention, the 5′ replication recognition sequence of the RNA replicon comprises a sequence homologous to conserved sequence element 1 (CSE 1) and/or a sequence homologous to conserved sequence element 2 (CSE 2) of an alphavirus.

Conserved sequence element 2 (CSE 2) of alphavirus genomic RNA typically is represented by SL3 and SL4 which is preceded by SL2 comprising at least the native initiation codon that encodes the first amino acid residue of alphavirus non-structural protein nsP1. In this description, however, in some embodiments, the conserved sequence element 2 (CSE 2) of alphavirus genomic RNA refers to a region spanning from SL2 to SL4 and comprising the native initiation codon that encodes the first amino acid residue of alphavirus non-structural protein nsP1. In a preferred embodiment, the RNA replicon comprises CSE 2 or a sequence homologous to CSE 2. In one embodiment, the RNA replicon comprises a sequence homologous to CSE 2 that is preferably characterized by a degree of sequence identity of 80% or more, preferably 85% or more, more preferably 90% or more, even more preferably 95% or more, to the sequence of CSE 2 of at least one alphavirus found in nature.

In a preferred embodiment, the 5′ replication recognition sequence comprises a sequence that is homologous to CSE 2 of an alphavirus. The CSE 2 of an alphavirus may comprise a fragment of an open reading frame of a non-structural protein from an alphavirus.

Thus, in a preferred embodiment, the RNA replicon is characterized in that it comprises a sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus. The sequence homologous to an open reading frame of a non-structural protein or a fragment thereof is typically a variant of an open reading frame of a non-structural protein or a fragment thereof of an alphavirus found in nature. In one embodiment, the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof is preferably characterized by a degree of sequence identity of 80% or more, preferably 85% or more, more preferably 90% or more, even more preferably 95% or more, to an open reading frame of a non-structural protein or a fragment thereof of at least one alphavirus found in nature.

In a more preferred embodiment, the sequence homologous to an open reading frame of a non-structural protein that is comprised by the replicon of the present invention does not comprise the native initiation codon of a non-structural protein, and more preferably does not comprise any initiation codon of a non-structural protein. In a preferred embodiment, the sequence homologous to CSE 2 is characterized by the removal of all initiation codons compared to a native alphavirus CSE 2 sequence. Thus, the sequence homologous to CSE 2 does preferably not comprise any initiation codon.

When the sequence homologous to an open reading frame does not comprise any initiation codon, the sequence homologous to an open reading frame is not itself an open reading frame since it does not serve as a template for translation.

In a preferred embodiment, the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus is characterized in that it comprises the removal of at least the native start codon of the open reading frame of a non-structural protein. Preferably, it is characterized in that it comprises the removal of at least the native start codon of the open reading frame encoding nsP1.

The native start codon is the AUG base triplet at which translation on ribosomes in a host cell begins when an RNA is present in a host cell. In other words, the native start codon is the first base triplet that is translated during ribosomal protein synthesis, e.g. in a host cell that has been inoculated with RNA comprising the native start codon. In one embodiment, the host cell is a cell from a eukaryotic species that is a natural host of the specific alphavirus that comprises the native alphavirus 5′ replication recognition sequence. In a preferred embodiment, the host cell is a BHK21 cell from the cell line “BHK21 [C13] (ATCC® CCL10™)”, available from American Type Culture Collection, Manassas, Virginia, USA.

The genomes of many alphaviruses have been fully sequenced and are publically accessible, and the sequences of non-structural proteins encoded by these genomes are publically accessible as well. Such sequence information allows to determine the native start codon in silico.

In one embodiment, the native start codon is comprised by a Kozak sequence or a functionally equivalent sequence. The Kozak sequence is a sequence initially described by Kozak (1987, Nucleic Acids Res., vol. 15, pp. 8125-8148). The Kozak sequence on an mRNA molecule is recognized by the ribosome as the translational start site. According to this reference, the Kozak sequence comprises an AUG start codon, immediately followed by a highly conserved G nucleotide: AUGG. In one embodiment of the present invention, the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus is characterized in that it comprises the removal of an initiation codon that is part of a Kozak sequence.

In one embodiment of the present invention, the 5′ replication recognition sequence of the replicon is characterized by the removal of at least all those initiation codons, which, at RNA level, are part of an AUGG sequence.

In a preferred embodiment, the sequence homologous to an open reading frame of a non-structural protein or a fragment thereof from an alphavirus is characterized in that it comprises the removal of one or more initiation codons other than the native start codon of the open reading frame of a non-structural protein. In a more preferred embodiment, said nucleic acid sequence is additionally characterized by the removal of the native start codon. For example, in addition to the removal of the native start codon, any one or two or three or four or more than four (e.g. five) initiation codons may be removed.

If the replicon is characterized by the removal of the native start codon, and optionally by the removal of one or more initiation codons other than the native start codon, of the open reading frame of a non-structural protein, the sequence homologous to an open reading frame is not itself an open reading frame since it does not serve as a template for translation.

The one or more initiation codon other than the native start codon that is removed, preferably in addition to removal of the native start codon, is preferably selected from an AUG base triplet that has the potential to initiate translation. An AUG base triplet that has the potential to initiate translation may be referred to as “potential initiation codon”. Whether a given AUG base triplet has the potential to initiate translation can be determined in silico or in a cell-based in vitro assay.

In one embodiment, it is determined in silico whether a given AUG base triplet has the potential to initiate translation: in that embodiment, the nucleotide sequence is examined, and an AUG base triplet is determined to have the potential to initiate translation if it is part of an AUGG sequence, preferably part of a Kozak sequence.

In one embodiment, it is determined in a cell-based in vitro assay whether a given AUG base triplet has the potential to initiate translation: a RNA replicon characterized by the removal of the native start codon and comprising the given AUG base triplet downstream of the position of the removal of the native start codon is introduced into a host cell. In one embodiment, the host cell is a cell from a eukaryotic species that is a natural host of the specific alphavirus that comprises the native alphavirus 5′ replication recognition sequence. In a preferred embodiment, the host cell is a BHK21 cell from the cell line “BHK21 [C13] (ATCC® CCL10™)”, available from American Type Culture Collection, Manassas, Virginia, USA. It is preferable that no further AUG base triplet is present between the position of the removal of the native start codon and the given AUG base triplet. If, following transfer of the RNA replicon—characterized by the removal of the native start codon and comprising the given AUG base triplet—into the host cell, translation is initiated at the given AUG base triplet, the given AUG base triplet is determined to have the potential to initiate translation. Whether translation is initiated can be determined by any suitable method known in the art. For example, the replicon may encode, downstream of the given AUG base triplet and in-frame with the given AUG base triplet, a tag that facilitates detection of the translation product (if any), e.g. a myc-tag or a HA-tag; whether or not an expression product having the encoded tag is present may be determined e.g. by Western Blot. In this embodiment, it is preferable that no further AUG base triplet is present between the given AUG base triplet and the nucleic acid sequence encoding the tag. The cell-based in vitro assay can be performed individually for more than one given AUG base triplet: in each case, it is preferable that no further AUG base triplet is present between the position of the removal of the native start codon and the given AUG base triplet. This can be achieved by removing all AUG base triplets (if any) between the position of the removal of the native start codon and the given AUG base triplet. Thereby, the given AUG base triplet is the first AUG base triplet downstream of the position of the removal of the native start codon.

Preferably, the replicon according to the present invention is characterized by the removal of all potential initiation codons that are downstream of the position of the removal of the native start codon and that are located within the open reading frame of alphavirus non-structural protein or of a fragment thereof. Thus, according to the invention, the 5′ replication recognition sequence preferably does not comprise an open reading frame that can be translated to protein.

In a preferred embodiment, the 5′ replication recognition sequence of the RNA replicon according to the invention is characterized by a secondary structure that is equivalent to the secondary structure of the 5′ replication recognition sequence of alphaviral genomic RNA. In a preferred embodiment, the 5′ replication recognition sequence of the RNA replicon according to the invention is characterized by a predicted secondary structure that is equivalent to the predicted secondary structure of the 5′ replication recognition sequence of alphaviral genomic RNA. According to the present invention, the secondary structure of an RNA molecule is preferably predicted by the web server for RNA secondary structure prediction http://rna.urmc.rochester.edu/RNAstructureWeb/Servers/Predict1/Predict1.html.

By comparing the secondary structure or predicted secondary structure of a 5′ replication recognition sequence of an RNA replicon characterized by the removal of at least one initiation codon compared to the native alphavirus 5′ replication recognition sequence, the presence or absence of a nucleotide pairing disruption can be identified. For example, at least one base pair may be absent at a given position, compared to a native alphavirus 5′ replication recognition sequence, e.g. a base pair within a stem loop, in particular the stem of the stem loop.

In a preferred embodiment, one or more stem loops of the 5′ replication recognition sequence are not deleted or disrupted. More preferably, stem loops 3 and 4 are not deleted or disrupted. More preferably, none of the stem loops of the 5′ replication recognition sequence is deleted or disrupted.

In one embodiment, the removal of at least one initiation codon does not disrupt the secondary structure of the 5′ replication recognition sequence. In an alternative embodiment, the removal of at least one initiation codon does disrupt the secondary structure of the 5′ replication recognition sequence. In this embodiment, the removal of at least one initiation codon may be causative for the absence of at least one base pair at a given position, e.g. a base pair within a stem loop, compared to a native alphavirus 5′ replication recognition sequence. If a base pair is absent within a stem loop, compared to a native alphavirus 5′ replication recognition sequence, the removal of at least one initiation codon is determined to introduce a nucleotide pairing disruption within the stem loop. A base pair within a stem loop is typically a base pair in the stem of the stem loop.

If the removal of at least one initiation codon introduces a nucleotide pairing disruption within a stem loop, compared to a native alphavirus 5′ replication recognition sequence, one or more nucleotide changes may be introduced which are expected to compensate for the nucleotide pairing disruption, and the secondary structure or predicted secondary structure obtained thereby may be compared to a native alphavirus 5′ replication recognition sequence.

Based on the common general knowledge and on the disclosure herein, certain nucleotide changes can be expected by the skilled person to compensate for nucleotide pairing disruptions. For example, if a base pair is disrupted at a given position of the secondary structure or predicted secondary structure of a given 5′ replication recognition sequence of an RNA replicon characterized by the removal of at least one initiation codon, compared to the native alphavirus 5′ replication recognition sequence, a nucleotide change that restores a base pair at that position, preferably without re-introducing an initiation codon, is expected to compensate for the nucleotide pairing disruption.

In a preferred embodiment, the 5′ replication recognition sequence of the replicon does not overlap with, or does not comprise, a translatable nucleic acid sequence, i.e. translatable into a peptide or protein, in particular a nsP, in particular nsP1, or a fragment of any thereof. For a nucleotide sequence to be “translatable”, it requires the presence of an initiation codon; the initiation codon encodes the most N-terminal amino acid residue of the peptide or protein. In one embodiment, the 5′ replication recognition sequence of the replicon does not overlap with, or does not comprise, a translatable nucleic acid sequence encoding an N-terminal fragment of nsP1.

In some scenarios, which are described in detail below, the RNA replicon comprises at least one subgenomic promoter. In a preferred embodiment, the subgenomic promoter of the replicon does not overlap with, or does not comprise, a translatable nucleic acid sequence, i.e. translatable into a peptide or protein, in particular a nsP, in particular nsP4, or a fragment of any thereof. In one embodiment, the subgenomic promoter of the replicon does not overlap with, or does not comprise, a translatable nucleic acid sequence that encodes a C-terminal fragment of nsP4. A RNA replicon having a subgenomic promoter that does not overlap with, or does not comprise, a translatable nucleic acid sequence, e.g. translatable into the C-terminal fragment of nsP4, may be generated by deleting part of the coding sequence for nsP4 (typically the part encoding the N-terminal part of nsP4), and/or by removing AUG base triplets in the part of the coding sequence for nsP4 that has not been deleted. If AUG base triplets in the coding sequence for nsP4 or a part thereof are removed, the AUG base triplets that are removed are preferably potential initiation codons. Alternatively, if the subgenomic promoter does not overlap with a nucleic acid sequence that encodes nsP4, the entire nucleic acid sequence encoding nsP4 may be deleted.

In one embodiment, the RNA replicon does not comprise an open reading frame encoding a truncated alphavirus non-structural protein. In the context of this embodiment, it is particularly preferable that the RNA replicon does not comprise an open reading frame encoding the N-terminal fragment of nsP1, and optionally does not comprise an open reading frame encoding the C-terminal fragment of nsP4. The N-terminal fragment of nsP1 is a truncated alphavirus protein; the C-terminal fragment of nsP4 is also a truncated alphavirus protein.

In some embodiments the replicon according to the present invention does not comprise stem loop 2 (SL2) of the 5′ terminus of the genome of an alphavirus. According to Frolov et al., supra, stem loop 2 is a conserved secondary structure found at the 5′ terminus of the genome of an alphavirus, upstream of CSE 2, but is dispensible for replication.

In one embodiment, the 5′ replication recognition sequence of the replicon does not overlap with a nucleic acid sequence that encodes alphavirus non-structural protein or a fragment thereof. Thus, the present invention encompasses replicons that are characterized, compared to genomic alphaviral RNA, by the removal of at least one initiation codon, as described herein, optionally combined with the deletion of the coding region for one or more alphavirus non-structural proteins, or a part thereof. For example, the coding region for nsP2 and nsP3 may be deleted, or the coding region for nsP2 and nsP3 may be deleted together with the deletion of the coding region for the C-terminal fragment of nsP1 and/or of the coding region for the N-terminal fragment of nsP4, and one or more remaining initiation codons, i.e. remaining after said removal, may be removed as described herein.

Deletion of the coding region for one or more alphavirus non-structural proteins may be achieved by standard methods, e.g., at DNA level, excision by the help of restriction enzymes, preferably restriction enzymes that recognize unique restriction sites in the open reading frame. Optionally, unique restriction sites may be introduced into an open reading frame by mutagenesis, e.g. site-directed mutagenesis. The respective DNA may be used as template for in vitro transcription.

A restriction site is a nucleic acid sequence, e.g. DNA sequence, which is necessary and sufficient to direct restriction (cleavage) of the nucleic acid molecule, e.g. DNA molecule, in which the restriction site is contained, by a specific restriction enzyme. A restriction site is unique for a given nucleic acid molecule if one copy of the restriction site is present in the nucleic acid molecule.

A restriction enzyme is an endonuclease that cuts a nucleic acid molecule, e.g. DNA molecule, at or near the restriction site.

Alternatively, a nucleic acid sequence characterized by the deletion of part or all of the open reading frame may be obtained by synthetic methods.

The RNA replicon according to the present invention is preferably a single stranded RNA molecule. The RNA replicon according to the present invention is typically a (+) stranded RNA molecule. In one embodiment, the RNA replicon of the present invention is an isolated nucleic acid molecule.

T Cell Receptors and Artificial T Cell Receptors

RNA replicons described herein comprising an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor are useful for expressing a T cell receptor or an artificial T cell receptor in a cell, in particular an immune effector cell such as a T cell. Cells engineered to express such T cell receptor or artificial T cell receptor are useful for providing an immune response in a subject and, in particular, in the treatment of diseases characterized by expression of an antigen targeted by the T cell receptor or artificial T cell receptor.

The term “immune response” refers to an integrated bodily response to an antigen and includes a cellular immune response. An immune response may be protective/preventive/prophylactic and/or therapeutic.

“Cellular immune response”, or similar terms are meant to include a cellular response directed to cells characterized by expression of an antigen, in particular characterized by presentation of an antigen with class I or class II MHC. The cellular response relates to cells called T cells or T-lymphocytes which act as either “helpers” or “killers”. The helper T cells (also termed CD4⁺ T cells) play a central role by regulating the immune response and the killer cells (also termed cytotoxic T cells, cytolytic T cells, CD8⁺ T cells or CTLs) kill diseased cells such as cancer cells, preventing the production of more diseased cells.

The term “antigen” relates to an agent comprising an epitope against which an immune response is to be generated and/or is directed. Preferably, an antigen in the context of the present invention is a molecule which, optionally after processing, is a target for an immune reaction, which is preferably specific for the antigen or cells expressing the antigen, preferably on the cell surface. The term “antigen” includes in particular proteins and peptides. An antigen is preferably a product which corresponds to or is derived from a naturally occurring antigen. Such naturally occurring antigens may include or may be derived from viruses, bacteria, fungi, parasites and other infectious agents and pathogens or an antigen may also be a tumor antigen. According to the present invention, an antigen may correspond to a naturally occurring product, for example, a viral protein, or a part thereof.

“Cell surface” is used in accordance with its normal meaning in the art, and thus includes the outside of the cell which is accessible to binding by proteins and other molecules. An antigen is expressed on the surface of cells if it is located at the surface of said cells and is accessible to binding by antigen-binding molecules such as antigen-specific antibodies added to the cells. In one embodiment, an antigen expressed on the surface of cells is an integral membrane protein having an extracellular portion. An antigen receptor (including T cell receptors and artificial T cell receptors) is expressed on the surface of cells if it is located at the surface of said cells and is available for binding to its target added to the cells. In one embodiment, an antigen receptor expressed on the surface of cells is an integral membrane protein having an extracellular portion recognizing a target.

The term “extracellular portion” or “ectodomain” in the context of the present invention refers to a part of a molecule such as a protein that is facing the extracellular space of a cell and preferably is accessible from the outside of said cell, e.g., by binding molecules such as antibodies located outside the cell. Preferably, the term refers to one or more extracellular loops or domains or a fragment thereof.

“Target” shall mean an agent such as a cell which is a target for an immune response such as a cellular immune response. Target cells include any undesirable cell such as a cancer cell or an infected cell. In preferred embodiments, the target cell is a cell expressing a target antigen, in particular a disease-specific antigen, which preferably is present on the cell surface or presented in the context of MHC molecules.

The term “epitope” refers to an antigenic determinant in a molecule such as an antigen, i.e., to a part in or fragment of the molecule that is recognized, i.e. bound, by the immune system, for example, that is recognized by an antibody or antigen receptor. For example, epitopes are the discrete, three-dimensional sites on an antigen, which are recognized by the immune system. Epitopes usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and usually have specific three dimensional structural characteristics, as well as specific charge characteristics. Conformational and non-conformational epitopes are distinguished in that the binding to the former but not the latter is lost in the presence of denaturing solvents. An epitope of a protein such as a tumor antigen preferably comprises a continuous or discontinuous portion of said protein and is preferably between 5 and 100, preferably between 5 and 50, more preferably between 8 and 30, most preferably between 10 and 25 amino acids in length, for example, the epitope may be preferably 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acids in length.

In one embodiment of the invention, an epitope is a T cell epitope. A T cell epitope is a portion of an antigen produced by antigen processing that is recognized (i.e., specifically bound) by a T cell receptor, in particular if presented in the context of MHC molecules (MHC class I or class II molecules), and thus is a MHC binding peptide.

“Antigen processing” refers to the degradation of an antigen into procession products, which are fragments of said antigen (e.g., the degradation of a protein into peptides) and the association of one or more of these fragments (e.g., via binding) with MHC molecules for presentation by cells, preferably antigen presenting cells to specific T cells.

An antigen-presenting cell (APC) is a cell that displays antigen in the context of major histocompatibility complex (MHC) on its surface. T cells may recognize this complex using their T cell receptor (TCR). Antigen-presenting cells process antigens and present them to T cells. According to the invention, the term “antigen-presenting cell” includes professional antigen-presenting cells and non-professional antigen-presenting cells.

Professional antigen-presenting cells are very efficient at internalizing antigen, either by phagocytosis or by receptor-mediated endocytosis, and then displaying a fragment of the antigen, bound to a class II MHC molecule, on their membrane. The T cell recognizes and interacts with the antigen-class II MHC molecule complex on the membrane of the antigen-presenting cell. An additional co-stimulatory signal is then produced by the antigen-presenting cell, leading to activation of the T cell. The expression of co-stimulatory molecules is a defining feature of professional antigen-presenting cells. The main types of professional antigen-presenting cells are dendritic cells, which have the broadest range of antigen presentation, and are probably the most important antigen-presenting cells, macrophages, B-cells, and certain activated epithelial cells.

Non-professional antigen-presenting cells do not constitutively express the MHC class II proteins required for interaction with naive T cells; these are expressed only upon stimulation of the non-professional antigen-presenting cells by certain cytokines such as IFNγ.

Dendritic cells (DCs) are leukocyte populations that present antigens captured in peripheral tissues to T cells via both MHC class II and I antigen presentation pathways. It is well known that dendritic cells are potent inducers of immune responses and the activation of these cells is a critical step for the induction of antitumoral immunity. Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNFα to cultures of monocytes harvested from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFα, CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, maturation and proliferation of dendritic cells. Dendritic cells are conveniently categorized as “immature” and “mature” cells, which can be used as a simple way to discriminate between two well characterized phenotypes. However, this nomenclature should not be construed to exclude all possible intermediate stages of differentiation. Immature dendritic cells are characterized as antigen presenting cells with a high capacity for antigen uptake and processing, which correlates with the high expression of Fcγ receptor and mannose receptor. The mature phenotype is typically characterized by a lower expression of these markers, but a high expression of cell surface molecules responsible for T cell activation such as class I and class II MHC, adhesion molecules (e. g. CD54 and CD11) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-1 BB). Dendritic cell maturation is referred to as the status of dendritic cell activation at which such antigen-presenting dendritic cells lead to T cell priming, while presentation by immature dendritic cells results in tolerance. Dendritic cell maturation is chiefly caused by biomolecules with microbial features detected by innate receptors (bacterial DNA, viral RNA, endotoxin, etc.), pro-inflammatory cytokines (TNF, IL-1, IFNs), ligation of CD40 on the dendritic cell surface by CD40L, and substances released from cells undergoing stressful cell death. The dendritic cells can be derived by culturing bone marrow cells in vitro with cytokines, such as granulocyte-macrophage colony-stimulating factor (GM-CSF) and tumor necrosis factor alpha.

T cells belong to a group of white blood cells known as lymphocytes, and play a central role in cell-mediated immunity. They can be distinguished from other lymphocyte types, such as B cells and natural killer cells by the presence of a special receptor on their cell surface called T cell receptors (TCR). The thymus is the principal organ responsible for the maturation of T cells. Several different subsets of T cells have been discovered, each with a distinct function.

T helper cells assist other white blood cells in immunologic processes, including maturation of B cells into plasma cells and activation of cytotoxic T cells and macrophages, among other functions. These cells are also known as CD4+ T cells because they express the CD4 protein on their surface. Helper T cells become activated when they are presented with peptide antigens by MHC class II molecules that are expressed on the surface of antigen presenting cells (APCs). Once activated, they divide rapidly and secrete small proteins called cytokines that regulate or assist in the active immune response.

Cytotoxic T cells destroy virally infected cells and tumor cells, and are also implicated in transplant rejection. These cells are also known as CD8+ T cells since they express the CD8 glycoprotein at their surface. These cells recognize their targets by binding to antigen associated with MHC class I, which is present on the surface of nearly every cell of the body.

A majority of T cells have a T cell receptor (TCR) existing as a complex of several proteins. The actual T cell receptor is composed of two separate peptide chains, which are produced from the independent T cell receptor alpha and beta (TCRα and TCRβ) genes and are called α- and β-TCR chains. γδ T cells (gamma delta T cells) represent a small subset of T cells that possess a distinct T cell receptor (TCR) on their surface. However, in γδ T cells, the TCR is made up of one γ-chain and one δ-chain. This group of T cells is much less common (2% of total T cells) than the αβ T cells.

Each chain of a T cell receptor is composed of two extracellular domains: variable (V) region and a constant (C) region. The constant region is proximal to the cell membrane, followed by a transmembrane region and a short cytoplasmic tail, while the variable region binds to the peptide/MHC complex. For the purpose of the present invention, the term “constant region of a T cell receptor chain or a portion thereof” also includes embodiments wherein the constant region of a T cell receptor chain is (from N terminus to C terminus) followed by a transmembrane region and a cytoplasmic tail, such as a transmembrane region and a cytoplasmic tail which are naturally linked to the constant region of a T cell receptor chain.

All T cells originate from hematopoietic stem cells in the bone marrow. Hematopoietic progenitors derived from hematopoietic stem cells populate the thymus and expand by cell division to generate a large population of immature thymocytes. The earliest thymocytes express neither CD4 nor CD8, and are therefore classed as double-negative (CD4−CD8−) cells. As they progress through their development they become double-positive thymocytes (CD4+CD8+), and finally mature to single-positive (CD4+CD8− or CD4−CD8+) thymocytes that are then released from the thymus to peripheral tissues.

The first signal in activation of T cells is provided by binding of the T cell receptor to a short peptide presented by the major histocompatibility complex (MHC) on another cell. This ensures that only a T cell with a TCR specific to that peptide is activated. The partner cell is usually a professional antigen presenting cell (APC), usually a dendritic cell in the case of naïve responses, although B cells and macrophages can be important APCs. The peptides presented to CD8+ T cells by MHC class I molecules are 8-10 amino acids in length; the peptides presented to CD4+ T cells by MHC class II molecules are longer, as the ends of the binding cleft of the MHC class II molecule are open.

Specific activation of CD4+ or CD8+ T cells may be detected in a variety of ways. Methods for detecting specific T cell activation include detecting the proliferation of T cells, the production of cytokines (e.g., lymphokines), or the generation of cytolytic activity. For CD4+ T cells, a preferred method for detecting specific T cell activation is the detection of the proliferation of T cells. For CD8+ T cells, a preferred method for detecting specific T cell activation is the detection of the generation of cytolytic activity.

T cells may generally be prepared in vitro or ex vivo, using standard procedures. For example, T cells may be isolated from bone marrow, peripheral blood or a fraction of bone marrow or peripheral blood of a mammal, such as a patient, using a commercially available cell separation system. Alternatively, T cells may be derived from related or unrelated humans, non-human animals, cell lines or cultures. A sample comprising T cells may, for example, be peripheral blood mononuclear cells (PBMC).

The nucleic acids encoding α- and β-chains of a T cell receptor may be contained on separate nucleic acid molecules, i.e., RNA replicons, or alternatively, on a single nucleic acid molecule. Accordingly, expression of a T cell receptor in a cell requires co-transfection of separate nucleic acid molecules encoding the different T cell receptor chains or transfection of only one type of nucleic acid molecule encoding the different T cell receptor chains.

The term “major histocompatibility complex” and the abbreviation “MHC” include MHC class I and MHC class II molecules and relate to a complex of genes which occurs in all vertebrates. MHC proteins or molecules are important for signaling between lymphocytes and antigen presenting cells or diseased cells in immune reactions, wherein the MHC proteins or molecules bind peptides and present them for recognition by T cell receptors. The proteins encoded by the MHC are expressed on the surface of cells, and display both self antigens (peptide fragments from the cell itself) and nonself antigens (e.g., fragments of invading microorganisms) to a T cell.

The MHC region is divided into three subgroups, class I, class II, and class III. MHC class I proteins contain an α-chain and β2-microglobulin (not part of the MHC encoded by chromosome 15). They present antigen fragments to cytotoxic T cells. On most immune system cells, specifically on antigen-presenting cells, MHC class II proteins contain α- and β-chains and they present antigen fragments to T-helper cells. MHC class III region encodes for other immune components, such as complement components and some that encode cytokines.

In humans, genes in the MHC region that encode antigen-presenting proteins on the cell surface are referred to as human leukocyte antigen (HLA) genes. However the abbreviation MHC is often used to refer to HLA gene products. In one preferred embodiment of all aspects of the invention an MHC molecule is an HLA molecule.

Engineered receptors have been produced, which confer an arbitrary specificity such as the specificity of a monoclonal antibody onto an immune effector cell such as a T cell. In this way, a large number of antigen-specific T cells can be generated for adoptive cell transfer. Such engineered antigen receptors according to the invention may be present on T cells, e.g. instead of or in addition to the T cell's own T cell receptor, and such T cells do not necessarily require processing and presentation of an antigen for recognition of the target cell but rather may recognize preferably with specificity any antigen present on a target cell. According to the invention, the term “antigen receptor” includes artificial receptors comprising a single molecule or a complex of molecules which recognize, i.e. bind to, a target structure (e.g. an antigen) on a target cell such as a cancer cell (e.g. by binding of an antigen binding site or antigen binding domain to an antigen expressed on the surface of the target cell) and may confer specificity onto an immune effector cell such as a T cell expressing said antigen receptor on the cell surface. Preferably, recognition of the target structure by an antigen receptor results in activation of an immune effector cell expressing said antigen receptor. An antigen receptor may comprise one or more protein units said protein units comprising one or more domains as described herein. In one embodiment, a single-chain variable fragment (scFv) derived from a monoclonal antibody is fused to CD3-zeta transmembrane and endodomain. Such molecules result in the transmission of a zeta signal in response to recognition by the scFv of its antigen target on a target cell and killing of the target cell that expresses the target antigen.

According to the invention the term “artificial receptor” or “artificial T cell receptor” is preferably synonymous with the terms “chimeric antigen receptor (CAR)” and “chimeric T cell receptor”.

According to the invention, an artificial T cell receptor may generally comprise an antigen binding domain and a T cell signaling domain.

The binding domain or antigen binding domain recognizes and binds antigen. In one embodiment, the antigen binding domain is comprised by an exodomain of an artificial T cell receptor. According to the invention, antigen can be recognized by an antigen receptor through any antigen binding domains able to form an antigen binding site such as through antigen-binding portions of antibodies and T cell receptors which may reside on different peptide chains. In one embodiment, the two domains forming an antigen binding site are derived from an immunoglobulin. In another embodiment, the two domains forming an antigen binding site are derived from a T cell receptor. Particularly preferred are antibody variable domains, such as single-chain variable fragments (scFv) derived from monoclonal antibodies and T cell receptor variable domains, in particular TCR alpha and beta single chains. In fact almost anything that binds a given target with high affinity can be used as an antigen binding domain. In one embodiment, the antigen binding domain comprises a single-chain variable fragment (scFv) of an antibody to the antigen. In one embodiment, the antigen binding domain comprises a variable region of a heavy chain of an immunoglobulin (VH) with a specificity for the antigen (VH(antigen)) and a variable region of a light chain of an immunoglobulin (VL) with a specificity for the antigen (VL(antigen)). In one embodiment, said heavy chain variable region (VH) and the corresponding light chain variable region (VL) are connected via a peptide linker, preferably a peptide linker comprising the amino acid sequence (GGGGS)3.

The activation signaling domain (or T cell signaling domain) serves to activate cytotoxic lymphocytes upon binding of the artificial T cell receptor to antigen. The identity of the activation signaling domain is limited only in that it has the ability to induce activation of the selected cytotoxic lymphocyte upon binding of the antigen by the artificial T cell receptor. Suitable activation signaling domains include the T cell CD3[zeta] chain and Fc receptor [gamma]. The skilled artisan will understand that sequence variants of these noted activation signaling domains can be used without adversely impacting the invention, where the variants have the same or similar activity as the domain on which they are modeled. Such variants will have at least about 80% sequence identity to the amino acid sequence of the domain from which they are derived. In one embodiment, the T cell signaling domain is located intracellularly. In one embodiment, the T cell signaling domain comprises CD3-zeta, preferably the endodomain of CD3-zeta, optionally in combination with CD28.

An artificial T cell receptor may further comprise a co-stimulation domain. The co-stimulation domain serves to enhance the proliferation and survival of the cytotoxic lymphocytes upon binding of the artificial T cell receptor to a targeted moiety. The identity of the co-stimulation domain is limited only in that it has the ability to enhance cellular proliferation and survival upon binding of the targeted moiety by the artificial T cell receptor. Suitable co-stimulation domains include CD28, CD137 (4-1BB), a member of the tumor necrosis factor (TNF) receptor family, CD134 (OX40), a member of the TNFR-superfamily of receptors, and CD278 (ICOS), a CD28-superfamily co-stimulatory molecule expressed on activated T cells. The skilled person will understand that sequence variants of these noted co-stimulation domains can be used without adversely impacting the invention, where the variants have the same or similar activity as the domain on which they are modeled. Such variants will have at least about 80% sequence identity to the amino acid sequence of the domain from which they are derived. In some embodiments of the invention, the artificial T cell receptor constructs comprise two co-stimulation domains. While the particular combinations include all possible variations of the four noted domains, specific examples include CD28+CD137 (4-1BB) and CD28+CD134 (OX40).

Following antigen recognition, receptors cluster and a signal is transmitted to the cell. In this respect, a “T cell signaling domain” is a domain, preferably an endodomain, which transmits an activation signal to the T cell after antigen is bound. The most commonly used endodomain component is CD3-zeta.

The artificial T cell receptors of the present invention may comprise the domains, together in the form of a fusion protein. Such fusion proteins will generally comprise a binding domain, one or more co-stimulation domains, and an activation signaling domain, linked in a N-terminal to C-terminal direction. However, the artificial T cell receptors of the present invention are not limited to this arrangement and other arrangements are acceptable and include a binding domain, an activation signaling domain, and one or more co-stimulation domains. It will be understood that because the binding domain must be free to bind antigen, the placement of the binding domain in the fusion protein will generally be such that display of the region on the exterior of the cell is achieved. In the same manner, because the co-stimulation and activation signaling domains serve to induce activity and proliferation of the cytotoxic lymphocytes, the fusion protein will generally display these two domains in the interior of the cell. The artificial T cell receptors may include additional elements, such as a signal peptide to ensure proper export of the fusion protein to the cells surface, a transmembrane domain to ensure the fusion protein is maintained as an integral membrane protein, and a hinge domain (or spacer region) that imparts flexibility to the binding domain and allows strong binding to antigen. Preferably, a signal sequence or signal peptide is a sequence or peptide that allows for sufficient passage through the secretory pathway and expression on the cell surface such that an antigen receptor, for example, may bind an antigen present in the extracellular environment. Preferably, the signal sequence or signal peptide is cleavable and is removed from the mature peptide chains. The signal sequence or signal peptide preferably is chosen with respect to the cell or organism wherein the peptide chains are produced in. In one embodiment, the signal peptide precedes the antigen binding domain. In one embodiment, the transmembrane domain is a hydrophobic alpha helix that spans the membrane. In one embodiment, the transmembrane domain comprises the CD28 transmembrane domain or a fragment thereof. In one embodiment of all aspects of the invention, an artificial T cell receptor comprises a spacer region which links the antigen binding domain to the transmembrane domain. In one embodiment, the spacer region allows the antigen binding domain to orient in different directions to facilitate antigen recognition. In one embodiment, the spacer region comprises the hinge region from IgG1.

In one embodiment of all aspects of the invention, an artificial T cell receptor comprises the structure:

embedded image

In one embodiment of all aspects of the invention, an artificial T cell receptor is preferably specific for the antigen to which it is targeted.

In one embodiment of all aspects of the invention, an artificial T cell receptor may be expressed by and/or present on the surface of a T cell, preferably a cytotoxic T cell. In one embodiment, the T cell when bound to antigen is reactive.

Adoptive cell transfer therapy with engineered T cells expressing artificial T cell receptors is a promising therapeutic as artificial T cell receptor-modified T cells can be engineered to target virtually any antigen. For example, patient's T cells may be genetically engineered (genetically modified) to express artificial T cell receptors specifically directed towards antigens on the patient's diseased cells, and then infused back into the patient.

According to the invention an artificial T cell receptor may replace the function of a T cell receptor and, in particular, may confer reactivity such as cytolytic activity to a cell such as a T cell. However, in contrast to the binding of the T cell receptor to an antigen peptide-MHC complex, an artificial T cell receptor may bind to an antigen, in particular when expressed on the cell surface.

The T cell surface glycoprotein CD3-zeta chain is a protein that in humans is encoded by the CD247 gene. CD3-zeta together with T cell receptor alpha/beta and gamma/delta heterodimers and CD3-gamma, -delta, and -epsilon, forms the T cell receptor-CD3 complex. The zeta chain plays an important role in coupling antigen recognition to several intracellular signal-transduction pathways. The term “CD3-zeta” preferably relates to human CD3-zeta.

CD28 (Cluster of Differentiation 28) is one of the molecules expressed on T cells that provide co-stimulatory signals, which are required for T cell activation. CD28 is the receptor for CD80 (B7.1) and CD86 (B7.2). Stimulation through CD28 in addition to the T cell receptor (TCR) can provide a potent co-stimulatory signal to T cells for the production of various interleukins (IL-6 in particular). The term “CD28” preferably relates to human CD28.

The term “immunoglobulin” relates to proteins of the immunoglobulin superfamily, preferably to antigen receptors such as antibodies or the B cell receptor (BCR). The immunoglobulins are characterized by a structural domain, i.e., the immunoglobulin domain, having a characteristic immunoglobulin (Ig) fold. The term encompasses membrane bound immunoglobulins as well as soluble immunoglobulins. Membrane bound immunoglobulins are also termed surface immunoglobulins or membrane immunoglobulins, which are generally part of the BCR. Soluble immunoglobulins are generally termed antibodies. Immunoglobulins generally comprise several chains, typically two identical heavy chains and two identical light chains which are linked via disulfide bonds. These chains are primarily composed of immunoglobulin domains, such as the V_L(variable light chain) domain, C_L(constant light chain) domain, and the C_H(constant heavy chain) domains C_H1, C_H2, C_H3, and C_H4. There are five types of mammalian immunoglobulin heavy chains, i.e., α, δ, ε, γ, and μ which account for the different classes of antibodies, i.e., IgA, IgD, IgE, IgG, and IgM. As opposed to the heavy chains of soluble immunoglobulins, the heavy chains of membrane or surface immunoglobulins comprise a transmembrane domain and a short cytoplasmic domain at their carboxy-terminus. In mammals there are two types of light chains, i.e., lambda and kappa. The immunoglobulin chains comprise a variable region and a constant region. The constant region is essentially conserved within the different isotypes of the immunoglobulins, wherein the variable part is highly divers and accounts for antigen recognition.

The term “antibody” refers to a glycoprotein comprising at least two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds. The term “antibody” includes monoclonal antibodies, recombinant antibodies, human antibodies, humanized antibodies and chimeric antibodies. Each heavy chain is comprised of a heavy chain variable region (abbreviated herein as VH) and a heavy chain constant region. Each light chain is comprised of a light chain variable region (abbreviated herein as VL) and a light chain constant region. The VH and VL regions can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each VH and VL is composed of three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. The variable regions of the heavy and light chains contain a binding domain that interacts with an antigen. The constant regions of the antibodies may mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g., effector cells) and the first component (Clq) of the classical complement system.

The term “monoclonal antibody” as used herein refers to a preparation of antibody molecules of single molecular composition. A monoclonal antibody displays a single binding specificity and affinity. In one embodiment, the monoclonal antibodies are produced by a hybridoma which includes a B cell obtained from a non-human animal, e.g., mouse, fused to an immortalized cell.

Antibodies may be derived from different species, including but not limited to mouse, rat, rabbit, guinea pig and human.

Antibodies described herein include IgA such as IgA1 or IgA2, IgG1, IgG2, IgG3, IgG4, IgE, IgM, and IgD antibodies. In various embodiments, the antibody is an IgG1 antibody, more particularly an IgG1, kappa or IgG1, lambda isotype (i.e. IgG1, κ, λ), an IgG2a antibody (e.g. IgG2a, κ, λ), an IgG2b antibody (e.g. IgG2b, κ, λ), an IgG3 antibody (e.g. IgG3, κ, λ) or an IgG4 antibody (e.g. IgG4, κ, λ).

The artificial T cell receptors described herein may comprise antigen-binding portions of one or more antibodies. The terms “antigen-binding portion” of an antibody (or simply “binding portion”) or “antigen-binding fragment” of an antibody (or simply “binding fragment”) or similar terms refer to one or more fragments of an antibody that retain the ability to specifically bind to an antigen. It has been shown that the antigen-binding function of an antibody can be performed by fragments of a full-length antibody. Examples of binding fragments encompassed within the term “antigen-binding portion” of an antibody include (i) Fab fragments, monovalent fragments consisting of the VL, VH, CL and CH domains; (ii) F(ab′)₂fragments, bivalent fragments comprising two Fab fragments linked by a disulfide bridge at the hinge region; (iii) Fd fragments consisting of the VH and CH domains; (iv) Fv fragments consisting of the VL and VH domains of a single arm of an antibody, (v) dAb fragments (Ward et al., (1989) Nature 341: 544-546), which consist of a VH domain; (vi) isolated complementarity determining regions (CDR), and (vii) combinations of two or more isolated CDRs which may optionally be joined by a synthetic linker. Furthermore, although the two domains of the Fv fragment, VL and VH, are coded for by separate genes, they can be joined, using recombinant methods, by a synthetic linker that enables them to be made as a single protein chain in which the VL and VH regions pair to form monovalent molecules (known as single chain Fv (scFv); see e.g., Bird et al. (1988) Science 242: 423-426; and Huston et al. (1988) Proc. Natl. Acad. Sci. USA 85: 5879-5883). Such single chain antibodies are also intended to be encompassed within the term “antigen-binding fragment” of an antibody. A further example is binding-domain immunoglobulin fusion proteins comprising (i) a binding domain polypeptide that is fused to an immunoglobulin hinge region polypeptide, (ii) an immunoglobulin heavy chain CH2 constant region fused to the hinge region, and (iii) an immunoglobulin heavy chain CH3 constant region fused to the CH2 constant region. The binding domain polypeptide can be a heavy chain variable region or a light chain variable region. The binding-domain immunoglobulin fusion proteins are further disclosed in US 2003/0118592 and US 2003/0133939. These antibody fragments are obtained using conventional techniques known to those with skill in the art, and the fragments are screened for utility in the same manner as are intact antibodies.

A single-chain variable fragment (scFv) is a fusion protein of the variable regions of the heavy (VH) and light chains (VL) of immunoglobulins, connected with a linker peptide. The linker can either connect the N-terminus of the VH with the C-terminus of the VL, or vice versa. Divalent (or bivalent) single-chain variable fragments (di-scFvs, bi-scFvs) can be engineered by linking two scFvs. This can be done by producing a single peptide chain with two VH and two VL regions, yielding tandem scFvs.

The term “binding domain” characterizes in connection with the present invention a structure, e.g. of an antibody, which binds to/interacts with a given target structure/antigen/epitope, optionally when interacting with another domain. Thus, these domains according to the invention designate an “antigen binding site”.

Antibodies and derivatives of antibodies are useful for providing binding domains such as antibody fragments, in particular for providing VL and VH regions.

Binding domains for an antigen which may be present within an antigen receptor have the ability of binding to (targeting) an antigen, i.e. the ability of binding to (targeting) an epitope present in an antigen, preferably an epitope located within the extracellular domain of an antigen. Preferably, binding domains for an antigen are specific for the antigen. Preferably, binding domains for an antigen bind to the antigen expressed on the cell surface. In particular preferred embodiments, binding domains for an antigen bind to native epitopes of an antigen present on the surface of living cells.

Antibodies can be produced by a variety of techniques, including conventional monoclonal antibody methodology, e.g., the standard somatic cell hybridization technique of Kohler and Milstein, Nature 256: 495 (1975). Although somatic cell hybridization procedures are preferred, in principle, other techniques for producing monoclonal antibodies can be employed, e.g., viral or oncogenic transformation of B-lymphocytes or phage display techniques using libraries of antibody genes.

The preferred animal system for preparing hybridomas that secrete monoclonal antibodies is the murine system. Hybridoma production in the mouse is a very well established procedure. Immunization protocols and techniques for isolation of immunized splenocytes for fusion are known in the art. Fusion partners (e.g., murine myeloma cells) and fusion procedures are also known.

Other preferred animal systems for preparing hybridomas that secrete monoclonal antibodies are the rat and the rabbit system (e.g. described in Spieker-Polet et al., Proc. Natl. Acad. Sci. U.S.A. 92:9348 (1995), see also Rossi et al., Am. J. Clin. Pathol. 124: 295 (2005)).

To generate antibodies, mice can be immunized with carrier-conjugated peptides derived from the antigen sequence, i.e. the sequence against which the antibodies are to be directed, an enriched preparation of recombinantly expressed antigen or fragments thereof and/or cells expressing the antigen, as described. Alternatively, mice can be immunized with DNA encoding the antigen or fragments thereof. In the event that immunizations using a purified or enriched preparation of the antigen do not result in antibodies, mice can also be immunized with cells expressing the antigen, e.g., a cell line, to promote immune responses.

The immune response can be monitored over the course of the immunization protocol with plasma and serum samples being obtained by tail vein or retroorbital bleeds. Mice with sufficient titers of immunoglobulin can be used for fusions. Mice can be boosted intraperitonealy or intravenously with antigen expressing cells 3 days before sacrifice and removal of the spleen to increase the rate of specific antibody secreting hybridomas.

To generate hybridomas producing monoclonal antibodies, splenocytes and lymph node cells from immunized mice can be isolated and fused to an appropriate immortalized cell line, such as a mouse myeloma cell line. The resulting hybridomas can then be screened for the production of antigen-specific antibodies. Individual wells can then be screened by ELISA for antibody secreting hybridomas. By Immunofluorescence and FACS analysis using antigen expressing cells, antibodies with specificity for the antigen can be identified. The antibody secreting hybridomas can be replated, screened again, and if still positive for monoclonal antibodies can be subcloned by limiting dilution. The stable subclones can then be cultured in vitro to generate antibody in tissue culture medium for characterization.

The ability of antibodies and other binding agents to bind an antigen can be determined using standard binding assays (e.g., ELISA, Western Blot, Immunofluorescence and flow cytometric analysis).

The term “binding” according to the invention preferably relates to a specific binding.

According to the present invention, an agent such as an antigen receptor is capable of binding to (targeting) a predetermined target if it has a significant affinity for said predetermined target and binds to said predetermined target in standard assays. “Affinity” or “binding affinity” is often measured by equilibrium dissociation constant (K_D). Preferably, the term “significant affinity” refers to the binding to a predetermined target with a dissociation constant (K_D) of 10⁻⁵M or lower, 10⁻⁶M or lower, 10⁻⁷M or lower, 10⁻⁸M or lower, 10⁻⁹M or lower, 10⁻¹⁰M or lower, 10⁻¹¹M or lower, or 10⁻¹²M or lower.

An agent is not (substantially) capable of binding to (targeting) a target if it has no significant affinity for said target and does not bind significantly, in particular does not bind detectably, to said target in standard assays. Preferably, the agent does not detectably bind to said target if present in a concentration of up to 2, preferably 10, more preferably 20, in particular 50 or 100 μg/ml or higher. Preferably, an agent has no significant affinity for a target if it binds to said target with a K_Dthat is at least 10-fold, 100-fold, 10³-fold, 10⁴-fold, 10⁵-fold, or 10⁶-fold higher than the K_Dfor binding to the predetermined target to which the agent is capable of binding. For example, if the K_Dfor binding of an agent to the target to which the agent is capable of binding is 10⁻⁷M, the K_Dfor binding to a target for which the agent has no significant affinity would be at least 10⁻⁶M, 10⁻⁵M, 10⁻⁴M, 10⁻³M, 10⁻²M, or 10⁻¹M.

An agent is specific for a predetermined target if it is capable of binding to said predetermined target while it is not (substantially) capable of binding to other targets, i.e. has no significant affinity for other targets and does not significantly bind to other targets in standard assays. Preferably, an agent is specific for a predetermined target if the affinity for and the binding to such other targets does not significantly exceed the affinity for or binding to proteins which are unrelated to a predetermined target such as bovine serum albumin (BSA), casein or human serum albumin (HSA). Preferably, an agent is specific for a predetermined target if it binds to said target with a K_Dthat is at least 10-fold, 100-fold, 10³-fold, 10⁴-fold, 10⁵-fold, or 10⁶-fold lower than the K_Dfor binding to a target for which it is not specific. For example, if the K_Dfor binding of an agent to the target for which it is specific is 10⁻⁷M, the K_Dfor binding to a target for which it is not specific would be at least 10⁻⁶M, 10⁻⁵M, 10⁻⁴M, 10⁻³M, 10⁻²M, or 10⁻¹M.

Binding of an agent to a target can be determined experimentally using any suitable method; see, for example, Berzofsky et al., “Antibody-Antigen Interactions” In Fundamental Immunology, Paul, W. E., Ed., Raven Press New York, NY (1984), Kuby, Janis Immunology, W. H. Freeman and Company New York, NY (1992), and methods described herein. Affinities may be readily determined using conventional techniques, such as by equilibrium dialysis; by using the BIAcore 2000 instrument, using general procedures outlined by the manufacturer; by radioimmunoassay using radiolabeled target antigen; or by another method known to the skilled artisan. The affinity data may be analyzed, for example, by the method of Scatchard et al., Ann N.Y. Acad. ScL, 51:660 (1949). The measured affinity of a particular antibody-antigen interaction can vary if measured under different conditions, e.g., salt concentration, pH. Thus, measurements of affinity and other antigen-binding parameters, e.g., K_D, IC₅₀, are preferably made with standardized solutions of antibody and antigen, and a standardized buffer.

At Least One Open Reading Frame Comprised by the Replicon

The RNA replicon according to the present invention comprises as an open reading frame encoding a peptide of interest or a protein of interest an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor and may comprise one or more further open reading frames encoding a peptide of interest or a protein of interest such as a further chain of a T cell receptor or of an artificial T cell receptor forming together with the first chain of a T cell receptor or of an artificial T cell receptor a functional T cell receptor or artificial T cell receptor. Preferably, the protein of interest is encoded by a heterologous nucleic acid sequence. The gene encoding the peptide or protein of interest is synonymously termed “gene of interest” or “transgene”. In various embodiments, the peptide or protein of interest is encoded by a heterologous nucleic acid sequence. According to the present invention, the term “heterologous” refers to the fact that a nucleic acid sequence is not naturally functionally or structurally linked to an alphavirus nucleic acid sequence. The replicon according to the present invention may encode a single polypeptide, i.e., a chain of a T cell receptor or of an artificial T cell receptor, or multiple polypeptides such as multiple chains of a T cell receptor or of an artificial T cell receptor or a chain of a T cell receptor or of an artificial T cell receptor and another polypeptide. Multiple polypeptides can be encoded as a single polypeptide (fusion polypeptide) or as separate polypeptides. In some embodiments, the replicon according to the present invention may comprise more than one open reading frames, each of which may independently be selected to be under the control of a subgenomic promoter or not. Alternatively, a poly-protein or fusion polypeptide comprises individual polypeptides separated by an optionally autocatalytic protease cleavage site (e.g. foot-and-mouth disease virus 2A protein), or an intein.

Proteins of interest may e.g. be selected from the group consisting of reporter proteins, pharmaceutically active peptides or proteins, inhibitors of intracellular interferon (IFN) signaling, and functional alphavirus non-structural protein.

Functional Alphavirus Non-Structural Protein

A further suitable protein of interest encoded by an open reading frame is functional alphavirus non-structural protein. The term “alphavirus non-structural protein” includes each and every co- or post-translationally modified form, including carbohydrate-modified (such as glycosylated) and lipid-modified forms of alphavirus non-structural protein.

In some embodiments, the term “alphavirus non-structural protein” refers to any one or more of individual non-structural proteins of alphavirus origin (nsP1, nsP2, nsP3, nsP4), or to a poly-protein comprising the polypeptide sequence of more than one non-structural protein of alphavirus origin. In some embodiments, “alphavirus non-structural protein” refers to nsP123 and/or to nsP4. In other embodiments, “alphavirus non-structural protein” refers to nsP1234. In one embodiment, the protein of interest encoded by an open reading frame consists of all of nsP1, nsP2, nsP3 and nsP4 as one single, optionally cleavable poly-protein: nsP1234. In one embodiment, the protein of interest encoded by an open reading frame consists of nsP1, nsP2 and nsP3 as one single, optionally cleavable polyprotein: nsP123. In that embodiment, nsP4 may be a further protein of interest and may be encoded by a further open reading frame.

In some embodiments, alphavirus non-structural protein is capable of forming a complex or association, e.g. in a host cell. In some embodiments, “alphavirus non-structural protein” refers to a complex or association of nsP123 (synonymously P123) and nsP4. In some embodiments, “alphavirus non-structural protein” refers to a complex or association of nsP1, nsP2, and nsP3. In some embodiments, “alphavirus non-structural protein” refers to a complex or association of nsP1, nsP2, nsP3 and nsP4. In some embodiments, “alphavirus non-structural protein” refers to a complex or association of any one or more selected from the group consisting of nsP1, nsP2, nsP3 and nsP4. In some embodiments, the alphavirus non-structural protein comprises at least nsP4.

The terms “complex” or “association” refer to two or more same or different protein molecules that are in spatial proximity. Proteins of a complex are preferably in direct or indirect physical or physicochemical contact with each other. A complex or association can consist of multiple different proteins (heteromultimer) and/or of multiple copies of one particular protein (homomultimer). In the context of alphavirus non-structural protein, the term “complex or association” describes a multitude of at least two protein molecules, of which at least one is an alphavirus non-structural protein. The complex or association can consist of multiple copies of one particular protein (homomultimer) and/or of multiple different proteins (heteromultimer). In the context of a multimer, “multi” means more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more than ten.

The term “functional alphavirus non-structural protein” includes alphavirus non-structural protein that has replicase function. Thus, “functional alphavirus non-structural protein” includes alphavirus replicase. “Replicase function” comprises the function of an RNA-dependent RNA polymerase (RdRP), i.e. an enzyme which is capable to catalyze the synthesis of (−) strand RNA based on a (+) strand RNA template, and/or which is capable to catalyze the synthesis of (+) strand RNA based on a (−) strand RNA template. Thus, the term “functional alphavirus non-structural protein” can refer to a protein or complex that synthesizes (−) stranded RNA, using the (+) stranded (e.g. genomic) RNA as template, to a protein or complex that synthesizes new (+) stranded RNA, using the (−) stranded complement of genomic RNA as template, and/or to a protein or complex that synthesizes a subgenomic transcript, using a fragment of the (−) stranded complement of genomic RNA as template. The functional alphavirus non-structural protein may additionally have one or more additional functions, such as e.g. a protease (for auto-cleavage), helicase, terminal adenylyltransferase (for poly(A) tail addition), methyltransferase and guanylyltransferase (for providing a nucleic acid with a 5′-cap), nuclear localization sites, triphosphatase (Gould et al., 2010, Antiviral Res., vol. 87 pp. 111-124; Rupp et al., 2015, J. Gen. Virol., vol. 96, pp. 2483-500).

According to the invention, the term “alphavirus replicase” refers to alphaviral RNA-dependent RNA polymerase, including a RNA-dependent RNA polymerase from a naturally occurring alphavirus (alphavirus found in nature) and a RNA-dependent RNA polymerase from a variant or derivative of an alphavirus, such as from an attenuated alphavirus. In the context of the present invention, the terms “replicase” and “alphavirus replicase” are used interchangeably, unless the context dictates that any particular replicase is not an alphavirus replicase.

The term “replicase” comprises all variants, in particular post-translationally modified variants, conformations, isoforms and homologs of alphavirus replicase, which are expressed by alphavirus-infected cells or which are expressed by cells that have been transfected with a nucleic acid that codes for alphavirus replicase. Moreover, the term “replicase” comprises all forms of replicase that have been produced and can be produced by recombinant methods. For example, a replicase comprising a tag that facilitates detection and/or purification of the replicase in the laboratory, e.g. a myc-tag, a HA-tag or an oligohistidine tag (His-tag) may be produced by recombinant methods.

Optionally, the alphavirus replicase is additionally functionally defined by the capacity of binding to any one or more of alphavirus conserved sequence element 1 (CSE 1) or complementary sequence thereof, conserved sequence element 2 (CSE 2) or complementary sequence thereof, conserved sequence element 3 (CSE 3) or complementary sequence thereof, conserved sequence element 4 (CSE 4) or complementary sequence thereof. Preferably, the replicase is capable of binding to CSE 2 [i.e. to the (+) strand] and/or to CSE 4 [i.e. to the (+) strand], or of binding to the complement of CSE 1 [i.e. to the (−) strand] and/or to the complement of CSE 3 [i.e. to the (−) strand].

The origin of the replicase is not limited to any particular alphavirus. In a preferred embodiment, the alphavirus replicase comprises non-structural protein from Semliki Forest virus, including a naturally occurring Semliki Forest virus and a variant or derivative of Semliki Forest virus, such as an attenuated Semliki Forest virus. In an alternative preferred embodiment, the alphavirus replicase comprises non-structural protein from Sindbis virus, including a naturally occurring Sindbis virus and a variant or derivative of Sindbis virus, such as an attenuated Sindbis virus. In an alternative preferred embodiment, the alphavirus replicase comprises non-structural protein from Venezuelan equine encephalitis virus (VEEV), including a naturally occurring VEEV and a variant or derivative of VEEV, such as an attenuated VEEV. In an alternative preferred embodiment, the alphavirus replicase comprises non-structural protein from chikungunya virus (CHIKV), including a naturally occurring CHIKV and a variant or derivative of CHIKV, such as an attenuated CHIKV.

A replicase can also comprise non-structural proteins from more than one alphavirus. Thus, heterologous complexes or associations comprising alphavirus non-structural protein and having replicase function are equally comprised by the present invention. Merely for illustrative purposes, replicase may comprise one or more non-structural proteins (e.g. nsP1, nsP2) from a first alphavirus, and one or more non-structural proteins (nsP3, nsP4) from a second alphavirus. Non-structural proteins from more than one different alphavirus may be encoded by separate open reading frames, or may be encoded by a single open reading frame as poly-protein, e.g. nsP1234.

In some embodiments, functional alphavirus non-structural protein is capable of forming membranous replication complexes and/or vacuoles in cells in which the functional alphavirus non-structural protein is expressed.

If functional alphavirus non-structural protein, i.e. alphavirus non-structural protein with replicase function, is encoded by a nucleic acid molecule according to the present invention, it is preferable that the subgenomic promoter of the replicon, if present, is compatible with said replicase. Compatible in this context means that the alphavirus replicase is capable of recognizing the subgenomic promoter, if present. In one embodiment, this is achieved when the subgenomic promoter is native to the alphavirus from which the replicase is derived, i.e. the natural origin of these sequences is the same alphavirus. In an alternative embodiment, the subgenomic promoter is not native to the alphavirus from which the alphavirus replicase is derived, provided that the alphavirus replicase is capable of recognizing the subgenomic promoter. In other words, the replicase is compatible with the subgenomic promoter (cross-virus compatibility). Examples of cross-virus compatibility concerning subgenomic promoter and replicase originating from different alphaviruses are known in the art. Any combination of subgenomic promoter and replicase is possible as long as cross-virus compatibility exists. Cross-virus compatibility can readily be tested by the skilled person working the present invention by incubating a replicase to be tested together with an RNA, wherein the RNA has a subgenomic promoter to be tested, at conditions suitable for RNA synthesis from the a subgenomic promoter. If a subgenomic transcript is prepared, the subgenomic promoter and the replicase are determined to be compatible. Various examples of cross-virus compatibility are known (reviewed by Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562).

In one embodiment, alphavirus non-structural protein is not encoded as fusion protein with a heterologous protein, e.g. ubiquitin.

In the present invention, an open reading frame encoding functional alphavirus non-structural protein can be provided on the RNA replicon, or alternatively, can be provided as separate nucleic acid molecule, e.g. mRNA molecule. A separate mRNA molecule may optionally comprise e.g. cap, 5′-UTR, 3′-UTR, poly(A) sequence, and/or adaptation of the codon usage. The separate mRNA molecule may be provided in trans, as described herein for the system of the present invention.

When an open reading frame encoding functional alphavirus non-structural protein is provided on the RNA replicon, the replicon can preferably be replicated by the functional alphavirus non-structural protein. In particular, the RNA replicon that encodes functional alphavirus non-structural protein can be replicated by the functional alphavirus non-structural protein encoded by the replicon. This embodiment is strongly preferred when no nucleic acid molecule encoding functional alphavirus non-structural protein is provided in trans. In this embodiment, cis-replication of the replicon is aimed at. In a preferred embodiment, the RNA replicon comprises an open reading frame encoding functional alphavirus non-structural protein as well as a further open reading frame encoding a protein of interest, and can be replicated by the functional alphavirus non-structural protein. This embodiment is particularly suitable in some methods for producing a protein of interest according to the present invention. An example of a respective replicon is illustrated in FIG. 6 (“cisReplicon Δ5ATG-RRS”).

If the replicon comprises an open reading frame encoding functional alphavirus non-structural protein, it is preferable that the open reading frame encoding functional alphavirus non-structural protein does not overlap with the 5′ replication recognition sequence. In one embodiment, the open reading frame encoding functional alphavirus non-structural protein does not overlap with the subgenomic promoter, if present. An example of a respective replicon is illustrated in FIG. 6 (“cisReplicon Δ5ATG-RRS”).

If multiple open reading frames are present on the replicon, then the functional alphavirus non-structural protein may be encoded by any one of them, optionally under control of a subgenomic promoter or not, preferably not under control of a subgenomic promoter. In a preferred embodiment, the functional alphavirus non-structural protein is encoded by the most upstream open reading frame of the RNA replicon. When the functional alphavirus non-structural protein is encoded by the most upstream open reading frame of the RNA replicon, the genetic information encoding functional alphavirus non-structural protein will be translated early after introduction of the RNA replicon into a host cell, and the resulting protein can subsequently drive replication, and optionally production of a subgenomic transcript, in the host cell. An example of a respective replicon is illustrated in FIG. 6 (“cisReplicon Δ5ATG-RRS”).

Presence of an open reading frame encoding functional alphavirus non-structural protein, either comprised by the replicon or comprised by a separate nucleic acid molecule that is provided in trans, allows that the replicon is replicated, and consequently, that a gene of interest encoded by the replicon, optionally under control of a subgenomic promoter, is expressed at high levels. This is associated with a cost advantage compared to other transgene expression systems. Since the replicon of the present invention can be replicated in the presence of functional alphavirus non-structural protein, high levels of expression of a gene of interest may be achieved even if relatively low amounts replicon RNA are administered. The low amounts of replicon RNA positively influence the costs.

Position of the at Least One Open Reading Frame in the RNA Replicon

The RNA replicon is suitable for expression of one or more genes encoding a peptide of interest or a protein of interest, optionally under control of a subgenomic promoter. Various embodiments are possible. One or more open reading frames, each encoding a peptide of interest or a protein of interest, can be present on the RNA replicon. The most upstream open reading frame of the RNA replicon is referred to as “first open reading frame”. In some embodiments, the “first open reading frame” is the only open reading frame of the RNA replicon. Optionally, one or more further open reading frames can be present downstream of the first open reading frame. One or more further open reading frames downstream of the first open reading frame may be referred to as “second open reading frame”, “third open reading frame” and so on, in the order (5′ to 3′) in which they are present downstream of the first open reading frame. Preferably, each open reading frame comprises a start codon (base triplet), typically AUG (in the RNA molecule), corresponding to ATG (in a respective DNA molecule).

If the replicon comprises a 3′ replication recognition sequence, it is preferred that all open reading frames are localized upstream of the 3′ replication recognition sequence.

When the RNA replicon comprising one or more open reading frames is introduced into a host cell, translation is preferably not initiated at any position upstream of the first open reading frame, owing to the removal of at least one initiation codon from the 5′ replication recognition sequence. Therefore, the replicon may serve directly as template for translation of the first open reading frame. Preferably, the replicon comprises a 5′-cap. This is helpful for expression of the gene encoded by the first open reading frame directly from the replicon.

In some embodiments, at least one open reading frame of the replicon is under the control of a subgenomic promoter, preferably an alphavirus subgenomic promoter. The alphavirus subgenomic promoter is very efficient, and is therefore suitable for heterologous gene expression at high levels. Preferably, the subgenomic promoter is a promoter for a subgenomic transcript in an alphavirus. This means that the subgenomic promoter is one which is native to an alphavirus and which preferably controls transcription of the open reading frame encoding one or more structural proteins in said alphavirus. Alternatively, the subgenomic promoter is a variant of a subgenomic promoter of an alphavirus; any variant which functions as promoter for subgenomic RNA transcription in a host cell is suitable. If the replicon comprises a subgenomic promoter, it is preferred that the replicon comprises a conserved sequence element 3 (CSE 3) or a variant thereof.

Preferably, the at least one open reading frame under control of a subgenomic promoter is localized downstream of the subgenomic promoter. Preferably, the subgenomic promoter controls production of subgenomic RNA comprising a transcript of the open reading frame.

In some embodiments the first open reading frame is under control of a subgenomic promoter. When the first open reading frame is under control of a subgenomic promoter, its localization resembles the localization of the open reading frame encoding structural proteins in the genome of an alphavirus. When the first open reading frame is under control of the subgenomic promoter, the gene encoded by the first open reading frame can be expressed both from the replicon as well as from a subgenomic transcript thereof (the latter in the presence of functional alphavirus non-structural protein). A respective embodiment is exemplified by the replicon “Δ5ATG-RRS” in FIG. 6. Preferably “Δ5ATG-RRS” does not comprise any initiation codon in the nucleic acid sequence encoding the C-terminal fragment of nsP4 (*nsP4). One or more further open reading frames, each under control of a subgenomic promoter, may be present downstream of the first open reading frame that is under control of a subgenomic promoter (not illustrated in FIG. 6). The genes encoded by the one or more further open reading frames, e.g. by the second open reading frame, may be translated from one or more subgenomic transcripts, each under control of a subgenomic promoter. For example, the RNA replicon may comprise a subgenomic promoter controlling production of a transcript that encodes a second protein of interest.

In other embodiments the first open reading frame is not under control of a subgenomic promoter. When the first open reading frame is not under control of a subgenomic promoter, the gene encoded by the first open reading frame can be expressed from the replicon. A respective embodiment is exemplified by the replicon “Δ5ATG-RRSASGP” in FIG. 6. One or more further open reading frames, each under control of a subgenomic promoter, may be present downstream of the first open reading frame (for illustration of two exemplary embodiments, see “Δ5ATG-RRS—bicistronic” and “cisReplicon Δ5ATG-RRS” in FIG. 6). The genes encoded by the one or more further open reading frames may be expressed from subgenomic transcripts.

In a cell which comprises the replicon according to the present invention, the replicon may be amplified by functional alphavirus non-structural protein. Additionally, if the replicon comprises one or more open reading frames under control of a subgenomic promoter, one or more subgenomic transcripts are expected to be prepared by functional alphavirus non-structural protein. Functional alphavirus non-structural protein may be provided in trans, or may be encoded by an open reading frame of the replicon.

If a replicon comprises more than one open reading frame encoding a protein of interest, it is preferable that each open reading frame encodes a different protein. For example, the protein encoded by the second open reading frame is different from the protein encoded by the first open reading frame.

In some embodiments, the protein of interest encoded by the first and/or a further open reading frame, preferably by the first open reading frame, is functional alphavirus non-structural protein. In some embodiments, the protein of interest encoded by the first and/or a further open reading frame, e.g. by the second open reading frame, is a chain of a T cell receptor or of an artificial T cell receptor.

In one embodiment, the protein of interest encoded by the first open reading frame is functional alphavirus non-structural protein. In that embodiment the replicon preferably comprises a 5′-cap. Particularly when the protein of interest encoded by the first open reading frame is functional alphavirus non-structural protein, and preferably when the replicon comprises a 5′-cap, the nucleic acid sequence encoding functional alphavirus non-structural protein can be efficiently translated from the replicon, and the resulting protein can subsequently drive replication of the replicon and drive synthesis of subgenomic transcript(s). This embodiment may be preferred when no additional nucleic acid molecule encoding functional alphavirus non-structural protein is used or present together with the replicon. In this embodiment, cis-replication of the replicon is aimed at.

One embodiment wherein the first open reading frame encodes functional alphavirus non-structural protein is illustrated by “cisReplicon Δ5ATG-RRS” in FIG. 6. Following translation of the nucleic acid sequence encoding nsP1234, the translation product (nsP1234 or fragment(s) thereof) can act as replicase and drive RNA synthesis, i.e. replication of the replicon and synthesis of a subgenomic transcript comprising the second open reading frame (“Transgene” in FIG. 6).

Trans-Replication System

In a second aspect, the present invention provides a system comprising:

- a RNA construct for expressing functional alphavirus non-structural protein,
- the RNA replicon according to the first aspect of the invention, which can be replicated by the functional alphavirus non-structural protein in trans.

In the second aspect it is preferred that the RNA replicon does not comprise an open reading frame encoding functional alphavirus non-structural protein.

Thus, the present invention provides a system comprising two nucleic acid molecules: a first RNA construct for expressing functional alphavirus non-structural protein (i.e. encoding functional alphavirus non-structural protein); and a second RNA molecule, the RNA replicon. The RNA construct for expressing functional alphavirus non-structural protein is synonymously referred to herein as “RNA construct for expressing functional alphavirus non-structural protein” or as “replicase construct”.

The functional alphavirus non-structural protein is as defined above and is typically encoded by an open reading frame comprised by the replicase construct. The functional alphavirus non-structural protein encoded by the replicase construct may be any functional alphavirus non-structural protein that is capable of replicating the replicon.

When the system of the present invention is introduced into a cell, preferably a eukaryotic cell, the open reading frame encoding functional alphavirus non-structural protein can be translated. After translation, the functional alphavirus non-structural protein is capable of replicating a separate RNA molecule (RNA replicon) in trans. Thus, the present invention provides a system for replicating RNA in trans. Consequently, the system of the present invention is a trans-replication system. According to the second aspect, the replicon is a trans-replicon.

Herein, trans (e.g. in the context of trans-acting, trans-regulatory), in general, means “acting from a different molecule” (i.e., intermolecular). It is the opposite of cis (e.g. in the context of cis-acting, cis-regulatory), which, in general, means “acting from the same molecule” (i.e., intramolecular). In the context of RNA synthesis (including transcription and RNA replication), a trans-acting element includes a nucleic acid sequence that contains a gene encoding an enzyme capable of RNA synthesis (RNA polymerase). The RNA polymerase uses a second nucleic acid molecule, i.e. a nucleic acid molecule other than the one by which it is encoded, as template for the synthesis of RNA. Both the RNA polymerase and the nucleic acid sequence that contains a gene encoding the RNA polymerase are said to “act in trans” on the second nucleic acid molecule. In the context of the present invention, the RNA polymerase encoded by the trans-acting RNA is functional alphavirus non-structural protein. The functional alphavirus non-structural protein is capable of using a second nucleic acid molecule, which is an RNA replicon, as template for the synthesis or RNA, including replication of the RNA replicon. The RNA replicon that can be replicated by the replicase in trans according to the present invention is synonymously referred to herein as “trans-replicon”.

In the system of the present invention, the role of the functional alphavirus non-structural protein is to amplify the replicon, and to prepare a subgenomic transcript, if a subgenomic promoter is present on the replicon. If the replicon encodes a gene of interest for expression, the expression levels of the gene of interest and/or the duration of expression may be regulated in trans by modifying the levels of the functional alphavirus non-structural protein.

The fact that alphaviral replicase is generally able to recognize and replicate a template RNA in trans was initially discovered in the 1980s, but the potential of trans-replication for biomedical applications was not recognized, inter alia because trans-replicated RNA was considered to inhibit efficient replication: it was discovered in the case of defective interfering (DI) RNA that co-replicates with alphaviral genomes in infected cells (Barrett et al., 1984, J. Gen. Virol., vol. 65 (Pt 8), pp. 1273-1283; Lehtovaara et al., 1981, Proc. Natl. Acad. Sci. U. S. A, vol. 78, pp. 5353-5357; Pettersson, 1981, Proc. Natl. Acad. Sci. U. S. A, vol. 78, pp. 115-119). DI RNAs are trans-replicons that may occur quasi-naturally during infections of cell lines with high virus load. DI elements co-replicate so efficiently that they reduce the virulence of the parental virus and thereby act as inhibitory parasitic RNA (Barrett et al., 1984, J. Gen. Virol., vol. 65 (Pt 11), pp. 1909-1920). Although the potential for biomedical applications was not recognized, the phenomenon of trans-replication was used in several basic studies aiming to elucidate mechanisms of replication, without requiring to express the replicase from the same molecule in cis; further, the separation of replicase and replicon also allows functional studies involving mutants of viral proteins, even if respective mutants were loss-of-function mutants (Lemm et al., 1994, EMBO J., vol. 13, pp. 2925-2934). These loss-of function studies and DI RNA did not suggest that trans-activation systems based on alphaviral elements may eventually become available to suit therapeutic purposes.

The system of the present invention comprises at least two nucleic acid molecules. Thus, it may comprise two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more nucleic acid molecules, which are preferably RNA molecules. In a preferred embodiment, the system consists of exactly two RNA molecules, the replicon and the replicase construct. In alternative preferred embodiments, the system comprises more than one replicon, each preferably encoding at least one protein of interest, and also comprises the replicase construct. In these embodiments, the functional alphavirus non-structural protein encoded by the replicase construct can act on each replicon to drive replication and production of subgenomic transcripts, respectively. For example, each replicon may encode a chain of a T cell receptor or an artificial T cell receptor. This is advantageous e.g. if expression of more than one chain of a T cell receptor or an artificial T cell receptor in a cell is desired so as to form a functional T cell receptor or artificial T cell receptor consisting of more than one chain.

Preferably, the replicase construct lacks at least one conserved sequence element (CSE) that is required for (−) strand synthesis based on a (+) strand template, and/or for (+) strand synthesis based on a (−) strand template. More preferably, the replicase construct does not comprise any alphaviral conserved sequence elements (CSEs). In particular, among the four CSEs of alphavirus (Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562; José et al., Future Microbiol., 2009, vol. 4, pp. 837-856), any one or more of the following CSEs are preferably not present on the replicase construct: CSE 1; CSE 2; CSE 3; CSE 4. Particularly in the absence of any one or more alphaviral CSE, the replicase construct of the present invention resembles typical eukaryotic mRNA much more than it resembles alphaviral genomic RNA.

The replicase construct of the present invention is preferably distinguished from alphaviral genomic RNA at least in that it is not capable of self-replication and/or that it does not comprise an open reading frame under the control of a sub-genomic promoter. When unable to self-replicate, the replicase construct may also be termed “suicide construct”.

The trans-replication system is associated with the following advantages:

First and foremost, the versatility of the trans-replication system allows that replicon and replicase construct can be designed and/or prepared at different times and/or at different sites. In one embodiment, the replicase construct is prepared at a first point in time, and the replicon is prepared at a later point in time. For example, following its preparation, the replicase construct may be stored for use at a later point in time. The present invention provides increased flexibility compared to cis-replicons: the system of the present invention may be designed for treatment, by cloning into the replicon a nucleic acid encoding a new chain of a T cell receptor or an artificial T cell receptor. A previously prepared replicase construct may be recovered from storage. In other words, the replicase construct can be designed and prepared independently of any particular replicon.

Second, the trans-replicon according to the present invention is typically a shorter nucleic acid molecule than a typical cis-replicon. This enables faster cloning of a replicon encoding a protein of interest, and provides high yields of the protein of interest.

Further advantages of the system of the present invention include the independence from nuclear transcription and the presence of key genetic information on two separate RNA molecules, which provides unprecedented design freedom. In view of its versatile elements, which are combinable with each other, the present invention allows to optimize replicase expression for a desired level of RNA amplification, for a desired target organism, for a desired level of production of a protein of interest, etc. The system according to the invention allows to co-transfect varying amounts or ratios of replicon and replicase construct for any given cell type—resting or cycling, in vitro or in vivo.

The replicase construct according to the present invention is preferably a single stranded RNA molecule. The replicase construct according to the present invention is typically a (+) stranded RNA molecule. In one embodiment, the replicase construct of the present invention is an isolated nucleic acid molecule.

Preferred Features of RNA Molecules According to the Invention

RNA molecules according to the invention may optionally be characterized by further features, e.g. by a 5′-cap, a 5′-UTR, a 3′-UTR, a poly(A) sequence, and/or adaptation of the codon usage. Details are described in the following.

Cap

In some embodiments, the replicon according to the present invention comprises a 5′-cap.

In some embodiments, the replicase construct according to the present invention comprises a 5′-cap.

The terms “5′-cap”, “cap”, “5′-cap structure”, “cap structure” are used synonymously to refer to a dinucleotide that is found on the 5′ end of some eukaryotic primary transcripts such as precursor messenger RNA. A 5′-cap is a structure wherein a (optionally modified) guanosine is bonded to the first nucleotide of an mRNA molecule via a 5′ to 5′ triphosphate linkage (or modified triphosphate linkage in the case of certain cap analogs). The terms can refer to a conventional cap or to a cap analog. For illustration, some particular cap dinucleotides (including cap analog dinucleotides) are shown in FIG. 7.

“RNA which comprises a 5′-cap” or “RNA which is provided with a 5′-cap” or “RNA which is modified with a 5′-cap” or “capped RNA” refers to RNA which comprises a 5′-cap. For example, providing an RNA with a 5′-cap may be achieved by in vitro transcription of a DNA template in presence of said 5′-cap, wherein said 5′-cap is co-transcriptionally incorporated into the generated RNA strand, or the RNA may be generated, for example, by in vitro transcription, and the 5′-cap may be attached to the RNA post-transcriptionally using capping enzymes, for example, capping enzymes of vaccinia virus. In capped RNA, the 3′ position of the first base of a (capped) RNA molecule is linked to the 5′ position of the subsequent base of the RNA molecule (“second base”) via a phosphodiester bond.

Presence of a cap on an RNA molecule is strongly preferred if translation of a nucleic acid sequence encoding a protein at early stages after introduction of the respective RNA into host cells or into a host organism is desired. For example, presence of a cap allows that a gene of interest encoded by RNA replicon is translated efficiently at early stages after introduction of the respective RNA into host cells. “Early stages” typically means within the first 1 hour, or within the first two hours, or within the first three hours after introduction of the RNA.

Presence of a cap on an RNA molecule is also preferred if it is desired that translation occurs in the absence of functional replicase, or when only minor levels of replicase are present in a host cell. For example, even if a nucleic acid molecule encoding replicase is introduced into a host cell, at early stages after introduction the levels of replicase will typically be minor.

In the system according to the invention, it is preferred that the RNA construct for expressing functional alphavirus non-structural protein comprises a 5′-cap.

In particular when the RNA replicon according to the present invention is not used or provided together with a second nucleic acid molecule (e.g. mRNA) that encodes functional alphavirus non-structural protein, it is preferred that the RNA replicon comprises a 5′-cap. Independently, the RNA replicon may also comprise a 5′-cap even when it is used or provided together with a second nucleic acid molecule that encodes functional alphavirus non-structural protein.

The term “conventional 5′-cap” refers to a naturally occurring 5′-cap, preferably to the 7-methylguanosine cap. In the 7-methylguanosine cap, the guanosine of the cap is a modified guanosine wherein the modification consists of a methylation at the 7-position (top of FIG. 7).

In the context of the present invention, the term “5′-cap analog” refers to a molecular structure that resembles a conventional 5′-cap, but is modified to possess the ability to stabilize RNA if attached thereto, preferably in vivo and/or in a cell. A cap analog is not a conventional 5′-cap.

For the case of eukaryotic mRNA, the 5′-cap has been generally described to be involved in efficient translation of mRNA: in general, in eukaryotes, translation is initiated only at the 5′ end of a messenger RNA (mRNA) molecule, unless an internal ribosomal entry site (IRES) is present. Eukaryotic cells are capable of providing an RNA with a 5′-cap during transcription in the nucleus: newly synthesized mRNAs are usually modified with a 5′-cap structure, e.g. when the transcript reaches a length of 20 to 30 nucleotides. First, the 5′ terminal nucleotide pppN (ppp representing triphosphate; N representing any nucleoside) is converted in the cell to 5′ GpppN by a capping enzyme having RNA 5′-triphosphatase and guanylyltransferase activities. The GpppN may subsequently be methylated in the cell by a second enzyme with (guanine-7)-methyltransferase activity to form the mono-methylated m⁷GpppN cap. In one embodiment, the 5′-cap used in the present invention is a natural 5′-cap.

In the present invention, a natural 5′-cap dinucleotide is typically selected from the group consisting of a non-methylated cap dinucleotide (G(5′)ppp(5′)N; also termed GpppN) and a methylated cap dinucleotide ((m⁷G(5′)ppp(5′)N; also termed m⁷GpppN). m⁷GpppN (wherein N is G) is represented by the following formula:

embedded image

Capped RNA of the present invention can be prepared in vitro, and therefore, does not depend on a capping machinery in a host cell. The most frequently used method to make capped RNAs in vitro is to transcribe a DNA template with either a bacterial or bacteriophage RNA polymerase in the presence of all four ribonucleoside triphosphates and a cap dinucleotide such as m⁷G(5′)ppp(5′) G (also called m⁷GpppN). The RNA polymerase initiates transcription with a nucleophilic attack by the 3′-OH of the guanosine moiety of m⁷GpppG on the α-phosphate of the next templated nucleoside triphosphate (pppN), resulting in the intermediate m⁷GpppGpN (wherein N is the second base of the RNA molecule). The formation of the competing GTP-initiated product pppGpN is suppressed by setting the molar ratio of cap to GTP between 5 and 10 during in vitro transcription.

In preferred embodiments of the present invention, the 5′-cap (if present) is a 5′-cap analog. These embodiments are particularly suitable if the RNA is obtained by in vitro transcription, e.g. is an in vitro transcribed RNA (IVT-RNA). Cap analogs have been initially described to facilitate large scale synthesis of RNA transcripts by means of in vitro transcription.

For messenger RNA, some cap analogs (synthetic caps) have been generally described to date, and they can all be used in the context of the present invention. Ideally, a cap analog is selected that is associated with higher translation efficiency and/or increased resistance to in vivo degradation and/or increased resistance to in vitro degradation.

Preferably, a cap analog is used that can only be incorporated into an RNA chain in one orientation. Pasquinelli et al. (1995, RNA J., vol., 1, pp. 957-967) demonstrated that during in vitro transcription, bacteriophage RNA polymerases use the 7-methylguanosine unit for initiation of transcription, whereby around 40-50% of the transcripts with cap possess the cap dinucleotide in a reverse orientation (i.e., the initial reaction product is Gpppm⁷GpN). Compared to the RNAs with a correct cap, RNAs with a reverse cap are not functional with respect to translation of a nucleic acid sequence into protein. Thus, it is desirable to incorporate the cap in the correct orientation, i.e., resulting in an RNA with a structure essentially corresponding to m⁷GpppGpN etc. It has been shown that the reverse integration of the cap-dinucleotide is inhibited by the substitution of either the 2′- or the 3′—OH group of the methylated guanosine unit (Stepinski et al., 2001; RNA J., vol. 7, pp. 1486-1495; Peng et al., 2002; Org. Lett., vol. 24, pp. 161-164). RNAs which are synthesized in presence of such “anti reverse cap analogs” are translated more efficiently than RNAs which are in vitro transcribed in presence of the conventional 5′-cap m⁷GpppG. To that end, one cap analog in which the 3′ OH group of the methylated guanosine unit is replaced by OCH₃is described e.g. by Holtkamp et al., 2006, Blood, vol. 108, pp. 4009-4017 (7-methyl(3′-O-methyl)GpppG; anti-reverse cap analog (ARCA)). ARCA is a suitable cap dinucleotide according to the present invention.

embedded image

In a preferred embodiment of the present invention, the RNA of the present invention is essentially not susceptible to decapping. This is important because, in general, the amount of protein produced from synthetic mRNAs introduced into cultured mammalian cells is limited by the natural degradation of mRNA. One in vivo pathway for mRNA degradation begins with the removal of the mRNA cap. This removal is catalyzed by a heterodimeric pyrophosphatase, which contains a regulatory subunit (Dcp1) and a catalytic subunit (Dcp2). The catalytic subunit cleaves between the a and β phosphate groups of the triphosphate bridge. In the present invention, a cap analog may be selected or present that is not susceptible, or less susceptible, to that type of cleavage. A suitable cap analog for this purpose may be selected from a cap dinucleotide according to formula (I):

embedded image

- wherein R¹is selected from the group consisting of optionally substituted alkyl, optionally substituted alkenyl, optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted aryl, and optionally substituted heteroaryl,
- R²and R³are independently selected from the group consisting of H, halo, OH, and optionally substituted alkoxy, or R²and R³together form O—X—O, wherein X is selected from the group consisting of optionally substituted CH₂, CH₂CH₂, CH₂CH₂CH₂, CH₂CH(CH₃), and
- C(CH₃)₂, or R²is combined with the hydrogen atom at position 4′ of the ring to which R²is attached to form —O—CH₂— or —CH₂—O—,
- R⁵is selected from the group consisting of S, Se, and BH₃,

R⁴and R⁶are independently selected from the group consisting of O, S, Se, and BH₃. n is 1, 2, or 3.

Preferred embodiments for R¹, R², R3, R⁴, R⁵, R⁶are disclosed in WO 2011/015347 A1 and may be selected accordingly in the present invention.

For example, in a preferred embodiment of the present invention, the RNA of the present invention comprises a phosphorothioate-cap-analog. Phosphorothioate-cap-analogs are specific cap analogs in which one of the three non-bridging O atoms in the triphosphate chain is replaced with an S atom, i.e. one of R⁴, R⁵or R⁶in Formula (I) is S. Phosphorothioate-cap-analogs have been described by J. Kowalska et al., 2008, RNA, vol. 14, pp. 1119-1131, as a solution to the undesired decapping process, and thus to increase the stability of RNA in vivo. In particular, the substitution of an oxygen atom for a sulphur atom at the beta-phosphate group of the 5′-cap results in stabilization against Dcp2. In that embodiment, which is preferred in the present invention, R⁵in Formula (I) is S; and R⁴and R⁶are O.

In a further preferred embodiment of the present invention, the RNA of the present invention comprises a phosphorothioate-cap-analog wherein the phosphorothioate modification of the RNA 5′-cap is combined with an “anti-reverse cap analog” (ARCA) modification. Respective ARCA-phosphorothioate-cap-analogs are described in WO 2008/157688 A2, and they can all be used in the RNA of the present invention. In that embodiment, at least one of R²or R³in Formula (I) is not OH, preferably one among R²and R³is methoxy (OCH₃), and the other one among R²and R³is preferably OH. In a preferred embodiment, an oxygen atom is substituted for a sulphur atom at the beta-phosphate group (so that R⁵in Formula (I) is S; and R⁴and R⁶are O). It is believed that the phosphorothioate modification of the ARCA ensures that the α, β, and γ phosphorothioate groups are precisely positioned within the active sites of cap-binding proteins in both the translational and decapping machinery. At least some of these analogs are essentially resistant to pyrophosphatase Dcp1/Dcp2. Phosphorothioate-modified ARCAs were described to have a much higher affinity for eIF4E than the corresponding ARCAs lacking a phosphorothioate group.

A respective cap analog that is particularly preferred in the present invention, i.e., m₂,^7,2′-OGpp_spG, is termed beta-S-ARCA (WO 2008/157688 A2; Kuhn et al., Gene Ther., 2010, vol. 17, pp. 961-971). Thus, in one embodiment of the present invention, the RNA of the present invention is modified with beta-S-ARCA. beta-S-ARCA is represented by the following structure:

embedded image

In general, the replacement of an oxygen atom for a sulphur atom at a bridging phosphate results in phosphorothioate diastereomers which are designated D1 and D2, based on their elution pattern in HPLC. Briefly, the D1 diastereomer of beta-S-ARCA″ or “beta-S-ARCA(D1)” is the diastereomer of beta-S-ARCA which elutes first on an HPLC column compared to the D2 diastereomer of beta-S-ARCA (beta-S-ARCA(D2)) and thus exhibits a shorter retention time. Determination of the stereochemical configuration by HPLC is described in WO 2011/015347 A1.

In a first particularly preferred embodiment of the present invention, RNA of the present invention is modified with the beta-S-ARCA(D2) diastereomer. The two diastereomers of beta-S-ARCA differ in sensitivity against nucleases. It has been shown that RNA carrying the D2 diastereomer of beta-S-ARCA is almost fully resistant against Dcp2 cleavage (only 6% cleavage compared to RNA which has been synthesized in presence of the unmodified ARCA 5′-cap), whereas RNA with the beta-S-ARCA(D1) 5′-cap exhibits an intermediary sensitivity to Dcp2 cleavage (71% cleavage). It has further been shown that the increased stability against Dcp2 cleavage correlates with increased protein expression in mammalian cells. In particular, it has been shown that RNAs carrying the beta-S-ARCA(D2) cap are more efficiently translated in mammalian cells than RNAs carrying the beta-S-ARCA(D1) cap. Therefore, in one embodiment of the present invention, RNA of the present invention is modified with a cap analog according to Formula (I), characterized by a stereochemical configuration at the P atom comprising the substituent R⁵in Formula (I) that corresponds to that at the P_β atom of the D2 diastereomer of beta-S-ARCA. In that embodiment, R⁵in Formula (I) is S; and R⁴and R⁶are O. Additionally, at least one of R²or R³in Formula (I) is preferably not OH, preferably one among R²and R³is methoxy (OCH3), and the other one among R²and R³is preferably OH.

In a second particularly preferred embodiment, RNA of the present invention is modified with the beta-S-ARCA(D1) diastereomer. It has been demonstrated that the beta-S-ARCA(D1) diastereomer, upon transfer of respectively capped RNA into immature antigen presenting cells, is particularly suitable for increasing the stability of the RNA, increasing translation efficiency of the RNA, prolonging translation of the RNA, increasing total protein expression of the RNA, and/or increasing the immune response against an antigen or antigen peptide encoded by said RNA (Kuhn et al., 2010, Gene Ther., vol. 17, pp. 961-971). Therefore, in an alternative embodiment of the present invention, RNA of the present invention is modified with a cap analog according to Formula (I), characterized by a stereochemical configuration at the P atom comprising the substituent R⁵in Formula (I) that corresponds to that at the P_β atom of the D1 diastereomer of beta-S-ARCA. Respective cap analogs and embodiments thereof are described in WO 2011/015347 A1 and Kuhn et al., 2010, Gene Ther., vol. 17, pp. 961-971. Any cap analog described in WO 2011/015347 A1, wherein the stereochemical configuration at the P atom comprising the substituent R⁵corresponds to that at the P_β atom of the D1 diastereomer of beta-S-ARCA, may be used in the present invention. Preferably, R⁵in Formula (I) is S; and R⁴and R⁶are O. Additionally, at least one of R²or R³in Formula (I) is preferably not OH, preferably one among R²and R³is methoxy (OCH3), and the other one among R²and R³is preferably OH.

In one embodiment, RNA of the present invention is modified with a 5′-cap structure according to Formula (I) wherein any one phosphate group is replaced by a boranophosphate group or a phosphoroselenoate group. Such caps have increased stability both in vitro and in vivo. Optionally, the respective compound has a 2′—O— or 3′-O-alkyl group (wherein alkyl is preferably methyl); respective cap analogs are termed BH₃-ARCAs or Se-ARCAs. Compounds that are particularly suitable for capping of mRNA include the β-BH₃-ARCAs and β-Se-ARCAs, as described in WO 2009/149253 A2. For these compounds, a stereochemical configuration at the P atom comprising the substituent R₅in Formula (I) that corresponds to that at the PB atom of the D1 diastereomer of beta-S-ARCA is preferred.

UTR

The term “untranslated region” or “UTR” relates to a region in a DNA molecule which is transcribed but is not translated into an amino acid sequence, or to the corresponding region in an RNA molecule, such as an mRNA molecule. An untranslated region (UTR) can be present 5′ (upstream) of an open reading frame (5′-UTR) and/or 3′ (downstream) of an open reading frame (3′-UTR).

A 3′-UTR, if present, is located at the 3′ end of a gene, downstream of the termination codon of a protein-encoding region, but the term “3′-UTR” does preferably not include the poly(A) tail. Thus, the 3′-UTR is upstream of the poly(A) tail (if present), e.g. directly adjacent to the poly(A) tail.

A 5′-UTR, if present, is located at the 5′ end of a gene, upstream of the start codon of a protein-encoding region. A 5′-UTR is downstream of the 5′-cap (if present), e.g. directly adjacent to the 5′-cap.

5′- and/or 3′-untranslated regions may, according to the invention, be functionally linked to an open reading frame, so as for these regions to be associated with the open reading frame in such a way that the stability and/or translation efficiency of the RNA comprising said open reading frame are increased.

In some embodiments, the replicase construct according to the present invention comprises a 5′-UTR and/or a 3′-UTR.

In a preferred embodiment, the replicase construct according to the present invention comprises

- (1) a 5′-UTR,
- (2) an open reading frame, and
- (3) a 3′-UTR.

UTRs are implicated in stability and translation efficiency of RNA. Both can be improved, besides structural modifications concerning the 5′-cap and/or the 3′ poly(A)-tail as described herein, by selecting specific 5′ and/or 3′ untranslated regions (UTRs). Sequence elements within the UTRs are generally understood to influence translational efficiency (mainly 5′-UTR) and RNA stability (mainly 3′-UTR). It is preferable that a 5′-UTR is present that is active in order to increase the translation efficiency and/or stability of the replicase construct. Independently or additionally, it is preferable that a 3′-UTR is present that is active in order to increase the translation efficiency and/or stability of the replicase construct.

The terms “active in order to increase the translation efficiency” and/or “active in order to increase the stability”, with reference to a first nucleic acid sequence (e.g. a UTR), means that the first nucleic acid sequence is capable of modifying, in a common transcript with a second nucleic acid sequence, the translation efficiency and/or stability of said second nucleic acid sequence in such a way that said translation efficiency and/or stability is increased in comparison with the translation efficiency and/or stability of said second nucleic acid sequence in the absence of said first nucleic acid sequence.

In one embodiment, the replicase construct according to the present invention comprises a 5′-UTR and/or a 3′-UTR which is heterologous or non-native to the alphavirus from which the functional alphavirus non-structural protein is derived. This allows the untranslated regions to be designed according to the desired translation efficiency and RNA stability. Thus, heterologous or non-native UTRs allow for a high degree of flexibility, and this flexibility is advantageous compared to native alphaviral UTRs. In particular, while it is known that alphaviral (native) RNA also comprises a 5′-UTR and/or a 3′-UTR, alphaviral UTRs fulfil a dual function, i.e. (i) to drive RNA replication as well as (ii) to drive translation. While alphaviral UTRs were reported to be inefficient for translation (Berben-Bloemheuvel et al., 1992, Eur. J. Biochem., vol. 208, pp. 581-587), they can typically not readily be replaced by more efficient UTRs because of their dual function. In the present invention, however, a 5′-UTR and/or a 3′-UTR comprised in a replicase construct for replication in trans can be selected independent of their potential influence on RNA replication.

Preferably, the replicase construct according to the present invention comprises a 5′-UTR and/or a 3′-UTR that is not of virus origin; particularly not of alphavirus origin. In one embodiment, the replicase construct comprises a 5′-UTR derived from a eukaryotic 5′-UTR and/or a 3′-UTR derived from a eukaryotic 3′-UTR.

A 5′-UTR according to the present invention can comprise any combination of more than one nucleic acid sequence, optionally separated by a linker. A 3′-UTR according to the present invention can comprise any combination of more than one nucleic acid sequence, optionally separated by a linker.

The term “linker” according to the invention relates to a nucleic acid sequence added between two nucleic acid sequences to connect said two nucleic acid sequences. There is no particular limitation regarding the linker sequence.

A 3′-UTR typically has a length of 200 to 2000 nucleotides, e.g. 500 to 1500 nucleotides. The 3′-untranslated regions of immunoglobulin mRNAs are relatively short (fewer than about 300 nucleotides), while the 3′-untranslated regions of other genes are relatively long. For example, the 3′-untranslated region of tPA is about 800 nucleotides in length, that of factor VIII is about 1800 nucleotides in length and that of erythropoietin is about 560 nucleotides in length. The 3′-untranslated regions of mammalian mRNA typically have a homology region known as the AAUAAA hexanucleotide sequence. This sequence is presumably the poly(A) attachment signal and is frequently located from 10 to 30 bases upstream of the poly(A) attachment site. 3′-untranslated regions may contain one or more inverted repeats which can fold to give stem-loop structures which act as barriers for exoribonucleases or interact with proteins known to increase RNA stability (e.g. RNA-binding proteins).

The human beta-globin 3′-UTR, particularly two consecutive identical copies of the human beta-globin 3′-UTR, contributes to high transcript stability and translational efficiency (Holtkamp et al., 2006, Blood, vol. 108, pp. 4009-4017). Thus, in one embodiment, the replicase construct according to the present invention comprises two consecutive identical copies of the human beta-globin 3′-UTR. Thus, it comprises in the 5′->3′ direction: (a) optionally a 5′-UTR; (b) an open reading frame; (c) a 3′-UTR; said 3′-UTR comprising two consecutive identical copies of the human beta-globin 3′-UTR, a fragment thereof, or a variant of the human beta-globin 3′-UTR or fragment thereof.

In one embodiment, the replicase construct according to the present invention comprises a 3′-UTR which is active in order to increase translation efficiency and/or stability, but which is not the human beta-globin 3′-UTR, a fragment thereof, or a variant of the human beta-globin 3′-UTR or fragment thereof.

In one embodiment, the replicase construct according to the present invention comprises a 5′-UTR which is active in order to increase translation efficiency and/or stability.

A UTR-containing replicase construct according to the invention can be prepared e.g. by in vitro transcription. This may be achieved by genetically modifying a template nucleic acid molecule (e.g. DNA) in such a way that it allows transcription of RNA with 5′-UTRs and/or 3′-UTRs.

As illustrated in FIG. 6, also the replicon can be characterized by a 5′-UTR and/or a 3′-UTR. The UTRs of the replicon are typically alphaviral UTRs or variants thereof.

Poly(A) sequence

In some embodiments, the replicon according to the present invention comprises a 3′-poly(A) sequence. If the replicon comprises conserved sequence element 4 (CSE 4), the 3′-poly(A) sequence of the replicon is preferably present downstream of CSE 4, most preferably directly adjacent to CSE 4.

In some embodiments, the replicase construct according to the present invention comprises a 3′-poly(A) sequence.

According to the invention, in one embodiment, a poly(A) sequence comprises or essentially consists of or consists of at least 20, preferably at least 26, preferably at least 40, preferably at least 80, preferably at least 100 and preferably up to 500, preferably up to 400, preferably up to 300, preferably up to 200, and in particular up to 150, A nucleotides, and in particular about 120 A nucleotides. In this context “essentially consists of” means that most nucleotides in the poly(A) sequence, typically at least 50%, and preferably at least 75% by number of nucleotides in the “poly(A) sequence”, are A nucleotides (adenylate), but permits that remaining nucleotides are nucleotides other than A nucleotides, such as U nucleotides (uridylate), G nucleotides (guanylate), C nucleotides (cytidylate). In this context “consists of” means that all nucleotides in the poly(A) sequence, i.e. 100% by number of nucleotides in the poly(A) sequence, are A nucleotides. The term “A nucleotide” or “A” refers to adenylate.

Indeed, it has been demonstrated that a 3′ poly(A) sequence of about 120 A nucleotides has a beneficial influence on the levels of RNA in transfected eukaryotic cells, as well as on the levels of protein that is translated from an open reading frame that is present upstream (5′) of the 3′ poly(A) sequence (Holtkamp et al., 2006, Blood, vol. 108, pp. 4009-4017).

In alphaviruses, a 3′ poly(A) sequence of at least 11 consecutive adenylate residues, or at least 25 consecutive adenylate residues, is thought to be important for efficient synthesis of the minus strand. In particular, in alphaviruses, a 3′ poly(A) sequence of at least 25 consecutive adenylate residues is understood to function together with conserved sequence element 4 (CSE 4) to promote synthesis of the (−) strand (Hardy & Rice, J. Virol., 2005, vol. 79, pp. 4630-4639).

The present invention provides for a 3′ poly(A) sequence to be attached during RNA transcription, i.e. during preparation of in vitro transcribed RNA, based on a DNA template comprising repeated dT nucleotides (deoxythymidylate) in the strand complementary to the coding strand. The DNA sequence encoding a poly(A) sequence (coding strand) is referred to as poly(A) cassette.

In a preferred embodiment of the present invention, the 3′ poly(A) cassette present in the coding strand of DNA essentially consists of dA nucleotides, but is interrupted by a random sequence having an equal distribution of the four nucleotides (dA, dC, dG, dT). Such random sequence may be 5 to 50, preferably 10 to 30, more preferably 10 to 20 nucleotides in length. Such a cassette is disclosed in WO 2016/005004 A1. Any poly(A) cassette disclosed in WO 2016/005004 A1 may be used in the present invention. A poly(A) cassette that essentially consists of dA nucleotides, but is interrupted by a random sequence having an equal distribution of the four nucleotides (dA, dC, dG, dT) and having a length of e.g. 5 to 50 nucleotides shows, on DNA level, constant propagation of plasmid DNA in E. coli and is still associated, on RNA level, with the beneficial properties with respect to supporting RNA stability and translational efficiency.

Consequently, in a preferred embodiment of the present invention, the 3′ poly(A) sequence contained in an RNA molecule described herein essentially consists of A nucleotides, but is interrupted by a random sequence having an equal distribution of the four nucleotides (A, C, G, U). Such random sequence may be 5 to 50, preferably 10 to 30, more preferably 10 to 20 nucleotides in length.

Codon Usage

In general, the degeneracy of the genetic code will allow the substitution of certain codons (base triplets coding for an amino acid) that are present in an RNA sequence by other codons (base triplets), while maintaining the same coding capacity (so that the replacing codon encodes the same amino acid as the replaced codon). In some embodiments of the present invention, at least one codon of an open reading frame comprised by a RNA molecule differs from the respective codon in the respective open reading frame in the species from which the open reading frame originates. In that embodiment, the coding sequence of the open reading frame is said to be “adapted” or “modified”. The coding sequence of an open reading frame comprised by the replicon may be adapted. Alternatively or additionally, the coding sequence for functional alphavirus non-structural protein comprised by the replicase construct may be adapted.

For example, when the coding sequence of an open reading frame is adapted, frequently used codons may be selected: WO 2009/024567 A1 describes the adaptation of a coding sequence of a nucleic acid molecule, involving the substitution of rare codons by more frequently used codons. Since the frequency of codon usage depends on the host cell or host organism, that type of adaptation is suitable to fit a nucleic acid sequence to expression in a particular host cell or host organism. Generally, speaking, more frequently used codons are typically translated more efficiently in a host cell or host organism, although adaptation of all codons of an open reading frame is not always required.

For example, when the coding sequence of an open reading frame is adapted, the content of G (guanylate) residues and C (cytidylate) residues may be altered by selecting codons with the highest GC-rich content for each amino acid. RNA molecules with GC-rich open reading frames were reported to have the potential to reduce immune activation and to improve translation and half-life of RNA (Thess et al., 2015, Mol. Ther. 23, 1457-1465).

When the replicon according to the present invention encodes alphavirus non-structural protein, the coding sequence for alphavirus non-structural protein can be adapted as desired. This freedom is possible because the open reading frame encoding alphavirus non-structural protein does not overlap with the 5′ replication recognition sequence of the replicon.

Safety features of embodiments of the present invention

The following features are preferred in the present invention, alone or in any suitable combination:

Preferably, the replicon or the system of the present invention is not particle-forming. This means that, following inoculation of a host cell by the replicon or the system of the present invention, the host cell does not produce virus particles, such as next generation virus particles. In one embodiment, all RNA molecules according to the invention are completely free of genetic information encoding any alphavirus structural protein, such as core nucleocapsid protein C, envelope protein P62, and/or envelope protein E1. This aspect of the present invention provides an added value in terms of safety over prior art systems wherein structural proteins are encoded on trans-replicating helper RNA (e.g. Bredenbeek et al., J. Virol, 1993, vol. 67, pp. 6439-6446).

Preferably, the system of the present invention does not comprise any alphavirus structural protein, such as core nucleocapsid protein C, envelope protein P62, and/or envelope protein E1.

Preferably, the replicon and the replicase construct of the system of the present invention are non-identical to each other. In one embodiment, the replicon does not encode functional alphavirus non-structural protein. In one embodiment, the replicase construct lacks at least one sequence element (preferably at least one CSE) that is required for (−) strand synthesis based on a (+) strand template, and/or for (+) strand synthesis based on a (−) strand template. In one embodiment, the replicase construct does not comprise CSE 1 and/or CSE 4.

Preferably, neither the replicon according to the present invention nor the replicase construct according to the present invention comprises an alphavirus packaging signal. For example, the alphavirus packaging signal comprised in the coding region of nsP2 of SFV (White et al. 1998, J. Virol., vol. 72, pp. 4320-4326) may be removed, e.g. by deletion or mutation. A suitable way of removing the alphavirus packaging signal includes adaptation of the codon usage of the coding region of nsP2. The degeneration of the genetic code may allow to delete the function of the packaging signal without affecting the amino acid sequence of the encoded nsP2.

In one embodiment, the system of the present invention is an isolated system. In that embodiment, the system is not present inside a cell, such as inside a mammalian cell, or is not present inside a virus capsid, such as inside a coat comprising alphavirus structural proteins. In one embodiment, the system of the present invention is present in vitro.

DNA

In a third aspect, the present invention provides a DNA comprising a nucleic acid sequence encoding the RNA replicon according to the first aspect of the present invention.

Preferably, the DNA is double-stranded.

In a preferred embodiment, the DNA according to the third aspect of the invention is a plasmid. The term “plasmid”, as used herein, generally relates to a construct of extrachromosomal genetic material, usually a circular DNA duplex, which can replicate independently of chromosomal DNA.

The DNA of the present invention may comprise a promoter that can be recognized by a DNA-dependent RNA-polymerase. This allows for transcription of the encoded RNA in vivo or in vitro, e.g. of the RNA of the present invention. IVT vectors may be used in a standardized manner as template for in vitro transcription. Examples of promoters preferred according to the invention are promoters for SP6, T3 or T7 polymerase.

In one embodiment, the DNA of the present invention is an isolated nucleic acid molecule.

Methods of Preparing RNA

Any RNA molecule according to the present invention, be it part of the system of the present invention or not, may be obtainable by in vitro transcription. In vitro-transcribed RNA (IVT-RNA) is of particular interest in the present invention. IVT-RNA is obtainable by transcription from a nucleic acid molecule (particularly a DNA molecule). The DNA molecule(s) of the third aspect of the present invention are suitable for such purposes, particularly if comprising a promoter that can be recognized by a DNA-dependent RNA-polymerase.

RNA according to the present invention can be synthesized in vitro. This allows to add cap-analogs to the in vitro transcription reaction. Typically, the poly(A) tail is encoded by a poly-(dT) sequence on the DNA template. Alternatively, capping and poly(A) tail addition can be achieved enzymatically after transcription.

The in vitro transcription methodology is known to the skilled person. For example, as mentioned in WO 2011/015347 A1, a variety of in vitro transcription kits is commercially available.

Methods for Producing Cells and Cells Produced Thereby

In further aspects, the present invention provides a method for producing cells such as immunoreactive cells comprising transducing a cell such as a T cell or a progenitor thereof with one or more RNA replicons of the invention and optionally a RNA construct for expressing functional alphavirus non-structural protein, or DNA encoding said RNA.

In one embodiment, the present invention provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor, the method comprising the steps of:

- (a) obtaining one or more RNA replicons according to the invention, which RNA replicon(s) comprise(s) an open reading frame encoding functional alphavirus non-structural protein, can be replicated by the functional alphavirus non-structural protein and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (b) inoculating the RNA replicon(s) or the DNA into a cell.

In a further embodiment, the present invention provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor, the method comprising the steps of:

- (a) obtaining a RNA construct for expressing functional alphavirus non-structural protein or DNA comprising nucleic acid sequence encoding the RNA construct,
- (b) obtaining one or more RNA replicon(s) according to any one of claims 1 to 13, and 15, which RNA replicon(s) can be replicated by the functional alphavirus non-structural protein in trans and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (c) co-inoculating the RNA construct or the DNA and the RNA replicon(s) or the DNA into a cell.

In one embodiment, the transfected cell expresses the functional alphavirus non-structural protein encoded by one or more transfected replicons and/or by a transfected replicase construct and/or the chain(s) of a T cell receptor or an artificial T cell receptor encoded by one or more transfected replicons. In one embodiment, the cell expresses a T cell receptor or an artificial T cell receptor, preferably on its cell surface. The different chains of a T cell receptor or an artificial T cell receptor may be encoded by different open reading frames residing on the same replicon or on different RNA replicons. In the latter embodiment, the different replicons are preferably co-transfected into a cell.

The cell into which one or more nucleic molecules can be inoculated or transfected can be referred to as “host cell”. According to the invention, the term “host cell” refers to any cell which can be transformed or transfected with an exogenous nucleic acid molecule. The term “cell” preferably is an intact cell, i.e. a cell with an intact membrane that has not released its normal intracellular components such as enzymes, organelles, or genetic material. An intact cell preferably is a viable cell, i.e. a living cell capable of carrying out its normal metabolic functions. The term “host cell” comprises, according to the invention, prokaryotic (e.g. E. coli) or eukaryotic cells (e.g. human and animal cells, plant cells, yeast cells and insect cells). Particular preference is given to mammalian cells such as cells from humans, mice, hamsters, pigs, domesticated animals including horses, cows, sheep and goats, as well as primates. The cells may be derived from a multiplicity of tissue types and comprise primary cells and cell lines. Specific examples include keratinocytes, peripheral blood leukocytes, bone marrow stem cells and embryonic stem cells. In other embodiments, the host cell is an antigen-presenting cell, in particular a dendritic cell, a monocyte or a macrophage. A nucleic acid may be present in the host cell in a single or in several copies and, in one embodiment is expressed in the host cell.

The cell may be a prokaryotic cell or a eukaryotic cell. Prokaryotic cells are suitable herein e.g. for propagation of DNA according to the invention, and eukaryotic cells are suitable herein e.g. for expression of the open reading frame of the replicon.

For purposes of the present invention, the terms such as “transduction” or “transfection” refer to the introduction of a nucleic acid into a cell or the uptake of a nucleic acid by a cell in vitro or in vivo. According to the present invention, a cell for transfection of a nucleic acid described herein can be present in vitro or in vivo, e.g. the cell can form part of an organ, a tissue and/or an organism of a patient. According to the invention, transfection can be transient or stable. For some applications of transfection, it is sufficient if the transfected genetic material is only transiently expressed. Since the nucleic acid introduced in the transfection process is usually not integrated into the nuclear genome, the foreign nucleic acid will be diluted through mitosis or degraded. Cells allowing episomal amplification of nucleic acids greatly reduce the rate of dilution. If it is desired that the transfected nucleic acid actually remains in the genome of the cell and its daughter cells, a stable transfection must occur. RNA can be transfected into cells to transiently express its coded protein.

According to the present invention, any technique useful for introducing, i.e. transferring or transfecting, nucleic acids into cells may be used. Preferably, nucleic acid such as RNA is transfected into cells by standard techniques. Such techniques include electroporation, lipofection and microinjection. In one particularly preferred embodiment of the present invention, RNA is introduced into cells by electroporation. Electroporation or electropermeabilization relates to a significant increase in the electrical conductivity and permeability of the cell plasma membrane caused by an externally applied electrical field. It is usually used in molecular biology as a way of introducing some substance into a cell. According to the invention it is preferred that introduction of nucleic acid encoding a protein or peptide into cells results in expression of said protein or peptide.

For transfection of cells in vivo a pharmaceutical composition comprising nucleic acid may be used. A delivery vehicle that targets the nucleic acid to a specific cell such as a T cell may be administered to a patient, resulting in transfection that occurs in vivo.

In one embodiment, a method for producing a cell is an in vitro method. In one embodiment, a method for producing a cell comprises or does not comprise the removal of a cell from a human or animal subject by surgery or therapy.

In this embodiment, the cell produced according to the invention may be administered to a subject. The cell may be autologous, syngenic, allogenic or heterologous with respect to the subject. Transfected cells may be (re)introduced into a subject using any means known in the art.

In other embodiments, the cell may be present in a subject, such as a patient. In these embodiments, the method for producing a cell is an in vivo method which comprises administration of RNA and/or DNA molecules to the subject.

In this respect, the invention also provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor in a subject, the method comprising the steps of:

- (a) obtaining one or more RNA replicons of the invention, which RNA replicon(s) comprise(s) an open reading frame encoding functional alphavirus non-structural protein, can be replicated by the functional alphavirus non-structural protein and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (b) administering the RNA replicon(s) or the DNA to the subject.

In various embodiments of the method, the RNA replicon is as defined above for the RNA replicon of the invention, as long as the RNA replicon comprises an open reading frame encoding functional alphavirus non-structural protein and an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor, and can be replicated by the functional alphavirus non-structural protein.

The invention further provides a method for producing a cell expressing a T cell receptor or an artificial T cell receptor in a subject, the method comprising the steps of:

- (a) obtaining a RNA construct for expressing functional alphavirus non-structural protein or DNA comprising nucleic acid sequence encoding the RNA construct,
- (b) obtaining one or more RNA replicon(s) of the invention, which RNA replicon(s) can be replicated by the functional alphavirus non-structural protein in trans and comprise(s) (an) open reading frame(s) encoding the chain(s) of the T cell receptor or artificial T cell receptor, or DNA comprising nucleic acid sequence encoding said RNA replicon(s), and
- (c) administering the RNA construct or the DNA and the RNA replicon(s) or the DNA to the subject.

In various embodiments of the method, the RNA construct for expressing functional alphavirus non-structural protein and/or the RNA replicon are as defined above for the system of the invention, as long as the RNA replicon can be replicated by the functional alphavirus non-structural protein in trans and comprises an open reading frame encoding a chain of a T cell receptor or of an artificial T cell receptor. The RNA construct for expressing functional alphavirus non-structural protein and the RNA replicon may either be administered at the same point in time, or may alternatively be administered at different points in time. In the second case, the RNA construct for expressing functional alphavirus non-structural protein is typically administered at a first point in time, and the RNA replicon is typically administered at a second, later, point in time. In that case, it is envisaged that the replicon will be immediately replicated since replicase will already have been synthesized in the cell. The second point in time is typically shortly after the first point in time, e.g. 1 minute to 24 hours after the first point in time. Preferably the administration of the RNA replicon is performed at the same site and via the same route of administration as the administration of the RNA construct for expressing functional alphavirus non-structural protein, in order to increase the prospects that the RNA replicon and the RNA construct for expressing functional alphavirus non-structural protein reach the same target tissue or cell. “Site” refers to the position of a subject's body. Suitable sites are for example, the left arm, right arm, etc.

In one embodiment, an additional RNA molecule, preferably an mRNA molecule, may be administered to the subject. Optionally, the additional RNA molecule encodes a protein suitable for inhibiting IFN, such as E3. Optionally, the additional RNA molecule may be administered prior to administration of the replicon or of the replicase construct or of the system according to the invention.

Any of the RNA replicon according to the invention, the system according to the invention, or the kit according to the invention, or the pharmaceutical composition according to the invention can be used in the method for producing a cell in a subject according to the invention. For example, in the method of the invention, RNA can be used in the format of a pharmaceutical composition, e.g. as described herein, or as naked RNA.

In one embodiment of the invention, a T cell receptor or artificial T cell receptor may comprise more than one polypeptide chain such as two polypeptide chains which have to form a complex within a cell so as to generate a functional T cell receptor or artificial T cell receptor. Accordingly, the present invention involves embodiment wherein a set of RNA replicons encodes the different chains of a T cell receptor or artificial T cell receptor. For example, different RNA replicons may comprise different open reading frames encoding different chains of said T cell receptor or artificial T cell receptor. These different RNA replicons, i.e., a set of RNA replicons, may be co-inoculated (optionally together with a RNA construct for expressing functional alphavirus non-structural protein) into a cell to provide a functional T cell receptor or artificial T cell receptor.

Cells

It is particularly preferred according to the invention to introduce nucleic acids encoding an antigen receptor into immune effector cells such as T cells or other cells with lytic potential, in particular lymphoid cells.

The term “immunoreactive cell” or “immune effector cell” in the context of the present invention relates to a cell which exerts effector functions during an immune reaction. An “immunoreactive cell” preferably is capable of binding an antigen such as an antigen expressed on the surface of a cell and mediating an immune response. For example, such cells secrete cytokines and/or chemokines, kill microbes, secrete antibodies, recognize infected or cancerous cells, and optionally eliminate such cells. For example, immunoreactive cells comprise T cells (cytotoxic T cells, helper T cells, tumor infiltrating T cells), B cells, natural killer cells, neutrophils, macrophages, and dendritic cells. Preferably, in the context of the present invention, “immunoreactive cells” are T cells, preferably CD4⁺ and/or CD8⁺ T cells. According to the invention, the term “immunoreactive cell” also includes a cell which can mature into an immune cell (such as T cell, in particular T helper cell, or cytolytic T cell) with suitable stimulation. Immunoreactive cells comprise CD34⁺ hematopoietic stem cells, immature and mature T cells and immature and mature B cells. The differentiation of T cell precursors into a cytolytic T cell, when exposed to an antigen, is similar to clonal selection of the immune system.

Preferably, an “immunoreactive cell” or “immune effector cell” recognizes an antigen with some degree of specificity, in particular if present on the surface of antigen presenting cells or diseased cells such as cancer cells. Preferably, said recognition enables the cell that recognizes an antigen to be responsive or reactive. If the cell is a helper T cell (CD4⁺ T cell) such responsiveness or reactivity may involve the release of cytokines and/or the activation of CD8⁺ lymphocytes (CTLs) and/or B-cells. If the cell is a CTL such responsiveness or reactivity may involve the elimination of cells, i.e., cells characterized by expression of an antigen, for example, via apoptosis or perforin-mediated cell lysis. According to the invention, CTL responsiveness may include sustained calcium flux, cell division, production of cytokines such as IFN-γ and TNF-α, up-regulation of activation markers such as CD44 and CD69, and specific cytolytic killing of antigen expressing target cells. CTL responsiveness may also be determined using an artificial reporter that accurately indicates CTL responsiveness. Such CTL that recognizes an antigen and are responsive or reactive are also termed “antigen-responsive CTL” herein.

The term “immune effector functions” or “effector functions” in the context of the present invention includes any functions mediated by components of the immune system that result, for example, in the killing of diseased cells such as tumor cells, or in the inhibition of tumor growth and/or inhibition of tumor development, including inhibition of tumor dissemination and metastasis. Preferably, the immune effector functions in the context of the present invention are T cell mediated effector functions. Such functions comprise in the case of a helper T cell (CD4⁺ T cell) the release of cytokines such as Interleukin-2 and/or the activation of CD8⁺ lymphocytes (CTLs) and/or B-cells, and in the case of CTL the elimination of cells, i.e., cells characterized by expression of an antigen, for example, via apoptosis or perforin-mediated cell lysis, production of cytokines such as IFN-γ and TNF-α, and specific cytolytic killing of antigen expressing target cells.

The cells used in connection with the present invention are preferably immune effector cells and the immune effector cells are preferably T cells. In particular, the cells used herein are cytotoxic lymphocytes, preferably selected from cytotoxic T cells, natural killer (NK) cells, and lymphokine-activated killer (LAK) cells. Upon activation/stimulation, each of these cytotoxic lymphocytes triggers the destruction of target cells. For example, cytotoxic T cells trigger the destruction of target cells by either or both of the following means. First, upon activation, the T cells release cytotoxins such as perforin, granzymes, and granulysin. Perforin and granulysin create pores in the target cell, and granzymes enter the cell and trigger a caspase cascade in the cytoplasm that induces apoptosis (programmed cell death) of the cell. Second, apoptosis can be induced via Fas-Fas ligand interaction between the T cells and target cells. The T cells and other cytotoxic lymphocytes will preferably be autologous cells, although heterologous cells or allogenic cells can be used.

The terms “T cell” and “T lymphocyte” are used interchangeably herein and include T helper cells (CD4+ T cells) and cytotoxic T cells (CTLs, CD8+ T cells) which comprise cytolytic T cells.

The T cells to be used according to the invention may express an endogenous T cell receptor or may lack expression of an endogenous T cell receptor.

Kit

The present invention also provides a kit comprising an RNA replicon according to the first aspect of the invention or a system according to the second aspect of the invention.

In one embodiment, the constituents of the kit are present as separate entities. For example, one nucleic acid molecule of the kit may be present in one entity, and the another nucleic acid of the kit may be present in a separate entity. For example, an open or closed container is a suitable entity. A closed container is preferred. The container used should preferably be RNAse-free or essentially RNAse-free.

In one embodiment, the kit of the present invention comprises RNA for inoculation with a cell and/or for administration to a human or animal subject.

The kit according to the present invention optionally comprises a label or other form of information element, e.g. an electronic data carrier. The label or information element preferably comprises instructions, e.g. printed written instructions or instructions in electronic form that are optionally printable. The instructions may refer to at least one suitable possible use of the kit.

Pharmaceutical Composition

The agents and compositions such as nucleic acids and cells described herein may be administered in the form of any suitable pharmaceutical composition.

The pharmaceutical compositions of the invention are preferably sterile and contain an effective amount of the agents described herein and optionally of further agents as discussed herein to generate the desired reaction or the desired effect.

Pharmaceutical compositions are usually provided in a uniform dosage form and may be prepared in a manner known per se. A pharmaceutical composition may e.g. be in the form of a solution or suspension.

A pharmaceutical composition may comprise salts, buffer substances, preservatives, carriers, diluents and/or excipients all of which are preferably pharmaceutically acceptable. The term “pharmaceutically acceptable” refers to the non-toxicity of a material which does not interact with the action of the active component of the pharmaceutical composition.

Salts which are not pharmaceutically acceptable may be used for preparing pharmaceutically acceptable salts and are included in the invention. Pharmaceutically acceptable salts of this kind comprise in a non limiting way those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, nitric, phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic acids, and the like. Pharmaceutically acceptable salts may also be prepared as alkali metal salts or alkaline earth metal salts, such as sodium salts, potassium salts or calcium salts.

Suitable buffer substances for use in a pharmaceutical composition include acetic acid in a salt, citric acid in a salt, boric acid in a salt and phosphoric acid in a salt.

Suitable preservatives for use in a pharmaceutical composition include benzalkonium chloride, chlorobutanol, paraben and thimerosal.

An injectible formulation may comprise a pharmaceutically acceptable excipient such as Ringer Lactate.

The term “carrier” refers to an organic or inorganic component, of a natural or synthetic nature, in which the active component is combined in order to facilitate, enhance or enable application. According to the invention, the term “carrier” also includes one or more compatible solid or liquid fillers, diluents or encapsulating substances, which are suitable for administration to a patient.

Possible carrier substances for parenteral administration are e.g. sterile water, Ringer, Ringer lactate, sterile sodium chloride solution, polyalkylene glycols, hydrogenated naphthalenes and, in particular, biocompatible lactide polymers, lactide/glycolide copolymers or polyoxyethylene/polyoxy-propylene copolymers.

The term “excipient” when used herein is intended to indicate all substances which may be present in a pharmaceutical composition and which are not active ingredients such as, e.g., carriers, binders, lubricants, thickeners, surface active agents, preservatives, emulsifiers, buffers, flavoring agents, or colorants.

In one embodiment, if the pharmaceutical composition comprises nucleic acids, it comprises at least one cationic entity. In general, cationic lipids, cationic polymers and other substances with positive charges may form complexes with negatively charged nucleic acids. It is possible to stabilize the RNA according to the invention by complexation with cationic compounds, preferably polycationic compounds such as for example a cationic or polycationic peptide or protein. In one embodiment, the pharmaceutical composition according to the present invention comprises at least one cationic molecule selected from the group consisting protamine, polyethylene imine, a poly-L-lysine, a poly-L-arginine, a histone or a cationic lipid.

According to the present invention, a cationic lipid is a cationic amphiphilic molecule, e.g., a molecule which comprises at least one hydrophilic and lipophilic moiety. The cationic lipid can be monocationic or polycationic. Cationic lipids typically have a lipophilic moiety, such as a sterol, an acyl or diacyl chain, and have an overall net positive charge. The head group of the lipid typically carries the positive charge. The cationic lipid preferably has a positive charge of 1 to 10 valences, more preferably a positive charge of 1 to 3 valences, and more preferably a positive charge of 1 valence. Examples of cationic lipids include, but are not limited to 1,2-di-O-octadecenyl-3-trimethylammonium propane (DOTMA); dimethyldioctadecylammonium (DDAB); 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP); 1,2-dioleoyl-3-dimethylammonium-propane (DODAP); 1,2-diacyloxy-3-dimethylammonium propanes; 1,2-dialkyloxy-3-dimethylammonium propanes; dioctadecyldimethyl ammonium chloride (DODAC), 1,2-dimyristoyloxypropyl-1,3-dimethylhydroxyethyl ammonium (DMRIE), and 2,3-dioleoyloxy-N-[2(spermine carboxamide)ethyl]-N,N-dimethyl-1-propanamium trifluoroacetate (DOSPA). Cationic lipids also include lipids with a tertiary amine group, including 1,2-dilinoleyloxy-N,N-dimethyl-3-aminopropane (DLinDMA). Cationic lipids are suitable for formulating RNA in lipid formulations as described herein, such as liposomes, emulsions and lipoplexes. Typically positive charges are contributed by at least one cationic lipid and negative charges are contributed by the RNA. In one embodiment, the pharmaceutical composition comprises at least one helper lipid, in addition to a cationic lipid. The helper lipid may be a neutral or an anionic lipid. The helper lipid may be a natural lipid, such as a phospholipid, or an analogue of a natural lipid, or a fully synthetic lipid, or lipid-like molecule, with no similarities with natural lipids. In the case where a pharmaceutical composition includes both a cationic lipid and a helper lipid, the molar ratio of the cationic lipid to the neutral lipid can be appropriately determined in view of stability of the formulation and the like.

In one embodiment, the pharmaceutical composition according to the present invention comprises protamine. According to the invention, protamine is useful as cationic carrier agent. The term “protamine” refers to any of various strongly basic proteins of relatively low molecular weight that are rich in arginine and are found associated especially with DNA in place of somatic histones in the sperm cells of animals such as fish. In particular, the term “protamine” refers to proteins found in fish sperm that are strongly basic, are soluble in water, are not coagulated by heat, and comprise multiple arginine monomers. According to the invention, the term “protamine” as used herein is meant to comprise any protamine amino acid sequence obtained or derived from native or biological sources including fragments thereof and multimeric forms of said amino acid sequence or fragment thereof. Furthermore, the term encompasses (synthesized) polypeptides which are artificial and specifically designed for specific purposes and cannot be isolated from native or biological sources.

The pharmaceutical composition according to the invention can be buffered, (e.g., with an acetate buffer, a citrate buffer, a succinate buffer, a Tris buffer, a phosphate buffer).

RNA-Containing Particles

In some embodiments, owing to the instability of non-protected RNA, it is advantageous to provide the RNA molecules of the present invention in complexed or encapsulated form. Respective pharmaceutical compositions are provided in the present invention. In particular, in some embodiments, the pharmaceutical composition of the present invention comprises nucleic acid-containing particles, preferably RNA-containing particles. Respective pharmaceutical compositions are referred to as particulate formulations. In particulate formulations according to the present invention, a particle comprises nucleic acid according to the invention and a pharmaceutically acceptable carrier or a pharmaceutically acceptable vehicle that is suitable for delivery of the nucleic acid. The nucleic acid-containing particles may be, for example, in the form of proteinaceous particles or in the form of lipid-containing particles. Suitable proteins or lipids are referred to as particle forming agents. Proteinaceous particles and lipid-containing particles have been described previously to be suitable for delivery of alphaviral RNA in particulate form (e.g. Strauss & Strauss, Microbiol. Rev., 1994, vol. 58, pp. 491-562). In particular, alphavirus structural proteins (provided e.g. by a helper virus) are a suitable carrier for delivery of RNA in the form of proteinaceous particles.

When the system according to the present invention is formulated as a particulate formulation, it is possible that each RNA species (e.g. replicon, replicase construct, and optional additional RNA species such as an RNA encoding a protein suitable for inhibiting IFN) is separately formulated as an individual particulate formulation. In that case, each individual particulate formulation will comprise one RNA species. The individual particulate formulations may be present as separate entities, e.g. in separate containers. Such formulations are obtainable by providing each RNA species separately (typically each in the form of an RNA-containing solution) together with a particle-forming agent, thereby allowing the formation of particles. Respective particles will contain exclusively the specific RNA species that is being provided when the particles are formed (individual particulate formulations).

In one embodiment, a pharmaceutical composition according to the invention comprises more than one individual particle formulation. Respective pharmaceutical compositions are referred to as mixed particulate formulations. Mixed particulate formulations according to the invention are obtainable by forming, separately, individual particulate formulations, as described above, followed by a step of mixing of the individual particulate formulations. By the step of mixing, one formulation comprising a mixed population of RNA-containing particles is obtainable (for illustration: e.g. a first population of particles may contain replicon according to the invention, and a second formulation of particles may contain replicase construct according to the invention). Individual particulate populations may be together in one container, comprising a mixed population of individual particulate formulations.

Alternatively, it is possible that all RNA species of the pharmaceutical composition (e.g. replicon, replicase construct, and optional additional species such as RNA encoding a protein suitable for inhibiting IFN) are formulated together as a combined particulate formulation. Such formulations are obtainable by providing a combined formulation (typically combined solution) of all RNA species together with a particle-forming agent, thereby allowing the formation of particles. As opposed to a mixed particulate formulation, a combined particulate formulation will typically comprise particles which comprise more than one RNA species. In a combined particulate composition different RNA species are typically present together in a single particle.

In one embodiment, the particulate formulation of the present invention is a nanoparticulate formulation. In that embodiment, the composition according to the present invention comprises nucleic acid according to the invention in the form of nanoparticles. Nanoparticulate formulations can be obtained by various protocols and with various complexing compounds. Lipids, polymers, oligomers, or amphipiles are typical constituents of nanoparticulate formulations.

As used herein, the term “nanoparticle” refers to any particle having a diameter making the particle suitable for systemic, in particular parenteral, administration, of, in particular, nucleic acids, typically a diameter of 1000 nanometers (nm) or less. In one embodiment, the nanoparticles have an average diameter in the range of from about 50 nm to about 1000 nm, preferably from about 50 nm to about 400 nm, preferably about 100 nm to about 300 nm such as about 150 nm to about 200 nm. In one embodiment, the nanoparticles have a diameter in the range of about 200 to about 700 nm, about 200 to about 600 nm, preferably about 250 to about 550 nm, in particular about 300 to about 500 nm or about 200 to about 400 nm.

In one embodiment, the polydispersity index (PI) of the nanoparticles described herein, as measured by dynamic light scattering, is 0.5 or less, preferably 0.4 or less or even more preferably 0.3 or less. The “polydispersity index” (PI) is a measurement of homogeneous or heterogeneous size distribution of the individual particles (such as liposomes) in a particle mixture and indicates the breadth of the particle distribution in a mixture. The PI can be determined, for example, as described in WO 2013/143555 A1.

As used herein, the term “nanoparticulate formulation” or similar terms refer to any particulate formulation that contains at least one nanoparticle. In some embodiments, a nanoparticulate composition is a uniform collection of nanoparticles. In some embodiments, a nanoparticulate composition is a lipid-containing pharmaceutical formulation, such as a liposome formulation or an emulsion.

Lipid-Containing Pharmaceutical Compositions

In one embodiment, the pharmaceutical composition of the present invention comprises at least one lipid. Preferably, at least one lipid is a cationic lipid. Said lipid-containing pharmaceutical composition comprises nucleic acid according to the present invention. In one embodiment, the pharmaceutical composition according to the invention comprises RNA encapsulated in a vesicle, e.g. in a liposome. In one embodiment, the pharmaceutical composition according to the invention comprises RNA in the form of an emulsion. In one embodiment, the pharmaceutical composition according to the invention comprises RNA in a complex with a cationic compound, thereby forming e.g. so-called lipoplexes or polyplexes. Encapsulation of RNA within vesicles such as liposomes is distinct from, for instance, lipid/RNA complexes. Lipid/RNA complexes are obtainable e.g. when RNA is e.g. mixed with pre-formed liposomes.

In one embodiment, the pharmaceutical composition according to the invention comprises RNA encapsulated in a vesicle. Such formulation is a particular particulate formulation according to the invention. A vesicle is a lipid bilayer rolled up into a spherical shell, enclosing a small space and separating that space from the space outside the vesicle. Typically, the space inside the vesicle is an aqueous space, i.e. comprises water. Typically, the space outside the vesicle is an aqueous space, i.e. comprises water. The lipid bilayer is formed by one or more lipids (vesicle-forming lipids). The membrane enclosing the vesicle is a lamellar phase, similar to that of the plasma membrane. The vesicle according to the present invention may be a multilamellar vesicle, a unilamellar vesicle, or a mixture thereof. When encapsulated in a vesicle, the RNA is typically separated from any external medium. Thus it is present in protected form, functionally equivalent to the protected form in a natural alphavirus. Suitable vesicles are particles, particularly nanoparticles, as described herein.

For example, RNA may be encapsulated in a liposome. In that embodiment, the pharmaceutical composition is or comprises a liposome formulation. Encapsulation within a liposome will typically protect RNA from RNase digestion. It is possible that the liposomes include some external RNA (e.g. on their surface), but at least half of the RNA (and ideally all of it) is encapsulated within the core of the liposome.

Liposomes are microscopic lipidic vesicles often having one or more bilayers of a vesicle-forming lipid, such as a phospholipid, and are capable of encapsulating a drug, e.g. RNA. Different types of liposomes may be employed in the context of the present invention, including, without being limited thereto, multilamellar vesicles (MLV), small unilamellar vesicles (SUV), large unilamellar vesicles (LUV), sterically stabilized liposomes (SSL), multivesicular vesicles (MV), and large multivesicular vesicles (LMV) as well as other bilayered forms known in the art. The size and lamellarity of the liposome will depend on the manner of preparation. There are several other forms of supramolecular organization in which lipids may be present in an aqueous medium, comprising lamellar phases, hexagonal and inverse hexagonal phases, cubic phases, micelles, reverse micelles composed of monolayers. These phases may also be obtained in the combination with DNA or RNA, and the interaction with RNA and DNA may substantially affect the phase state. Such phases may be present in nanoparticulate RNA formulations of the present invention.

Liposomes may be formed using standard methods known to the skilled person. Respective methods include the reverse evaporation method, the ethanol injection method, the dehydration-rehydration method, sonication or other suitable methods. Following liposome formation, the liposomes can be sized to obtain a population of liposomes having a substantially homogeneous size range.

In a preferred embodiment of the present invention, the RNA is present in a liposome which includes at least one cationic lipid. Respective liposomes can be formed from a single lipid or from a mixture of lipids, provided that at least one cationic lipid is used. Preferred cationic lipids have a nitrogen atom which is capable of being protonated; preferably, such cationic lipids are lipids with a tertiary amine group. A particularly suitable lipid with a tertiary amine group is 1,2-dilinoleyloxy-N,N-dimethyl-3-aminopropane (DLinDMA). In one embodiment, the RNA according to the present invention is present in a liposome formulation as described in WO 2012/006378 A1: a liposome having a lipid bilayer encapsulating an aqueous core including RNA, wherein the lipid bilayer comprises a lipid with a pKa in the range of 5.0 to 7.6, which preferably has a tertiary amine group. Preferred cationic lipids with a tertiary amine group include DLinDMA (pKa 5.8) and are generally described in WO 2012/031046 A2. According to WO 2012/031046 A2, liposomes comprising a respective compound are particularly suitable for encapsulation of RNA and thus liposomal delivery of RNA. In one embodiment, the RNA according to the present invention is present in a liposome formulation, wherein the liposome includes at least one cationic lipid whose head group includes at least one nitrogen atom (N) which is capable of being protonated, wherein the liposome and the RNA have a N:P ratio of between 1:1 and 20:1. According to the present invention, “N:P ratio” refers to the molar ratio of nitrogen atoms (N) in the cationic lipid to phosphate atoms (P) in the RNA comprised in a lipid containing particle (e.g. liposome), as described in WO 2013/006825 A1. The N:P ratio of between 1:1 and 20:1 is implicated in the net charge of the liposome and in efficiency of delivery of RNA to a vertebrate cell.

In one embodiment, the RNA according to the present invention is present in a liposome formulation that comprises at least one lipid which includes a polyethylene glycol (PEG) moiety, wherein RNA is encapsulated within a PEGylated liposome such that the PEG moiety is present on the liposome's exterior, as described in WO 2012/031043 A1 and WO 2013/033563 A1.

In one embodiment, the RNA according to the present invention is present in a liposome formulation, wherein the liposome has a diameter in the range of 60-180 nm, as described in WO 2012/030901 A1.

In one embodiment, the RNA according to the present invention is present in a liposome formulation, wherein the RNA-containing liposomes have a net charge close to zero or negative, as disclosed in WO 2013/143555 A1.

In other embodiments, the RNA according to the present invention is present in the form of an emulsion. Emulsions have been previously described to be used for delivery of nucleic acid molecules, such as RNA molecules, to cells. Preferred herein are oil-in-water emulsions. The respective emulsion particles comprise an oil core and a cationic lipid. More preferred are cationic oil-in-water emulsions in which the RNA according to the present invention is complexed to the emulsion particles. The emulsion particles comprise an oil core and a cationic lipid. The cationic lipid can interact with the negatively charged RNA, thereby anchoring the RNA to the emulsion particles. In an oil-in-water emulsion, emulsion particles are dispersed in an aqueous continuous phase. For example, the average diameter of the emulsion particles may typically be from about 80 nm to 180 nm. In one embodiment, the pharmaceutical composition of the present invention is a cationic oil-in-water emulsion, wherein the emulsion particles comprise an oil core and a cationic lipid, as described in WO 2012/006380 A2. The RNA according to the present invention may be present in the form of an emulsion comprising a cationic lipid wherein the N:P ratio of the emulsion is at least 4:1, as described in WO 2013/006834 A1. The RNA according to the present invention may be present in the form of a cationic lipid emulsion, as described in WO 2013/006837 A1. In particular, the composition may comprise RNA complexed with a particle of a cationic oil-in-water emulsion, wherein the ratio of oil/lipid is at least about 8:1 (mole:mole).

In other embodiments, the pharmaceutical composition according to the invention comprises RNA in the format of a lipoplex. The term, “lipoplex” or “RNA lipoplex” refers to a complex of lipids and nucleic acids such as RNA. Lipoplexes can be formed of cationic (positively charged) liposomes and the anionic (negatively charged) nucleic acid. The cationic liposomes can also include a neutral “helper” lipid. In the simplest case, the lipoplexes form spontaneously by mixing the nucleic acid with the liposomes with a certain mixing protocol, however various other protocols may be applied. It is understood that electrostatic interactions between positively charged liposomes and negatively charged nucleic acid are the driving force for the lipoplex formation (WO 2013/143555 A1). In one embodiment of the present invention, the net charge of the RNA lipoplex particles is close to zero or negative. It is known that electro-neutral or negatively charged lipoplexes of RNA and liposomes lead to substantial RNA expression in spleen dendritic cells (DCs) after systemic administration and are not associated with the elevated toxicity that has been reported for positively charged liposomes and lipoplexes (cf. WO 2013/143555 A1). Therefore, in one embodiment of the present invention, the pharmaceutical composition according to the invention comprises RNA in the format of nanoparticles, preferably lipoplex nanoparticles, in which (i) the number of positive charges in the nanoparticles does not exceed the number of negative charges in the nanoparticles and/or (ii) the nanoparticles have a neutral or net negative charge and/or (iii) the charge ratio of positive charges to negative charges in the nanoparticles is 1.4:1 or less and/or (iv) the zeta potential of the nanoparticles is 0 or less. As described in WO 2013/143555 A1, zeta potential is a scientific term for electrokinetic potential in colloidal systems. In the present invention, (a) the zeta potential and (b) the charge ratio of the cationic lipid to the RNA in the nanoparticles can both be calculated as disclosed in WO 2013/143555 A1. In summary, pharmaceutical compositions which are nanoparticulate lipoplex formulations with a defined particle size, wherein the net charge of the particles is close to zero or negative, as disclosed in WO 2013/143555 A1, are preferred pharmaceutical compositions in the context of the present invention.

Therapeutic Treatments

In view of the capacity to be administered to a subject, each of the RNA replicon according to the invention, the system according to the invention, the DNA according to the invention, the cell according to the invention, the kit according to the invention, or the pharmaceutical composition according to the invention, may be referred to as “medicament”, or the like. The present invention foresees that the RNA replicon, the system, the DNA, the cell, the kit, or the pharmaceutical composition of the present invention are provided for use as a medicament.

The medicament can be used to treat a subject. By “treat” is meant to administer a compound or composition or other entity as described herein to a subject. The term includes methods for treatment of the human or animal body by therapy.

The term “treatment” or “therapeutic treatment” preferably relates to any treatment which improves the health status and/or prolongs (increases) the lifespan of an individual. Said treatment may eliminate the disease in an individual, arrest or slow the development of a disease in an individual, inhibit or slow the development of a disease in an individual, decrease the frequency or severity of symptoms in an individual, and/or decrease the recurrence in an individual who currently has or who previously has had a disease.

The terms “prophylactic treatment” or “preventive treatment” relate to any treatment that is intended to prevent a disease from occurring in an individual. The terms “prophylactic treatment” or “preventive treatment” are used herein interchangeably.

In particular, cells, in particular immune effector cells such as T cells engineered to express a T cell receptor or an artificial T cell receptor described herein are useful for providing an immune response in a subject and, in particular, in the treatment of diseases characterized by expression of antigens targeted by the T cell receptor or artificial T cell receptor.

“Providing an immune response” may mean that there was no immune response against a particular target antigen, target cell and/or target tissue before providing an immune response, but it may also mean that there was a certain level of immune response against a particular target antigen, target cell and/or target tissue before providing an immune response and after providing an immune response said immune response is enhanced. Thus, “providing an immune response” includes “inducing an immune response” and “enhancing an immune response”. Preferably, after providing an immune response in a subject, said subject is protected from developing a disease such as a cancer disease or the disease condition is ameliorated by providing an immune response. For example, an immune response against a tumor antigen may be provided in a patient having a cancer disease or in a subject being at risk of developing a cancer disease. Providing an immune response in this case may mean that the disease condition of the subject is ameliorated, that the subject does not develop metastases, or that the subject being at risk of developing a cancer disease does not develop a cancer disease.

According to the various aspects of the invention, the aim is preferably to provide an immune response against diseased cells expressing an antigen such as cancer cells expressing a tumor antigen, and to treat a disease such as a cancer disease involving cells expressing an antigen such as a tumor antigen.

Antigen-specific immune cells described herein can be administered to a patient for preventing or treating a disease, which disease is characterized by expression of an antigen that can be bound by an antigen receptor expressed in the immune cells. Such immune cells can be used for the selective eradication of cells expressing an antigen, as well as for immunization or vaccination against a disease wherein an antigen is expressed, which antigen can be bound by an antigen receptor expressed in the immune cells.

In one embodiment, a method of treating or preventing a disease comprises administering to a patient an effective amount of a nucleic acid encoding an antigen receptor of the invention, in which the antigen receptor is able to bind an antigen that is associated with the disease (e.g., a viral or tumor antigen) to be treated or prevented. In another embodiment, a method of treating or preventing a disease comprises administering to a patient an effective amount of recombinant immune effector cells or an expanded population of said immune effector cells, which immune effector cells or population of cells recombinantly express an antigen receptor, in which the antigen receptor is able to bind an antigen that is associated with the disease to be treated or prevented. In preferred embodiments, the disease is cancer and the antigen is a tumor associated antigen.

In another embodiment, the present invention provides for a method of immunizing or vaccinating against a disease associated with a specific antigen or against a disease-causing organism expressing a specific antigen, which method comprises administering to a patient an effective amount of a nucleic acid encoding an antigen receptor of the invention, in which the antigen receptor is able to bind the specific antigen. In another embodiment, the present invention provides for a method of immunizing or vaccinating against a disease associated with a specific antigen or against a disease-causing organism expressing a specific antigen, which method comprises administering to a patient an effective amount of recombinant immune effector cells or an expanded population of said immune effector cells, which immune effector cells or population of cells recombinantly express an antigen receptor, in which the antigen receptor is able to bind to the specific antigen.

In certain embodiments, the population of immune effector cells can be a clonally expanded population. The recombinant immune effector cells or populations thereof provide for therapeutic or prophylactic immune effector function in an antigen-specific manner. Preferably, an antigen receptor is expressed on the cell surface of the immune effector cells.

Accordingly, the agents, compositions and methods described herein can be used to treat a subject with a disease, e.g., a disease characterized by the presence of diseased cells expressing an antigen. Particularly preferred diseases are cancer diseases. The agents, compositions and methods described herein may also be used for immunization or vaccination to prevent a disease described herein.

The term “disease” refers to an abnormal condition that affects the body of an individual. A disease is often construed as a medical condition associated with specific symptoms and signs. A disease may be caused by factors originally from an external source, such as infectious disease, or it may be caused by internal dysfunctions, such as autoimmune diseases. In humans, “disease” is often used more broadly to refer to any condition that causes pain, dysfunction, distress, social problems, or death to the individual afflicted, or similar problems for those in contact with the individual. In this broader sense, it sometimes includes injuries, disabilities, disorders, syndromes, infections, isolated symptoms, deviant behaviors, and atypical variations of structure and function, while in other contexts and for other purposes these may be considered distinguishable categories. Diseases usually affect individuals not only physically, but also emotionally, as contracting and living with many diseases can alter one's perspective on life, and one's personality. According to the invention, the term “disease” includes infectious diseases and cancer diseases, in particular those forms of cancer described herein. Any reference herein to cancer or particular forms of cancer also includes cancer metastasis thereof.

A disease to be treated according to the invention is preferably a disease involving cells characterized by expression of an antigen. “Disease involving cells characterized by expression of an antigen” or similar expressions means according to the invention that the antigen is expressed in cells of a diseased tissue or organ. Expression in cells of a diseased tissue or organ may be increased compared to the state in a healthy tissue or organ. In one embodiment, expression is only found in a diseased tissue, while expression in a healthy tissue is not found, e.g. expression is repressed. According to the invention, diseases involving cells characterized by expression of an antigen include infectious diseases and cancer diseases, wherein the disease-associated antigen is preferably an antigen of the infectious agent and a tumor antigen, respectively.

The term “healthy” or “normal” refer to non-pathological conditions, and preferably means non-infected or non-cancerous.

In embodiments of the invention, “T cell receptor or artificial T cell receptor targeting an antigen” or similar terms include a T cell receptor binding to processed antigen, i.e. a T cell epitope presented in the context of MHC, an artificial T cell receptor binding to antigen expressed on the cell surface and an artificial T cell receptor binding to processed antigen, i.e. a T cell epitope presented in the context of HMC. Binding of the T cell receptor or artificial T cell receptor, when present on an immune effector cell such as a T cell, preferably results in the stimulation, priming and/or expansion of the immune effector cell and in the immune effector cell exerting effector functions as described herein.

Embodiments of the invention involving the use of T cell receptors generally aim at targeting antigen expressing cells through the recognition of antigen processing products, i.e. antigen epitopes or T cell epitopes, presented on the cell surface in the context of MHC molecules. Embodiments of the invention involving the use of artificial T cell receptors generally aim at targeting antigen expressing cells through the recognition of antigen expressed on the cell surface (e.g., if the artificial T cell receptor comprises an antigen binding domain from an antibody) or antigen processing products, i.e. antigen epitopes or T cell epitopes, presented on the cell surface in the context of MHC molecules (e.g., if the artificial T cell receptor comprises a binding domain from a T cell receptor). Preferably, the antigen or epitope if recognized by a T cell receptor or an artificial T cell receptor is able to induce in the presence of appropriate co-stimulatory signals, clonal expansion of the T cell carrying the T cell receptor or artificial T cell receptor recognizing the antigen or epitope.

Antigens targeted according to the invention, may be antigens derived from pathogens or tumor antigens. Accordingly, diseases which may be treated according to the invention are those caused by pathogens or cancer.

In a preferred embodiment, an antigen is a disease-specific antigen or disease-associated antigen. The term “disease-specific antigen” or “disease-associated antigen” refers to all antigens that are of pathological significance. In one particularly preferred embodiment, the antigen is present in diseased cells, tissues and/or organs while it is not present or present in a reduced amount in healthy cells, tissues and/or organs and, thus, can be used for targeting diseased cells, tissues and/or organs, e.g. by T cells carrying an antigen receptor (T cell receptor or artificial T cell receptor) targeting the antigen. In one embodiment, a disease-specific antigen or disease-associated antigen is present on the surface of a diseased cell.

The term “pathogen” refers to pathogenic biological material capable of causing disease in an organism, preferably a vertebrate organism. Pathogens include microorganisms such as bacteria, unicellular eukaryotic organisms (protozoa), fungi, as well as viruses.

Examples for pathogenic viruses are human immunodeficiency virus (HIV), cytomegalovirus (CMV), herpes virus (HSV), hepatitis A-virus (HAV), HBV, HCV, papilloma virus, and human T-lymphotrophic virus (HTLV). Unicellular organisms comprise plasmodia, trypanosomes, amoeba, etc.

Pathogenic unicellular eukaryotic parasites may be e.g. from the genus Plasmodium, e.g. P. falciparum, P. vivax, P. malariae or P. ovale, from the genus Leishmania, or from the genus Trypanosoma, e.g. T. cruzi or T. brucei.

In a preferred embodiment, an antigen is a tumor antigen or tumor-associated antigen, i.e., a constituent of cancer cells which may be derived from the cytoplasm, the cell surface and the cell nucleus, in particular those antigens which are produced, preferably in large quantity, as surface antigens on cancer cells.

In the context of the present invention, the term “tumor antigen” or “tumor-associated antigen” relates to proteins that are under normal conditions specifically expressed in a limited number of tissues and/or organs or in specific developmental stages, for example, the tumor antigen may be under normal conditions specifically expressed in stomach tissue, preferably in the gastric mucosa, in reproductive organs, e.g., in testis, in trophoblastic tissue, e.g., in placenta, or in germ line cells, and are expressed or aberrantly expressed in one or more tumor or cancer tissues. In this context, “a limited number” preferably means not more than 3, more preferably not more than 2. The tumor antigens in the context of the present invention include, for example, differentiation antigens, preferably cell type specific differentiation antigens, i.e., proteins that are under normal conditions specifically expressed in a certain cell type at a certain differentiation stage, cancer/testis antigens, i.e., proteins that are under normal conditions specifically expressed in testis and sometimes in placenta, and germ line specific antigens. In the context of the present invention, the tumor antigen is preferably associated with the cell surface of a cancer cell and is preferably not or only rarely expressed in normal tissues. Preferably, the tumor antigen or the aberrant expression of the tumor antigen identifies cancer cells. In the context of the present invention, the tumor antigen that is expressed by a cancer cell in a subject, e.g., a patient suffering from a cancer disease, is preferably a self-protein in said subject. In preferred embodiments, the tumor antigen in the context of the present invention is expressed under normal conditions specifically in a tissue or organ that is non-essential, i.e., tissues or organs which when damaged by the immune system do not lead to death of the subject, or in organs or structures of the body which are not or only hardly accessible by the immune system. Preferably, the amino acid sequence of the tumor antigen is identical between the tumor antigen which is expressed in normal tissues and the tumor antigen which is expressed in cancer tissues.

Examples for tumor antigens that may be useful in the present invention are p53, ART-4, BAGE, beta-catenin/m, Bcr-abL CAMEL, CAP-1, CASP-8, CDC27/m, CDK4/m, CEA, the cell surface proteins of the claudin family, such as CLAUDIN-6, CLAUDIN-18.2 and CLAUDIN-12, c-MYC, CT, Cyp-B, DAM, ELF2M, ETV6-AML1, G250, GAGE, GnT-V, Gap100, HAGE, HER-2/neu, HPV-E7, HPV-E6, HAST-2, hTERT (or hTRT), LAGE, LDLR/FUT, MAGE-A, preferably MAGE-A1, MAGE-A2, MAGE-A3, MAGE-A4, MAGE-A5, MAGE-A6, MAGE-A7, MAGE-A8, MAGE-A9, MAGE-A10, MAGE-A11, or MAGE-A12, MAGE-B, MAGE-C, MART-1/Melan-A, MC1R, Myosin/m, MUC1, MUM-1, -2, -3, NA88-A, NF1, NY-ESO-1, NY-BR-1, p190 minor BCR-abL, Pm1/RARa, PRAME, proteinase 3, PSA, PSM, RAGE, RU1 or RU2, SAGE, SART-1 or SART-3, SCGB3A2, SCP1, SCP2, SCP3, SSX, SURVIVIN, TEL/AML1, TPI/m, TRP-1, TRP-2, TRP-2/INT2, TPTE and WT. Particularly preferred tumor antigens include CLAUDIN-18.2 (CLDN18.2) and CLAUDIN-6 (CLDN6).

The term “CLDN” or simply “CI” as used herein means claudin and includes CLDN6 and CLDN18.2. Preferably, a claudin is a human claudin. Claudins are a family of proteins that are the most important components of tight junctions, where they establish the paracellular barrier that controls the flow of molecules in the intercellular space between cells of an epithelium. Claudins are transmembrane proteins spanning the membrane 4 times with the N-terminal and the C-terminal end both located in the cytoplasm. The first extracellular loop, termed EC1 or ECL1, consists on average of 53 amino acids, and the second extracellular loop, termed EC2 or ECL2, consists of around 24 amino acids. Cell surface proteins of the claudin family are expressed in tumors of various origins, and are particularly suited as target structures in connection with targeted cancer immunotherapy due to their selective expression (no expression in a toxicity relevant normal tissue) and localization to the plasma membrane.

CLDN6 and CLDN18.2 have been identified as differentially expressed in tumor tissues, with the only normal tissue expressing CLDN18.2 being stomach (differentiated epithelial cells of the gastric mucosa) and the only normal tissue expressing CLDN6 being placenta.

CLDN18.2 is expressed in cancers of various origins such as pancreatic carcinoma, esophageal carcinoma, gastric carcinoma, bronchial carcinoma, breast carcinoma, and ENT tumors. CLDN18.2 is a valuable target for the prevention and/or treatment of primary tumors, such as gastric cancer, esophageal cancer, pancreatic cancer, lung cancer such as non small cell lung cancer (NSCLC), ovarian cancer, colon cancer, hepatic cancer, head-neck cancer, and cancers of the gallbladder, and metastases thereof, in particular gastric cancer metastasis such as Krukenberg tumors, peritoneal metastasis, and lymph node metastasis. Antigen receptors targeting at least CLDN18.2 are useful in treating such cancer diseases.

CLDN6 has been found to be expressed, for example, in ovarian cancer, lung cancer, gastric cancer, breast cancer, hepatic cancer, pancreatic cancer, skin cancer, melanomas, head neck cancer, sarcomas, bile duct cancer, renal cell cancer, and urinary bladder cancer. CLDN6 is a particularly preferred target for the prevention and/or treatment of ovarian cancer, in particular ovarian adenocarcinoma and ovarian teratocarcinoma, lung cancer, including small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC), in particular squamous cell lung carcinoma and adenocarcinoma, gastric cancer, breast cancer, hepatic cancer, pancreatic cancer, skin cancer, in particular basal cell carcinoma and squamous cell carcinoma, malignant melanoma, head and neck cancer, in particular malignant pleomorphic adenoma, sarcoma, in particular synovial sarcoma and carcinosarcoma, bile duct cancer, cancer of the urinary bladder, in particular transitional cell carcinoma and papillary carcinoma, kidney cancer, in particular renal cell carcinoma including clear cell renal cell carcinoma and papillary renal cell carcinoma, colon cancer, small bowel cancer, including cancer of the ileum, in particular small bowel adenocarcinoma and adenocarcinoma of the ileum, testicular embryonal carcinoma, placental choriocarcinoma, cervical cancer, testicular cancer, in particular testicular seminoma, testicular teratoma and embryonic testicular cancer, uterine cancer, germ cell tumors such as a teratocarcinoma or an embryonal carcinoma, in particular germ cell tumors of the testis, and the metastatic forms thereof. Antigen receptors targeting at least CLDN6 are useful in treating such cancer diseases.

The terms “cancer disease” or “cancer” refer to or describe the physiological condition in an individual that is typically characterized by unregulated cell growth. Examples of cancers include, but are not limited to, carcinoma, lymphoma, blastoma, sarcoma, and leukemia. More particularly, examples of such cancers include bone cancer, blood cancer, lung cancer, liver cancer, pancreatic cancer, skin cancer, cancer of the head or neck, cutaneous or intraocular melanoma, uterine cancer, ovarian cancer, rectal cancer, cancer of the anal region, stomach cancer, colon cancer, breast cancer, prostate cancer, uterine cancer, carcinoma of the sexual and reproductive organs, Hodgkin's Disease, cancer of the esophagus, cancer of the small intestine, cancer of the endocrine system, cancer of the thyroid gland, cancer of the parathyroid gland, cancer of the adrenal gland, sarcoma of soft tissue, cancer of the bladder, cancer of the kidney, renal cell carcinoma, carcinoma of the renal pelvis, neoplasms of the central nervous system (CNS), neuroectodermal cancer, spinal axis tumors, glioma, meningioma, and pituitary adenoma. The term “cancer” according to the invention also comprises cancer metastases. Preferably, a “cancer disease” is characterized by cells expressing a tumor antigen and a cancer cell expresses a tumor antigen.

In one embodiment, a cancer disease is a malignant disease which is characterized by the properties of anaplasia, invasiveness, and metastasis. A malignant tumor may be contrasted with a non-cancerous benign tumor in that a malignancy is not self-limited in its growth, is capable of invading into adjacent tissues, and may be capable of spreading to distant tissues (metastasizing), while a benign tumor has none of those properties.

According to the invention, the term “tumor” or “tumor disease” refers to a swelling or lesion formed by an abnormal growth of cells (called neoplastic cells or tumor cells). By “tumor cell” is meant an abnormal cell that grows by a rapid, uncontrolled cellular proliferation and continues to grow after the stimuli that initiated the new growth cease. Tumors show partial or complete lack of structural organization and functional coordination with the normal tissue, and usually form a distinct mass of tissue, which may be either benign, pre-malignant or malignant.

By “metastasis” is meant the spread of cancer cells from its original site to another part of the body. The formation of metastasis is a very complex process and depends on detachment of malignant cells from the primary tumor, invasion of the extracellular matrix, penetration of the endothelial basement membranes to enter the body cavity and vessels, and then, after being transported by the blood, infiltration of target organs. Finally, the growth of a new tumor at the target site depends on angiogenesis. Tumor metastasis often occurs even after the removal of the primary tumor because tumor cells or components may remain and develop metastatic potential. In one embodiment, the term “metastasis” according to the invention relates to “distant metastasis” which relates to a metastasis which is remote from the primary tumor and the regional lymph node system. In one embodiment, the term “metastasis” according to the invention relates to lymph node metastasis.

A relapse or recurrence occurs when a person is affected again by a condition that affected them in the past. For example, if a patient has suffered from a tumor disease, has received a successful treatment of said disease and again develops said disease said newly developed disease may be considered as relapse or recurrence. However, according to the invention, a relapse or recurrence of a tumor disease may but does not necessarily occur at the site of the original tumor disease. Thus, for example, if a patient has suffered from ovarian tumor and has received a successful treatment a relapse or recurrence may be the occurrence of an ovarian tumor or the occurrence of a tumor at a site different to ovary. A relapse or recurrence of a tumor also includes situations wherein a tumor occurs at a site different to the site of the original tumor as well as at the site of the original tumor. Preferably, the original tumor for which the patient has received a treatment is a primary tumor and the tumor at a site different to the site of the original tumor is a secondary or metastatic tumor.

Infectious diseases that can be treated or prevented by the present invention are caused by infectious agents including, but not limited to, viruses, bacteria, fungi, protozoa, helminths, and parasites.

Infectious viruses of both human and non-human vertebrates, include retroviruses, RNA viruses and DNA viruses. Examples of virus that have been found in humans include but are not limited to: Retroviridae (e.g., human immunodeficiency viruses, such as HIV-1 (also referred to as HTLV-III, LAV or HTLV-III/LAV, or HIV-III; and other isolates, such as HIV-LP; Picornaviridae (e.g., polio viruses, hepatitis A virus; enteroviruses, human Coxsackie viruses, rhinoviruses, echoviruses); Calciviridae (e.g., strains that cause gastroenteritis); Togaviridae (e.g., equine encephalitis viruses, rubella viruses); Flaviridae (e.g., dengue viruses, encephalitis viruses, yellow fever viruses); Coronaviridae (e.g., coronaviruses); Rhabdoviridae (e.g., vesicular stomatitis viruses, rabies viruses); Filoviridae (e.g., ebola viruses); Paramyxoviridae (e.g., parainfluenza viruses, mumps virus, measles virus, respiratory syncytial virus); Orthomyxoviridae (e.g., influenza viruses); Bungaviridae (e.g., Hanta viruses, bunga viruses, phleboviruses and Nairo viruses); Arena viridae (hemorrhagic fever viruses); Reoviridae (e.g., reoviruses, orbiviurses and rotaviruses); Birnaviridae; Hepadnaviridae (Hepatitis B virus); Parvovirida (parvoviruses); Papovaviridae (papilloma viruses, polyoma viruses); Adenoviridae (most adenoviruses); Herpesviridae (herpes simplex virus (HSV) 1 and 2, varicella zoster virus, cytomegalovirus (CMV), herpes virus; Poxyiridae (variola viruses, vaccinia viruses, pox viruses); and Iridoviridae (e.g., African swine fever virus); and unclassified viruses (e.g., the etiological agents of Spongiform encephalopathies, the agent of delta hepatitis (thought to be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis (class 1=internally transmitted; class 2-parenterally transmitted (i.e., Hepatitis C); Norwalk and related viruses, and astroviruses).

Retroviruses that are contemplated include both simple retroviruses and complex retroviruses. The complex retroviruses include the subgroups of lentiviruses, T cell leukemia viruses and the foamy viruses. Lentiviruses include HIV-1, but also include HIV-2, SIV, Visna virus, feline immunodeficiency virus (FIV), and equine infectious anemia virus (EIAV). The T cell leukemia viruses include HTLV-1, HTLV-II, simian T cell leukemia virus (STLV), and bovine leukemia virus (BLV). The foamy viruses include human foamy virus (HFV), simian foamy virus (SFV) and bovine foamy virus (BFV).

Bacterial infections or diseases that can be treated or prevented by the present invention are caused by bacteria including, but not limited to, bacteria that have an intracellular stage in its life cycle, such as mycobacteria (e.g., Mycobacteria tuberculosis, M. bovis, M. avium, M leprae, or M. africanum), rickettsia, mycoplasma, chlamydia, and legionella. Other examples of bacterial infections contemplated include but are not limited to infections caused by Gram positive bacillus (e.g., Listeria, Bacillus such as Bacillus anthracis, Erysipelothrix species), Gram negative bacillus (e.g., Bartonella, Brucella, Campylobacter, Enterobacter, Escherichia, Francisella, Hemophilus, Klebsiella, Morganella, Proteus, Providencia, Pseudomonas, Salmonella, Serratia, Shigella, Vibrio, and Yersinia species), spirochete bacteria (e.g., Borrelia species including Borrelia burgdorferi that causes Lyme disease), anaerobic bacteria (e.g., Actinomyces and Clostridium species), Gram positive and negative coccal bacteria, Enterococcus species, Streptococcus species, Pneumococcus species, Staphylococcus species, Neisseria species. Specific examples of infectious bacteria include but are not limited to: Helicobacter pyloris, Borelia burgdorferi, Legionella pneumophilia, Mycobacteria tuberculosis, M. avium, M. intracellulare, M. kansaii, M. gordonae, Staphylococcus aureus, Neisseria gonorrhoeae, Neisseria meningitidis, Listeria monocytogenes, Streptococcus pyogenes (Group A Streptococcus), Streptococcus agalactiae (Group B Streptococcus), Streptococcus viridans, Streptococcus aecalis, Streptococcus bovis, Streptococcus pneumoniae, Haemophilus influenzae, Bacillus antracis, Corynebacterium diphtheriae, Erysipelothrix rhusiopathiae, Clostridium perfringers, Clostridium tetani, Enterobacter aerogenes, Klebsiella pneumoniae, Pasteurella multocida, Fusobacterium nucleatuin, Streptobacillus moniliformis, Treponema pallidium, Treponema pertenue, Leptospira, Rickettsia, and Actinoyyces israelli.

Fungal diseases that can be treated or prevented by the present invention include but are not limited to aspergilliosis, crytococcosis, sporotrichosis, coccidioidomycosis, paracoccidioidomycosis, histoplasmosis, blastomycosis, zygomycosis, and candidiasis.

Parasitic diseases that can be treated or prevented by the present invention include, but are not limited to, amebiasis, malaria, leishmania, coccidia, giardiasis, cryptosporidiosis, toxoplasmosis, and trypanosomiasis. Also encompassed are infections by various worms, such as but not limited to ascariasis, ancylostomiasis, trichuriasis, strongyloidiasis, toxoccariasis, trichinosis, onchocerciasis, filaria, and dirofilariasis. Also encompassed are infections by various flukes, such as but not limited to schistosomiasis, paragonimiasis, and clonorchiasis.

The terms “individual” and “subject” are used herein interchangeably. They refer to human beings, non-human primates or other mammals (e.g. mouse, rat, rabbit, dog, cat, cattle, swine, sheep, horse or primate) that can be afflicted with or are susceptible to a disease or disorder (e.g., cancer) but may or may not have the disease or disorder. In many embodiments, the individual is a human being. Unless otherwise stated, the terms “individual” and “subject” do not denote a particular age, and thus encompass adults, elderlies, children, and newborns. In preferred embodiments of the present invention, the “individual” or “subject” is a “patient”. The term “patient” means according to the invention a subject for treatment, in particular a diseased subject.

By “being at risk” is meant a subject, i.e. a patient, that is identified as having a higher than normal chance of developing a disease, in particular cancer, compared to the general population. In addition, a subject who has had, or who currently has, a disease, in particular cancer is a subject who has an increased risk for developing a disease, as such a subject may continue to develop a disease. Subjects who currently have, or who have had, a cancer also have an increased risk for cancer metastases.

A prophylactic administration of an agent or composition of the invention, preferably protects the recipient from the development of a disease. A therapeutic administration of an agent or composition of the invention, may lead to the inhibition of the progression of the disease. This comprises the deceleration of the progression of the disease, in particular a disruption of the progression of the disease, which preferably leads to elimination of the disease.

The agents and compositions described herein may be administered via any conventional route, such as by parenteral administration including by injection or infusion. Administration is preferably parenterally, e.g. intravenously, intraarterially, subcutaneously, intradermally or intramuscularly.

Compositions suitable for parenteral administration usually comprise a sterile aqueous or nonaqueous preparation of the active compound, which is preferably isotonic to the blood of the recipient. Examples of compatible carriers and solvents are Ringer solution and isotonic sodium chloride solution. In addition, usually sterile, fixed oils are used as solution or suspension medium.

The agents and compositions described herein are administered in effective amounts. An “effective amount” refers to the amount which achieves a desired reaction or a desired effect alone or together with further doses. In the case of treatment of a particular disease or of a particular condition, the desired reaction preferably relates to inhibition of the course of the disease. This comprises slowing down the progress of the disease and, in particular, interrupting or reversing the progress of the disease.

The desired reaction in a treatment of a disease or of a condition may also be delay of the onset or a prevention of the onset of said disease or said condition.

An effective amount of an agent or composition described herein will depend on the condition to be treated, the severeness of the disease, the individual parameters of the patient, including age, physiological condition, size and weight, the duration of treatment, the type of an accompanying therapy (if present), the specific route of administration and similar factors. Accordingly, the doses administered of the agents described herein may depend on various of such parameters. In the case that a reaction in a patient is insufficient with an initial dose, higher doses (or effectively higher doses achieved by a different, more localized route of administration) may be used.

The agents and compositions described herein can be administered to patients, e.g., in vivo, to treat or prevent a variety of disorders such as those described herein.

Preferred patients include human patients having disorders that can be corrected or ameliorated by administering the agents and compositions described herein. This includes disorders involving cells characterized by expression of an antigen.

For example, in one embodiment, agents described herein can be used to treat a patient with a cancer disease, e.g., a cancer disease such as described herein characterized by the presence of cancer cells expressing an antigen.

The pharmaceutical compositions and methods of treatment described according to the invention may also be used for immunization or vaccination to prevent a disease described herein.

The pharmaceutical composition can be administered locally or systemically, preferably systemically.

The term “systemic administration” refers to the administration of an agent such that the agent becomes widely distributed in the body of an individual in significant amounts and develops a desired effect. For example, the agent may develop its desired effect in the blood and/or reaches its desired site of action via the vascular system. Typical systemic routes of administration include administration by introducing the agent directly into the vascular system or oral, pulmonary, or intramuscular administration wherein the agent is adsorbed, enters the vascular system, and is carried to one or more desired site(s) of action via the blood.

According to the present invention, it is preferred that the systemic administration is by parenteral administration. The term “parenteral administration” refers to administration of an agent such that the agent does not pass the intestine. The term “parenteral administration” includes intravenous administration, subcutaneous administration, intradermal administration or intraarterial administration but is not limited thereto.

Administration may also be carried out, for example, orally, intraperitoneally or intramuscularly.

The agents and compositions provided herein may be used alone or in combination with conventional therapeutic regimens such as surgery, irradiation, chemotherapy and/or bone marrow transplantation (autologous, syngeneic, allogeneic or unrelated).

EXAMPLES
Material and Methods

The following materials and methods were used in the examples that are described below.

DNA Encoding Replicon and Trans-Replicon Constructs

Vectors systems used herein were engineered from Venezuelan Equine Encephalitis virus (VEEV; accession no. L01442), the overall vector design resembles a Semliki Forest virus vector system generated before (FIG. 1). In a first step, a plasmid encoding a self-replicating RNA (cis-replicon) based on VEEV was obtained by gene synthesis from a commercial provider. This construct lacks VEEV structural genes, but contains all conserved sequence elements (CSE) of VEEV that serve as replication-recognition sequence (RRS) and control viral replication into pST1 plasmid backbone (Holtkamp et al., 2006, Blood, vol. 108, pp. 4009-4017) under the transcriptional control of a T7 phage RNA-polymerase promoter. A plasmid-encoded poly(A) cassette consisting of 30 and 70 adenylate residues (polyA30-70), separated by a 10 nucleotide random sequence (WO 2016/005004 A1), was added immediately downstream of the very last nucleotide of the VEEV 3′CSE. A Sapl restriction site for plasmid linearization was placed immediately downstream of the poly(A) cassette. The insertion of genes of interest into cis-replicons is done downstream of the subgenomic promoter (FIG. 1A). Using further gene synthesis and PCR-based seamless cloning/recombination techniques we generated a plasmid serving as template for in vitro transcription of an mRNA encoding the complete open reading frame of the VEEV replicase into the pST1 plasmid backbone (Holtkamp et al., 2006, Blood, vol. 108, pp. 4009-4017). This vector contains the human alpha-globin 5′UTR upstream, and a plasmid-encoded poly(A30-70) cassette downstream of the replicase ORF. Again a Sapl restriction site was placed immediately downstream of the poly(A) cassette for plasmid linearization. Upon in vitro transcription the resulting mRNA lacks functional RRS of VEEV and is unable to replicate. For in vitro transcription of RNA replicating in trans (trans-replicon) two different variants of template plasmids were generated. For the first variant a plasmid encoding a trans-replicon with WT-RRSs (non-modified 5′ CSE, subgenomic promoter, 3′CSE) was obtained by removing the majority of VEEV-replicase coding sequences from the cis-replicon, keeping only those comprising functional RRSs (5′-CSE and the subgenomic promoter). Owing to the removal of the major part of the replicase ORF, the RNA encoded by the respective plasmid, when present in a host cell, is not capable to drive replication in cis, but requires for replication the presence of functional alphavirus non-structural protein in trans. Genes of interests are inserted downstream of the subgenomic promoter.

For the second trans-replicon version, the sequences comprising the subgenomic promoter were removed, and the 5′RRS was shortened to the first ˜270nts of the VEEV genome. Furthermore, the 5′RRS was mutated to remove any AUG codon that could serve as translation start codon. Compensation nucleotide changes were introduced to ensure proper folding and function of the 5′RRS, this trans-replicon was Δ5ATG-RRSΔSGP. The removal of 5′AUG ensures that translation starts exclusively with the start codon of the ORF of interest, which is inserted downstream of the mutated 5′CSE.

The major biological difference between both variant of trans-replicating RNA is that trans-replicons with WT-RRS and subgenomic promoter encode transgenes on the subgenomic transcript which is generated only upon replication. This means that no protein of interest is translated in absence of a replicase expressed in trans. In contrast, the Δ5ATG-RRSΔSGP trans-replicon can be translated to proteins even without replicase provided that in vitro transcriptions were performed with synthetic cap analoga.

Genes of interest herein are chimeric antigen receptors (CARs) and T cell receptors (TCR). Alpha and beta chains of the TCRs were inserted into separate replicating RNA vectors and cotransferred into T cells.

In vitro transcription

In vitro transcription from plasmids described above and purification of RNA was performed as previously described with the exception that beta-S-ARCA(D2) cap analog was used instead of ARCA (Holtkamp et al., supra; Kuhn et al., 2010, Gene Ther., vol. 17, pp. 961-971). Quality of purified RNA was assessed by spectrophotometry, and capillary electrophoresis (2100 BioAnalyzer, Agilent, Santa Clara, USA). The RNA used in the examples is purified IVT-RNA.

RNA transfer into cells:

For electroporation, RNA was resuspended in X-vivo serum-free medium in a final volume of 62.5 μl/mm cuvette gap size. The following electroporation settings were applied using a square-wave electroporation device (BTX ECM 830, Harvard Apparatus, Holliston, MA, USA): T cells: 1250 V/cm, 1 pulse of 3 milliseconds (ms), MZ-GaBa-18-β2m: 562.5, 3 ms, 2 pulses).

Cell lines and reagents

The human melanoma cell line MZ-GaBa-018 had been established from the post-vaccine melanoma lesion of a melanoma patient (Sahin U. et al., Nature, 2017). As MZ-GaBa-018 cells completely lacked HLA class I surface expression due to a deletion microglobulin (β2m) of the β2 gene the subclone MZ-GABA-018_PGK_hB2M_bln_C5_P9 (MZ-GaBa-18-β2m) was established by transduction with β2m as a tool for T cell assays and cultured in RPMI1640 medium (Life Technologies) supplemented with 15% FCS (Biochrome AG) and 7 μg/ml blasticidin. The human Epstein-Barr virus (EBV)-immortalised B cell lymphoblastoid line JY was cultured in RPMI1640+Glutamax medium supplemented with 10% FCS. T cells were grown in RPMI medium supplemented with 5% human AB serum (One Lamda Inc., Los Angeles, CA, USA), 1% non-essential aminoacids and 1% sodium pyruvate (both Life Technologies). Cells were grown at 37° C. in humidified atmosphere equilibrated to 5% CO₂.

Peripheral blood mononuclear cells (PBMCs) and T cells

PBMCs were isolated by Ficoll-Hypaque (Amersham Biosciences, Uppsala, Sweden) density gradient centrifugation from buffy coats or from blood samples. HLA allelotypes were determined by PCR standard methods. CD4+ and CD8+ T cells were enriched from PBMCs using anti-CD4 and anti-CD8 microbeads (Miltenyi Biotech, Bergisch-Gladbach, Germany).

Flow cytometry: Cell surface expression of transfected TCR genes was analyzed by flow cytometry using PE-conjugated anti-TCR antibody against the appropriate variable region family of the TCR β chain (Beckman Coulter Inc., Fullerton, USA) and APC-labeled anti-CD8/-CD4 antibodies (BD Biosciences). Cell surface expression of transfected CARs was analyzed using a Alexa-647-conjugated idiotype-specific antibody (Ganymed pharmaceuticals) recognizing the scFv fragment contained in all CAR constructs. HLA antigens were detected by staining with a PE-labeled HLA class I-specific antibody (BD Biosciences). CLDN6 and CLDN18.2 surface expression on target cells was analyzed by staining with an Alexa-Fluor647-conjugated CLDN6- or CLDN18.2-specific antibody (Ganymed Pharmaceuticals). Flow cytometric analysis was performed on a BD FACSCanto™ II analytical flow cytometer (BD Biosciences). Acquired data were analyzed using version ten of the FlowJo software (Tree Star).

Luciferase cytotoxicity assay: A luciferase based cytotoxicity assay was performed as previously described (Omokoko et al., J.Immunol.Res., 2016). 1×10⁴target cells were transfected with luciferase RNA and co-cultured with OKT3-preactivated TCR-transfected CD8⁺ T cells for 47 hours. A reaction mixture containing D-Luciferin (BD Biosciences; final concentration 1.2 mg/mL) was added. One hour later, luminescence was measured using a Tecan Infinite M200 reader (Tecan). Cell killing was calculated by measuring the reduction of total luciferase activity. Viable cells were measured by the luciferase-mediated oxidation of luciferin. Specific killing was calculated according to the following equation: (1−(CPSexp−CPSmin)/(CPSmax−CPSmin)))*100. Maximum luminescence (maximum counts per second, CPSmax) was assessed after incubating target cells with mock transfected effector T cells and minimal luminescences (CPSmin) was assessed after treatment of targets with detergent Triton-X-100 for complete lysis.

ELISPOT (Enzyme-Linked ImmunoSPOT Assay): Microtiter plates (Millipore, Bedford, MA, USA) were coated overnight at room temperature with an anti-IFNγ antibody 1-D1k (Mabtech, Stockholm, Sweden) and blocked with 2% human albumin (CSL Behring, Marburg, Germany). 5×10⁴/well antigen presenting stimulator cells were plated in duplicates together with 3×10⁵/well TCR-transfected CD8+ effector cells 20-24 h after electroporation. The plates were incubated overnight (37° C., 5% CO2), washed with PBS 0.05% Tween 20, and incubated for 2 hours with the anti-IFNγ biotinylated mAB 7-B6-1 (Mabtech) at a final concentration of 1 μg/ml at 37° ° C. Avidin-bound horseradish peroxidase H (Vectastain Elite Kit; Vector Laboratories, Burlingame, USA) was added to the wells, incubated for 1 hour at room temperature and developed with 3-amino-9-ethyl carbazole (Sigma, Deisenhofen, Germany).

ELISA (Enzyme-Linked ImmunoSorbent Assay): The amount of IFNγ secreted by target-reative T cells was quantified in culture supernatants using the human IFNγ ELISA Ready-SET-Go! Kit (eBioscience) and following the manufacturer's instructions.

Objective

Redirecting T lymphocyte antigen specificity by gene transfer can provide large numbers of tumor reactive T lymphocytes for adoptive immunotherapy. However, safety concerns associated with viral vector production have limited clinical application of T cells expressing chimeric antigen receptors (CARs) or T cell receptors (TCRs). T lymphocytes can be gene modified by RNA electroporation without integration-associated safety concerns. To establish a safe platform for adoptive immunotherapy, we developed a novel replicative RNA format to achieve high level and prolonged expression of therapeutic receptors. We tested applicability of our system for CARs as well as TCRs and analyzed effector function of RNA-transfected T cells.

Example 1: IFNγ Release from CAR-Transfected Resting CD4+ Cells is Stimulated Using Replicative RNA

To assess the efficiency of CAR expression in T cells using replicative RNA, we transfected different CAR-encoding replicative RNA species and compared CAR surface expression as well as IFNγ secretion of the T cells in response to target cells. CD4⁺ T cells were isolated from peripheral blood mononuclear cells (PBMCs) of healthy donors using magnetic assisted cell sorting (MACS). Immediately after MACS CD4⁺ T cells were electroporated with equimolar amounts of replicative and non-replicative RNA encoding a human CLAUDIN-6 (CLDN6) reactive CAR. Trans-replicating RNA was cotransfected with mRNA encoding replicase as indicated. Staining of the electroporated cells with a CAR-specific antibody revealed that 40% of the cells or more expressed high levels of CAR 24 h after electroporation (FIG. 2A, B). Replicative RNA resulted in higher CAR expression levels per cells as reflected by higher mean fluorescence intensities (MFI) of the CAR specific staining, but not necessarily in a greater CAR positive population. At the same time as we performed the CAR staining we started a cocultivation of the transfected T cells with JY cells lacking human CLDN6, or JY-cells stably transfected with human CLDN6. The next day we quantified the release of IFNγ into the culture supernatants by ELISA and found that IFNγ release was increased by approximately one order of magnitude using replicative RNA (FIG. 2C).

We concluded that replicative RNA leads to higher CAR expression levels and stimulates IFNγ release compared to mRNA.

Example 2: IFNγ Release from CAR-Transfected CD8 T Cells is Stimulated and More Sustained Using Replicative RNA

Next we assessed the duration of CAR expression and IFNγ release using CD8+ cytotoxic T cells. To this aim CD8+ T cells were isolated from fresh or frozen PBMCs from different donors. Those isolated from fresh PBMCs were pre-stimulated with OKT3 and IL2 for 48 h and expanded in presence of IL-2 for 72 h. CD8 cells from frozen PBMCs were electroporated directly after MACS, at the same time as the pre-stimulated cells. Both CD8 isolates were electroporated with equimolar amounts of replicative and non-replicative RNA encoding a human CLAUDIN-6 reactive CAR. Trans-replicating RNA was cotransfected with mRNA encoding replicase.

To assess the duration of CAR surface expression we stained the cells with a CAR-specific antibody in intervals of 24 h after electroporation. We observed that CAR expression declined rapidly in all samples (FIG. 3A, C). Compared to mRNA the CAR expression in resting cells was much higher using replicative RNA (FIG. 3A), whereas mRNA was leading to comparable CAR-levels in stimulated cells (FIG. 3C). We also asked the question how the decline of CAR-expression after electroporation correlated to the ability of the cells to release IFNγ. We therefore started cocultivation of the transfected T cells and CLAUDIN6 positive and negative JY cells at the same time points we analyzed CAR expression, and collected culture supernatants 24 h. Using ELISA we found that IFNγ release from resting cells was stimulated using replicative RNA, and was more sustained compared to mRNA (FIG. 3B). IFNγ release from pre-stimulated cells was overall much higher, and at early time points similarly strong with all RNA species. At later time points trans-replicating RNA led to a more sustained IFNγ release (FIG. 3D).

Example 3: Improved Neo-Antigen-Specific TCR-Mediated Recognition and Function of Autologous Melanoma Cells after Replicative RNA Transfer

Multiple publications indicated that favourable clinical outcomes of clinical immunotherapies such as checkpoint blockade (Rizvi, Science, 2015; Snyder, N. Engl. J. Med. 2014; Mcgranahan, N, Science, 2016) and adoptive T cell therapy (Tran, E., Science, 2014; Robbins, P. F., Nat. Med., 2013; Tran, E. N. Engl. J. Med. 2016) are associated with neo-epitope immune recognition. These data not only support novel individualized vaccination strategies (Sahin, Nature, 2017) but also the development of autologous TCR gene therapies targeting neo-antigens to treat patients with advanced epithelial cancers (Klebanoff A., Nature Medicine, 2016). Neo-antigens represent ideal targets for T cell based immunotherapy because somatic mutations are central to the formation of cancers, they are exclusive to tumor cells (minimizing risk of on-target, off-tumor toxicity) and high affinity TCRs are not deleted during negative selection. However, realization of this concept will require major technical, manufacturing and regulatory innovations to treat the majority of patients with solid cancers. Electroporation of T cells with optimized RNA encoding neo-antigen-specific TCRs would provide a cost-efficient and flexible platform. Therefore, we applied the concept of replicative RNA transfer to the expression of neo-antigen-specific TCRs and subsequently analysed functional recognition of autologous melanoma cells.

We used two TCRs that were cloned from tumor infiltrating lymphocytes (TILs) of a melanoma patient recognizing two individual neo-antigens (M05 and M14) expressed by the tumor of this patient. The human melanoma cell line MZ-GaBa-018 has been established from the same lesion of the melanoma patient and has been confirmed to express M05 and M14 (Sahin U. et al., Nature, 2017). We isolated CD8+ T cells from a healthy donor, transfected them with equimolar amounts of replicative and non-replicative RNAs encoding the M14-TCR. Trans-replicating RNA was cotransfected with titrated amounts of mRNA encoding replicase. Transfected T cells were rested overnight before they were cocultured with MZ-GaBa-18-β2m cells and specific recognition of melanoma cells was analysed by IFNγ-ELISPOT assay (FIG. 4A). Melanoma cells were not recognized by T cells transfected with standard mRNA or NTR RNA (without replicase RNA) encoding the M014-TCR, but recognition could be induced by transfer of NTR in combination with replicase RNA. Notably, recognition increased dose-dependently when higher amounts of replicase RNA were cotransfected. TCR surface expression was verified by flow cytometry staining using a Vβ-specific antibody detecting the Vβ subfamily of the M14-TCR (FIG. 4B). TCR surface expression also increased in a dose-dependent manner after cotransfection of titrated amounts of replicase RNA in combination with NTR RNA.

Next, we wanted to know, if replicative RNA transfer of neo-antigen-specific TCRs, can also lead to improved lysis of melanoma cells. We transfected OKT3-preactivated CD8+ T cells with equimolar amounts of replicative and non-replicative RNAs encoding both, the M05- and the M14-TCR. TCR-tranfected T cells were rested overnight and cocultured with luciferase-transfected MZ-GaBa-18-β2m cells using different effector-to-target (E:T) ratios. Specific lysis of melanoma cells was analysed after 48 h of coculture using a luciferase-based killing assay (FIG. 5). For both TCRs significantly improved lysis of melanoma cells endogenously expressing the respective mutated epitopes was mediated my T cells after TCR transfection using replicative RNA at all 10 tested E:T ratios indicating that replicative RNA could indeed have the potential to increase the therapeutic window of T cells transfected with therapeutic TCRs.

	Number	Date	Country
Parent	16645757	Mar 2020	US
Child	18538898		US

RNA REPLICON FOR EXPRESSING AT CELL RECEPTOR OR AN ARTIFICIAL T CELL RECEPTOR

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

Continuations (1)