LASSA VIRUS VACCINE

Abstract
The present invention provides mRNAs usable as vaccines against lassa virus (LASV) infections. Further, the invention relates to (pharmaceutical) compositions and vaccines comprising said mRNAs and their use for treatment or prophylaxis of a lassa virus infection. The present invention further features a kit comprising the mRNAs, (pharmaceutical) compositions or vaccines and a method for treatment or prophylaxis of lassa virus infections using said mRNAs, pharmaceutical) compositions or vaccines.
Description

Lassa virus (LASV) or Lassa mammarenavirus belongs to the Arenaviridae, a family of enveloped viruses with bi-segmented, ambisense single-stranded RNA genomes. The large (L) genomic segment encodes an RNA-dependent RNA polymerase (RdRp) and a zinc-binding matrix protein (Z) (or small RNIG finger z protein, analogous of matrix protein, has been identified as a major budding factor). The small (S) genomic segment encodes the nucleoprotein (NP), and the glycoprotein precursor (GPC). The LASV GPC is synthesized as a single polypeptide and undergoes processing by signal peptidases and cellular pro-protein convertases yielding the stable signal peptide (SSP), the N-terminal GP1, and the transmembrane GP2.


Genetic diversity among LASV strains is the highest among the Arenaviridae, and causes a great challenge for vaccine development. Recent studies using next-generation sequencing (NGS) for confirmed LASV clustering into four major clades and provided further evidence for high LASV genome diversity: overall strain variation among the 54 strains is as high as 27% and 15% at the nucleotide and amino acid levels, respectively (Lukashevich I S. Viruses. 2012 November; 4(11): 2514-2557). LASV in vivo reassortment has been documented suggesting infection of individual hosts with at least two LASV strains from distinguished clades.


People of all ages and sex are susceptible to the LASV infection. Lassa virus is endemic in rural West Africa. LASV has the highest human impact of any of the hemorrhagic fever viruses (with the exception of Dengue fever) with an estimated 100,000-300,000 infections and 5,000-10,000 deaths annually in western Africa. Based on prospective studies performed in four of the most affected countries, Guinea, Sierra-Leone, Liberia, and Nigeria, it is estimated that 59 to 200 million people are at risk of primary LASV infections with an annual incidence of disease as high as 3 million and as many as 67,000 deaths per year. The current areas at risk cover approximately 80% of Sierra-Leone and Liberia, 50% of Guinea, 40% of Nigeria, 30% of each of Cote d'Ivoire, Togo and Benin, and 10% of Ghana (Lukashevich I S. Viruses. 2012 November; 4(11): 2514-2557).


The fatality rate for hospitalized patients is about 17%, but in certain groups of patients, such as pregnant women in their third trimester, more than 30% may die, and fetal or neonatal loss is about 88% (Fisher-Hoch S P et al. J Virol. 2000 August; 74(15):6777-83).


The early stage of human LASV infection is characterized by unspecific signs and symptoms. After an incubation period of 3-21 days, early symptoms include fever, sore throat, retrosternal pain, and myalgia. In progressed LF patients, elevated liver enzymes and high viral load in plasma (viremia) combined with vascular manifestations are indicators of poor prognosis and a fatal outcome.


The pathogenesis of Lassa Fever is still not clearly understood. After crossing the epithelium barrier at the respiratory and gastrointestinal tracts, LASV infects primarily macrophages and dendritic cells (DCs). Efficient virus replication in these cells results in virus dissemination and systemic infection.


Importantly, adaptive cell-mediated responses seem to play a key role in outcome of LASV infection in humans. Growing body of evidence indicates that very early events during the natural infection, probably, even in previremic stage can affect the balance between effective adaptive immune responses and virus replication and determine the clinical outcome (Lukashevich I S and Pushko P Expert Rev Vaccines. 2016 September; 15(9):1135-50). T cell responses seem to be central for immunity to LASV. In humans, strong CD4+ T cell memory responses against LASV NP can be recalled in PBMCs for up to six years after the initial infection, and T cell memory responses to GPC are similarly long-lived. Ub nacaqzes, LASV infection that results in a fatal outcome is associated with a lack of demonstrable T cell activation (Prescott et al. Nat Rev Immunol. 2017 March; 17(3): 195-207).


LASV establishes persistent, asymptomatic infection, with profuse urinary virus excretion in Mastomys natalensis, a ubiquitous and highly commensal rodent host. Human infection occurs via contact with rodents, inhalation of contaminated droplets/dust, ingestion of contaminated food, or contact with infected patients. Widespread prevention of such contact is presently impractical, and provision of a vaccine for community and hospital use is an imperative public health need.


Currently, there are no licensed treatments specific for Lassa fever. The broad-spectrum antiviral ribavirin seems to benefit patients with Lassa fever if the drug is administered early in the course of disease. The most beneficial current treatment is aggressive supportive care. Therefor, rapid and reliable diagnosis of Lassa virus infection is essential to reduce morbidity and mortality and to support public health measures, such as patient isolation and contact tracing.


There are no licensed vaccines to prevent Lassa virus infections. (Prescott et al. Nat Rev Immunol. 2017 March; 17(3):195-207).


Platforms currently being developed for LASV vaccines can be grouped into non-replication competent vaccine platforms and replication-competent vaccine platforms (cf. Lukashevich I S. Viruses. 2012 November; 4(11): 2514-2557 and Lukashevich I S and Pushko P Expert Rev Vaccines. 2016 September; 15(9):1135-50).


A favorable safety profile is the most attractive feature of non-replication competent vaccine platforms such as inactivated (“killed”) vaccines or virus-like particles. However, these approaches in general have low immunogenicity and efficacy. To achieve desirable levels of protection these vaccines therefore require multiple prime-boost injections.


Another potential approach includes epitope-based vaccines comprising NIA-binding LASV peptides. However, there are serious safety concerns regarding peptide-based vaccination of individuals recently infected with the virus or in immune individuals previously exposed (perhaps unknowingly/asymptomatically) to the pathogen due to the potential reactivation of CD8+ memory cell progeny leading to potentially fatal immune reactions. Also, since in West Africa the variability of the human HLA genes is very high, a large number of peptides would potentially have to be included in the vaccine, raising a concern regarding the feasibility of this approach.


Virus-based vaccine platforms currently being developed for LASV include alphavirus vector-based vaccines, “live-attenuated” LASV and recombinant vaccinia virus, recombinant Vesicular Stomatitis Virus (VSV), reassortant vaccine platform, MOP/LAS (clone ML29) or recombinant Yellow Fever 17D (YF17D) expressing LASV antigens. However, the development of virus-based vaccine platforms is challenging: while on the one hand, recombinant replication-defective viruses may exhibit a low immunogenicity and thus offer an insufficient protection, on the other hand, viruses exhibiting an immunosuppressive phenotype such as vaccinia virus, or other viruses harboring the potential of reversion to virulence raise safety concerns and are particularly unfeasible for countries with a high prevalence of HIV-1 (cf. Lukashevich I S. Viruses. 2012 November; 4(11): 2514-2557 and Lukashevich I S and Pushko P Expert Rev Vaccines. 2016 September; 15(9):1135-50).


WHO placed Lassa Fever on list of top emerging diseases to accelerate R&D on dangerous pathogens which are most prone to cause epidemics. There is currently no licensed LASV vaccine and vaccine development is hampered by high cost of biocontainment requirement, the absence of appropriate small animal models and genetic diversity of LASV species. LASV vaccine platforms currently under development have been falling short of expectations in terms of safety and efficacy.


There is an urgent need in the art for the provision of a safe and effective LASV vaccine. It is the object of the present invention to comply with this need.


Further needs regarding LASV vaccine properties, characteristics or performances comprise e.g.

    • Induction of a strong humoral immune response
    • Induction of B-cell memory
    • Fast onset of immune protection
    • Longevity of the induced immune responses
    • Induction of broad cellular T-cell responses
    • Induction of a (local and transient) pro-inflammatory environment
    • No induction of systemic cytokine or chemokine response
    • Well tolarability, no side-effects, non toxic
    • Advantageous stability characteristics
    • Formulation(s) compatible with many different antigens: larger antigen cocktails feasible based on the same (production) technology
    • No vector immunity, i.e. technology can be used to vaccinate the same subject multiple times against multiple (different) antigens
    • Speed, adaptability, simplicity and scalability of production


Although the present invention is described in detail below, it is to be understood that this invention is not limited to the particular methodologies, protocols and reagents described herein as these may vary. It is also to be understood that the terminology used herein is not intended to limit the scope of the present invention which will be limited only by the appended claims. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art.


In the following, the elements of the present invention will be described. These elements are listed with specific embodiments, however, it should be understood that they may be combined in any manner and in any number to create additional embodiments. The variously described examples and preferred embodiments should not be construed to limit the present invention to only the explicitly described embodiments. This description should be understood to support and encompass embodiments which combine the explicitly described embodiments with any number of the disclosed and/or preferred elements. Furthermore, any permutations and combinations of all described elements in this application should be considered disclosed by the description of the present application unless the context indicates otherwise.


Throughout this specification and the claims which follow, unless the context requires otherwise, the term “comprise”, and variations such as “comprises” and “comprising”, will be understood to imply the inclusion of a stated member, integer or step but not the exclusion of any other non-stated member, integer or step. The term “consist of” is a particular embodiment of the term “comprise”, wherein any other non-stated member, integer or step is excluded. In the context of the present invention, the term “comprise” encompasses the term “consist of”. The term “comprising” thus encompasses “including” as well as “consisting” e.g., a composition “comprising” X may consist exclusively of X or may include something additional e.g., X+Y.


The terms “a” and “an” and “the” and similar reference used in the context of describing the invention (especially in the context of the claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. Recitation of ranges of values herein is merely intended to serve as a shorthand method of referring individually to each separate value falling within the range. Unless otherwise indicated herein, each individual value is incorporated into the specification as if it were individually recited herein. No language in the specification should be construed as indicating any non-claimed element essential to the practice of the invention.


The word “substantially” does not exclude “completely” e.g., a composition which is “substantially free” from Y may be completely free from Y. Where necessary, the word “substantially” may be omitted from the definition of the invention.


The term “about” in relation to a numerical value x means x±10%.


In the present invention, if not otherwise indicated, different features of alternatives and embodiments may be combined with each other.


For the sake of clarity and readability the following definitions are provided. Any technical feature mentioned for these definitions may be read on each and every embodiment of the invention. Additional definitions and explanations may be specifically provided in the context of these embodiments.


Definitions

Adaptive immune response: The adaptive immune response is typically understood to be an antigen-specific response of the immune system. Antigen specificity allows for the generation of responses that are tailored, for example, to specific pathogens or pathogen-infected cells. The ability to mount these tailored responses is usually maintained in the body by “memory cells”. Should a pathogen infect the body more than once, these specific memory cells are used to quickly eliminate it. In this context, the first step of an adaptive immune response is the activation of naïve antigen-specific T cells or different immune cells able to induce an antigen-specific immune response by antigen-presenting cells. This occurs in the lymphoid tissues and organs through which naïve T cells are constantly passing. The three cell types that may serve as antigen-presenting cells are dendritic cells, macrophages, and B cells. Each of these cells has a distinct function in eliciting immune responses. Dendritic cells may take up antigens by phagocytosis and macropinocytosis and may become stimulated by contact with e.g. a foreign antigen to migrate to the local lymphoid tissue, where they differentiate into mature dendritic cells. Macrophages ingest particulate antigens such as bacteria and are induced by infectious agents or other appropriate stimuli to express MHC molecules. The unique ability of B cells to bind and internalize soluble protein antigens via their receptors may also be important to induce T cells. MHC-molecules are, typically, responsible for presentation of an antigen to T-cells. Therein, presenting the antigen on MHC molecules leads to activation of T cells, which induces their proliferation and differentiation into armed effector T cells. The most important function of effector T cells is the killing of infected cells by CD8+ cytotoxic T cells and the activation of macrophages by Th1 cells, which together make up cell-mediated immunity, and the activation of B cells by both Th2 and Th1 cells to produce different classes of antibody, thus driving the humoral immune response. T cells recognize an antigen by their T cell receptors which do not recognize and bind the antigen directly, but instead recognize short peptide fragments e.g. of pathogen-derived protein antigens, e.g. so-called epitopes, which are bound to MHC molecules on the surfaces of other cells.


Adaptive immune system: The adaptive immune system is essentially dedicated to eliminate or prevent pathogenic growth. It typically regulates the adaptive immune response by providing the vertebrate immune system with the ability to recognize and remember specific pathogens (to generate immunity), and to mount stronger attacks each time the pathogen is encountered. The system is highly adaptable because of somatic hypermutation (a process of accelerated somatic mutations), and V(D)J recombination (an irreversible genetic recombination of antigen receptor gene segments). This mechanism allows a small number of genes to generate a vast number of different antigen receptors, which are then uniquely expressed on each individual lymphocyte. Because the gene rearrangement leads to an irreversible change in the DNA of each cell, all of the progeny (offspring) of such a cell will then inherit genes encoding the same receptor specificity, including the Memory B cells and Memory T cells that are the keys to long-lived specific immunity.


Artificial nucleic acid molecule: An artificial nucleic acid molecule may typically be understood to be a nucleic acid molecule, e.g. a DNA or an RNA, that does not occur naturally. In other words, an artificial nucleic acid molecule may be understood as a non-natural nucleic acid molecule. Such nucleic acid molecule may be non-natural due to its individual sequence (which does not occur naturally) and/or due to other modifications, e.g. structural modifications of nucleotides, which do not occur naturally. An artificial nucleic acid molecule may be a DNA molecule, an RNA molecule or a hybrid-molecule comprising DNA and RNA portions. Typically, artificial nucleic acid molecules may be designed and/or generated by genetic engineering methods to correspond to a desired artificial sequence of nucleotides (heterologous sequence). In this context an artificial sequence is usually a sequence that may not occur naturally, i.e. it differs from the wild type sequence by at least one nucleotide. The term “wild type” may be understood as a sequence occurring in nature. Further, the term “artificial nucleic acid molecule” is not restricted to mean “one single molecule” but is, typically, understood to comprise an ensemble of identical molecules. Accordingly, it may relate to a plurality of identical molecules contained in an aliquot.


Cellular immunity/cellular immune response: Cellular immunity relates typically to the activation of macrophages, natural killer cells (NK), antigen-specific cytotoxic T-lymphocytes, and the release of various cytokines in response to an antigen. In more general terms, cellular immunity is not based on antibodies, but on the activation of cells of the immune system. Typically, a cellular immune response may be characterized e.g. by activating antigen-specific cytotoxic T-lymphocytes that are able to induce apoptosis in cells, e.g. specific immune cells like dendritic cells or other cells, displaying epitopes of foreign antigens on their surface. Such cells may be virus-infected or infected with intracellular bacteria, or cancer cells displaying tumor antigens. Further characteristics may be activation of macrophages and natural killer cells, enabling them to destroy pathogens and stimulation of cells to secrete a variety of cytokines that influence the function of other cells involved in adaptive immune responses and innate immune responses.


DNA: DNA is the usual abbreviation for deoxy-ribonucleic acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually deoxy-adenosine-monophosphate, deoxy-thymidine-monophosphate, deoxy-guanosine-monophosphate and deoxy-cytidine-monophosphate monomers which are—by themselves—composed of a sugar moiety (deoxyribose), a base moiety and a phosphate moiety, and polymerise by a characteristic backbone structure. The backbone structure is, typically, formed by phosphodiester bonds between the sugar moiety of the nucleotide, i.e. deoxyribose, of a first and a phosphate moiety of a second, adjacent monomer. The specific order of the monomers, i.e. the order of the bases linked to the sugar/phosphate-backbone, is called the DNA sequence. DNA may be single stranded or double stranded. In the double stranded form, the nucleotides of the first strand typically hybridize with the nucleotides of the second strand, e.g. by A/T-base-pairing and G/C-base-pairing.


Fragment of a sequence: A fragment of a sequence may typically be a shorter portion of a full-length sequence of e.g. a nucleic acid molecule or an amino acid sequence. Accordingly, a fragment, typically, consists of a sequence that is identical to the corresponding stretch within the full-length sequence. A preferred fragment of a sequence in the context of the present invention, consists of a continuous stretch of entities, such as nucleotides or amino acids corresponding to a continuous stretch of entities in the molecule the fragment is derived from, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, and most preferably at least 80% of the total (i.e. full-length) molecule from which the fragment is derived.


Gene therapy: Gene therapy may typically be understood to mean a treatment of a patient's body or isolated elements of a patient's body, for example isolated tissues/cells, by nucleic acids encoding a peptide or protein. It typically may comprise at least one of the steps of a) administration of a nucleic acid, preferably an RNA as defined herein, directly to the patient—by whatever administration route—or in vitro to isolated cells/tissues of the patient, which results in transfection of the patient's cells either in vivo/ex vivo or in vitro; b) transcription and/or translation of the introduced nucleic acid molecule; and optionally c) re-administration of isolated, transfected cells to the patient, if the nucleic acid has not been administered directly to the patient.


Genetic vaccination: Genetic vaccination may typically be understood to be vaccination by administration of a nucleic acid molecule encoding an antigen or an immunogen or fragments thereof. The nucleic acid molecule may be administered to a subject's body or to isolated cells of a subject. Upon transfection of certain cells of the body or upon transfection of the isolated cells, the antigen or immunogen may be expressed by those cells and subsequently presented to the immune system, eliciting an adaptive, i.e. antigen-specific immune response. Accordingly, genetic vaccination typically comprises at least one of the steps of a) administration of a nucleic acid, preferably an artificial nucleic acid molecule as defined herein, to a subject, preferably a patient, or to isolated cells of a subject, preferably a patient, which usually results in transfection of the subject's cells either in vivo or in vitro; b) transcription and/or translation of the introduced nucleic acid molecule; and optionally c) re-administration of isolated, transfected cells to the subject, preferably the patient, if the nucleic acid has not been administered directly to the patient.


Heterologous sequence: Two sequences are typically understood to be “heterologous” if they are not derivable from the same gene. I.e., although heterologous sequences may be derivable from the same organism, they naturally (in nature) do not occur in the same nucleic acid molecule, such as in the same mRNA.


Humoral immunity/humoral immune response: Humoral immunity refers typically to antibody production and optionally to accessory processes accompanying antibody production. A humoral immune response may be typically characterized, e.g., by Th2 activation and cytokine production, germinal center formation and isotype switching, affinity maturation and memory cell generation. Humoral immunity also typically may refer to the effector functions of antibodies, which include pathogen and toxin neutralization, classical complement activation, and opsonin promotion of phagocytosis and pathogen elimination.


Immunogen: In the context of the present invention, an immunogen may be typically understood to be a compound that is able to stimulate an immune response. Preferably, an immunogen is a peptide, polypeptide, or protein. In a particularly preferred embodiment, an immunogen in the sense of the present invention is the product of translation of a provided nucleic acid molecule, preferably an artificial nucleic acid molecule as defined herein. Typically, an immunogen elicits at least an adaptive immune response.


Immunostimulatory composition: In the context of the invention, an immunostimulatory composition may be typically understood to be a composition containing at least one component which is able to induce an immune response or from which a component, which is able to induce an immune response, is derivable. Such immune response may be preferably an innate immune response or a combination of an adaptive and an innate immune response. Preferably, an immunostimulatory composition in the context of the invention contains at least one artificial nucleic acid molecule, more preferably an RNA, for example an mRNA molecule. The immunostimulatory component, such as the mRNA may be complexed with a suitable carrier. Thus, the immunostimulatory composition may comprise an mRNA/carrier-complex. Furthermore, the immunostimulatory composition may comprise an adjuvant and/or a suitable vehicle for the immunostimulatory component, such as the mRNA.


Immune response: An immune response may typically be a specific reaction of the adaptive immune system to a particular antigen (so called specific or adaptive immune response) or an unspecific reaction of the innate immune system (so called unspecific or innate immune response), or a combination thereof.


Immune system: The immune system may protect organisms from infection. If a pathogen succeeds in passing a physical barrier of an organism and enters this organism, the innate immune system provides an immediate, but non-specific response. If pathogens evade this innate response, vertebrates possess a second layer of protection, the adaptive immune system. Here, the immune system adapts its response during an infection to improve its recognition of the pathogen. This improved response is then retained after the pathogen has been eliminated, in the form of an immunological memory, and allows the adaptive immune system to mount faster and stronger attacks each time this pathogen is encountered. According to this, the immune system comprises the innate and the adaptive immune system. Each of these two parts typically contains so called humoral and cellular components.


Immunostimulatory RNA: An immunostimulatory RNA (isRNA) in the context of the invention may typically be an RNA that is able to induce an innate immune response. It usually does not have an open reading frame and thus does not provide a peptide-antigen or immunogen but elicits an immune response e.g. by binding to a specific kind of Toll-like-receptor (TLR) or other suitable receptors. However, of course also mRNAs having an open reading frame and coding for a peptide/protein may induce an innate immune response and, thus, may be immunostimulatory RNAs.


Innate immune system: The innate immune system, also known as non-specific (or unspecific) immune system, typically comprises the cells and mechanisms that defend the host from infection by other organisms in a non-specific manner. This means that the cells of the innate system may recognize and respond to pathogens in a generic way, but unlike the adaptive immune system, it does not confer long-lasting or protective immunity to the host. The innate immune system may be, e.g., activated by ligands of Toll-like receptors (TLRs) or other auxiliary substances such as lipopolysaccharides, TNF-alpha, CD40 ligand, or cytokines, monokines, lymphokines, interleukins or chemokines, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL-20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32, IL-33, IFN-alpha, IFN-beta, IFN-gamma, GM-CSF, G-CSF, M-CSF, LT-beta, TNF-alpha, growth factors, and hGH, a ligand of human Toll-like receptor TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, a ligand of murine Toll-like receptor TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, TLR11, TLR12 or TLR13, a ligand of a NOD-like receptor, a ligand of a RIG-I like receptor, an immunostimulatory nucleic acid, an immunostimulatory RNA (isRNA), a CpG-DNA, an antibacterial agent, or an anti-viral agent. The pharmaceutical composition according to the present invention may comprise one or more such substances. Typically, a response of the innate immune system includes recruiting immune cells to sites of infection, through the production of chemical factors, including specialized chemical mediators, called cytokines; activation of the complement cascade; identification and removal of foreign substances present in organs, tissues, the blood and lymph, by specialized white blood cells; activation of the adaptive immune system; and/or acting as a physical and chemical barrier to infectious agents.


Cloning site: A cloning site is typically understood to be a segment of a nucleic acid molecule, which is suitable for insertion of a nucleic acid sequence, e.g., a nucleic acid sequence comprising an open reading frame. Insertion may be performed by any molecular biological method known to the one skilled in the art, e.g. by restriction and ligation. A cloning site typically comprises one or more restriction enzyme recognition sites (restriction sites). These one or more restrictions sites may be recognized by restriction enzymes which cleave the DNA at these sites. A cloning site which comprises more than one restriction site may also be termed a multiple cloning site (MCS) or a polylinker.


Nucleic acid molecule: A nucleic acid molecule is a molecule comprising, preferably consisting of nucleic acid components. The term nucleic acid molecule preferably refers to DNA or RNA molecules. It is preferably used synonymous with the term “polynucleotide”. Preferably, a nucleic acid molecule is a polymer comprising or consisting of nucleotide monomers, which are covalently linked to each other by phosphodiester-bonds of a sugar/phosphate-backbone. The term “nucleic acid molecule” also encompasses modified nucleic acid molecules, such as base-modified, sugar-modified or backbone-modified etc. DNA or RNA molecules.


Open reading frame: An open reading frame (ORF) in the context of the invention may typically be a sequence of several nucleotide triplets, which may be translated into a peptide or protein. An open reading frame preferably contains a start codon, i.e. a combination of three subsequent nucleotides coding usually for the amino acid methionine (ATG), at its 5′-end and a subsequent region, which usually exhibits a length which is a multiple of 3 nucleotides. An ORF is preferably terminated by a stop-codon (e.g., TAA, TAG, TGA). Typically, this is the only stop-codon of the open reading frame. Thus, an open reading frame in the context of the present invention is preferably a nucleotide sequence, consisting of a number of nucleotides that may be divided by three, which starts with a start codon (e.g. ATG) and which preferably terminates with a stop codon (e.g., TAA, TGA, or TAG). The open reading frame may be isolated or it may be incorporated in a longer nucleic acid sequence, for example in a vector or an mRNA. An open reading frame may also be termed “(protein) coding sequence” or, preferably, “coding sequence”.


Peptide: A peptide or polypeptide is typically a polymer of amino acid monomers, linked by peptide bonds. It typically contains less than 50 monomer units. Nevertheless, the term peptide is not a disclaimer for molecules having more than 50 monomer units. Long peptides are also called polypeptides, typically having between 50 and 600 monomeric units.


Protein: A protein typically comprises one or more peptides or polypeptides. A protein is typically folded into 3-dimensional form, which may be required for the protein to exert its biological function.


Restriction site: A restriction site, also termed restriction enzyme recognition site, is a nucleotide sequence recognized by a restriction enzyme. A restriction site is typically a short, preferably palindromic nucleotide sequence, e.g. a sequence comprising 4 to 8 nucleotides. A restriction site is preferably specifically recognized by a restriction enzyme. The restriction enzyme typically cleaves a nucleotide sequence comprising a restriction site at this site. In a double-stranded nucleotide sequence, such as a double-stranded DNA sequence, the restriction enzyme typically cuts both strands of the nucleotide sequence.


RNA, mRNA: RNA is the usual abbreviation for ribonucleic-acid. It is a nucleic acid molecule, i.e. a polymer consisting of nucleotides. These nucleotides are usually adenosine-monophosphate, uridine-monophosphate, guanosine-monophosphate and cytidine-monophosphate monomers which are connected to each other along a so-called backbone. The backbone is formed by phosphodiester bonds between the sugar, i.e. ribose, of a first and a phosphate moiety of a second, adjacent monomer. The specific succession of the monomers is called the RNA-sequence. Usually RNA may be obtainable by transcription of a DNA-sequence, e.g., inside a cell. In eukaryotic cells, transcription is typically performed inside the nucleus or the mitochondria. In vivo, transcription of DNA usually results in the so-called premature RNA which has to be processed into so-called messenger-RNA, usually abbreviated as mRNA. Processing of the premature RNA, e.g. in eukaryotic organisms, comprises a variety of different posttranscriptional-modifications such as splicing, 5′-capping, polyadenylation, export from the nucleus or the mitochondria and the like. The sum of these processes is also called maturation of RNA. The mature messenger RNA usually provides the nucleotide sequence that may be translated into an amino-acid sequence of a particular peptide or protein. Typically, a mature mRNA comprises a 5′-cap, a 5′-UTR, an open reading frame, a 3′-UTR and a poly(A) sequence. Aside from messenger RNA, several non-coding types of RNA exist which may be involved in regulation of transcription and/or translation.


Sequence of a nucleic acid molecule: The sequence of a nucleic acid molecule is typically understood to be the particular and individual order, i.e. the succession of its nucleotides. The sequence of a protein or peptide is typically understood to be the order, i.e. the succession of its amino acids.


Sequence identity: Two or more sequences are identical if they exhibit the same length and order of nucleotides or amino acids. The percentage of identity typically describes the extent to which two sequences are identical, i.e. it typically describes the percentage of nucleotides that correspond in their sequence position with identical nucleotides of a reference-sequence. For determination of the degree of identity, the sequences to be compared are considered to exhibit the same length, i.e. the length of the longest sequence of the sequences to be compared. This means that a first sequence consisting of 8 nucleotides is 80% identical to a second sequence consisting of 10 nucleotides comprising the first sequence. In other words, in the context of the present invention, identity of sequences preferably relates to the percentage of nucleotides of a sequence which have the same position in two or more sequences having the same length. Gaps are usually regarded as non-identical positions, irrespective of their actual position in an alignment.


Transfection: The term “transfection” refers to the introduction of nucleic acid molecules, such as DNA or RNA (e.g. mRNA) molecules, into cells, preferably into eukaryotic cells. In the context of the present invention, the term “transfection” encompasses any method known to the skilled person for introducing nucleic acid molecules into cells, preferably into eukaryotic cells, such as into mammalian cells. Such methods encompass, for example, electroporation, lipofection, e.g. based on cationic lipids and/or liposomes, calcium phosphate precipitation, nanoparticle based transfection, virus based transfection, or transfection based on cationic polymers, such as DEAE-dextran or polyethylenimine etc. Preferably, the introduction is non-viral.


Vector: The term “vector” refers to a nucleic acid molecule, preferably to an artificial nucleic acid molecule. A vector in the context of the present invention is suitable for incorporating or harboring a desired nucleic acid sequence, such as a nucleic acid sequence comprising an open reading frame. Such vectors may be storage vectors, expression vectors, cloning vectors, transfer vectors etc. A storage vector is a vector, which allows the convenient storage of a nucleic acid molecule, for example, of an mRNA molecule. Thus, the vector may comprise a sequence corresponding, e.g., to a desired mRNA sequence or a part thereof, such as a sequence corresponding to the coding sequence and the 3′-UTR of an mRNA. An expression vector may be used for production of expression products such as RNA, e.g. mRNA, or peptides, polypeptides or proteins. For example, an expression vector may comprise sequences needed for transcription of a sequence stretch of the vector, such as a promoter sequence, e.g. an RNA polymerase promoter sequence. A cloning vector is typically a vector that contains a cloning site, which may be used to incorporate nucleic acid sequences into the vector. A cloning vector may be, e.g., a plasmid vector or a bacteriophage vector. A transfer vector may be a vector, which is suitable for transferring nucleic acid molecules into cells or organisms, for example, viral vectors. A vector in the context of the present invention may be, e.g., an RNA vector or a DNA vector. Preferably, a vector is a DNA molecule. Preferably, a vector in the sense of the present application comprises a cloning site, a selection marker, such as an antibiotic resistance factor, and a sequence suitable for multiplication of the vector, such as an origin of replication. Preferably, a vector in the context of the present application is a plasmid vector.


Vehicle: A vehicle is typically understood to be a material that is suitable for storing, transporting, and/or administering a compound, such as a pharmaceutically active compound. For example, it may be a physiologically acceptable liquid, which is suitable for storing, transporting, and/or administering a pharmaceutically active compound.


SUMMARY OF THE INVENTION

The present invention is based on the inventors' surprising finding that an mRNA encoding at least one antigenic peptide or protein derived from a lassa virus (LASV) efficiently induces antigen-specific immune responses against lassa virus. Furthermore, the inventors surprisingly found that mRNA-based vaccines comprising mRNAs encoding different antigens of a lassa virus (particularly nucleoprotein (NP) and/or glycoprotein precursor (GPC) and/or zinc-binding matrix protein (Z)) can be extremely effective in inducing an antigen-specific immune response against lassa virus, particularly in elicting effective and long-lasting T cell responses. The inventive mRNA is able to efficiently induce antigen-specific immune responses at a very low dosage and dosing regimen. Furthermore, mRNAs encoding different antigens (optionally of different lassa viruses) can be effectively combined in one mRNA-based vaccine. Additionally, the mRNAs according to the invention enable rapid and rational vaccine design with flexibility, speed and scalability of production probably exceeding those of current virus-based technologies.


Antigenic Peptides or Proteins


In a first aspect, the present invention relates to an mRNA comprising a coding region (or “coding sequence”), encoding at least one antigenic peptide or protein derived from a lassa virus, or a variant or fragment of said antigenic peptide or protein.


As used herein, the term “mRNA” refers to “messenger RNAs”, i.e. typically single-strand ribonucleic acid molecules transcribed from DNA that carry genetic information required for protein synthesis from DNA to the ribosomes, that are inter alia characterized by the specific succession of their nucleotides (i.e. their (m)RNA sequence). The term “mRNA” may thus be used to refer to mRNA molecules or mRNA sequences as will be readily understood by the skilled person in the respective context.


The term “antigenic peptide or protein” refers to (poly-)peptides capable of eliciting an (adaptive) immune response. Such “antigenic peptides or proteins” preferably comprise or provide at least one functional epitope. The term “epitope” or “antigenic determinant” typically refers to the part of an antigen which is recognized by adaptive immune system. An “antigen” is a substance, which is capable of being recognized (typically via its epitope(s)) by the immune system, preferably by the adaptive immune system, and which is capable of eliciting an antigen-specific immune response, e.g. by formation of antibodies and/or antigen-specific T cells as part of an adaptive immune response. In the context of the present invention, “antigens” are preferably peptides or proteins (“antigenic peptides or proteins”). Typically, an antigen comprise or consist a peptide or protein, which may be presented to (antigen-specific) T-cells on MHC surface molecules by the MHC complex. In the sense of the present invention an antigenic peptide or protein is typically the product of translation of a provided nucleic acid molecule, preferably an mRNA as defined herein. As used herein, the term “epitope” in particular refers to a part or fragment of an antigen presented on a MHC surface molecule. Such a fragment comprising or consisting of a (functional) epitope may typically comprise from about 5 to about 20 amino acids. Epitopes can be distinguished in T cell epitopes and B cell epitopes. T cell epitopes or parts of the proteins in the context of the present invention may comprise fragments preferably having a length of about 6 to about 20 or even more amino acids, e.g. fragments as processed and presented by MHC class I molecules, preferably having a length of about 8 to about 10 amino acids, e.g. 8, 9, or 10, (or even 11, or 12 amino acids), or fragments as processed and presented by MHC class II molecules, preferably having a length of about 13 or more amino acids, e.g. 13, 14, 15, 16, 17, 18, 19, 20 or even more amino acids, wherein these fragments may be selected from any part of the amino acid sequence. These fragments are typically recognized by T cells in form of a complex consisting of the peptide fragment and an MHC surface molecule, i.e. the fragments are typically not recognized in their native form. B cell epitopes are typically fragments located on the outer surface of (native) protein or peptide antigens as defined herein, preferably having 5 to 15 amino acids, more preferably having 5 to 12 amino acids, even more preferably having 6 to 9 amino acids, which may be recognized by antibodies, i.e. in their native form. The term “epitope” includes “conformational” (or “discontinuous”) epitopes, which are composed of discontinuous sequences of the amino acids of the antigen but are brought together in the three-dimensional structure, and “linear” epitopes, which are formed by a continuous sequence of amino acids from the antigen.


The expression “mRNA encoding a . . . protein derived from a lassa virus” may preferably refer to and may be used interchangeably with the expression “mRNA encoding a . . . protein corresponding to a . . . protein of lassa virus” or simply “mRNA encoding a lassa virus . . . protein”.


The inventive mRNA thus preferably comprises a coding region encoding a full-length antigenic peptide or protein derived from a lassa virus, or a variant or fragment thereof, wherein said antigenic peptide or protein preferably comprises or provides at least one (functional) epitope, i.e. said antigenic peptide or protein preferably either comprise a native epitope (preferably recognized by B cells) or is processed to yield a MHC-bound epitope (preferably recognized by T cells), said epitope preferably being capable of inducing the desired adaptive immune response in a subject.


In preferred embodiments, the coding region of the inventive mRNA encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, and/or at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus, and/or at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof.


Thus, the inventive mRNA may encode (i) at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus (e.g. full-length glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof) (ii) at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus, or a variant or fragment thereof (e.g. full-length nucleoprotein (NP) of a lassa virus, or a variant or fragment thereof), (iiI) at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof (e.g. full-length zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof), (iv) a combination thereof, wherein a “combination” in this context includes combinations of separate antigenic proteins or peptides (preferably encoded by different ORFs in the coding region) or fusion proteins comprising any combination of the aforementioned antigenic proteins or peptides joined together optionally via a suitable peptide linker.


In preferred embodiments, the mRNA according to the invention is provided as (or usable as) a vaccine.


In a preferred embodiment, the inventive mRNA comprises a coding region, encoding at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus or a variant or fragment of said NP and/or at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus or a variant or fragment of said GPC and/or at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus or a variant or fragment of said Z. Accordingly, the amino acid sequence of the at least one antigenic peptide or protein encoded by the inventive mRNA may be selected from any peptide or protein derived from nucleoprotein (NP) of a lassa virus or a variant or fragment of said NP or a glycoprotein precursor (GPC) of a lassa virus or a variant or fragment of said GPC or zinc-binding matrix protein (Z) of a lassa virus or a variant or fragment of said Z or from any synthetically engineered lassa virus peptide or protein.


Clades or Lineages


The antigenic peptide(s) or protein(s) encoded by the coding region of the inventive mRNA may be derived from the same lassa virus or from different lassa viruses. In specific embodiments the lassa virus peptide(s) or protein(s) encoded by the at least one mRNA(s) are derived from a lassa virus of clade I, II, III or IV or from a lassa virus of lineage I, II, III or IV.


In a further specific embodiments the lassa virus peptide(s) or protein(s) encoded by the at least one mRNA(s) are derived from a lassa virus of clade I, II, III and/or IV or from a lassa virus of lineage I, II, III and/or IV.


In a further embodiments the lassa virus peptide(s) or protein(s) encoded by the at least one mRNA(s) are derived from a lassa virus of clade I, II, III and IV or from a lassa virus of lineage I, II, III and IV.


Per definition, a lineage is a single line of descent or linear chain within the phylogenetic tree, while a clade is a (usually branched) monophyletic group, containing a single ancestor and all its descendants.


Figure A shows a phylogenetic tree that shows relationships among Lassa virus strains (Bowen, J. Virol. 2000; 6992-7004). The four Lassa virus lineages are labeled from I through IV. Representative strains from each of the four lineages are shown in outline (Lineage I: LP strain, Lineage II: strain 803213, Lineage III: GA391 and Lineage IV: Josiah).


The overall topology of the most parsimonious tree suggests that the LP strain (lineage I) occupies the most basal position within the Lassa clade, followed by lineage II (southern central Nigeria). Lineages III (northern central Nigeria) and IV (Guinea, Liberia, and Sierra Leone) exhibit a sister relationship. Lineage IV is the largest group of Lassa virus strains and contains all strains from Guinea, Liberia, and Sierra Leone.


Sequence differences within and between lineages can be determined by comparing the sequences of different strains, generally in the region of the NP gene, e.g. by means of the specified primers for RT-PCR described in Bowen J. Virol. 2000; 6992-7004).


The overall genetic diversity within Lassa virus strains is great, approaching a maximum of 26.8% nucleotide and 14.8% deduced amino acid divergence. Within lineages, the diversity is generally less than 20% at the nucleotide level and 11% at the amino acid level except for lineage III when the outlier strain 9608911 is included. Within lineage IV, less variation is observed within strains from Sierra Leone than in Guinean and Liberian strains. Variation between lineages ranges from 19% to almost 27% at the nucleotide level and 6.7% to almost 15% at the amino acid level.


A further method to determine the sequence variability or the genetic diversity on the basis of e.g. the GPC gene is described in Leski et al (Emerg Infect Dis. 2015 April; 21(4): 609-618).


In a specific embodiments the lassa virus peptide(s) or protein(s) encoded by the at least one mRNA(s) are derived from at least two different lassa virus strains of clade I, II, III and/or IV or from a lassa virus of lineage I, II, III and/or IV, wherein the at least two lassa virus proteins exhibit considerable sequence dissimilarities. In a preferred embodiment, the dissimilarities are greater than 11% amino acid sequence divergence in the region of the NP gene. In a second embodiment, the sequence divergence of the mRNA encoding the lassa virus protein is greater than 22% nucleotide sequence divergence.


Full-Length Antigenic Proteins and Variants Thereof


In preferred embodiments, the at least one coding region of the inventive mRNA encodes at least one full-length protein of nucleoprotein (NP) (or a variant thereof) of a lassa virus and/or at least one full-length glycoprotein precursor (GPC) (or a variant thereof) of a lassa virus and/or at least one full-length zinc-binding matrix protein (Z) (or a variant thereof) of a lassa virus.


The term “full-length protein” or as used herein typically refers to a protein that substantially comprises the entire amino acid sequence of the naturally occurring (wild type) protein. Accordingly, in a preferred embodiment, the at least one coding sequence of the inventive mRNA encodes a full-length antigenic protein derived from a lassa virus or a variant of said protein, wherein said protein is preferably selected from the proteins disclosed in Tables 1-3.









TABLE 1







Lassa virus Glycoprotein (GPC)















column 1









NCBI



Accession
column 2
column 3
column 4
column 5
column 6
column 7


Row
No.
A
B
C
D
E
F

















1
AAF86703
1
376
751, 1126, 1501, 1876,
4828
5356
5884






2251, 2626, 3001


2
AAG41802
2
377
752, 1127, 1502, 1877,
4829
5357
5885






2252, 2627, 3002


3
AAL13212
3
378
753, 1128, 1503, 1878,
4830
5358
5886






2253, 2628, 3003


4
AAO59512
4
379
754, 1129, 1504, 1879,
4831
5359
5887






2254, 2629, 3004


5
AAT49000
5
380
755, 1130, 1505, 1880,
4832
5360
5888






2255, 2630, 3005


6
AAT49004
6
381
756, 1131, 1506, 1881,
4833
5361
5889






2256, 2631, 3006


7
AAT49008
7
382
757, 1132, 1507, 1882,
4834
5362
5890






2257, 2632, 3007


8
AAT49010
8
383
758, 1133, 1508, 1883,
4835
5363
5891






2258, 2633, 3008


9
AAT49014
9
384
759, 1134, 1509, 1884,
4836
5364
5892






2259, 2634, 3009


10
AAV54104
10
385
760, 1135, 1510, 1885,
4837
5365
5893






2260, 2635, 3010


11
ADI39451
11
386
761, 1136, 1511, 1886,
4838
5366
5894






2261, 2636, 3011


12
ADU56610
12
387
762, 1137, 1512, 1887,
4839
5367
5895






2262, 2637, 3012


13
ADU56618
13
388
763, 1138, 1513, 1888,
4840
5368
5896






2263, 2638, 3013


14
ADU56622
14
389
764, 1139, 1514, 1889,
4841
5369
5897






2264, 2639, 3014


15
ADU56626
15
390
765, 1140, 1515, 1890,
4842
5370
5898






2265, 2640, 3015


16
ADU56630
16
391
766, 1141, 1516, 1891,
4843
5371
5899






2266, 2641, 3016


17
AFY05588
17
392
767, 1142, 1517, 1892,
4844
5372
5900






2267, 2642, 3017


18
AFY05590
18
393
768, 1143, 1518, 1893,
4845
5373
5901






2268, 2643, 3018


19
AHC95553
19
394
769, 1144, 1519, 1894,
4846
5374
5902






2269, 2644, 3019


20
AHC95555
20
395
770, 1145, 1520, 1895,
4847
5375
5903






2270, 2645, 3020


21
AHC95557
21
396
771, 1146, 1521, 1896,
4848
5376
5904






2271, 2646, 3021


22
AHC95559
22
397
772, 1147, 1522, 1897,
4849
5377
5905






2272, 2647, 3022


23
AIT17126
23
398
773, 1148, 1523, 1898,
4850
5378
5906






2273, 2648, 3023


24
AIT17130
24
399
774, 1149, 1524, 1899,
4851
5379
5907






2274, 2649, 3024


25
AIT17134
25
400
775, 1150, 1525, 1900,
4852
5380
5908






2275, 2650, 3025


26
AIT17138
26
401
776, 1151, 1526, 1901,
4853
5381
5909






2276, 2651, 3026


27
AIT17142
27
402
777, 1152, 1527, 1902,
4854
5382
5910






2277, 2652, 3027


28
AIT17146
28
403
778, 1153, 1528, 1903,
4855
5383
5911






2278, 2653, 3028


29
AIT17154
29
404
779, 1154, 1529, 1904,
4856
5384
5912






2279, 2654, 3029


30
AIT17158
30
405
780, 1155, 1530, 1905,
4857
5385
5913






2280, 2655, 3030


31
AIT17164
31
406
781, 1156, 1531, 1906,
4858
5386
5914






2281, 2656, 3031


32
AIT17168
32
407
782, 1157, 1532, 1907,
4859
5387
5915






2282, 2657, 3032


33
AIT17172
33
408
783, 1158, 1533, 1908,
4860
5388
5916






2283, 2658, 3033


34
AIT17176
34
409
784, 1159, 1534, 1909,
4861
5389
5917






2284, 2659, 3034


35
AIT17180
35
410
785, 1160, 1535, 1910,
4862
5390
5918






2285, 2660, 3035


36
AIT17184
36
411
786, 1161, 1536, 1911,
4863
5391
5919






2286, 2661, 3036


37
AIT17188
37
412
787, 1162, 1537, 1912,
4864
5392
5920






2287, 2662, 3037


38
AIT17192
38
413
788, 1163, 1538, 1913,
4865
5393
5921






2288, 2663, 3038


39
AIT17196
39
414
789, 1164, 1539, 1914,
4866
5394
5922






2289, 2664, 3039


40
AIT17204
40
415
790, 1165, 1540, 1915,
4867
5395
5923






2290, 2665, 3040


41
AIT17208
41
416
791, 1166, 1541, 1916,
4868
5396
5924






2291, 2666, 3041


42
AIT17210
42
417
792, 1167, 1542, 1917,
4869
5397
5925






2292, 2667, 3042


43
AIT17212
43
418
793, 1168, 1543, 1918,
4870
5398
5926






2293, 2668, 3043


44
AIT17216
44
419
794, 1169, 1544, 1919,
4871
5399
5927






2294, 2669, 3044


45
AIT17224
45
420
795, 1170, 1545, 1920,
4872
5400
5928






2295, 2670, 3045


46
AIT17228
46
421
796, 1171, 1546, 1921,
4873
5401
5929






2296, 2671, 3046


47
AIT17232
47
422
797, 1172, 1547, 1922,
4874
5402
5930






2297, 2672, 3047


48
AIT17236
48
423
798, 1173, 1548, 1923,
4875
5403
5931






2298, 2673, 3048


49
AIT17240
49
424
799, 1174, 1549, 1924,
4876
5404
5932






2299, 2674, 3049


50
AIT17244
50
425
800, 1175, 1550, 1925,
4877
5405
5933






2300, 2675, 3050


51
AIT17246
51
426
801, 1176, 1551, 1926,
4878
5406
5934






2301, 2676, 3051


52
AIT17250
52
427
802, 1177, 1552, 1927,
4879
5407
5935






2302, 2677, 3052


53
AIT17254
53
428
803, 1178, 1553, 1928,
4880
5408
5936






2303, 2678, 3053


54
AIT17258
54
429
804, 1179, 1554, 1929,
4881
5409
5937






2304, 2679, 3054


55
AIT17262
55
430
805, 1180, 1555, 1930,
4882
5410
5938






2305, 2680, 3055


56
AIT17266
56
431
806, 1181, 1556, 1931,
4883
5411
5939






2306, 2681, 3056


57
AIT17270
57
432
807, 1182, 1557, 1932,
4884
5412
5940






2307, 2682, 3057


58
AIT17274
58
433
808, 1183, 1558, 1933,
4885
5413
5941






2308, 2683, 3058


59
AIT17276
59
434
809, 1184, 1559, 1934,
4886
5414
5942






2309, 2684, 3059


60
AIT17280
60
435
810, 1185, 1560, 1935,
4887
5415
5943






2310, 2685, 3060


61
AIT17288
61
436
811, 1186, 1561, 1936,
4888
5416
5944






2311, 2686, 3061


62
AIT17292
62
437
812, 1187, 1562, 1937,
4889
5417
5945






2312, 2687, 3062


63
AIT17294
63
438
813, 1188, 1563, 1938,
4890
5418
5946






2313, 2688, 3063


64
AIT17298
64
439
814, 1189, 1564, 1939,
4891
5419
5947






2314, 2689, 3064


65
AIT17302
65
440
815, 1190, 1565, 1940,
4892
5420
5948






2315, 2690, 3065


66
AIT17310
66
441
816, 1191, 1566, 1941,
4893
5421
5949






2316, 2691, 3066


67
AIT17314
67
442
817, 1192, 1567, 1942,
4894
5422
5950






2317, 2692, 3067


68
AIT17318
68
443
818, 1193, 1568, 1943,
4895
5423
5951






2318, 2693, 3068


69
AIT17322
69
444
819, 1194, 1569, 1944,
4896
5424
5952






2319, 2694, 3069


70
AIT17328
70
445
820, 1195, 1570, 1945,
4897
5425
5953






2320, 2695, 3070


71
AIT17332
71
446
821, 1196, 1571, 1946,
4898
5426
5954






2321, 2696, 3071


72
AIT17340
72
447
822, 1197, 1572, 1947,
4899
5427
5955






2322, 2697, 3072


73
AIT17344
73
448
823, 1198, 1573, 1948,
4900
5428
5956






2323, 2698, 3073


74
AIT17348
74
449
824, 1199, 1574, 1949,
4901
5429
5957






2324, 2699, 3074


75
AIT17350
75
450
825, 1200, 1575, 1950,
4902
5430
5958






2325, 2700, 3075


76
AIT17354
76
451
826, 1201, 1576, 1951,
4903
5431
5959






2326, 2701, 3076


77
AIT17360
77
452
827, 1202, 1577, 1952,
4904
5432
5960






2327, 2702, 3077


78
AIT17364
78
453
828, 1203, 1578, 1953,
4905
5433
5961






2328, 2703, 3078


79
AIT17368
79
454
829, 1204, 1579, 1954,
4906
5434
5962






2329, 2704, 3079


80
AIT17370
80
455
830, 1205, 1580, 1955,
4907
5435
5963






2330, 2705, 3080


81
AIT17378
81
456
831, 1206, 1581, 1956,
4908
5436
5964






2331, 2706, 3081


82
AIT17388
82
457
832, 1207, 1582, 1957,
4909
5437
5965






2332, 2707, 3082


83
AIT17404
83
458
833, 1208, 1583, 1958,
4910
5438
5966






2333, 2708, 3083


84
AIT17408
84
459
834, 1209, 1584, 1959,
4911
5439
5967






2334, 2709, 3084


85
AIT17412
85
460
835, 1210, 1585, 1960,
4912
5440
5968






2335, 2710, 3085


86
AIT17416
86
461
836, 1211, 1586, 1961,
4913
5441
5969






2336, 2711, 3086


87
AIT17420
87
462
837, 1212, 1587, 1962,
4914
5442
5970






2337, 2712, 3087


88
AIT17424
88
463
838, 1213, 1588, 1963,
4915
5443
5971






2338, 2713, 3088


89
AIT17428
89
464
839, 1214, 1589, 1964,
4916
5444
5972






2339, 2714, 3089


90
AIT17430
90
465
840, 1215, 1590, 1965,
4917
5445
5973






2340, 2715, 3090


91
AIT17438
91
466
841, 1216, 1591, 1966,
4918
5446
5974






2341, 2716, 3091


92
AIT17442
92
467
842, 1217, 1592, 1967,
4919
5447
5975






2342, 2717, 3092


93
AIT17446
93
468
843, 1218, 1593, 1968,
4920
5448
5976






2343, 2718, 3093


94
AIT17450
94
469
844, 1219, 1594, 1969,
4921
5449
5977






2344, 2719, 3094


95
AIT17454
95
470
845, 1220, 1595, 1970,
4922
5450
5978






2345, 2720, 3095


96
AIT17458
96
471
846, 1221, 1596, 1971,
4923
5451
5979






2346, 2721, 3096


97
AIT17462
97
472
847, 1222, 1597, 1972,
4924
5452
5980






2347, 2722, 3097


98
AIT17466
98
473
848, 1223, 1598, 1973,
4925
5453
5981






2348, 2723, 3098


99
AIT17470
99
474
849, 1224, 1599, 1974,
4926
5454
5982






2349, 2724, 3099


100
AIT17474
100
475
850, 1225, 1600, 1975,
4927
5455
5983






2350, 2725, 3100


101
AIT17478
101
476
851, 1226, 1601, 1976,
4928
5456
5984






2351, 2726, 3101


102
AIT17482
102
477
852, 1227, 1602, 1977,
4929
5457
5985






2352, 2727, 3102


103
AIT17486
103
478
853, 1228, 1603, 1978,
4930
5458
5986






2353, 2728, 3103


104
AIT17490
104
479
854, 1229, 1604, 1979,
4931
5459
5987






2354, 2729, 3104


105
AIT17494
105
480
855, 1230, 1605, 1980,
4932
5460
5988






2355, 2730, 3105


106
AIT17498
106
481
856, 1231, 1606, 1981,
4933
5461
5989






2356, 2731, 3106


107
AIT17502
107
482
857, 1232, 1607, 1982,
4934
5462
5990






2357, 2732, 3107


108
AIT17506
108
483
858, 1233, 1608, 1983,
4935
5463
5991






2358, 2733, 3108


109
AIT17510
109
484
859, 1234, 1609, 1984,
4936
5464
5992






2359, 2734, 3109


110
AIT17514
110
485
860, 1235, 1610, 1985,
4937
5465
5993






2360, 2735, 3110


111
AIT17518
111
486
861, 1236, 1611, 1986,
4938
5466
5994






2361, 2736, 3111


112
AIT17524
112
487
862, 1237, 1612, 1987,
4939
5467
5995






2362, 2737, 3112


113
AIT17528
113
488
863, 1238, 1613, 1988,
4940
5468
5996






2363, 2738, 3113


114
AIT17538
114
489
864, 1239, 1614, 1989,
4941
5469
5997






2364, 2739, 3114


115
AIT17540
115
490
865, 1240, 1615, 1990,
4942
5470
5998






2365, 2740, 3115


116
AIT17544
116
491
866, 1241, 1616, 1991,
4943
5471
5999






2366, 2741, 3116


117
AIT17548
117
492
867, 1242, 1617, 1992,
4944
5472
6000






2367, 2742, 3117


118
AIT17552
118
493
868, 1243, 1618, 1993,
4945
5473
6001






2368, 2743, 3118


119
AIT17556
119
494
869, 1244, 1619, 1994,
4946
5474
6002






2369, 2744, 3119


120
AIT17560
120
495
870, 1245, 1620, 1995,
4947
5475
6003






2370, 2745, 3120


121
AIT17564
121
496
871, 1246, 1621, 1996,
4948
5476
6004






2371, 2746, 3121


122
AIT17572
122
497
872, 1247, 1622, 1997,
4949
5477
6005






2372, 2747, 3122


123
AIT17576
123
498
873, 1248, 1623, 1998,
4950
5478
6006






2373, 2748, 3123


124
AIT17580
124
499
874, 1249, 1624, 1999,
4951
5479
6007






2374, 2749, 3124


125
AIT17584
125
500
875, 1250, 1625, 2000,
4952
5480
6008






2375, 2750, 3125


126
AIT17588
126
501
876, 1251, 1626, 2001,
4953
5481
6009






2376, 2751, 3126


127
AIT17592
127
502
877, 1252, 1627, 2002,
4954
5482
6010






2377, 2752, 3127


128
AIT17596
128
503
878, 1253, 1628, 2003,
4955
5483
6011






2378, 2753, 3128


129
AIT17600
129
504
879, 1254, 1629, 2004,
4956
5484
6012






2379, 2754, 3129


130
AIT17604
130
505
880, 1255, 1630, 2005,
4957
5485
6013






2380, 2755, 3130


131
AIT17608
131
506
881, 1256, 1631, 2006,
4958
5486
6014






2381, 2756, 3131


132
AIT17612
132
507
882, 1257, 1632, 2007,
4959
5487
6015






2382, 2757, 3132


133
AIT17616
133
508
883, 1258, 1633, 2008,
4960
5488
6016






2383, 2758, 3133


134
AIT17620
134
509
884, 1259, 1634, 2009,
4961
5489
6017






2384, 2759, 3134


135
AIT17626
135
510
885, 1260, 1635, 2010,
4962
5490
6018






2385, 2760, 3135


136
AIT17630
136
511
886, 1261, 1636, 2011,
4963
5491
6019






2386, 2761, 3136


137
AIT17634
137
512
887, 1262, 1637, 2012,
4964
5492
6020






2387, 2762, 3137


138
AIT17638
138
513
888, 1263, 1638, 2013,
4965
5493
6021






2388, 2763, 3138


139
AIT17642
139
514
889, 1264, 1639, 2014,
4966
5494
6022






2389, 2764, 3139


140
AIT17646
140
515
890, 1265, 1640, 2015,
4967
5495
6023






2390, 2765, 3140


141
AIT17650
141
516
891, 1266, 1641, 2016,
4968
5496
6024






2391, 2766, 3141


142
AIT17658
142
517
892, 1267, 1642, 2017,
4969
5497
6025






2392, 2767, 3142


143
AIT17666
143
518
893, 1268, 1643, 2018,
4970
5498
6026






2393, 2768, 3143


144
AIT17670
144
519
894, 1269, 1644, 2019,
4971
5499
6027






2394, 2769, 3144


145
AIT17674
145
520
895, 1270, 1645, 2020,
4972
5500
6028






2395, 2770, 3145


146
AIT17678
146
521
896, 1271, 1646, 2021,
4973
5501
6029






2396, 2771, 3146


147
AIT17686
147
522
897, 1272, 1647, 2022,
4974
5502
6030






2397, 2772, 3147


148
AIT17690
148
523
898, 1273, 1648, 2023,
4975
5503
6031






2398, 2773, 3148


149
AIT17694
149
524
899, 1274, 1649, 2024,
4976
5504
6032






2399, 2774, 3149


150
AIT17712
150
525
900, 1275, 1650, 2025,
4977
5505
6033






2400, 2775, 3150


151
AIT17718
151
526
901, 1276, 1651, 2026,
4978
5506
6034






2401, 2776, 3151


152
AIT17724
152
527
902, 1277, 1652, 2027,
4979
5507
6035






2402, 2777, 3152


153
AIT17728
153
528
903, 1278, 1653, 2028,
4980
5508
6036






2403, 2778, 3153


154
AIT17732
154
529
904, 1279, 1654, 2029,
4981
5509
6037






2404, 2779, 3154


155
AIT17736
155
530
905, 1280, 1655, 2030,
4982
5510
6038






2405, 2780, 3155


156
AIT17744
156
531
906, 1281, 1656, 2031,
4983
5511
6039






2406, 2781, 3156


157
AIT17752
157
532
907, 1282, 1657, 2032,
4984
5512
6040






2407, 2782, 3157


158
AIT17756
158
533
908, 1283, 1658, 2033,
4985
5513
6041






2408, 2783, 3158


159
AIT17758
159
534
909, 1284, 1659, 2034,
4986
5514
6042






2409, 2784, 3159


160
AIT17762
160
535
910, 1285, 1660, 2035,
4987
5515
6043






2410, 2785, 3160


161
AIT17770
161
536
911, 1286, 1661, 2036,
4988
5516
6044






2411, 2786, 3161


162
AIT17778
162
537
912, 1287, 1662, 2037,
4989
5517
6045






2412, 2787, 3162


163
AIT17782
163
538
913, 1288, 1663, 2038,
4990
5518
6046






2413, 2788, 3163


164
AIT17786
164
539
914, 1289, 1664, 2039,
4991
5519
6047






2414, 2789, 3164


165
AIT17790
165
540
915, 1290, 1665, 2040,
4992
5520
6048






2415, 2790, 3165


166
AIT17798
166
541
916, 1291, 1666, 2041,
4993
5521
6049






2416, 2791, 3166


167
AIT17802
167
542
917, 1292, 1667, 2042,
4994
5522
6050






2417, 2792, 3167


168
AIT17806
168
543
918, 1293, 1668, 2043,
4995
5523
6051






2418, 2793, 3168


169
AIT17810
169
544
919, 1294, 1669, 2044,
4996
5524
6052






2419, 2794, 3169


170
AIT17814
170
545
920, 1295, 1670, 2045,
4997
5525
6053






2420, 2795, 3170


171
AIT17820
171
546
921, 1296, 1671, 2046,
4998
5526
6054






2421, 2796, 3171


172
AIT17824
172
547
922, 1297, 1672, 2047,
4999
5527
6055






2422, 2797, 3172


173
AIT17828
173
548
923, 1298, 1673, 2048,
5000
5528
6056






2423, 2798, 3173


174
AIT17836
174
549
924, 1299, 1674, 2049,
5001
5529
6057






2424, 2799, 3174


175
AIT17840
175
550
925, 1300, 1675, 2050,
5002
5530
6058






2425, 2800, 3175


176
AIT17844
176
551
926, 1301, 1676, 2051,
5003
5531
6059






2426, 2801, 3176


177
AMR44577
177
552
927, 1302, 1677, 2052,
5004
5532
6060






2427, 2802, 3177


178
ANH09722
178
553
928, 1303, 1678, 2053,
5005
5533
6061






2428, 2803, 3178


179
ANH09724
179
554
929, 1304, 1679, 2054,
5006
5534
6062






2429, 2804, 3179


180
ANH09728
180
555
930, 1305, 1680, 2055,
5007
5535
6063






2430, 2805, 3180


181
ANH09730
181
556
931, 1306, 1681, 2056,
5008
5536
6064






2431, 2806, 3181


182
ANH09732
182
557
932, 1307, 1682, 2057,
5009
5537
6065






2432, 2807, 3182


183
ANH09738
183
558
933, 1308, 1683, 2058,
5010
5538
6066






2433, 2808, 3183


184
ANH09740
184
559
934, 1309, 1684, 2059,
5011
5539
6067






2434, 2809, 3184


185
CAA36645
185
560
935, 1310, 1685, 2060,
5012
5540
6068






2435, 2810, 3185


186
NP_694870
186
561
936, 1311, 1686, 2061,
5013
5541
6069






2436, 2811, 3186
















TABLE 2







Lassa virus Nucleoprotein (NP)















column 1









NCBI



Accession
column 2
column 3
column 4
column 5
column 6
column 7


Row
No.
A
B
C
D
E
F

















1
AAA46284
187
562
937, 1312, 1687, 2062,
5014
5542
6070






2437, 2812, 3187


2
AAF86704
188
563
938, 1313, 1688, 2063,
5015
5543
6071






2438, 2813, 3188


3
AAO59513
189
564
939, 1314, 1689, 2064,
5016
5544
6072






2439, 2814, 3189


4
AAT48999
190
565
940, 1315, 1690, 2065,
5017
5545
6073






2440, 2815, 3190


5
AAT49007
191
566
941, 1316, 1691, 2066,
5018
5546
6074






2441, 2816, 3191


6
AAT49009
192
567
942, 1317, 1692, 2067,
5019
5547
6075






2442, 2817, 3192


7
ADI39452
193
568
943, 1318, 1693, 2068,
5020
5548
6076






2443, 2818, 3193


8
ADU56611
194
569
944, 1319, 1694, 2069,
5021
5549
6077






2444, 2819, 3194


9
ADU56615
195
570
945, 1320, 1695, 2070,
5022
5550
6078






2445, 2820, 3195


10
ADU56619
196
571
946, 1321, 1696, 2071,
5023
5551
6079






2446, 2821, 3196


11
ADU56623
197
572
947, 1322, 1697, 2072,
5024
5552
6080






2447, 2822, 3197


12
ADU56627
198
573
948, 1323, 1698, 2073,
5025
5553
6081






2448, 2823, 3198


13
ADU56631
199
574
949, 1324, 1699, 2074,
5026
5554
6082






2449, 2824, 3199


14
AFY05580
200
575
950, 1325, 1700, 2075,
5027
5555
6083






2450, 2825, 3200


15
AFY05581
201
576
951, 1326, 1701, 2076,
5028
5556
6084






2451, 2826, 3201


16
AFY05587
202
577
952, 1327, 1702, 2077,
5029
5557
6085






2452, 2827, 3202


17
AFY05591
203
578
953, 1328, 1703, 2078,
5030
5558
6086






2453, 2828, 3203


18
AFY05597
204
579
954, 1329, 1704, 2079,
5031
5559
6087






2454, 2829, 3204


19
AFY05610
205
580
955, 1330, 1705, 2080,
5032
5560
6088






2455, 2830, 3205


20
AFY05616
206
581
956, 1331, 1706, 2081,
5033
5561
6089






2456, 2831, 3206


21
AFY05618
207
582
957, 1332, 1707, 2082,
5034
5562
6090






2457, 2832, 3207


22
AHC95552
208
583
958, 1333, 1708, 2083,
5035
5563
6091






2458, 2833, 3208


23
AHC95554
209
584
959, 1334, 1709, 2084,
5036
5564
6092






2459, 2834, 3209


24
AHC95556
210
585
960, 1335, 1710, 2085,
5037
5565
6093






2460, 2835, 3210


25
AHC95558
211
586
961, 1336, 1711, 2086,
5038
5566
6094






2461, 2836, 3211


26
AHC95560
212
587
962, 1337, 1712, 2087,
5039
5567
6095






2462, 2837, 3212


27
AIT17127
213
588
963, 1338, 1713, 2088,
5040
5568
6096






2463, 2838, 3213


28
AIT17131
214
589
964, 1339, 1714, 2089,
5041
5569
6097






2464, 2839, 3214


29
AIT17139
215
590
965, 1340, 1715, 2090,
5042
5570
6098






2465, 2840, 3215


30
AIT17143
216
591
966, 1341, 1716, 2091,
5043
5571
6099






2466, 2841, 3216


31
AIT17147
217
592
967, 1342, 1717, 2092,
5044
5572
6100






2467, 2842, 3217


32
AIT17151
218
593
968, 1343, 1718, 2093,
5045
5573
6101






2468, 2843, 3218


33
AIT17155
219
594
969, 1344, 1719, 2094,
5046
5574
6102






2469, 2844, 3219


34
AIT17161
220
595
970, 1345, 1720, 2095,
5047
5575
6103






2470, 2845, 3220


35
AIT17165
221
596
971, 1346, 1721, 2096,
5048
5576
6104






2471, 2846, 3221


36
AIT17169
222
597
972, 1347, 1722, 2097,
5049
5577
6105






2472, 2847, 3222


37
AIT17173
223
598
973, 1348, 1723, 2098,
5050
5578
6106






2473, 2848, 3223


38
AIT17177
224
599
974, 1349, 1724, 2099,
5051
5579
6107






2474, 2849, 3224


39
AIT17181
225
600
975, 1350, 1725, 2100,
5052
5580
6108






2475, 2850, 3225


40
AIT17185
226
601
976, 1351, 1726, 2101,
5053
5581
6109






2476, 2851, 3226


41
AIT17189
227
602
977, 1352, 1727, 2102,
5054
5582
6110






2477, 2852, 3227


42
AIT17193
228
603
978, 1353, 1728, 2103,
5055
5583
6111






2478, 2853, 3228


43
AIT17197
229
604
979, 1354, 1729, 2104,
5056
5584
6112






2479, 2854, 3229


44
AIT17201
230
605
980, 1355, 1730, 2105,
5057
5585
6113






2480, 2855, 3230


45
AIT17205
231
606
981, 1356, 1731, 2106,
5058
5586
6114






2481, 2856, 3231


46
AIT17209
232
607
982, 1357, 1732, 2107,
5059
5587
6115






2482, 2857, 3232


47
AIT17211
233
608
983, 1358, 1733, 2108,
5060
5588
6116






2483, 2858, 3233


48
AIT17213
234
609
984, 1359, 1734, 2109,
5061
5589
6117






2484, 2859, 3234


49
AIT17217
235
610
985, 1360, 1735, 2110,
5062
5590
6118






2485, 2860, 3235


50
AIT17225
236
611
986, 1361, 1736, 2111,
5063
5591
6119






2486, 2861, 3236


51
AIT17229
237
612
987, 1362, 1737, 2112,
5064
5592
6120






2487, 2862, 3237


52
AIT17233
238
613
988, 1363, 1738, 2113,
5065
5593
6121






2488, 2863, 3238


53
AIT17237
239
614
989, 1364, 1739, 2114,
5066
5594
6122






2489, 2864, 3239


54
AIT17241
240
615
990, 1365, 1740, 2115,
5067
5595
6123






2490, 2865, 3240


55
AIT17245
241
616
991, 1366, 1741, 2116,
5068
5596
6124






2491, 2866, 3241


56
AIT17247
242
617
992, 1367, 1742, 2117,
5069
5597
6125






2492, 2867, 3242


57
AIT17251
243
618
993, 1368, 1743, 2118,
5070
5598
6126






2493, 2868, 3243


58
AIT17255
244
619
994, 1369, 1744, 2119,
5071
5599
6127






2494, 2869, 3244


59
AIT17259
245
620
995, 1370, 1745, 2120,
5072
5600
6128






2495, 2870, 3245


60
AIT17263
246
621
996, 1371, 1746, 2121,
5073
5601
6129






2496, 2871, 3246


61
AIT17267
247
622
997, 1372, 1747, 2122,
5074
5602
6130






2497, 2872, 3247


62
AIT17271
248
623
998, 1373, 1748, 2123,
5075
5603
6131






2498, 2873, 3248


63
AIT17275
249
624
999, 1374, 1749, 2124,
5076
5604
6132






2499, 2874, 3249


64
AIT17277
250
625
1000, 1375, 1750, 2125,
5077
5605
6133






2500, 2875, 3250


65
AIT17281
251
626
1001, 1376, 1751, 2126,
5078
5606
6134






2501, 2876, 3251


66
AIT17285
252
627
1002, 1377, 1752, 2127,
5079
5607
6135






2502, 2877, 3252


67
AIT17289
253
628
1003, 1378, 1753, 2128,
5080
5608
6136






2503, 2878, 3253


68
AIT17293
254
629
1004, 1379, 1754, 2129,
5081
5609
6137






2504, 2879, 3254


69
AIT17295
255
630
1005, 1380, 1755, 2130,
5082
5610
6138






2505, 2880, 3255


70
AIT17299
256
631
1006, 1381, 1756, 2131,
5083
5611
6139






2506, 2881, 3256


71
AIT17303
257
632
1007, 1382, 1757, 2132,
5084
5612
6140






2507, 2882, 3257


72
AIT17307
258
633
1008, 1383, 1758, 2133,
5085
5613
6141






2508, 2883, 3258


73
AIT17311
259
634
1009, 1384, 1759, 2134,
5086
5614
6142






2509, 2884, 3259


74
AIT17315
260
635
1010, 1385, 1760, 2135,
5087
5615
6143






2510, 2885, 3260


75
AIT17319
261
636
1011, 1386, 1761, 2136,
5088
5616
6144






2511, 2886, 3261


76
AIT17323
262
637
1012, 1387, 1762, 2137,
5089
5617
6145






2512, 2887, 3262


77
AIT17325
263
638
1013, 1388, 1763, 2138,
5090
5618
6146






2513, 2888, 3263


78
AIT17329
264
639
1014, 1389, 1764, 2139,
5091
5619
6147






2514, 2889, 3264


79
AIT17333
265
640
1015, 1390, 1765, 2140,
5092
5620
6148






2515, 2890, 3265


80
AIT17335
266
641
1016, 1391, 1766, 2141,
5093
5621
6149






2516, 2891, 3266


81
AIT17341
267
642
1017, 1392, 1767, 2142,
5094
5622
6150






2517, 2892, 3267


82
AIT17345
268
643
1018, 1393, 1768, 2143,
5095
5623
6151






2518, 2893, 3268


83
AIT17351
269
644
1019, 1394, 1769, 2144,
5096
5624
6152






2519, 2894, 3269


84
AIT17359
270
645
1020, 1395, 1770, 2145,
5097
5625
6153






2520, 2895, 3270


85
AIT17361
271
646
1021, 1396, 1771, 2146,
5098
5626
6154






2521, 2896, 3271


86
AIT17365
272
647
1022, 1397, 1772, 2147,
5099
5627
6155






2522, 2897, 3272


87
AIT17371
273
648
1023, 1398, 1773, 2148,
5100
5628
6156






2523, 2898, 3273


88
AIT17379
274
649
1024, 1399, 1774, 2149,
5101
5629
6157






2524, 2899, 3274


89
AIT17381
275
650
1025, 1400, 1775, 2150,
5102
5630
6158






2525, 2900, 3275


90
AIT17389
276
651
1026, 1401, 1776, 2151,
5103
5631
6159






2526, 2901, 3276


91
AIT17399
277
652
1027, 1402, 1777, 2152,
5104
5632
6160






2527, 2902, 3277


92
AIT17405
278
653
1028, 1403, 1778, 2153,
5105
5633
6161






2528, 2903, 3278


93
AIT17409
279
654
1029, 1404, 1779, 2154,
5106
5634
6162






2529, 2904, 3279


94
AIT17413
280
655
1030, 1405, 1780, 2155,
5107
5635
6163






2530, 2905, 3280


95
AIT17421
281
656
1031, 1406, 1781, 2156,
5108
5636
6164






2531, 2906, 3281


96
AIT17425
282
657
1032, 1407, 1782, 2157,
5109
5637
6165






2532, 2907, 3282


97
AIT17429
283
658
1033, 1408, 1783, 2158,
5110
5638
6166






2533, 2908, 3283


98
AIT17431
284
659
1034, 1409, 1784, 2159,
5111
5639
6167






2534, 2909, 3284


99
AIT17439
285
660
1035, 1410, 1785, 2160,
5112
5640
6168






2535, 2910, 3285


100
AIT17443
286
661
1036, 1411, 1786, 2161,
5113
5641
6169






2536, 2911, 3286


101
AIT17447
287
662
1037, 1412, 1787, 2162,
5114
5642
6170






2537, 2912, 3287


102
AIT17451
288
663
1038, 1413, 1788, 2163,
5115
5643
6171






2538, 2913, 3288


103
AIT17455
289
664
1039, 1414, 1789, 2164,
5116
5644
6172






2539, 2914, 3289


104
AIT17459
290
665
1040, 1415, 1790, 2165,
5117
5645
6173






2540, 2915, 3290


105
AIT17463
291
666
1041, 1416, 1791, 2166,
5118
5646
6174






2541, 2916, 3291


106
AIT17467
292
667
1042, 1417, 1792, 2167,
5119
5647
6175






2542, 2917, 3292


107
AIT17471
293
668
1043, 1418, 1793, 2168,
5120
5648
6176






2543, 2918, 3293


108
AIT17475
294
669
1044, 1419, 1794, 2169,
5121
5649
6177






2544, 2919, 3294


109
AIT17479
295
670
1045, 1420, 1795, 2170,
5122
5650
6178






2545, 2920, 3295


110
AIT17491
296
671
1046, 1421, 1796, 2171,
5123
5651
6179






2546, 2921, 3296


111
AIT17495
297
672
1047, 1422, 1797, 2172,
5124
5652
6180






2547, 2922, 3297


112
AIT17499
298
673
1048, 1423, 1798, 2173,
5125
5653
6181






2548, 2923, 3298


113
AIT17503
299
674
1049, 1424, 1799, 2174,
5126
5654
6182






2549, 2924, 3299


114
AIT17507
300
675
1050, 1425, 1800, 2175,
5127
5655
6183






2550, 2925, 3300


115
AIT17511
301
676
1051, 1426, 1801, 2176,
5128
5656
6184






2551, 2926, 3301


116
AIT17515
302
677
1052, 1427, 1802, 2177,
5129
5657
6185






2552, 2927, 3302


117
AIT17519
303
678
1053, 1428, 1803, 2178,
5130
5658
6186






2553, 2928, 3303


118
AIT17525
304
679
1054, 1429, 1804, 2179,
5131
5659
6187






2554, 2929, 3304


119
AIT17533
305
680
1055, 1430, 1805, 2180,
5132
5660
6188






2555, 2930, 3305


120
AIT17539
306
681
1056, 1431, 1806, 2181,
5133
5661
6189






2556, 2931, 3306


121
AIT17541
307
682
1057, 1432, 1807, 2182,
5134
5662
6190






2557, 2932, 3307


122
AIT17545
308
683
1058, 1433, 1808, 2183,
5135
5663
6191






2558, 2933, 3308


123
AIT17549
309
684
1059, 1434, 1809, 2184,
5136
5664
6192






2559, 2934, 3309


124
AIT17553
310
685
1060, 1435, 1810, 2185,
5137
5665
6193






2560, 2935, 3310


125
AIT17561
311
686
1061, 1436, 1811, 2186,
5138
5666
6194






2561, 2936, 3311


126
AIT17565
312
687
1062, 1437, 1812, 2187,
5139
5667
6195






2562, 2937, 3312


127
AIT17569
313
688
1063, 1438, 1813, 2188,
5140
5668
6196






2563, 2938, 3313


128
AIT17573
314
689
1064, 1439, 1814, 2189,
5141
5669
6197






2564, 2939, 3314


129
AIT17577
315
690
1065, 1440, 1815, 2190,
5142
5670
6198






2565, 2940, 3315


130
AIT17581
316
691
1066, 1441, 1816, 2191,
5143
5671
6199






2566, 2941, 3316


131
AIT17589
317
692
1067, 1442, 1817, 2192,
5144
5672
6200






2567, 2942, 3317


132
AIT17593
318
693
1068, 1443, 1818, 2193,
5145
5673
6201






2568, 2943, 3318


133
AIT17605
319
694
1069, 1444, 1819, 2194,
5146
5674
6202






2569, 2944, 3319


134
AIT17609
320
695
1070, 1445, 1820, 2195,
5147
5675
6203






2570, 2945, 3320


135
AIT17613
321
696
1071, 1446, 1821, 2196,
5148
5676
6204






2571, 2946, 3321


136
AIT17617
322
697
1072, 1447, 1822, 2197,
5149
5677
6205






2572, 2947, 3322


137
AIT17621
323
698
1073, 1448, 1823, 2198,
5150
5678
6206






2573, 2948, 3323


138
AIT17625
324
699
1074, 1449, 1824, 2199,
5151
5679
6207






2574, 2949, 3324


139
AIT17627
325
700
1075, 1450, 1825, 2200,
5152
5680
6208






2575, 2950, 3325


140
AIT17631
326
701
1076, 1451, 1826, 2201,
5153
5681
6209






2576, 2951, 3326


141
AIT17635
327
702
1077, 1452, 1827, 2202,
5154
5682
6210






2577, 2952, 3327


142
AIT17639
328
703
1078, 1453, 1828, 2203,
5155
5683
6211






2578, 2953, 3328


143
AIT17643
329
704
1079, 1454, 1829, 2204,
5156
5684
6212






2579, 2954, 3329


144
AIT17651
330
705
1080, 1455, 1830, 2205,
5157
5685
6213






2580, 2955, 3330


145
AIT17655
331
706
1081, 1456, 1831, 2206,
5158
5686
6214






2581, 2956, 3331


146
AIT17659
332
707
1082, 1457, 1832, 2207,
5159
5687
6215






2582, 2957, 3332


147
AIT17663
333
708
1083, 1458, 1833, 2208,
5160
5688
6216






2583, 2958, 3333


148
AIT17667
334
709
1084, 1459, 1834, 2209,
5161
5689
6217






2584, 2959, 3334


149
AIT17671
335
710
1085, 1460, 1835, 2210,
5162
5690
6218






2585, 2960, 3335


150
AIT17675
336
711
1086, 1461, 1836, 2211,
5163
5691
6219






2586, 2961, 3336


151
AIT17679
337
712
1087, 1462, 1837, 2212,
5164
5692
6220






2587, 2962, 3337


152
AIT17683
338
713
1088, 1463, 1838, 2213,
5165
5693
6221






2588, 2963, 3338


153
AIT17687
339
714
1089, 1464, 1839, 2214,
5166
5694
6222






2589, 2964, 3339


154
AIT17695
340
715
1090, 1465, 1840, 2215,
5167
5695
6223






2590, 2965, 3340


155
AIT17705
341
716
1091, 1466, 1841, 2216,
5168
5696
6224






2591, 2966, 3341


156
AIT17711
342
717
1092, 1467, 1842, 2217,
5169
5697
6225






2592, 2967, 3342


157
AIT17713
343
718
1093, 1468, 1843, 2218,
5170
5698
6226






2593, 2968, 3343


158
AIT17719
344
719
1094, 1469, 1844, 2219,
5171
5699
6227






2594, 2969, 3344


159
AIT17725
345
720
1095, 1470, 1845, 2220,
5172
5700
6228






2595, 2970, 3345


160
AIT17729
346
721
1096, 1471, 1846, 2221,
5173
5701
6229






2596, 2971, 3346


161
AIT17733
347
722
1097, 1472, 1847, 2222,
5174
5702
6230






2597, 2972, 3347


162
AIT17745
348
723
1098, 1473, 1848, 2223,
5175
5703
6231






2598, 2973, 3348


163
AIT17753
349
724
1099, 1474, 1849, 2224,
5176
5704
6232






2599, 2974, 3349


164
AIT17763
350
725
1100, 1475, 1850, 2225,
5177
5705
6233






2600, 2975, 3350


165
AIT17771
351
726
1101, 1476, 1851, 2226,
5178
5706
6234






2601, 2976, 3351


166
AIT17779
352
727
1102, 1477, 1852, 2227,
5179
5707
6235






2602, 2977, 3352


167
AIT17783
353
728
1103, 1478, 1853, 2228,
5180
5708
6236






2603, 2978, 3353


168
AIT17787
354
729
1104, 1479, 1854, 2229,
5181
5709
6237






2604, 2979, 3354


169
AIT17803
355
730
1105, 1480, 1855, 2230,
5182
5710
6238






2605, 2980, 3355


170
AIT17811
356
731
1106, 1481, 1856, 2231,
5183
5711
6239






2606, 2981, 3356


171
AIT17815
357
732
1107, 1482, 1857, 2232,
5184
5712
6240






2607, 2982, 3357


172
AIT17821
358
733
1108, 1483, 1858, 2233,
5185
5713
6241






2608, 2983, 3358


173
AIT17837
359
734
1109, 1484, 1859, 2234,
5186
5714
6242






2609, 2984, 3359


174
AIT17841
360
735
1110, 1485, 1860, 2235,
5187
5715
6243






2610, 2985, 3360


175
AIT17845
361
736
1111, 1486, 1861, 2236,
5188
5716
6244






2611, 2986, 3361


176
ANH09723
362
737
1112, 1487, 1862, 2237,
5189
5717
6245






2612, 2987, 3362


177
ANH09725
363
738
1113, 1488, 1863, 2238,
5190
5718
6246






2613, 2988, 3363


178
ANH09729
364
739
1114, 1489, 1864, 2239,
5191
5719
6247






2614, 2989, 3364


179
ANH09731
365
740
1115, 1490, 1865, 2240,
5192
5720
6248






2615, 2990, 3365


180
ANH09733
366
741
1116, 1491, 1866, 2241,
5193
5721
6249






2616, 2991, 3366


181
ANH09735
367
742
1117, 1492, 1867, 2242,
5194
5722
6250






2617, 2992, 3367


182
ANH09737
368
743
1118, 1493, 1868, 2243,
5195
5723
6251






2618, 2993, 3368


183
ANH09739
369
744
1119, 1494, 1869, 2244,
5196
5724
6252






2619, 2994, 3369


184
ANH09741
370
745
1120, 1495, 1870, 2245,
5197
5725
6253






2620, 2995, 3370


185
CAA36646
371
746
1121, 1496, 1871, 2246,
5198
5726
6254






2621, 2996, 3371


186
CCA30315
372
747
1122, 1497, 1872, 2247,
5199
5727
6255






2622, 2997, 3372


187
JX985061
373
748
1123, 1498, 1873, 2248,
5200
5728
6256






2623, 2998, 3373


188
NP_694869
374
749
1124, 1499, 1874, 2249,
5201
5729
6257






2624, 2999, 3374


189
SCA79106
375
750
1125, 1500, 1875, 2250,
5202
5730
6258






2625, 3000, 3375
















TABLE 3







Zinc-binding matrix protein (Z)















column 1









NCBI



Accession
column 2
column 3
column 4
column 5
column 6
column 7


Row
No.
A
B
C
D
E
F

















1
ADY11072
3451
3604
3757, 3910, 4063, 4216,
5203
5731
6259






4369, 4522, 4675


2
AAO59514
3452
3605
3758, 3911, 4064, 4217,
5204
5732
6260






4370, 4523, 4676


3
AAO59508
3453
3606
3759, 3912, 4065, 4218,
5205
5733
6261






4371, 4524, 4677


4
AAO59510
3454
3607
3760, 3913, 4066, 4219,
5206
5734
6262






4372, 4525, 4678


5
AAT48997
3455
3608
3761, 3914, 4067, 4220,
5207
5735
6263






4373, 4526, 4679


6
AAT49005
3456
3609
3762, 3915, 4068, 4221,
5208
5736
6264






4374, 4527, 4680


7
ADU56612
3457
3610
3763, 3916, 4069, 4222,
5209
5737
6265






4375, 4528, 4681


8
ADU56616
3458
3611
3764, 3917, 4070, 4223,
5210
5738
6266






4376, 4529, 4682


9
ADU56620
3459
3612
3765, 3918, 4071, 4224,
5211
5739
6267






4377, 4530, 4683


10
ADU56624
3460
3613
3766, 3919, 4072, 4225,
5212
5740
6268






4378, 4531, 4684


11
ADU56628
3461
3614
3767, 3920, 4073, 4226,
5213
5741
6269






4379, 4532, 4685


12
ADU56632
3462
3615
3768, 3921, 4074, 4227,
5214
5742
6270






4380, 4533, 4686


13
ADU56644
3463
3616
3769, 3922, 4075, 4228,
5215
5743
6271






4381, 4534, 4687


14
AHC95545
3464
3617
3770, 3923, 4076, 4229,
5216
5744
6272






4382, 4535, 4688


15
AHC95547
3465
3618
3771, 3924, 4077, 4230,
5217
5745
6273






4383, 4536, 4689


16
AIT17124
3466
3619
3772, 3925, 4078, 4231,
5218
5746
6274






4384, 4537, 4690


17
AIT17128
3467
3620
3773, 3926, 4079, 4232,
5219
5747
6275






4385, 4538, 4691


18
AIT17132
3468
3621
3774, 3927, 4080, 4233,
5220
5748
6276






4386, 4539, 4692


19
AIT17136
3469
3622
3775, 3928, 4081, 4234,
5221
5749
6277






4387, 4540, 4693


20
AIT17140
3470
3623
3776, 3929, 4082, 4235,
5222
5750
6278






4388, 4541, 4694


21
AIT17144
3471
3624
3777, 3930, 4083, 4236,
5223
5751
6279






4389, 4542, 4695


22
AIT17152
3472
3625
3778, 3931, 4084, 4237,
5224
5752
6280






4390, 4543, 4696


23
AIT17162
3473
3626
3779, 3932, 4085, 4238,
5225
5753
6281






4391, 4544, 4697


24
AIT17166
3474
3627
3780, 3933, 4086, 4239,
5226
5754
6282






4392, 4545, 4698


25
AIT17170
3475
3628
3781, 3934, 4087, 4240,
5227
5755
6283






4393, 4546, 4699


26
AIT17174
3476
3629
3782, 3935, 4088, 4241,
5228
5756
6284






4394, 4547, 4700


27
AIT17178
3477
3630
3783, 3936, 4089, 4242,
5229
5757
6285






4395, 4548, 4701


28
AIT17182
3478
3631
3784, 3937, 4090, 4243,
5230
5758
6286






4396, 4549, 4702


29
AIT17186
3479
3632
3785, 3938, 4091, 4244,
5231
5759
6287






4397, 4550, 4703


30
AIT17194
3480
3633
3786, 3939, 4092, 4245,
5232
5760
6288






4398, 4551, 4704


31
AIT17198
3481
3634
3787, 3940, 4093, 4246,
5233
5761
6289






4399, 4552, 4705


32
AIT17202
3482
3635
3788, 3941, 4094, 4247,
5234
5762
6290






4400, 4553, 4706


33
AIT17206
3483
3636
3789, 3942, 4095, 4248,
5235
5763
6291






4401, 4554, 4707


34
AIT17214
3484
3637
3790, 3943, 4096, 4249,
5236
5764
6292






4402, 4555, 4708


35
AIT17222
3485
3638
3791, 3944, 4097, 4250,
5237
5765
6293






4403, 4556, 4709


36
AIT17226
3486
3639
3792, 3945, 4098, 4251,
5238
5766
6294






4404, 4557, 4710


37
AIT17230
3487
3640
3793, 3946, 4099, 4252,
5239
5767
6295






4405, 4558, 4711


38
AIT17234
3488
3641
3794, 3947, 4100, 4253,
5240
5768
6296






4406, 4559, 4712


39
AIT17238
3489
3642
3795, 3948, 4101, 4254,
5241
5769
6297






4407, 4560, 4713


40
AIT17242
3490
3643
3796, 3949, 4102, 4255,
5242
5770
6298






4408, 4561, 4714


41
AIT17248
3491
3644
3797, 3950, 4103, 4256,
5243
5771
6299






4409, 4562, 4715


42
AIT17252
3492
3645
3798, 3951, 4104, 4257,
5244
5772
6300






4410, 4563, 4716


43
AIT17256
3493
3646
3799, 3952, 4105, 4258,
5245
5773
6301






4411, 4564, 4717


44
AIT17260
3494
3647
3800, 3953, 4106, 4259,
5246
5774
6302






4412, 4565, 4718


45
AIT17268
3495
3648
3801, 3954, 4107, 4260,
5247
5775
6303






4413, 4566, 4719


46
AIT17272
3496
3649
3802, 3955, 4108, 4261,
5248
5776
6304






4414, 4567, 4720


47
AIT17278
3497
3650
3803, 3956, 4109, 4262,
5249
5777
6305






4415, 4568, 4721


48
AIT17282
3498
3651
3804, 3957, 4110, 4263,
5250
5778
6306






4416, 4569, 4722


49
AIT17286
3499
3652
3805, 3958, 4111, 4264,
5251
5779
6307






4417, 4570, 4723


50
AIT17290
3500
3653
3806, 3959, 4112, 4265,
5252
5780
6308






4418, 4571, 4724


51
AIT17296
3501
3654
3807, 3960, 4113, 4266,
5253
5781
6309






4419, 4572, 4725


52
AIT17300
3502
3655
3808, 3961, 4114, 4267,
5254
5782
6310






4420, 4573, 4726


53
AIT17312
3503
3656
3809, 3962, 4115, 4268,
5255
5783
6311






4421, 4574, 4727


54
AIT17316
3504
3657
3810, 3963, 4116, 4269,
5256
5784
6312






4422, 4575, 4728


55
AIT17320
3505
3658
3811, 3964, 4117, 4270,
5257
5785
6313






4423, 4576, 4729


56
AIT17326
3506
3659
3812, 3965, 4118, 4271,
5258
5786
6314






4424, 4577, 4730


57
AIT17330
3507
3660
3813, 3966, 4119, 4272,
5259
5787
6315






4425, 4578, 4731


58
AIT17342
3508
3661
3814, 3967, 4120, 4273,
5260
5788
6316






4426, 4579, 4732


59
AIT17346
3509
3662
3815, 3968, 4121, 4274,
5261
5789
6317






4427, 4580, 4733


60
AIT17362
3510
3663
3816, 3969, 4122, 4275,
5262
5790
6318






4428, 4581, 4734


61
AIT17366
3511
3664
3817, 3970, 4123, 4276,
5263
5791
6319






4429, 4582, 4735


62
AIT17376
3512
3665
3818, 3971, 4124, 4277,
5264
5792
6320






4430, 4583, 4736


63
AIT17382
3513
3666
3819, 3972, 4125, 4278,
5265
5793
6321






4431, 4584, 4737


64
AIT17386
3514
3667
3820, 3973, 4126, 4279,
5266
5794
6322






4432, 4585, 4738


65
AIT17396
3515
3668
3821, 3974, 4127, 4280,
5267
5795
6323






4433, 4586, 4739


66
AIT17402
3516
3669
3822, 3975, 4128, 4281,
5268
5796
6324






4434, 4587, 4740


67
AIT17406
3517
3670
3823, 3976, 4129, 4282,
5269
5797
6325






4435, 4588, 4741


68
AIT17410
3518
3671
3824, 3977, 4130, 4283,
5270
5798
6326






4436, 4589, 4742


69
AIT17418
3519
3672
3825, 3978, 4131, 4284,
5271
5799
6327






4437, 4590, 4743


70
AIT17422
3520
3673
3826, 3979, 4132, 4285,
5272
5800
6328






4438, 4591, 4744


71
AIT17426
3521
3674
3827, 3980, 4133, 4286,
5273
5801
6329






4439, 4592, 4745


72
AIT17432
3522
3675
3828, 3981, 4134, 4287,
5274
5802
6330






4440, 4593, 4746


73
AIT17436
3523
3676
3829, 3982, 4135, 4288,
5275
5803
6331






4441, 4594, 4747


74
AIT17444
3524
3677
3830, 3983, 4136, 4289,
5276
5804
6332






4442, 4595, 4748


75
AIT17448
3525
3678
3831, 3984, 4137, 4290,
5277
5805
6333






4443, 4596, 4749


76
AIT17452
3526
3679
3832, 3985, 4138, 4291,
5278
5806
6334






4444, 4597, 4750


77
AIT17456
3527
3680
3833, 3986, 4139, 4292,
5279
5807
6335






4445, 4598, 4751


78
AIT17460
3528
3681
3834, 3987, 4140, 4293,
5280
5808
6336






4446, 4599, 4752


79
AIT17468
3529
3682
3835, 3988, 4141, 4294,
5281
5809
6337






4447, 4600, 4753


80
AIT17472
3530
3683
3836, 3989, 4142, 4295,
5282
5810
6338






4448, 4601, 4754


81
AIT17476
3531
3684
3837, 3990, 4143, 4296,
5283
5811
6339






4449, 4602, 4755


82
AIT17480
3532
3685
3838, 3991, 4144, 4297,
5284
5812
6340






4450, 4603, 4756


83
AIT17484
3533
3686
3839, 3992, 4145, 4298,
5285
5813
6341






4451, 4604, 4757


84
AIT17488
3534
3687
3840, 3993, 4146, 4299,
5286
5814
6342






4452, 4605, 4758


85
AIT17496
3535
3688
3841, 3994, 4147, 4300,
5287
5815
6343






4453, 4606, 4759


86
AIT17500
3536
3689
3842, 3995, 4148, 4301,
5288
5816
6344






4454, 4607, 4760


87
AIT17508
3537
3690
3843, 3996, 4149, 4302,
5289
5817
6345






4455, 4608, 4761


88
AIT17512
3538
3691
3844, 3997, 4150, 4303,
5290
5818
6346






4456, 4609, 4762


89
AIT17516
3539
3692
3845, 3998, 4151, 4304,
5291
5819
6347






4457, 4610, 4763


90
AIT17520
3540
3693
3846, 3999, 4152, 4305,
5292
5820
6348






4458, 4611, 4764


91
AIT17526
3541
3694
3847, 4000, 4153, 4306,
5293
5821
6349






4459, 4612, 4765


92
AIT17536
3542
3695
3848, 4001, 4154, 4307,
5294
5822
6350






4460, 4613, 4766


93
AIT17542
3543
3696
3849, 4002, 4155, 4308,
5295
5823
6351






4461, 4614, 4767


94
AIT17546
3544
3697
3850, 4003, 4156, 4309,
5296
5824
6352






4462, 4615, 4768


95
AIT17550
3545
3698
3851, 4004, 4157, 4310,
5297
5825
6353






4463, 4616, 4769


96
AIT17554
3546
3699
3852, 4005, 4158, 4311,
5298
5826
6354






4464, 4617, 4770


97
AIT17558
3547
3700
3853, 4006, 4159, 4312,
5299
5827
6355






4465, 4618, 4771


98
AIT17570
3548
3701
3854, 4007, 4160, 4313,
5300
5828
6356






4466, 4619, 4772


99
AIT17578
3549
3702
3855, 4008, 4161, 4314,
5301
5829
6357






4467, 4620, 4773


100
AIT17586
3550
3703
3856, 4009, 4162, 4315,
5302
5830
6358






4468, 4621, 4774


101
AIT17590
3551
3704
3857, 4010, 4163, 4316,
5303
5831
6359






4469, 4622, 4775


102
AIT17602
3552
3705
3858, 4011, 4164, 4317,
5304
5832
6360






4470, 4623, 4776


103
AIT17610
3553
3706
3859, 4012, 4165, 4318,
5305
5833
6361






4471, 4624, 4777


104
AIT17614
3554
3707
3860, 4013, 4166, 4319,
5306
5834
6362






4472, 4625, 4778


105
AIT17618
3555
3708
3861, 4014, 4167, 4320,
5307
5835
6363






4473, 4626, 4779


106
AIT17622
3556
3709
3862, 4015, 4168, 4321,
5308
5836
6364






4474, 4627, 4780


107
AIT17628
3557
3710
3863, 4016, 4169, 4322,
5309
5837
6365






4475, 4628, 4781


108
AIT17632
3558
3711
3864, 4017, 4170, 4323,
5310
5838
6366






4476, 4629, 4782


109
AIT17636
3559
3712
3865, 4018, 4171, 4324,
5311
5839
6367






4477, 4630, 4783


110
AIT17640
3560
3713
3866, 4019, 4172, 4325,
5312
5840
6368






4478, 4631, 4784


111
AIT17648
3561
3714
3867, 4020, 4173, 4326,
5313
5841
6369






4479, 4632, 4785


112
AIT17652
3562
3715
3868, 4021, 4174, 4327,
5314
5842
6370






4480, 4633, 4786


113
AIT17656
3563
3716
3869, 4022, 4175, 4328,
5315
5843
6371






4481, 4634, 4787


114
AIT17660
3564
3717
3870, 4023, 4176, 4329,
5316
5844
6372






4482, 4635, 4788


115
AIT17664
3565
3718
3871, 4024, 4177, 4330,
5317
5845
6373






4483, 4636, 4789


116
AIT17668
3566
3719
3872, 4025, 4178, 4331,
5318
5846
6374






4484, 4637, 4790


117
AIT17676
3567
3720
3873, 4026, 4179, 4332,
5319
5847
6375






4485, 4638, 4791


118
AIT17680
3568
3721
3874, 4027, 4180, 4333,
5320
5848
6376






4486, 4639, 4792


119
A1117684
3569
3722
3875, 4028, 4181, 4334,
5321
5849
6377






4487, 4640, 4793


120
AIT17692
3570
3723
3876, 4029, 4182, 4335,
5322
5850
6378






4488, 4641, 4794


121
AIT17702
3571
3724
3877, 4030, 4183, 4336,
5323
5851
6379






4489, 4642, 4795


122
AIT17708
3572
3725
3878, 4031, 4184, 4337,
5324
5852
6380






4490, 4643, 4796


123
AIT17716
3573
3726
3879, 4032, 4185, 4338,
5325
5853
6381






4491, 4644, 4797


124
AIT17720
3574
3727
3880, 4033, 4186, 4339,
5326
5854
6382






4492, 4645, 4798


125
AIT17730
3575
3728
3881, 4034, 4187, 4340,
5327
5855
6383






4493, 4646, 4799


126
AIT17734
3576
3729
3882, 4035, 4188, 4341,
5328
5856
6384






4494, 4647, 4800


127
AIT17738
3577
3730
3883, 4036, 4189, 4342,
5329
5857
6385






4495, 4648, 4801


128
AIT17750
3578
3731
3884, 4037, 4190, 4343,
5330
5858
6386






4496, 4649, 4802


129
AIT17754
3579
3732
3885, 4038, 4191, 4344,
5331
5859
6387






4497, 4650, 4803


130
AIT17760
3580
3733
3886, 4039, 4192, 4345,
5332
5860
6388






4498, 4651, 4804


131
AIT17768
3581
3734
3887, 4040, 4193, 4346,
5333
5861
6389






4499, 4652, 4805


132
AIT17776
3582
3735
3888, 4041, 4194, 4347,
5334
5862
6390






4500, 4653, 4806


133
AIT17780
3583
3736
3889, 4042, 4195, 4348,
5335
5863
6391






4501, 4654, 4807


134
AIT17784
3584
3737
3890, 4043, 4196, 4349,
5336
5864
6392






4502, 4655, 4808


135
AIT17788
3585
3738
3891, 4044, 4197, 4350,
5337
5865
6393






4503, 4656, 4809


136
AIT17792
3586
3739
3892, 4045, 4198, 4351,
5338
5866
6394






4504, 4657, 4810


137
AIT17796
3587
3740
3893, 4046, 4199, 4352,
5339
5867
6395






4505, 4658, 4811


138
AIT17800
3588
3741
3894, 4047, 4200, 4353,
5340
5868
6396






4506, 4659, 4812


139
AIT17804
3589
3742
3895, 4048, 4201, 4354,
5341
5869
6397






4507, 4660, 4813


140
AIT17808
3590
3743
3896, 4049, 4202, 4355,
5342
5870
6398






4508, 4661, 4814


141
AIT17834
3591
3744
3897, 4050, 4203, 4356,
5343
5871
6399






4509, 4662, 4815


142
AIT17838
3592
3745
3898, 4051, 4204, 4357,
5344
5872
6400






4510, 4663, 4816


143
AIT17842
3593
3746
3899, 4052, 4205, 4358,
5345
5873
6401






4511, 4664, 4817


144
ANH09743
3594
3747
3900, 4053, 4206, 4359,
5346
5874
6402






4512, 4665, 4818


145
ANH09745
3595
3748
3901, 4054, 4207, 4360,
5347
5875
6403






4513, 4666, 4819


146
ANH09749
3596
3749
3902, 4055, 4208, 4361,
5348
5876
6404






4514, 4667, 4820


147
ANH09751
3597
3750
3903, 4056, 4209, 4362,
5349
5877
6405






4515, 4668, 4821


148
ANH09753
3598
3751
3904, 4057, 4210, 4363,
5350
5878
6406






4516, 4669, 4822


149
ANH09755
3599
3752
3905, 4058, 4211, 4364,
5351
5879
6407






4517, 4670, 4823


150
ANH09759
3600
3753
3906, 4059, 4212, 4365,
5352
5880
6408






4518, 4671, 4824


151
ANH09761
3601
3754
3907, 4060, 4213, 4366,
5353
5881
6409






4519, 4672, 4825


152
AMR44579
3602
3755
3908, 4061, 4214, 4367,
5354
5882
6410






4520, 4673, 4826


153
AMZ00378
3603
3756
3909, 4062, 4215, 4368,
5355
5883
6411






4521, 4674, 4827









Preferred Lassa virus polypeptide, nucleic acid and mRNA sequences are provided in Table 1, 2 and 3. Therein, each row represents a specific suitable Lassa virus construct of the invention. The protein design/name is indicated for each row (column “Name”). Therein, each row corresponds to a glycoprotein precursor (GPC) (Table 1) or nucleoprotein (NP) (Table 2) or nucleoprotein (NP) zinc-binding matrix protein (Z) (Table 3) as identified by the database accession number of the corresponding protein (first column “NCBI Accession No.”). The second column in Table 1, 2 or 3 (“A”) indicates the SEQ ID NO: corresponding to the respective amino acid sequence as provided herein. The SEQ ID NO: corresponding to the nucleic acid sequence of the wild type mRNA encoding the protein is indicated in the third column (“B”). The fourth column (“C”) provides the SEQ ID NO:'s corresponding to modified/optimized nucleic acid sequences of the mRNAs as described herein that encode the protein preferably having the amino acid sequence as defined by the SEQ ID NO: indicated in the second column (“A”) or by the database entry indicated in the first column. mRNA constructs comprising coding sequences encoding said proteins are provided in column 5 “D” “SEQ ID NO: mRNA design 1” column 6 “E” “SEQ ID NO: mRNA design 2” and column 7 “E” “SEQ ID NO: mRNA design 3”. Additional information regarding each of the sequences provided in Table 1, 2 and 3 may also be derived from the sequence listing, in particular from the details provided therein under identifier <223>.


Wild Type Antigenic Proteins


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one full-length, wild type protein derived from a lassa virus, preferably a full-length, wild type glycoprotein precursor (GPC) derived from a lassa virus, said glycoprotein precursor (GPC) preferably comprising or consisting of an amino acid sequence as defined by the database accession number listed in Table 1, or preferably by any one of SEQ ID NO: 1-186 (see Table 1, column “A”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one full-length, wild type protein derived from a lassa virus, preferably a full-length, wild type nucleoprotein (NP) derived from a lassa virus, said nucleoprotein (NP) preferably comprising or consisting of an amino acid sequence as defined by the database accession number listed in Table 2, or preferably by any one of SEQ ID NO: 187-375 (see Table 2, column “A”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one full-length, wild type protein derived from a lassa virus, preferably a full-length, wild type zinc-binding matrix protein (Z) derived from a lassa virus, said zinc-binding matrix protein (Z) preferably comprising or consisting of an amino acid sequence as defined by the database accession number listed in Table 3, or preferably by any one of SEQ ID NO: 3451-3603 (see Table 3, column “A”).


A full-length “naturally occurring (wild type)” protein may be encoded by a naturally occurring (wild type) nucleic acid sequence, in particular RNA sequence, or (due to the degeneracy of the genetic code) by a nucleic acid sequence “variant”. Thus, in preferred embodiments, the inventive mRNA, and in particular its at least one coding region, comprises a wild type RNA sequence or an RNA sequence variant encoding a full-length, wild type protein as defined herein.


Accordingly, the at least one coding sequence of the inventive mRNA may preferably comprise or consist of a wild type RNA sequence encoding a full-length, wild type protein as defined herein.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a full-length, wild type glycoprotein precursor (GPC) derived from a lassa virus, said mRNA (in particular its at least one coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 376-561 (see Table 1, column “B”) or a variant of any of these sequences.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a full-length, wild type nucleoprotein (NP) derived from a lassa virus, said mRNA (in particular its at least one coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 562-750 (see Table 2, column “B”) or a variant of any of these sequences.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a full-length, wild type zinc-binding matrix protein (Z) derived from a lassa virus, said mRNA (in particular its at least one coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 3604-3756 (see Table 3, column “B”) or a variant of any of these sequences.


In this context, a “variant” RNA sequence differs from the respective wild type RNA sequence in at least one nucleic acid residue, preferably without resulting (due to the degenerated genetic code) in an alteration of the encoded amino acid sequence. According to a preferred embodiment, said “variant” RNA sequence encoding a full-length, wild type protein comprises or consists of an RNA sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% with an RNA sequence encoding said full-length, wild type protein, preferably an RNA sequence according to SEQ ID NO: 376-561 (encoding a GPC or a variant thereof) or an RNA sequence according to SEQ ID NO: 562-750 (encoding an NP or a variant thereof) or an RNA sequence according to SEQ ID NO: 3604-3756 (encoding an Z or a variant thereof).


Variant Antigenic Proteins


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one full-length wild type protein derived from a lassa virus, preferably a full-length, “variant” glycoprotein precursor (GPC) and/or a full-length, “variant” nucleoprotein (NP) and/or a full-length, “variant” zinc-binding matrix protein (Z) comprising or consisting of a “variant” amino acid sequence of the sequences as defined by the database accession number provided under the respective column in Table 1, 2 or 3 (column “A”), and more preferably comprising or consisting of a “variant” amino acid sequence of the sequences as defined by the SEQ ID NOs: 1-186 (GPC) or SEQ ID NOs: 187-375 (NP) or SEQ ID NOs: 3451-3603 (Z).


Preferably, the sequence of a protein “variant” or “sequence variant” differs in at least one amino acid residue from the amino acid sequence of the naturally occurring (wild type) protein serving as a reference (or “parent”) sequence. Variant proteins thus preferably comprise at least one amino acid mutation, substitution, insertion or deletion as compared to their respective reference sequence. Preferably, the term “variant” as used herein comprises any homolog, isoform or transcript variant of a protein as defined herein, wherein the homolog, isoform or transcript variant is preferably characterized by a degree of identity or homology, respectively, as defined herein.


A protein “variant” encoded by the at least one coding sequence of the inventive mRNA may comprise at least one amino acid substitution as compared to the wild type (naturally occurring) antigen amino acid sequence. Said substitution may be selected from a conservative or non-conservative substitution. In some embodiments, it is preferred that a protein “variant” encoded by the at least one coding sequence of the inventive mRNA comprises at least one conservative amino acid substitution, wherein amino acids, which originate from the same class, are exchanged for one another. In particular, these are amino acids having aliphatic side chains, positively or negatively charged side chains, aromatic groups in the side chains or amino acids, the side chains of which can form hydrogen bridges, e.g. side chains which have a hydroxyl function. By conservative constitution, e.g. an amino acid having a polar side chain may be replaced by another amino acid having a corresponding polar side chain, or, for example, an amino acid characterized by a hydrophobic side chain may be substituted by another amino acid having a corresponding hydrophobic side chain (e.g. serine (threonine) by threonine (serine) or leucine (isoleucine) by isoleucine (leucine)). The protein “variant” may also comprise amino acid mutations, insertions, deletions and/or non-conservative substitutions, in particular, at those sequence positions, which do not impair the functionality of the epitope(s) of the encoded antigenic protein(s).


Accordingly, in preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one “variant” protein derived from a lassa virus, preferably a variant glycoprotein precursor (GPC) derived from a lassa virus, said glycoprotein precursor (GPC) preferably comprising or consisting of a “variant” amino acid sequence of the sequences as defined by any one of SEQ ID NOs: 1-186, wherein said “variant” preferably comprises or consists of an amino acid sequence having sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 1-186.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one “variant” protein derived from a lassa virus, preferably a variant nucleoprotein (NP) derived from a lassa virus, said nucleoprotein (NP) preferably comprising or consisting of a “variant” amino acid sequence of the sequences as defined by any one of SEQ ID NO: 187-375, wherein said “variant” preferably comprises or consists of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 187-375.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one “variant” protein derived from a lassa virus, preferably a variant zinc-binding matrix protein (Z) derived from a lassa virus, said zinc-binding matrix protein (Z) preferably comprising or consisting of a “variant” amino acid sequence of the sequences as defined by any one of SEQ ID NO: 3451-3603, wherein said “variant” preferably comprises or consists of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 3451-3603.


Such “variant” antigenic proteins or peptides may be encoded by “variant” RNA sequence as defined above.


Accordingly, in preferred embodiments, the inventive mRNA comprises at least one coding region encoding a “variant” glycoprotein precursor (GPC) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 376-561 or a variant thereof, wherein said “variant” RNA sequence has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 376-561 or a variant or fragment of any of said sequences.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a variant nucleoprotein (NP) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 562-750 or a variant thereof, wherein said variant RNA sequence has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 562-750 or a variant or fragment of any of said sequences.


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a variant zinc-binding matrix protein (Z) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 3604-3756 or a variant thereof, wherein said variant RNA sequence has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 3604-3756 or a variant or fragment of any of said sequences.


Fragments of Antigenic Proteins or Variants Thereof


According to further preferred embodiment, the at least one coding sequence of the inventive mRNA encodes a fragment of a protein or a variant thereof as defined herein. Said fragment can be of any length, provided that it preferably comprises or yields at least one functional epitope.


In the context of the present invention, a “fragment” of a protein (or a variant thereof) may comprise a sequence of a protein (or a variant thereof) as defined above, which is, with regard to its amino acid sequence (or its encoded nucleic acid sequence), N-terminally, C-terminally and/or intrasequentially truncated compared to the amino acid sequence of the naturally occurring protein or variant thereof (or its encoded nucleic acid sequence). Such truncation may thus occur either on the amino acid level or on the nucleic acid level, respectively. A sequence identity with respect to such a fragment as defined herein therefore preferably refers to the entire protein (or a variant thereof) as defined herein or to the entire (coding) nucleic acid sequence of such a protein (or a variant thereof).


A preferred “fragment” in the context of the present invention, consists of a continuous stretch of entities, such as nucleotides or amino acids corresponding to a continuous stretch of entities in the molecule the fragment is derived from, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, and most preferably at least 80% of the total (i.e. full-length) molecule from which the fragment is derived.


A “fragment” of a protein (or a variant thereof) may particularly comprise or consist of an amino acid sequence of said protein (or a variant thereof) as defined herein, having a length of about 5 to about 20 or even more amino acids and which is preferably processed and presented by an MHC complex. Preferably, a fragment of a protein (or a variant thereof) may comprise or consist of an amino acid sequence of said protein (or a variant thereof) as defined herein, which has a length of about 6 to about 20 or even more amino acids, e.g. a fragment as processed and presented by MHC class I molecules, preferably having a length of about 8 to about 10 amino acids, e.g. 8, 9, or 10, (or even 6, 7, 11, or 12 amino acids), or a fragment as processed and presented by MHC class II molecules, preferably having a length of about 13 or more amino acids, e.g. 13, 14, 15, 16, 17, 18, 19, 20 or even more amino acids, wherein the fragment may be selected from any part of the amino acid sequence. These fragments are typically recognized by T-cells in the form of a complex consisting of the peptide fragment and an MHC molecule, i.e. the fragments are typically not recognized in their “native” or “free” form, but rather in MHC-bound form.


Preferably, a “fragment” of a protein (or a variant thereof) encoded by the at least one coding sequence of the inventive mRNA may typically comprise or consist of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an amino acid sequence of an antigenic protein or peptide as defined herein, preferably as defined by the database accession number provided under the respective column in Table 1-3, and more preferably as defined by any one of the SEQ ID NO:'s listed in Tables 1-3 (SEQ ID NOs. 1-375 and 3451-3603) or a variant thereof.


Accordingly, in preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one fragment of a protein or variant thereof derived from a lassa virus, preferably a glycoprotein precursor (GPC) derived from a lassa virus, said glycoprotein precursor (GPC) preferably comprising or consisting of an amino acid sequence of the sequences as defined by any one of SEQ ID NOs: 1-186, wherein said fragment preferably comprises or consists of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 1-186 (or Table 1 column “A”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one fragment of a protein or variant thereof derived from a lassa virus, preferably a variant nucleoprotein (NP) derived from a lassa virus, said nucleoprotein (NP) preferably comprising or consisting of an amino acid sequence of the sequences as defined by any one of SEQ ID NO: 187-375, wherein said fragment preferably comprises or consists of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 187-375 (or Table 2 column “A”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one fragment of a protein or variant thereof derived from a lassa virus, preferably a variant zinc-binding matrix protein (Z) derived from a lassa virus, said zinc-binding matrix protein (Z) preferably comprising or consisting of an amino acid sequence of the sequences as defined by any one of SEQ ID NO: 3451-3603, wherein said fragment preferably comprises or consists of an amino acid sequence having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97% to any one of the sequences as defined by SEQ ID NOs: 3451-3603 (or Table 3 column “A”).


Accordingly, in preferred embodiments, the inventive mRNA comprises at least one coding region encoding a fragment of a glycoprotein precursor (GPC) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 376-561 or a fragment thereof, wherein said RNA sequence fragment has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 376-561 (or Table 1 column “B”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a variant nucleoprotein (NP) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 562-750 or a fragment thereof, wherein said RNA sequence fragment has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 562-750 (or Table 2 column “B”).


In preferred embodiments, the inventive mRNA comprises at least one coding region encoding a variant zinc-binding matrix protein (Z) derived from a lassa virus, said mRNA (in particular its coding region) comprising or consisting of an RNA sequence as defined by SEQ ID NOs: 3604-3756 or a fragment thereof, wherein said RNA sequence fragment has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 3604-3756 (or Table 3 column “B”).


In accordance with the above, in preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus according to SEQ ID NOs: 1-186 or a variant or fragment thereof. The coding region of said mRNA preferably comprises an RNA sequence selected from RNA sequences having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 376-561. Specifically, said coding region may comprise an RNA sequence selected from RNA sequences as defined by SEQ ID NOs: 751-936, 1126-1311, 1501-1686, 1876-2061, 2251-2436, 2626-2811 or 3001-3186 (or Table 1 column “C”).


In further preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus according to SEQ ID NOs. 187-375 or a variant or fragment thereof. Said coding region preferably comprises an RNA sequence selected from RNA sequences has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 562-750. Specifically, said coding region may comprise an RNA sequence selected from RNA sequences as defined by SEQ ID NOs: 937-1125, 1312-1500, 1687-1875, 2062-2250, 2437-2625, 2812-3000 or 3187-3375 (or Table 2 column “C”).


In further preferred embodiments, the inventive mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus according to SEQ ID NOs. 3451-3603 or a variant or fragment thereof. Said coding region preferably comprises an RNA sequence selected from RNA sequences has a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with an RNA sequence as defined by SEQ ID NOs: 3604-3756. Specifically, said coding region may comprise an RNA sequence selected from RNA sequences as defined by SEQ ID NOs: 3757-3909, 3910-4062, 4063-4215, 4216-4368, 4369-4521, 4522-4674, or 4675-4827 (or Table 3 column “C”).


Mono-, Bi- and Multicistronic RNA


According to certain embodiments of the present invention, the mRNA is mono-, bi-, or multicistronic, preferably as defined herein. The coding sequences in a bi- or multicistronic mRNA preferably encode distinct antigenic peptides or proteins as defined herein (or variants or fragments of said peptides/proteins). Preferably, the coding sequences encoding two or more antigenic peptides or proteins may be separated in the bi- or multicistronic mRNA by at least one IRES sequence.


An “IRES” (internal ribosomal entry site) is an RNA element that allows for translation initiation in a cap-independent manner.


Thus, the term “encoding two or more peptides or proteins” may mean, without being limited thereto, that the bi- or even multicistronic mRNA, may encode e.g. at least two, three, four, five, six or more (preferably different) antigenic peptides or proteins or their variants or fragments within the definitions provided herein. More preferably, without being limited thereto, the bi- or even multicistronic mRNA, may encode, for example, at least two, three, four, five, six or more (preferably different) antigenic peptides or proteins as defined herein or their variants or fragments as defined herein.


In this context, a so-called IRES (internal ribosomal entry site) sequence as defined above can function as a sole ribosome binding site, but it can also serve to provide a bi- or even multicistronic mRNA as defined above, which encodes several antigenic peptides or proteins which are to be translated by the ribosomes independently of one another. Examples of IRES sequences, which can be used according to the invention, are those from picornaviruses (e.g. FMDV), pestiviruses (CFFV), polioviruses (PV), encephalomyocarditis viruses (ECMV), foot and mouth disease viruses (FMDV), hepatitis C viruses (HCV), classical swine fever viruses (CSFV), mouse leukoma virus (MLV), simian immunodeficiency viruses (SIV) or cricket paralysis viruses (CrPV).


According to further embodiments, the at least one coding region of the mRNA according to the invention may encode at least two, three, four, five, six, seven, eight and more antigenic peptides or proteins (or variants or fragments thereof) as defined herein linked with or without an amino acid linker sequence, wherein said linker sequence may comprise rigid linkers, flexible linkers, cleavable linkers (e.g., self-cleaving peptides) or a combination thereof. Therein, the antigenic peptides or proteins may be identical or different or a combination thereof.


In some embodiments, the at least one coding region of the mRNA according to the invention comprises at least two, three, four, five, six, seven, eight or more nucleic acid sequences identical to or having a sequence identity of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, preferably of at least 70%, more preferably of at least 80%, even more preferably at least 85%, even more preferably of at least 90% and most preferably of at least 95% or even 97%, with any one of the nucleic acid sequences disclosed in Tables 1-3 herein, or a variant or fragment of any one of said nucleic acid sequences.


Preferably, the mRNA comprising at least one coding region as defined herein typically comprises a length of about 50 to about 20000, or 100 to about 20000 nucleotides, preferably of about 250 to about 20000 nucleotides, more preferably of about 500 to about 10000, even more preferably of about 500 to about 5000.


Circular RNA:


In embodiments, the nucleic acid is an RNA, in particular a circular RNA. As used herein, “circular RNA” has to be understood as a circular polynucleotide that can encode at least one antigenic peptide or protein as defined herein. Accordingly, in preferred embodiments, said circular RNA comprises at least one coding sequence encoding at least one antigenic peptide or protein derived from a Lassa virus or a fragment or variant thereof as defined herein.


The production of circRNAs can be performed using various methods provided in the art. For example, US6210931 teaches a method of synthesizing circRNAs by inserting DNA fragments into a plasmid containing sequences having the capability of spontaneous cleavage and self-circularization. U.S. Pat. No. 5,773,244 teaches producing circRNAs by making a DNA construct encoding an RNA cyclase ribozyme, expressing the DNA construct as an RNA, and then allowing the RNA to self-splice, which produces a circRNA free from intron in vitro. WO1992/001813 teaches a process of making single strand circular nucleic acids by synthesizing a linear polynucleotide, combining the linear nucleotide with a complementary linking oligonucleotide under hybridization conditions, and ligating the linear polynucleotide. The person skilled in the art may also use methods provided in WO2015/034925 or WO2016/011222 to produce circular RNA. Accordingly, methods for producing circular RNA as provided in U.S. Pat. Nos. 6,210,931, 5,773,244, WO1992/001813, WO2015/034925 and WO2016/011222 are incorporated herewith by reference.


RNA In Vitro Transcription and RNA Purification:


In a preferred embodiment, the artificial nucleic acid is an RNA, preferably an mRNA.


The artificial RNA according to the present invention may be prepared using any method known in the art, including chemical synthesis such as e.g. solid phase RNA synthesis, as well as in vitro methods, such as RNA in vitro transcription reactions.


In a preferred embodiment, the artificial nucleic acid as defined herein, preferably the RNA as defined herein, is obtained by RNA in vitro transcription. Accordingly, the RNA of the invention is preferably an in vitro transcribed RNA.


The terms “RNA in vitro transcription” or “in vitro transcription” relate to a process wherein RNA is synthesized in a cell-free system (in vitro) as defined above. DNA, particularly plasmid DNA (or PCR product), is typically used as template for the generation of RNA transcripts.


In embodiments, the nucleotide mixture used in RNA in vitro transcription may additionally contain modified nucleotides as defined herein. In embodiments, the nucleotide mixture (i.e. the fraction of each nucleotide in the mixture) may be optimized for the given RNA sequence, preferably as described WO2015/188933.


In embodiment where more than one different artificial nucleic acid as defined herein has to be produced, e.g. where 2, 3, 4, 5, 6, 7, 8, 9, 10 or even more different artificial nucleic acids have to be produced (e.g. encoding different antigenic peptides, proteins of Lassa virus), procedures as described in WO2017/109134 may be suitably used.


In the context of nucleic acid vaccine production, it may be required to provide GMP-grade RNA. GMP-grade RNA may be suitably produced using a manufacturing process approved by regulatory authorities. Accordingly, in a particularly preferred embodiment, RNA production is performed under current good manufacturing practice (GMP), implementing various quality control steps on DNA and RNA level, according to WO2016/180430. Accordingly, the RNA of the invention is a GMP-grade RNA, particularly a GMP-grade mRNA.


The obtained RNA products are preferably purified using PureMessenger® (CureVac, Tubingen, Germany; RP-HPLC according to WO2008/077592) and/or tangential flow filtration (as described in WO2016/193206).


In a preferred embodiment, the RNA, particularly the purified RNA, is lyophilized according to WO2016/165831 or WO2011/069586 to yield a temperature stable dried artificial nucleic acid (powder) as defined herein. The RNA of the invention, particularly the purified RNA may also be dried using spray-drying or spray-freeze drying according to WO2016/184575 or WO2016184576 to yield a temperature stable artificial nucleic acid (powder) as defined herein. Accordingly, in the context of manufacturing and purifying nucleic acids, particularly RNA, the disclosures of WO2017/109161, WO2015/188933, WO2016/180430, WO2008/077592, WO2016/193206, WO2016/165831, WO2011/069586, WO2016/184575, and WO2016/184576 are incorporated herewith by reference.


Accordingly, in preferred embodiments the RNA is a dried RNA, particularly a dried mRNA.


The term “dried RNA” as used herein has to be understood as RNA that has been lyophilized, or spray-dried, or spray-freeze dried as defined above to obtain a temperature stable dried RNA (powder).


Accordingly, in preferred embodiments the RNA is a purified RNA, particularly purified mRNA.


The term “purified RNA” as used herein has to be understood as RNA which has a higher purity after certain purification steps (e.g. HPLC, TFF, precipitation steps) than the starting material (e.g. in vitro transcribed RNA). Typical impurities that are essentially not present in purified RNA comprise peptides or proteins (e.g. enzymes derived from DNA dependent RNA in vitro transcription, e.g. RNA polymerases, RNases, BSA, pyrophosphatase, restriction endonuclease, DNase), spermidine, abortive RNA sequences, RNA fragments, free nucleotides (modified nucleotides, conventional NTPs, cap analogue), plasmid DNA fragments, buffer components (HEPES, TRIS, MgCl2) etc. Other impurities that may be derived from e.g. fermentation procedures comprise bacterial impurities (bioburden, bacterial DNA) or impurities derived from purification procedures (organic solvents etc.). Accordingly, it is desirable in this regard for the “degree of RNA purity” to be as close as possible to 100%. It is also desirable for the degree of RNA purity that the amount of full length RNA transcripts is as close as possible to 100%. Accordingly “purified RNA” as used herein has a degree of purity of more than 70%, 75%, 80%, 85%, very particularly 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% and most favorably 99% or more. The degree of purity may for example be determined by an analytical HPLC, wherein the percentages provided above correspond to the ratio between the area of the peak for the target RNA and the total area of all peaks representing the by-products. Alternatively, the degree of purity may for example be determined by an analytical agarose gel electrophoresis or capillary gel electrophoresis.


It has to be understood that “dried RNA” as defined herein and “purified RNA” as defined herein or “GMP-grade mRNA” as defined herein may have superior stability characteristics and improved efficiency (e.g. better translatability of the mRNA in vivo).


RNA Modifications


According to preferred embodiments, the inventive mRNA is an artificial mRNA. According to further preferred embodiments, the inventive mRNA is a modified mRNA. Suitable modifications envisaged in the context of the present invention are described below.


In this context, a modification as defined herein preferably leads to a stabilization of said mRNA. More preferably, the invention thus provides a stabilized mRNA According to a preferred embodiment, the inventive mRNA is thus be provided as a “stabilized” mRNA, i.e. which is essentially resistant to in vivo degradation (e.g. by an exo- or endo-nuclease).


Preferably, a stabilized mRNA in the context of the present invention is stabilized in a cell, such as a prokaryotic or eukaryotic cell, preferably in a mammalian cell, such as a human cell. Stabilization may also occur outside of cells, e.g. in a buffer solution etc., for example, in a manufacturing process for a pharmaceutical composition comprising the stabilized mRNA.


Such stabilization can be effected, for example, by a modified phosphate backbone of the inventive mRNA. A backbone modification in connection with the present invention is a modification, wherein phosphates of the backbone of the nucleotides contained in said mRNA are chemically modified. Nucleotides that may be preferably used in this connection contain e.g. a phosphorothioate-modified phosphate backbone, preferably at least one of the phosphate oxygens contained in the phosphate backbone being replaced by a sulfur atom. Stabilized mRNAs may further include, for example: non-ionic phosphate analogues, such as, for example, alkyl and aryl phosphonates, in which the charged phosphonate oxygen is replaced by an alkyl or aryl group, or phosphodiesters and alkylphosphotriesters, in which the charged oxygen residue is present in alkylated form. Such backbone modifications typically include, without implying any limitation, modifications from the group consisting of methylphosphonates, phosphoramidates and phosphorothioates (e.g. cytidine-5′-O-(1-thiophosphate)).


In the following, specific modifications are described, which are preferably capable of “stabilizing” the mRNA according to the invention.


Chemical Modifications


The term “modification” as used herein may refer to chemical modifications comprising backbone modifications as well as sugar modifications or base modifications.


In this context, a “modified” mRNA may contain nucleotide analogues/modifications, e.g. backbone modifications, sugar modifications or base modifications. A backbone modification in connection with the present invention is a modification, in which phosphates of the backbone of the nucleotides contained in said mRNA herein are chemically modified. A sugar modification in connection with the present invention is a chemical modification of the sugar of the nucleotides of the mRNA. Furthermore, a base modification in connection with the present invention is a chemical modification of the base moiety of the nucleotides of the mRNA. In this context, nucleotide analogues or modifications are preferably selected from nucleotide analogues, which are applicable for transcription and/or translation.


Sugar Modifications:


The modified nucleosides and nucleotides, which may be incorporated into a “modified” mRNA as defined herein, can be modified in the sugar moiety.


For example, the 2′ hydroxyl group (OH) can be modified or replaced with a number of different “oxy” or “deoxy” substituents. Examples of “oxy”-2′ hydroxyl group modifications include, but are not limited to, alkoxy or aryloxy (—OR, e.g., R═H, alkyl, cycloalkyl, aryl, aralkyl, heteroaryl or sugar); polyethyleneglycols (PEG), —O(CH2CH2O)nCH2CH2OR; “locked” nucleic acids (LNA) in which the 2′ hydroxyl is connected, e.g., by a methylene bridge, to the 4′ carbon of the same ribose sugar; and amino groups (—O-amino, wherein the amino group, e.g., NRR, can be alkylamino, dialkylamino, heterocyclyl, arylamino, diarylamino, heteroarylamino, or diheteroaryl amino, ethylene diamine, polyamino) or aminoalkoxy.


“Deoxy” modifications include hydrogen, amino (e.g. NH2; alkylamino, dialkylamino, heterocyclyl, arylamino, diaryl amino, heteroaryl amino, diheteroaryl amino, or amino acid); or the amino group can be attached to the sugar through a linker, wherein the linker comprises one or more of the atoms C, N, and O.


The sugar group can also contain one or more carbons that possess the opposite stereochemical configuration than that of the corresponding carbon in ribose. Thus, a modified mRNA can include nucleotides containing, for instance, arabinose as the sugar.


Backbone Modifications:


The phosphate backbone may further be modified in the modified nucleosides and nucleotides, which may be incorporated into a “modified” mRNA as defined herein.


The phosphate groups of the backbone can be modified by replacing one or more of the oxygen atoms with a different substituent. Further, the modified nucleosides and nucleotides can include the full replacement of an unmodified phosphate moiety with a modified phosphate as described herein.


Examples of modified phosphate groups include, but are not limited to, phosphorothioate, phosphoroselenates, borano phosphates, borano phosphate esters, hydrogen phosphonates, phosphoroamidates, alkyl or aryl phosphonates and phosphotriesters. Phosphorodithioates have both non-linking oxygens replaced by sulfur. The phosphate linker can also be modified by the replacement of a linking oxygen with nitrogen (bridged phosphoroamidates), sulfur (bridged phosphorothioates) and carbon (bridged methylene-phosphonates).


Base Modifications:


The modified nucleosides and nucleotides, which may be incorporated into a “modified” mRNA as defined herein can further be modified in the nucleobase moiety.


Examples of nucleobases found in RNA include, but are not limited to, adenine, guanine, cytosine and uracil. For example, the nucleosides and nucleotides described herein can be chemically modified on the major groove face. In some embodiments, the major groove chemical modifications can include an amino group, a thiol group, an alkyl group, or a halo group.


In some embodiments of the present invention, the nucleotide analogues/modifications are selected from base modifications, which are preferably selected from 2-amino-6-chloropurineriboside-5′-triphosphate, 2-Aminopurine-riboside-5′-triphosphate; 2-aminoadenosine-5-triphosphate, 2′-Amino-2′-deoxycytidine-triphosphate, 2-thiocytidine-5-triphosphate, 2-thiouridine-5-triphosphate, 2′-Fluorothymidine-5′-triphosphate, 2′-O-Methyl-inosine-5-triphosphate 4-thiouridine-5′-triphosphate, 5-aminoallylcytidine-5′-triphosphate, 5-aminoallyluridine-5′-triphosphate, 5-bromocytidine-5′-triphosphate, 5-bromouridine-5′-triphosphate, 5-Bromo-2′-deoxycytidine-5′-triphosphate, 5-Bromo-2′-deoxyuridine-5′-triphosphate, 5-iodocytidine-5′-triphosphate, 5-Iodo-2′-deoxycytidine-5′-triphosphate, 5-iodouridine-5′-triphosphate, 5-Iodo-2′-deoxyuridine-5′-triphosphate, 5-methylcytidine-5′-triphosphate, 5-methyluridine-5′-triphosphate, 5-Propynyl-2′-deoxycytidine-5′-triphosphate, 5-Propynyl-2′-deoxyuridine-5′-triphosphate, 6-azacytidine-5′-triphosphate, 6-azauridine-5′-triphosphate, 6-chloropurineriboside-5′-triphosphate, 7-deazaadenosine-5′-triphosphate, 7-deazaguanosine-5′-triphosphate, 8-azaadenosine-5′-triphosphate, 8-azidoadenosine-5′-triphosphate, benzimidazole-riboside-5′-triphosphate, N1-methyladenosine-5′-triphosphate, N1-methylguanosine-5′-triphosphate, N6-methyladenosine-5′-triphosphate, O6-methylguanosine-5′-triphosphate, pseudouridine-5′-triphosphate, or puromycin-5′-triphosphate, xanthosine-5′-triphosphate. Particular preference is given to nucleotides for base modifications selected from the group of base-modified nucleotides consisting of 5-methylcytidine-5′-triphosphate, 7-deazaguanosine-5′-triphosphate, 5-bromocytidine-5′-triphosphate, and pseudouridine-5′-triphosphate.


In some embodiments, modified nucleosides include pyridin-4-one ribonucleoside, 5-aza-uridine, 2-thio-5-aza-uridine, 2-thiouridine, 4-thio-pseudouridine, 2-thio-pseudouridine, 5-hydroxyuridine, 3-methyluridine, 5-carboxymethyl-uridine, 1-carboxymethyl-pseudouridine, 5-propynyl-uridine, 1-propynyl-pseudouridine, 5-taurinomethyluridine, 1-taurinomethyl-pseudouridine, 5-taurinomethyl-2-thio-uridine, 1-taurinomethyl-4-thio-uridine, 5-methyl-uridine, 1-methyl-pseudouridine, 4-thio-1-methyl-pseudouridine, 2-thio-1-methyl-pseudouridine, 1-methyl-1-deaza-pseudouridine, 2-thio-1-methyl-1-deaza-pseudouridine, dihydrouridine, dihydropseudouridine, 2-thio-dihydrouridine, 2-thio-dihydropseudouridine, 2-methoxyuridine, 2-methoxy-4-thio-uridine, 4-methoxy-pseudouridine, and 4-methoxy-2-thio-pseudouridine.


In some embodiments, modified nucleosides include 5-aza-cytidine, pseudoisocytidine, 3-methyl-cytidine, N4-acetylcytidine, 5-formylcytidine, N4-methylcytidine, 5-hydroxymethylcytidine, 1-methyl-pseudoisocytidine, pyrrolo-cytidine, pyrrolo-pseudoisocytidine, 2-thio-cytidine, 2-thio-5-methyl-cytidine, 4-thio-pseudoisocytidine, 4-thio-1-methyl-pseudoisocytidine, 4-thio-1-methyl-1-deaza-pseudoisocytidine, 1-methyl-1-deaza-pseudoisocytidine, zebularine, 5-aza-zebularine, 5-methyl-zebularine, 5-aza-2-thio-zebularine, 2-thio-zebularine, 2-methoxy-cytidine, 2-methoxy-5-methyl-cytidine, 4-methoxy-pseudoisocytidine, and 4-methoxy-1-methyl-pseudoisocytidine.


In other embodiments, modified nucleosides include 2-aminopurine, 2, 6-diaminopurine, 7-deaza-adenine, 7-deaza-8-aza-adenine, 7-deaza-2-aminopurine, 7-deaza-8-aza-2-aminopurine, 7-deaza-2,6-diaminopurine, 7-deaza-8-aza-2,6-diaminopurine, 1-methyladenosine, N6-methyladenosine, N6-isopentenyladenosine, N6-(cis-hydroxyisopentenyl)adenosine, 2-methylthio-N6-(cis-hydroxyisopentenyl) adenosine, N6-glycinylcarbamoyladenosine, N6-threonylcarbamoyladenosine, 2-methylthio-N6-threonyl carbamoyladenosine, N6,N6-dimethyladenosine, 7-methyladenine, 2-methylthio-adenine, and 2-methoxy-adenine.


In other embodiments, modified nucleosides include inosine, 1-methyl-inosine, wyosine, wybutosine, 7-deaza-guanosine, 7-deaza-8-aza-guanosine, 6-thio-guanosine, 6-thio-7-deaza-guanosine, 6-thio-7-deaza-8-aza-guanosine, 7-methyl-guanosine, 6-thio-7-methyl-guanosine, 7-methylinosine, 6-methoxy-guanosine, 1-methylguanosine, N2-methylguanosine, N2,N2-dimethylguanosine, 8-oxo-guanosine, 7-methyl-8-oxo-guanosine, 1-methyl-6-thio-guanosine, N2-methyl-6-thio-guanosine, and N2,N2-dimethyl-6-thio-guanosine.


In some embodiments, the nucleotide can be modified on the major groove face and can include replacing hydrogen on C-5 of uracil with a methyl group or a halo group. In specific embodiments, a modified nucleoside is 5′-O-(1-thiophosphate)-adenosine, 5′-O-(1-thiophosphate)-cytidine, 5′-O-(1-thiophosphate)-guanosine, 5′-O-(1-thiophosphate)-uridine or 5′-O-(1-thiophosphate)-pseudouridine.


In some embodiments, a “modified” mRNA comprises nucleoside modifications selected from 6-aza-cytidine, 2-thio-cytidine, α-thio-cytidine, Pseudo-iso-cytidine, 5-aminoallyl-uridine, 5-iodo-uridine, N1-methyl-pseudouridine, 5,6-dihydrouridine, α-thio-uridine, 4-thio-uridine, 6-aza-uridine, 5-hydroxy-uridine, deoxy-thymidine, 5-methyl-uridine, Pyrrolo-cytidine, inosine, α-thio-guanosine, 6-methyl-guanosine, 5-methyl-cytdine, 8-oxo-guanosine, 7-deaza-guanosine, N1-methyl-adenosine, 2-amino-6-Chloro-purine, N6-methyl-2-amino-purine, Pseudo-iso-cytidine, 6-Chloro-purine, N6-methyl-adenosine, α-thio-adenosine, 8-azido-adenosine, 7-deaza-adenosine.


In some embodiments, a “modified” mRNA does not comprise any of the chemical modifications as described herein. Said modified mRNA may comprise a lipid modification or a sequence modification as described below.


Lipid Modifications


According to further embodiments, a “modified” mRNA as defined herein contains at least one lipid modification.


Such a lipid-modified mRNA may comprise (i) an mRNA as defined herein, (ii) at least one linker covalently linked with said mRNA, and (iii) at least one lipid covalently linked with the respective linker.


Alternatively, the lipid-modified mRNA may comprise at least one mRNA as defined herein, and at least one (bifunctional) lipid covalently linked (without a linker) with said mRNA.


Alternatively, the lipid-modified mRNA may comprise (i) an mRNA as defined herein, (ii) at least one linker covalently linked with said mRNA, and (iii) at least one lipid covalently linked with the respective linker, and (iv) at least one (bifunctional) lipid covalently linked (without a linker) with said mRNA.


In this context, it is particularly preferred that the lipid modification is present at the terminal ends of a linear mRNA.


Sequence Modifications


According to preferred embodiments, the inventive mRNA comprises at least one sequence modification as described below.


G/C Content Modification:


According to preferred embodiments, the mRNA may be modified, and thus stabilized, by modifying the guanosine/cytosine (G/C) content of said mRNA, preferably of the at least one coding sequence of said mRNA. In other words, the inventive mRNA may be G/C modified.


A “G/C-modified” mRNA typically comprises an RNA sequence that is based on a modified wild type RNA sequence and comprises an altered number of guanosine and/or cytosine nucleotides as compared to said wild type RNA sequence. Such an altered number of G/C nucleotides may be generated by substituting codons containing adenosine or thymidine nucleotides by “synonymous” codons containing guanosine or cytosine nucleotides. Accordingly, the codon substitutions preferably do not alter the encoded amino acid residues, but exclusively alter the G/C content of the nucleic acid molecule.


In a particularly preferred embodiment of the present invention, the G/C content of the coding sequence of the mRNA is modified, particularly increased, compared to the G/C content of the coding sequence of the respective wild type (i.e. unmodified) mRNA. The amino acid sequence encoded by the inventive mRNA is preferably not modified as compared to the amino acid sequence encoded by the respective wild type mRNA.


Such modifications of the mRNA are based on the fact that the sequence of any mRNA region to be translated is important for efficient translation of said mRNA. Thus, the composition of the mRNA and the sequence of various nucleotides are important. In particular, sequences having an increased G (guanosine)/C (cytosine) content are more stable than sequences having an increased A (adenosine)/U (uracil) content.


According to the invention, the codons of the inventive mRNA may therefore be varied compared to the respective wild type mRNA, while retaining the translated amino acid sequence, such that they include an increased amount of G/C nucleotides.


In respect to the fact that several codons code for one and the same amino acid (so-called degeneration of the genetic code), the most favourable codons for the stability can be determined (so-called alternative codon usage). Depending on the amino acid to be encoded by the inventive mRNA, there are various possibilities for modification of the mRNA sequence, compared to its wild type sequence. In the case of amino acids, which are encoded by codons, which contain exclusively G or C nucleotides, no modification of the codon is necessary.


Thus, the codons for Pro (CCC or CCG), Arg (CGC or CGG), Ala (GCC or GCG) and Gly (GGC or GGG) require no modification, since no A or U is present. In contrast, codons which contain A and/or U nucleotides can be modified by substitution of other codons, which code for the same amino acids but contain no A and/or U. Examples of these are: the codons for Pro can be modified from CCU or CCA to CCC or CCG; the codons for Arg can be modified from CGU or CGA or AGA or AGG to CGC or CGG; the codons for Ala can be modified from GCU or GCA to GCC or GCG; the codons for Gly can be modified from GGU or GGA to GGC or GGG. In other cases, although A or U nucleotides cannot be eliminated from the codons, it is however possible to decrease the A and U content by using codons which contain a lower content of A and/or U nucleotides. Examples of these are: the codons for Phe can be modified from UUU to UUC; the codons for Leu can be modified from UUA, UUG, CUU or CUA to CUC or CUG; the codons for Ser can be modified from UCU or UCA or AGU to UCC, UCG or AGC; the codon for Tyr can be modified from UAU to UAC; the codon for Cys can be modified from UGU to UGC; the codon for His can be modified from CAU to CAC; the codon for Gln can be modified from CM to CAG; the codons for Ile can be modified from AUU or AUA to AUC; the codons for Thr can be modified from ACU or ACA to ACC or ACG; the codon for Asn can be modified from MU to MC; the codon for Lys can be modified from AAA to MG; the codons for Val can be modified from GUU or GUA to GUC or GUG; the codon for Asp can be modified from GAU to GAC; the codon for Glu can be modified from GM to GAG; the stop codon UM can be modified to UAG or UGA. In the case of the codons for Met (AUG) and Trp (UGG), on the other hand, there is no possibility of sequence modification. The substitutions listed above can be used either individually or in all possible combinations to increase the G/C content of the inventive mRNA compared to its particular wild type mRNA sequence (i.e. the original sequence). Thus, for example, all codons for Thr occurring in the wild type sequence can be modified to ACC (or ACG). Preferably, however, for example, combinations of the above substitution possibilities are used:


substitution of all codons coding for Thr in the original sequence (wild type RNA) to ACC (or ACG) and


substitution of all codons originally coding for Ser to UCC (or UCG or AGC); substitution of all codons coding for Ile in the original sequence to AUC and


substitution of all codons originally coding for Lys to MG and


substitution of all codons originally coding for Tyr to UAC; substitution of all codons coding for Val in the original sequence to GUC (or GUG) and


substitution of all codons originally coding for Glu to GAG and


substitution of all codons originally coding for Ala to GCC (or GCG) and


substitution of all codons originally coding for Arg to CGC (or CGG); substitution of all codons coding for Val in the original sequence to GUC (or GUG) and


substitution of all codons originally coding for Glu to GAG and


substitution of all codons originally coding for Ala to GCC (or GCG) and


substitution of all codons originally coding for Gly to GGC (or GGG) and


substitution of all codons originally coding for Asn to MC; substitution of all codons coding for Val in the original sequence to GUC (or GUG) and


substitution of all codons originally coding for Phe to UUC and


substitution of all codons originally coding for Cys to UGC and


substitution of all codons originally coding for Leu to CUG (or CUC) and


substitution of all codons originally coding for Gln to CAG and


substitution of all codons originally coding for Pro to CCC (or CCG); etc.


Preferably, the G/C content of the coding sequence of the inventive mRNA is increased by at least 7%, more preferably by at least 15%, particularly preferably by at least 20%, compared to the G/C content of the coding sequence of the wild type mRNA, which codes for an antigenic peptide or protein as defined herein or a variant or fragment thereof.


According to preferred embodiments, at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, more preferably at least 70%, even more preferably at least 80% and most preferably at least 90%, 95% or even 100% of the substitutable codons in the coding region (encoding an antigenic protein or peptide as defined herein) or the whole sequence of the wild type mRNA sequence are substituted, thereby increasing the G/C content of said mRNA sequence.


In this context, it is particularly preferable to increase the G/C content of the inventive mRNA, preferably of its at least one coding sequence, to the maximum (i.e. 100% of the substitutable codons) as compared to the wild type sequence.


A further preferred modification of the mRNA is based on the finding that the translation efficiency is also determined by a different frequency in the occurrence of tRNAs in cells. Thus, in case an mRNA comprises an increased number of so-called “rare codons”, the corresponding mRNA is translated to a significantly poorer degree than mRNAs comprising codons coding for relatively “frequent” tRNAs.


In some preferred embodiments, in modified mRNAs of the invention, the coding region encoding an antigenic peptide or protein is modified compared to the corresponding region of the wild type mRNA such that at least one codon of the wild type sequence, which codes for a tRNA which is relatively rare in the cell, is exchanged for a codon, which codes for a tRNA which is relatively frequent in the cell and carries the same amino acid as the relatively rare tRNA.


Thereby, the sequence of the mRNA is modified by inserting that codons which code for frequently occurring tRNAs. In other words, according to the invention, by this modification all codons of the wild type sequence, which code for a tRNA which is relatively rare in the cell, can in each case be exchanged for a codon, which codes for a tRNA which is relatively frequent in the cell and which, in each case, carries the same amino acid as the relatively rare tRNA. Which tRNAs occur relatively frequently in the cell and which, in contrast, occur relatively rarely is known to a person skilled in the art; cf. e.g. Akashi, Curr. Opin. Genet. Dev. 2001, 11(6): 660-666. The codons, which use for the particular amino acid the tRNA which occurs the most frequently, e.g. the Gly codon, which uses the tRNA, which occurs the most frequently in the (human) cell, are particularly preferred.


According to the invention, it is particularly preferable to link the sequential G/C content which is increased, in particular maximized, in the modified mRNA, with the “frequent” codons without modifying the encoded amino acid sequence, e.g. of antigenic protein or peptide encoded by the coding sequence of said mRNA. Such preferred embodiments are envisaged to allow the provision of a particularly efficiently translated and stabilized (modified) mRNA.


The determination of a modified mRNA as described above (increased G/C content; exchange of tRNAs) can be carried out using the computer program explained in WO02/098443, the disclosure content of which is included in its full scope in the present invention. Using this computer program, the nucleotide sequence of any desired nucleic acid, in particular mRNA, can be modified with the aid of the genetic code or the degenerative nature thereof such that a maximum G/C content results, in combination with the use of codons which code for tRNAs occurring as frequently as possible in the cell, the amino acid sequence coded by the modified nucleic acid, in particular mRNA, preferably not being modified compared to the non-modified sequence.


Alternatively, it is also possible to modify only the G/C content or only the codon usage compared to the original sequence. The source code in Visual Basic 6.0 (development environment used: Microsoft Visual Studio Enterprise 6.0 with Servicepack 3) is also described in WO02/098443.


In further preferred embodiments of the present invention, the A/U content in the environment of the ribosome binding site of the mRNA is increased compared to the A/U content in the environment of the ribosome binding site of its respective wild type mRNA.


This modification (an increased A/U content around the ribosome binding site) increases the efficiency of ribosome binding to said mRNA. An effective binding of the ribosomes to the ribosome binding site (Kozak sequence: SEQ ID NO: 3397; the AUG forms the start codon) in turn has the effect of an efficient translation of the inventive mRNA.


According to further embodiments of the present invention, the mRNA may be modified with respect to potentially destabilizing sequence elements. Particularly, the coding sequence and/or the 5′ and/or 3 untranslated region of said mRNA may be modified compared to the respective wild type mRNA such that it contains no destabilizing sequence elements, the encoded amino acid sequence of the modified mRNA preferably not being modified compared to its respective wild type mRNA.


It is known that, for example in sequences of eukaryotic RNAs, destabilizing sequence elements (DSE) occur, to which signal proteins bind and regulate enzymatic degradation of RNA in vivo. For further stabilization of the modified mRNA, optionally in coding region encoding the antigenic peptide or protein as defined herein, one or more such modifications compared to the corresponding region of the wild type mRNA can therefore be carried out, so that no or substantially no destabilizing sequence elements are contained there.


According to the invention, DSE present in the untranslated regions (3′- and/or 5′-UTR) can also be eliminated from the inventive mRNA by such modifications. Such destabilizing sequences are e.g. AU-rich sequences (AURES), which occur in 3′-UTR sections of numerous unstable RNAs (Caput et al., Proc. Natl. Acad. Sci. USA 1986, 83: 1670 to 1674). The inventive mRNA is therefore preferably modified compared to the respective wild type RNA such that said mRNA contains no such destabilizing sequences. This also applies to those sequence motifs which are recognized by possible endonucleases, e.g. the sequence GAACAAG, which is contained in the 3′-UTR segment of the gene encoding the transferrin receptor (Binder et al., EMBO J. 1994, 13: 1969 to 1980). These sequence motifs are also preferably removed from the inventive mRNA.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a glycoprotein precursor (GPC) derived from a lassa virus, wherein said coding region comprises or consists of a G/C optimized RNA sequence as defined by SEQ ID NOs: 751-936, 2251-2436, 2626-2811 or 3001-3186, or a variant or fragment of the sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a nucleoprotein (NP) derived from a lassa virus, wherein said coding region comprises or consists of a G/C optimized RNA sequence as defined by SEQ ID NOs: 937-1125, 2437-2625, 2812-3000 or 3187-3375 or a variant or fragment of the sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a zinc-binding matrix protein (Z) derived from a lassa virus, wherein said coding region comprises or consists of a G/C optimized RNA sequence as defined by SEQ ID NOs: 3757-3909, 4369-4521, 4522-4674 or 4675-4827 or a variant or fragment of the sequences.


Sequences Adapted to Human Codon Usage:


A further preferred modification of the inventive mRNA is based on the finding that codons encoding the same amino acid typically occur at different frequencies. According to further preferred embodiments, in the modified mRNA, the coding sequence is modified compared to the corresponding region of the respective wild type RNA (or said other wild type nucleic acid) such that the frequency of the codons encoding the same amino acid corresponds to the naturally occurring frequency of that codon according to the human codon usage as e.g. shown in Table 4.


For example, in the case of the amino acid alanine (Ala) present in an amino acid sequence encoded by the at least one coding sequence of the mRNA, the wild type coding sequence is preferably adapted in a way that the codon “GCC” is used with a frequency of 0.40, the codon “GCT” is used with a frequency of 0.28, the codon “GCA” is used with a frequency of 0.22 and the codon “GCG” is used with a frequency of 0.10 etc. (see Table 4).









TABLE 4







Human codon usage table














Amino



Amino





acid
codon
fraction
/1000
acid
codon
fraction
/1000

















Ala
GCG
0.10
7.4
Pro
CCG
0.11
6.9


Ala
GCA
0.22
15.8
Pro
CCA
0.27
16.9


Ala
GCT
0.28
18.5
Pro
CCT
0.29
17.5


Ala
GCC*
0.40
27.7
Pro
CCC*
0.33
19.8


Cys
TGT
0.42
10.6
Gln
CAG*
0.73
34.2


Cys
TGC*
0.58
12.6
Gln
CAA
0.27
12.3


Asp
GAT
0.44
21.8
Arg
AGG
0.22
12.0


Asp
GAC*
0.56
25.1
Arg
AGA*
0.21
12.1


Glu
GAG*
0.59
39.6
Arg
CGG
0.19
11.4


Glu
GAA
0.41
29.0
Arg
CGA
0.10
6.2


Phe
TTT
0.43
17.6
Arg
CGT
0.09
4.5


Phe
TTC*
0.57
20.3
Arg
CGC
0.19
10.4


Gly
GGG
0.23
16.5
Ser
AGT
0.14
12.1


Gly
GGA
0.26
16.5
Ser
AGC*
0.25
19.5


Gly
GGT
0.18
10.8
Ser
TCG
0.06
4.4


Gly
GGC*
0.33
22.2
Ser
TCA
0.15
12.2


His
CAT
0.41
10.9
Ser
TCT
0.18
15.2


His
CAC*
0.59
15.1
Ser
TCC
0.23
17.7


Ile
ATA
0.14
7.5
Thr
ACG
0.12
6.1


Ile
ATT
0.35
16.0
Thr
ACA
0.27
15.1


Ile
ATC*
0.52
20.8
Thr
ACT
0.23
13.1


Lys
AAG*
0.60
31.9
Thr
ACC*
0.38
18.9


Lys
AAA
0.40
24.4
Val
GTG*
0.48
28.1


Leu
TTG
0.12
12.9
Val
GTA
0.10
7.1


Leu
TTA
0.06
7.7
Val
GTT
0.17
11.0


Leu
CTG*
0.43
39.6
Val
GTC
0.25
14.5


Leu
CTA
0.07
7.2
Trp
TGG*
1
13.2


Leu
CTT
0.12
13.2
Tyr
TAT
0.42
12.2


Leu
CTC
0.20
19.6
Tyr
TAC*
0.58
15.3


Met
ATG*
1
22.0
Stop
TGA*
0.61
1.6


Asn
AAT
0.44
17.0
Stop
TAG
0.17
0.8


Asn
AAC*
0.56
19.1
Stop
TAA
0.22
1.0





*most frequent codon






Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a glycoprotein precursor (GPC) derived from a lassa virus, wherein said coding region comprises or consists of an RNA sequence adapted to human codon usage as defined by SEQ ID NOs: 1501-1686 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a nucleoprotein (NP) derived from a lassa virus, wherein said coding region comprises or consists of an RNA sequence adapted to human codon usage as defined by SEQ ID NOs: 1687-1875 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a zinc-binding matrix protein (Z) derived from a lassa virus, wherein said coding region comprises or consists of an RNA sequence adapted to human codon usage as defined by SEQ ID NOs: 4063-4215 or a variant or fragment of these sequences.


Codon-Optimized Sequences:


As described above it is preferred according to the invention, that all codons of the wild type sequence which code for a tRNA, which is relatively rare in the cell, are exchanged for a codon which codes for a tRNA, which is relatively frequent in the cell and which, in each case, carries the same amino acid as the relatively rare tRNA.


Therefore, it is particularly preferred that the most frequent codons are used for each encoded amino acid (see Table 4, most frequent codons are marked with asterisks). Such an optimization procedure increases the codon adaptation index (CAI) and ultimately maximises the CAI. In the context of the invention, sequences with increased or maximized CAI are typically referred to as “codon-optimized” sequences and/or CAI increased and/or maximized sequences.


According to preferred embodiments, the inventive mRNA comprises at least one coding sequence, wherein the coding sequence is codon-optimized as described herein. More preferably, the codon adaptation index (CAI) of the at least one coding sequence is at least 0.5, at least 0.8, at least 0.9 or at least 0.95. Most preferably, the codon adaptation index (CAI) of the at least one coding sequence is 1.


For example, in the case of the amino acid alanine (Ala) present in the amino acid sequence encoded by the at least one coding sequence of the mRNA, the wild type coding sequence is adapted in a way that the most frequent human codon “GCC” is always used for said amino acid, or for the amino acid Cysteine (Cys), the wild type sequence is adapted in a way that the most frequent human codon “TGC” is always used for said amino acid etc.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a glycoprotein precursor (GPC) derived from a lassa virus, wherein said coding region comprises or consists of a codon optimized RNA sequence as defined by SEQ ID NOs: 1876-2061 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a nucleoprotein (NP) derived from a lassa virus, wherein said coding region comprises or consists of codon optimized RNA sequence as defined by SEQ ID NOs: 2062-2250 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a zinc-binding matrix protein (Z) derived from a lassa virus, wherein said coding region comprises or consists of codon optimized RNA sequence as defined by SEQ ID NOs: 4216-4368 or a variant or fragment of these sequences.


C-Optimized Sequences:


According to preferred embodiments, the inventive mRNA is modified by modifying, preferably increasing, the cytosine (C) content of said mRNA, in particular in its at least one coding sequence.


In preferred embodiments, the C content of the coding sequence of the inventive mRNA is modified, preferably increased, compared to the C content of the coding sequence of the respective wild type (unmodified) RNA. The amino acid sequence encoded by the at least one coding sequence of the inventive mRNA is preferably not modified as compared to the amino acid sequence encoded by the respective wild type mRNA.


In preferred embodiments, said modified mRNA according to the invention is modified such that at least 10%, 20%, 30%, 40%, 50%, 60%, 70% or 80%, or at least 90% of the theoretically possible maximum cytosine-content or even a maximum cytosine-content is achieved.


In further preferred embodiments, at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or even 100% of the codons of the wild type mRNA sequence, which are “cytosine content optimizable” are replaced by codons having a higher cytosine-content than the ones present in the wild type sequence.


In further preferred embodiments, some of the codons of the wild type coding sequence may additionally be modified such that a codon for a relatively rare tRNA in the cell is exchanged by a codon for a relatively frequent tRNA in the cell, provided that the substituted codon for a relatively frequent tRNA carries the same amino acid as the relatively rare tRNA of the original wild type codon. Preferably, all of the codons for a relatively rare tRNA are replaced by a codon for a relatively frequent tRNA in the cell, except codons encoding amino acids, which are exclusively encoded by codons not containing any cytosine, or except for glutamine (Gln), which is encoded by two codons each containing the same number of cytosines.


In further preferred embodiments of the present invention, the modified mRNA is modified such that at least 80%, or at least 90% of the theoretically possible maximum cytosine-content or even a maximum cytosine-content is achieved by means of codons, which code for relatively frequent tRNAs in the cell, wherein the amino acid sequence remains unchanged.


Due to the naturally occurring degeneracy of the genetic code, more than one codon may encode a particular amino acid. Accordingly, 18 out of 20 naturally occurring amino acids are encoded by more than one codon (with Tryp and Met being an exception), e.g. by 2 codons (e.g. Cys, Asp, Glu), by three codons (e.g. Ile), by 4 codons (e.g. Al, Gly, Pro) or by 6 codons (e.g. Leu, Arg, Ser). However, not all codons encoding the same amino acid are utilized with the same frequency under in vivo conditions. Depending on each single organism, a typical codon usage profile is established.


The term “cytosine content-optimizable codon” as used within the context of the present invention refers to codons, which exhibit a lower content of cytosines than other codons encoding the same amino acid. Accordingly, any wild type codon, which may be replaced by another codon encoding the same amino acid and exhibiting a higher number of cytosines within that codon, is considered to be cytosine-optimizable (C-optimizable). Any such substitution of a C-optimizable wild type codon by the specific C-optimized codon within a wild type coding sequence increases its overall C-content and reflects a C-enriched modified RNA sequence.


According to some preferred embodiments, the inventive mRNA, and in particular its at least one coding sequence, comprises or consists of a C-maximized sequence containing C-optimized codons for all potentially C-optimizable codons. Accordingly, 100% or all of the theoretically replaceable C-optimizable codons are preferably replaced by C-optimized codons over the entire length of the coding sequence.


In this context, cytosine-content optimizable codons are codons, which contain a lower number of cytosines than other codons coding for the same amino acid.


Any of the codons GCG, GCA, GCU codes for the amino acid Ala, which may be exchanged by the codon GCC encoding the same amino acid, and/or


the codon UGU that codes for Cys may be exchanged by the codon UGC encoding the same amino acid, and/or


the codon GAU which codes for Asp may be exchanged by the codon GAC encoding the same amino acid, and/or


the codon that UUU that codes for Phe may be exchanged for the codon UUC encoding the same amino acid, and/or


any of the codons GGG, GGA, GGU that code Gly may be exchanged by the codon GGC encoding the same amino acid, and/or


the codon CAU that codes for His may be exchanged by the codon CAC encoding the same amino acid, and/or any of the codons AUA, AUU that code for Ile may be exchanged by the codon AUC, and/or


any of the codons UUG, UUA, CUG, CUA, CUU coding for Leu may be exchanged by the codon CUC encoding the same amino acid, and/or


the codon MU that codes for Asn may be exchanged by the codon MC encoding the same amino acid, and/or


any of the codons CCG, CCA, CCU coding for Pro may be exchanged by the codon CCC encoding the same amino acid, and/or


any of the codons AGG, AGA, CGG, CGA, CGU coding for Arg may be exchanged by the codon CGC encoding the same amino acid, and/or


any of the codons AGU, AGC, UCG, UCA, UCU coding for Ser may be exchanged by the codon UCC encoding the same amino acid, and/or


any of the codons ACG, ACA, ACU coding for Thr may be exchanged by the codon ACC encoding the same amino acid, and/or


any of the codons GUG, GUA, GUU coding for Val may be exchanged by the codon GUC encoding the same amino acid, and/or


the codon UAU coding for Tyr may be exchanged by the codon UAC encoding the same amino acid.


In any of the above instances, the number of cytosines is increased by 1 per exchanged codon. Exchange of all non C-optimized codons (corresponding to C-optimizable codons) of the coding sequence results in a C-maximized coding sequence. In the context of the invention, at least 70%, preferably at least 80%, more preferably at least 90%, of the non C-optimized codons within the at least one coding sequence of the inventive mRNA are replaced by C-optimized codons.


It may be preferred that for some amino acids the percentage of C-optimizable codons replaced by C-optimized codons is less than 70%, while for other amino acids the percentage of replaced codons is higher than 70% to meet the overall percentage of C-optimization of at least 70% of all C-optimizable wild type codons of the coding sequence.


Preferably, in a C-optimized mRNA according to the invention, at least 50% of the C-optimizable wild type codons for any given amino acid are replaced by C-optimized codons, e.g. any modified C-enriched mRNA preferably contains at least 50% C-optimized codons at C-optimizable wild type codon positions encoding any one of the above mentioned amino acids Ala, Cys, Asp, Phe, Gly, His, Ile, Leu, Asn, Pro, Arg, Ser, Thr, Val and Tyr, preferably at least 60%.


In this context codons encoding amino acids, which are not cytosine content-optimizable and which are, however, encoded by at least two codons, may be used without any further selection process. However, the codon of the wild type sequence that codes for a relatively rare tRNA in the cell, e.g. a human cell, may be exchanged for a codon that codes for a relatively frequent tRNA in the cell, wherein both code for the same amino acid. Accordingly, the relatively rare codon GM coding for Glu may be exchanged by the relative frequent codon GAG coding for the same amino acid, and/or


the relatively rare codon AAA coding for Lys may be exchanged by the relative frequent codon MG coding for the same amino acid, and/or


the relatively rare codon CAA coding for Gln may be exchanged for the relative frequent codon CAG encoding the same amino acid.


In this context, the amino acids Met (AUG) and Trp (UGG), which are encoded by only one codon each, remain unchanged. Stop codons are not cytosine-content optimized, however, the relatively rare stop codons amber, ochre (UM, UAG) may be exchanged by the relatively frequent stop codon opal (UGA).


The single substitutions listed above may be used individually as well as in all possible combinations in order to optimize the cytosine-content of the modified mRNA according to the invention compared to the respective wild type mRNA.


Accordingly, the coding sequence as defined herein may be changed compared to the coding sequence of the respective wild type RNA in such a way that an amino acid encoded by at least two or more codons, of which one comprises one additional cytosine, such a codon may be exchanged by the C-optimized codon comprising one additional cytosine, wherein the amino acid is preferably unaltered compared to the wild type sequence.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a glycoprotein precursor (GPC) derived from a lassa virus, wherein said coding region comprises or consists of a C-optimized RNA sequence as defined by SEQ ID NOs: 1126-1311 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a nucleoprotein (NP) derived from a lassa virus, wherein said coding region comprises or consists of C-optimized RNA sequence as defined by SEQ ID NOs: 1312-1500 or a variant or fragment of these sequences.


Accordingly, in preferred embodiments, inventive mRNAs comprise at least one coding region encoding a zinc-binding matrix protein (Z) derived from a lassa virus, wherein said coding region comprises or consists of C-optimized RNA sequence as defined by SEQ ID NOs: 3910-4062 or a variant or fragment of these sequences.


According to particularly preferred embodiments, the (modified) mRNA according to the invention comprises at least one coding sequence as defined herein, wherein (a) the G/C content of the at least one coding sequence of said mRNA is increased compared to the G/C content of the corresponding coding sequence of the corresponding wild type mRNA, and/or (b) wherein the C content of the at least one coding sequence of said mRNA is increased compared to the C content of the corresponding coding sequence of the corresponding wild type mRNA, and/or (c) wherein the codons in the at least one coding sequence of said mRNA are adapted to human codon usage, wherein the codon adaptation index (CAI) is preferably increased or maximized in the at least one coding sequence of said mRNA, and wherein the amino acid sequence encoded by said mRNA is preferably not being modified compared to the amino acid sequence encoded by the corresponding wild type RNA.


5′-Cap


According to further preferred embodiments, the inventive mRNA comprises a “5′-cap” structure, which preferably stabilizes said mRNA.


A 5′-cap is an entity, typically a modified nucleotide entity, which generally “caps” the 5′-end of a mature mRNA. A 5′-cap may typically be formed by a modified nucleotide, particularly by a derivative of a guanine nucleotide. Preferably, the 5′-cap is linked to the 5′-terminus via a 5′-5′-triphosphate linkage. A 5′-cap may be methylated, e.g. m7GpppN, wherein N is the terminal 5′ nucleotide of the nucleic acid carrying the 5′-cap, typically the 5′-end of an mRNA. m7GpppN is the 5′-cap structure, which naturally occurs in mRNA transcribed by polymerase II and is therefore preferably not considered as modification comprised in a modified mRNA in this context. Accordingly, a “modified” mRNA may comprise an m7GpppN as 5′-cap, but additionally the modified RNA typically comprises at least one further modification as defined herein.


Further examples of 5′-cap structures include glyceryl, inverted deoxy abasic residue (moiety), 4′,5′ methylene nucleotide, 1-(beta-D-erythrofuranosyl) nucleotide, 4′-thio nucleotide, carbocyclic nucleotide, 1,5-anhydrohexitol nucleotide, L-nucleotides, alpha-nucleotide, modified base nucleotide, threo-pentofuranosyl nucleotide, acyclic 3′,4′-seco nucleotide, acyclic 3,4-dihydroxybutyl nucleotide, acyclic 3,5 dihydroxypentyl nucleotide, 3′-3′-inverted nucleotide moiety, 3′-3′-inverted abasic moiety, 3′-2′-inverted nucleotide moiety, 3′-2′-inverted abasic moiety, 1,4-butanediol phosphate, 3′-phosphoramidate, hexylphosphate, aminohexyl phosphate, 3′-phosphate, 3′ phosphorothioate, phosphorodithioate, or bridging or non-bridging methylphosphonate moiety. These modified 5′-cap structures are regarded as at least one modification in this context.


Particularly preferred modified 5′-cap structures are cap1 (methylation of the ribose of the adjacent nucleotide of m7G), cap2 (additional methylation of the ribose of the 2nd nucleotide downstream of the m7G), cap3 (additional methylation of the ribose of the 3rd nucleotide downstream of the m7G), cap4 (methylation of the ribose of the 4th nucleotide downstream of the m7G), ARCA (anti-reverse cap analogue, modified ARCA (e.g. phosphothioate modified ARCA), inosine, N1-methyl-guanosine, 2′-fluoro-guanosine, 7-deaza-guanosine, 8-oxo-guanosine, 2-amino-guanosine, LNA-guanosine, and 2-azido-guanosine.


According to preferred embodiments, the inventive mRNA comprises a 5′-cap structure and/or at least one 3′-untranslated region element (3′-UTR element), preferably as defined herein. More preferably, said mRNA further comprises a 5′-UTR element as defined herein.


Poly(A)


According to further preferred embodiments, the inventive mRNA contains a poly-A tail. A poly(A) sequence, also called poly(A) tail or 3′-poly(A) tail, is typically understood to be a sequence of adenosine nucleotides, e.g., of up to about 400 adenosine nucleotides, e.g. from about 20 to about 400, preferably from about 50 to about 400, more preferably from about 50 to about 300, even more preferably from about 50 to about 250, most preferably from about 60 to about 250 adenosine nucleotides. As used herein, a poly(A) sequence may also comprise about 10 to 200 adenosine nucleotides, preferably about 10 to 100 adenosine nucleotides, more preferably about 40 to 80 adenosine nucleotides or even more preferably about 50 to 70 adenosine nucleotides. A poly(A) sequence is typically located at the 3′-end of an mRNA.


Accordingly, in further preferred embodiments, the inventive mRNA comprises at its 3′-terminus a poly(A) tail of typically about 10 to 200 adenosine nucleotides, preferably about 10 to 100 adenosine nucleotides, more preferably about 40 to 80 adenosine nucleotides or even more preferably about 50 to 70 adenosine nucleotides.


Preferably, the poly(A) sequence in the mRNA is derived from a DNA template by RNA in vitro transcription. Alternatively, the poly(A) sequence may also be obtained in vitro by common methods of chemical-synthesis without being necessarily transcribed from a DNA-progenitor. Moreover, poly(A) sequences, or poly(A) tails may be generated by enzymatic polyadenylation of the mRNA using commercially available polyadenylation kits and corresponding protocols known in the art.


Polyadenylation is typically understood to be the addition of a poly(A) sequence to a nucleic acid molecule, such as an RNA molecule, e.g. to a premature mRNA. Polyadenylation may be induced by a so-called polyadenylation signal. This signal is preferably located within a stretch of nucleotides at the 3′-end of the mRNA to be polyadenylated. A polyadenylation signal typically comprises a hexamer consisting of adenine and uracil/thymine nucleotides, preferably the hexamer sequence AAUAAA. Other sequences, preferably hexamer sequences, are also conceivable. Polyadenylation typically occurs during processing of a pre-mRNA (also called premature-mRNA). Typically, RNA maturation (from pre-mRNA to mature mRNA) comprises a step of polyadenylation.


Alternatively, the inventive mRNA optionally comprises a polyadenylation signal. Accordingly, the inventive mRNA may comprise a polyadenylation signal which conveys polyadenylation to a (transcribed) RNA by specific protein factors (e.g. cleavage and polyadenylation specificity factor (CPSF), cleavage stimulation factor (CstF), cleavage factors I and II (CF I and CF II), poly(A) polymerase (PAP)).


In this context, a consensus polyadenylation signal is preferred comprising the NN(U/T)ANA consensus sequence. In a particularly preferred aspect, the polyadenylation signal comprises one of the following sequences: AA(U/T)AAA or A(U/T)(U/T)AAA (wherein uridine is usually present in RNA and thymidine is usually present in DNA).


Poly(C)


According to further preferred embodiments, the inventive mRNA contains a poly(C) tail on the 3′-terminus of typically about 10 to 200 cytosine nucleotides, preferably about 10 to 100 cytosine nucleotides, more preferably about 20 to 70 cytosine nucleotides or even more preferably about 20 to 60 or even 10 to 40 cytosine nucleotides.


UTRs


According to preferred embodiments, the inventive mRNA comprises at least one 5′- or 3′-UTR element. In this context, an UTR element comprises or consists of a nucleic acid sequence, which is derived from the 5′- or 3′-UTR of any naturally occurring gene or which is derived from a fragment, a homolog or a variant of the 5′- or 3′-UTR of a gene. Preferably, the 5′- or 3′-UTR element used according to the present invention is heterologous to the at least one coding sequence of the mRNA. Even if 5′- or 3′-UTR elements derived from naturally occurring genes are preferred, also synthetically engineered UTR elements may be used in the context of the present invention.


3′-UTR


According to preferred embodiments, the inventive mRNA further comprises at least one 3′-UTR element.


The term “3′-UTR element” typically refers to a nucleic acid sequence, which comprises or consists of a nucleic acid sequence that is derived from a 3′-UTR or from a variant of a 3′-UTR.


Generally, the term “3′-UTR” refers to a part of a nucleic acid molecule, which is located 3′ (i.e. “downstream”) of an open reading frame and which is not translated into protein. In the context of the present invention, a 3′-UTR corresponds to the sequence of a mature mRNA, which is located between the stop codon of the protein coding sequence, preferably immediately 3′ to the stop codon of the protein coding sequence, and the poly(A) sequence of the mRNA. The term “corresponds to” means that the 3′-UTR sequence may be an RNA sequence, such as in the mRNA sequence used for defining the 3′-UTR sequence, or a DNA sequence, which corresponds to such RNA sequence. In the context of the present invention, the term “a 3′-UTR of a gene”, such as “a 3′-UTR of a ribosomal protein gene”, is the sequence, which corresponds to the 3′-UTR of the mature mRNA derived from this gene, i.e. the mRNA obtained by transcription of the gene and maturation of the pre-mature mRNA. The term “3′-UTR of a gene” encompasses the DNA sequence and the RNA sequence (both sense and antisense strand and both mature and immature) of the 3′-UTR.


A 3′-UTR element in the sense of the present invention may represent the 3′-UTR of an RNA, preferably an mRNA. Thus, in the sense of the present invention, preferably, a 3′-UTR element may be the 3′-UTR of an RNA, preferably of an mRNA, or it may be the transcription template for a 3′-UTR of an RNA. Thus, a 3′-UTR element preferably is a nucleic acid sequence which corresponds to the 3′-UTR of an RNA, preferably to the 3′-UTR of an mRNA, such as an mRNA obtained by transcription of a genetically engineered vector construct. Preferably, the 3′-UTR element fulfils the function of a 3′-UTR or encodes a sequence which fulfils the function of a 3′-UTR.


Preferably, the at least one 3′-UTR element of the inventive mRNA comprises or consists of a nucleic acid sequence derived from the 3′-UTR of a chordate gene, preferably a vertebrate gene, more preferably a mammalian gene, most preferably a human gene, or from a variant of the 3′-UTR of a chordate gene, preferably a vertebrate gene, more preferably a mammalian gene, most preferably a human gene.


Preferably, the inventive mRNA comprises a 3′-UTR element, which may be derivable from a gene that relates to an mRNA with an enhanced half-life (that provides a stable mRNA), for example a 3′-UTR element as defined and described below. Preferably, the 3′-UTR element comprises or consists of a nucleic acid sequence derived from a 3′-UTR of a gene, which preferably encodes a stable mRNA, or from a homolog, a fragment or a variant of said gene.


In particularly preferred embodiments, the 3′-UTR element comprises or consists of a nucleic acid sequence, which is derived from a 3′-UTR of a gene selected from the group consisting of an albumin gene, an alpha-globin gene, a beta-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen alpha gene, such as a collagen alpha 1(I) gene, or from a variant of a 3′-UTR of a gene selected from the group consisting of an albumin gene, an alpha-globin gene, a beta-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen alpha gene, such as a collagen alpha 1(I) gene according to SEQ ID NO: 1369-1390 of the patent application WO2013/143700, whose disclosure is incorporated herein by reference, or from a homolog, a fragment or a variant thereof.


The term “a nucleic acid sequence which is derived from the 3′-UTR of a [ . . . ] gene” preferably refers to a nucleic acid sequence which is based on the 3′-UTR sequence of a [ . . . ] gene or on a part thereof, such as on the 3′-UTR of an albumin gene, an alpha-globin gene, a beta-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, or a collagen alpha gene, such as a collagen alpha 1(I) gene, preferably of an albumin gene or on a part thereof. This term includes sequences corresponding to the entire 3′-UTR sequence, i.e. the full length 3′-UTR sequence of a gene, and sequences corresponding to a fragment of the 3′-UTR sequence of a gene, such as an albumin gene, alpha-globin gene, beta-globin gene, tyrosine hydroxylase gene, lipoxygenase gene, or collagen alpha gene, such as a collagen alpha 1(I) gene, preferably of an albumin gene.


The term “a nucleic acid sequence which is derived from a variant of the 3′-UTR of a [ . . . ] gene” preferably refers to a nucleic acid sequence, which is based on a variant of the 3′-UTR sequence of a gene, such as on a variant of the 3′-UTR of an albumin gene, an alpha-globin gene, a beta-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, or a collagen alpha gene, such as a collagen alpha 1(I) gene, or on a part thereof as described above. This term includes sequences corresponding to the entire sequence of the variant of the 3′-UTR of a gene, i.e. the full length variant 3′-UTR sequence of a gene, and sequences corresponding to a fragment of the variant 3′-UTR sequence of a gene. A “fragment” in this context preferably consists of a continuous stretch of nucleotides corresponding to a continuous stretch of nucleotides in the full-length variant 3′-UTR, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, even more preferably at least 80%, and most preferably at least 90% of the full-length variant 3′-UTR. Such a fragment of a variant, in the sense of the present invention, is preferably a functional fragment of a variant as described herein.


Albumin-Derived 3′-UTRs:


In a particularly preferred embodiment, the 3′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of an albumin gene, preferably a vertebrate albumin gene, more preferably a mammalian albumin gene, most preferably a human albumin gene according to SEQ ID NO: 3388 or the corresponding RNA sequence SEQ ID NO: 3389.


Human Albumin 3′-UTR SEQ ID NO: 3388:









CATCACATTTAAAAGCATCTCAGCCTACCATGAGAATAAGAGAAAGAAAA





TGAAGATCAAAAGCTTATTCATCTGTTTTTCTTTTTCGTTGGTGTAAAG





CCAACACCCTGTCTAAAAAACATAAATTTCTTTAATCATTTTGCCTCT





TTTCTCTGTGCTTCAATTAATAAAAAATGGAAAGAATCT





(corresponding to SEQ ID NO: 1369 of the patent





application WO2013/143700).






In this context it is particularly preferred that the mRNA comprises a 3′-UTR element comprising a corresponding RNA sequence derived from the nucleic acids according to SEQ ID NO: 1369-1390 of the patent application WO2013/143700 or a fragment, homolog or variant thereof.


Most preferably the 3′-UTR element comprises the nucleic acid sequence derived from a fragment of the human albumin gene according to SEQ ID NO: 3390 or 3392:


Albumin7 3′ UTR:









CATCACATTTAAAAGCATCTCAGCCTACCATGAGAATAAGAGAAAGAAA





ATGAAGATCAATAGCTTATTCATCTCTTTTTCTTTTCGTTGGTGTAAAG





CCAACACCCTGTCTAAAAAACATAAATTTCTTTAATCATTTTGCCTC





TTTTCTCTGTGCTTCAATTAATAAAAAATGGAAAGAACCT (SEQ ID





NO: 3390 corresponding to SEQ ID NO: 1376 of





the patent application WO2013/143700)






In this context, it is particularly preferred that the 3′-UTR element of the inventive mRNA comprises or consists of a corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3390 or 3392 as shown in SEQ ID NO: 3391 or 3393.


Globin-Derived 3′-UTRs


In another particularly preferred embodiment, the 3′-UTR element of the inventive mRNA comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of an alpha-globin gene, preferably a vertebrate alpha- or beta-globin gene, more preferably a mammalian alpha- or beta-globin gene, most preferably a human alpha- or beta-globin gene according to SEQ ID NO: 3380, 3382 or 3384 or the corresponding RNA sequences SEQ ID NO: 3381, 3383 or 3385:


3′-UTR of Homo sapiens Hemoglobin, Alpha 1 (HBA1):









GCTGGAGCCTCGGTGGCCATGCTTCTTGCCCCTTGGGCCTCCCCCCAGCC





CCTCCTCCCCTTCCTGCACCCGTACCCCCGTGGTCTTTGAATAAAGTCTG





AGTGGGCGGC (SEQ ID NO: 3380 corresponding to SEQ





ID NO: 1370 of the patent application





WO2013/143700)






3′-UTR of Homo sapiens Hemoglobin, Alpha 2 (HBA2):









GCTGGAGCCTCGGTAGCCGTTCCTCCTGCCCGCTGGGCCTCCCAAC





GGGCCCTCCTCCCCTCCTTGCACCGGCCCTTCCTGGTCTTTGAATA





AAGTCTGAGTGGGCAG (SEQ ID NO: 3382 corresponding





to SEQ ID NO: 1371 of the patent application





WO2013/143700)






3′-UTR of Homo sapiens Hemoglobin, Beta (HBB):









GCTCGCTTTCTTGCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAA





GTCCAACTACTAAACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGAT





TCTGCCTAATAAAAAACATTTATTTTCATTGC (SEQ ID NO: 3384





corresponding to SEQ ID NO: 1372 of the patent





application WO2013/143700)






For example, the 3′-UTR element may comprise or consist of the centre, alpha-complex-binding portion of the 3′-UTR of an alpha-globin gene, such as of a human alpha-globin gene, or a homolog, a fragment, or a variant of an alpha-globin gene, preferably according to SEQ ID NO: 3386:


Centre, Alpha-Complex-Binding Portion of the 3′-UTR of an Alpha-Globin Gene (Also Named Herein as “Muag”):









GCCCGATGGGCCTCCCAACGGGCCCTCCTCCCCTCCTTGCACCG (SEQ





ID NO: 3386 corresponding to SEQ ID NO: 1393 of





the patent application WO2013/143700).






In this context it is particularly preferred that the 3′-UTR element of the inventive mRNA comprises or consists of a corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386 as shown in SEQ ID NO: 3387, or a homolog, a variant or fragment thereof.


5′-UTR


In preferred embodiments, the inventive mRNA further comprises a 5′-UTR.


A 5′-UTR is typically understood to be a particular section of messenger RNA (mRNA). It is located 5′ of the open reading frame of the mRNA. Typically, the 5′-UTR starts with the transcriptional start site and ends one nucleotide before the start codon of the open reading frame. The 5′-UTR may comprise elements for controlling gene expression, also called regulatory elements. Such regulatory elements may be, for example, ribosomal binding sites. The 5′-UTR may be post-transcriptionally modified, for example by addition of a 5′-cap. In the context of the present invention, a 5′-UTR corresponds to the sequence of a mature mRNA, which is located between the 5′-cap and the start codon. Preferably, the 5′-UTR corresponds to the sequence, which extends from a nucleotide located 3′ to the 5′-cap, preferably from the nucleotide located immediately 3′ to the 5′-cap, to a nucleotide located 5′ to the start codon of the protein coding sequence, preferably to the nucleotide located immediately 5′ to the start codon of the protein coding sequence. The nucleotide located immediately 3′ to the 5′-cap of a mature mRNA typically corresponds to the transcriptional start site. The term “corresponds to” means that the 5′-UTR sequence may be an RNA sequence, such as in the mRNA sequence used for defining the 5′-UTR sequence, or a DNA sequence, which corresponds to such RNA sequence. In the context of the present invention, the term “a 5′-UTR of a gene” is the sequence, which corresponds to the 5′-UTR of the mature mRNA derived from this gene, i.e. the mRNA obtained by transcription of the gene and maturation of the pre-mature mRNA. The term “5′-UTR of a gene” encompasses the DNA sequence and the RNA sequence of the 5′-UTR. By the inventive embodiments such a 5′-UTR may be provided 5′-terminal to the coding sequence. Its length is typically less than 500, 400, 300, 250 or less than 200 nucleotides. In other embodiments its length may be in the range of at least 10, 20, 30 or 40, preferably up to 100 or 150, nucleotides.


According to particularly preferred embodiments, the inventive mRNA comprises at least one 5′-untranslated region element (5′-UTR element) which comprises or consists of a nucleic acid sequence, which is derived from the 5′-UTR of a TOP gene or which is derived from a fragment, homolog or variant of the 5′-UTR of a TOP gene.


The 5′ terminal oligopyrimidine tract (TOP) is typically a stretch of pyrimidine nucleotides located in the 5′ terminal region of a nucleic acid molecule, such as the 5′ terminal region of certain mRNA molecules or the 5′ terminal region of a functional entity, e.g. the transcribed region, of certain genes. The sequence starts with a cytidine, which usually corresponds to the transcriptional start site, and is followed by a stretch of usually about 3 to 30 pyrimidine nucleotides. For example, the TOP may comprise 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or even more nucleotides. The pyrimidine stretch and thus the 5′ TOP ends one nucleotide 5′ to the first purine nucleotide located downstream of the TOP. An mRNA that contains a 5′ terminal oligopyrimidine tract is often referred to as TOP mRNA. Accordingly, genes that provide such messenger RNAs are referred to as TOP genes. TOP sequences have, for example, been found in genes and mRNAs encoding peptide elongation factors and ribosomal proteins.


TOP genes are typically characterized by the presence of a 5′ terminal oligopyrimidine tract (TOP). Furthermore, most TOP genes are characterized by a growth-associated translational regulation. However, also TOP genes with a tissue specific translational regulation are known. As defined above, the 5′-UTR of a TOP gene corresponds to the sequence of a 5′-UTR of a mature mRNA derived from a TOP gene, which preferably extends from the nucleotide located 3′ to the 5′-cap to the nucleotide located 5′ to the start codon. A 5′-UTR of a TOP gene typically does not comprise any start codons, preferably no upstream AUGs (uAUGs) or upstream open reading frames (uORFs). Therein, upstream AUGs and upstream open reading frames are typically understood to be AUGs and open reading frames that occur 5′ of the start codon (AUG) of the open reading frame that should be translated. The 5′-UTRs of TOP genes are generally rather short. The lengths of 5′-UTRs of TOP genes may vary between 20 nucleotides up to 500 nucleotides, and are typically less than about 200 nucleotides, preferably less than about 150 nucleotides, more preferably less than about 100 nucleotides. Exemplary 5′-UTRs of TOP genes in the sense of the present invention are the nucleic acid sequences extending from the nucleotide at position 5 to the nucleotide located immediately 5′ to the start codon (e.g. the ATG) in the sequences according to SEQ ID Nos. 1-1363 of the patent application WO2013/143700, whose disclosure is incorporated herewith by reference. In this context, a particularly preferred fragment of a 5′-UTR of a TOP gene is a 5′-UTR of a TOP gene lacking the 5′ TOP motif. The terms “5′-UTR of a TOP gene” or “5′-TOP UTR” preferably refer to the 5′-UTR of a naturally occurring TOP gene.


In the context of the present invention, a “TOP motif” is a nucleic acid sequence which corresponds to a 5′ TOP as defined above. Thus, a TOP motif in the context of the present invention is preferably a stretch of pyrimidine nucleotides having a length of 3-30 nucleotides. Preferably, the TOP-motif consists of at least 3 pyrimidine nucleotides, preferably at least 4 pyrimidine nucleotides, preferably at least 5 pyrimidine nucleotides, more preferably at least 6 nucleotides, more preferably at least 7 nucleotides, most preferably at least 8 pyrimidine nucleotides, wherein the stretch of pyrimidine nucleotides preferably starts at its 5′-end with a cytosine nucleotide. In TOP genes and TOP mRNAs, the TOP-motif preferably starts at its 5′-end with the transcriptional start site and ends one nucleotide 5′ to the first purin residue in said gene or mRNA. A TOP motif in the sense of the present invention is preferably located at the 5′-end of a sequence, which represents a 5′-UTR, or at the 5′-end of a sequence, which codes for a 5′-UTR. Thus, preferably, a stretch of 3 or more pyrimidine nucleotides is called “TOP motif” in the sense of the present invention if this stretch is located at the 5′-end of a respective sequence, such as the artificial nucleic acid molecule, the 5′-UTR element of the artificial nucleic acid molecule, or the nucleic acid sequence which is derived from the 5′-UTR of a TOP gene as described herein. In other words, a stretch of 3 or more pyrimidine nucleotides, which is not located at the 5′-end of a 5′-UTR or a 5′-UTR element but anywhere within a 5′-UTR or a 5′-UTR element, is preferably not referred to as “TOP motif”.


In particularly preferred embodiments, the 5′-UTR element of the inventive mRNA does not comprise a TOP-motif or a 5′ TOP, as defined above.


In some embodiments, the nucleic acid sequence of the 5′-UTR element, which is derived from a 5′-UTR of a TOP gene, terminates at its 3′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 upstream of the start codon (e.g. A(U/T)G) of the gene or mRNA it is derived from. Thus, the 5′-UTR element does not comprise any part of the protein coding sequence. Thus, preferably, the only amino acid coding part of the inventive mRNA is provided by the coding sequence.


The nucleic acid sequence derived from the 5′-UTR of a TOP gene is preferably derived from a eukaryotic TOP gene, preferably a plant or animal TOP gene, more preferably a chordate TOP gene, even more preferably a vertebrate TOP gene, most preferably a mammalian TOP gene, such as a human TOP gene.


For example, the 5′-UTR element is preferably selected from 5′-UTR elements comprising or consisting of a nucleic acid sequence, which is derived from a nucleic acid sequence selected from the group consisting of SEQ ID Nos. 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, whose disclosure is incorporated herein by reference, from the homologs of SEQ ID Nos. 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from a variant thereof, or preferably from a corresponding RNA sequence. The term “homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700” refers to sequences of other species than Homo sapiens, which are homologous to the sequences according to SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700.


In preferred embodiments, the 5′-UTR element of the inventive mRNA comprises or consists of a nucleic acid sequence, which is derived from a nucleic acid sequence extending from nucleotide position 5 (i.e. the nucleotide that is located at position 5 in the sequence) to the nucleotide position immediately 5′ to the start codon (located at the 3′-end of the sequences), e.g. the nucleotide position immediately 5′ to the ATG sequence, of a nucleic acid sequence selected from SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from the homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700 from a variant thereof, or a corresponding RNA sequence. It is particularly preferred that the 5′-UTR element is derived from a nucleic acid sequence extending from the nucleotide position immediately 3′ to the 5′ TOP to the nucleotide position immediately 5′ to the start codon (located at the 3′-end of the sequences), e.g. the nucleotide position immediately 5′ to the ATG sequence, of a nucleic acid sequence selected from SEQ ID Nos. 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from the homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from a variant thereof, or a corresponding RNA sequence.


In particularly preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence, which is derived from a 5′-UTR of a TOP gene encoding a ribosomal protein or from a variant of a 5′-UTR of a TOP gene encoding a ribosomal protein. For example, the 5′-UTR element comprises or consists of a nucleic acid sequence, which is derived from a 5′-UTR of a nucleic acid sequence according to any of SEQ ID NOs: 67, 170, 193, 244, 259, 554, 650, 675, 700, 721, 913, 1016, 1063, 1120, 1138, and 1284-1360 of the patent application WO2013/143700, a corresponding RNA sequence, a homolog thereof, or a variant thereof as described herein, preferably lacking the 5′ TOP motif. As described above, the sequence extending from position 5 to the nucleotide immediately 5′ to the ATG (which is located at the 3′-end of the sequences) corresponds to the 5′-UTR of said sequences.


In some embodiments, the inventive mRNA thus comprises a 5′-UTR element, which comprises or consists of a nucleic acid sequence, which is derived from the 5′-UTR of a vertebrate TOP gene, such as a mammalian, e.g. a human TOP gene, selected from RPSA, RPS2, RPS3, RPS3A, RPS4, RPSS, RPS6, RPS7, RPS8, RPS9, RPS10, RPS11, RPS12, RPS13, RPS14, RPS15, RPS15A, RPS16, RPS17, RPS18, RPS19, RPS20, RPS21, RPS23, RPS24, RPS25, RPS26, RPS27, RPS27A, RPS28, RPS29, RPS30, RPL3, RPL4, RPLS, RPL6, RPL7, RPL7A, RPL8, RPL9, RPL10, RPL10A, RPL11, RPL12, RPL13, RPL13A, RPL14, RPL15, RPL17, RPL18, RPL18A, RPL19, RPL21, RPL22, RPL23, RPL23A, RPL24, RPL26, RPL27, RPL27A, RPL28, RPL29, RPL30, RPL31, RPL32, RPL34, RPL35, RPL35A, RPL36, RPL36A, RPL37, RPL37A, RPL38, RPL39, RPL40, RPL41, RPLP0, RPLP1, RPLP2, RPLP3, RPLP0, RPLP1, RPLP2, EEF1A1, EEF1B2, EEF1D, EEF1G, EEF2, EIF3E, EIF3F, EIF3H, EIF2S3, EIF3C, EIF3K, EIF3EIP, EIF4A2, PABPC1, HNRNPA1, TPT1, TUBB1, UBA52, NPM1, ATP5G2, GNB2L1, NME2, UQCRB, or from a homolog or variant thereof, wherein preferably the 5′-UTR element does not comprise a TOP-motif or the 5′ TOP of said genes, and wherein optionally the 5′-UTR element starts at its 5′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 downstream of the 5′ terminal oligopyrimidine tract (TOP) and wherein further optionally the 5′-UTR element which is derived from a 5′-UTR of a TOP gene terminates at its 3′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 upstream of the start codon (A(U/T)G) of the gene it is derived from.


In further particularly preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence, which is derived from the 5′-UTR of a ribosomal protein Large 32 gene (RPL32), a ribosomal protein Large 35 gene (RPL35), a ribosomal protein Large 21 gene (RPL21), an ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, an hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), an androgen-induced 1 gene (AIG1), cytochrome c oxidase subunit VIc gene (COX6C), or a N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, preferably from a vertebrate ribosomal protein Large 32 gene (RPL32), a vertebrate ribosomal protein Large 35 gene (RPL35), a vertebrate ribosomal protein Large 21 gene (RPL21), a vertebrate ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a vertebrate hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a vertebrate androgen-induced 1 gene (AIG1), a vertebrate cytochrome c oxidase subunit VIc gene (COX6C), or a vertebrate N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, more preferably from a mammalian ribosomal protein Large 32 gene (RPL32), a ribosomal protein Large 35 gene (RPL35), a ribosomal protein Large 21 gene (RPL21), a mammalian ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a mammalian hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a mammalian androgen-induced 1 gene (AIG1), a mammalian cytochrome c oxidase subunit VIc gene (COX6C), or a mammalian N-acylsphingosine ami-dohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, most preferably from a human ribosomal protein Large 32 gene (RPL32), a human ribosomal protein Large 35 gene (RPL35), a human ribosomal protein Large 21 gene (RPL21), a human ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a human hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a human androgen-induced 1 gene (AIG1), a human cytochrome c oxidase subunit VIc gene (COX6C), or a human N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, wherein preferably the 5′-UTR element does not comprise the 5′ TOP of said gene.


ATP5A1 Derived 5′-UTR:


In some preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence, which is derived from a 5′-UTR of a TOP gene encoding a mitochondrial ATP synthase subunit alpha or from a homolog or variant of a 5′-UTR of a TOP gene encoding a mitochondrial ATP synthase subunit alpha, preferably lacking the 5′ TOP motif.


In this context, the 5′-UTR element preferably comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a mitochondrial ATP synthase subunit alpha gene, preferably from a vertebrate mitochondrial ATP synthase subunit alpha (ATP5A1) gene, more preferably from a mammalian mitochondrial ATP synthase subunit alpha (ATP5A1) gene, most preferably from a human mitochondrial ATP synthase subunit alpha (ATP5A1) gene, or from a variant of the 5′-UTR of a mitochondrial ATP synthase subunit alpha gene, preferably from a vertebrate mitochondrial ATP synthase subunit alpha (ATP5A1) gene, more preferably from a mammalian mitochondrial ATP synthase subunit alpha (ATP5A1) gene, most preferably from a human mitochondrial ATP synthase subunit alpha (ATP5A1) gene, wherein preferably the 5′-UTR element does not comprise the 5′ TOP of said gene.


Accordingly, in a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence, which has an identity of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 3378 (5′-UTR of ATP5A1 lacking the 5′ terminal oligopyrimidine tract: GCGGCTCGGCCATTTTGTCCCAGTCAGTCCGGAGGCTGCGGCTGCAGAAGTACCGCCTGCGGAGTAACTGCAAAG; corresponding to SEQ ID NO: 1414 of the patent application WO2013/143700) or preferably to a corresponding RNA sequence according to SEQ ID NO: 3379, or wherein the at least one 5′-UTR element comprises or consists of a fragment of a nucleic acid sequence which has an identity of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 3378 or more preferably to a corresponding RNA sequence (SEQ ID NO: 3379), wherein, preferably, the fragment is as described above, i.e. being a continuous stretch of nucleotides representing at least 20% etc. of the full-length 5′-UTR. Preferably, the fragment exhibits a length of at least about 20 nucleotides or more, preferably of at least about 30 nucleotides or more, more preferably of at least about 40 nucleotides or more. Preferably, the fragment is a functional fragment as described herein.


L32 Derived 5′-UTR:


In some preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence, which is derived from a 5′-UTR of a TOP gene encoding a ribosomal Large protein (RPL) or from a homolog or variant of a 5′-UTR of a TOP gene encoding a ribosomal Large protein (RPL). For example, the 5′-UTR element may comprise or consist of a nucleic acid sequence, which is derived from a 5′-UTR of a nucleic acid sequence according to any of SEQ ID NOs: 67, 259, 1284-1318, 1344, 1346, 1348-1354, 1357, 1358, 1421 and 1422 of the patent application WO2013/143700, a corresponding RNA sequence, a homolog thereof, or a variant thereof as described herein, preferably lacking the 5′ TOP motif.


In this context, the 5′-UTR element preferably comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a ribosomal protein Large 32 gene, preferably from a vertebrate ribosomal protein Large 32 (L32) gene, more preferably from a mammalian ribosomal protein Large 32 (L32) gene, most preferably from a human ribosomal protein Large 32 (L32) gene, or from a variant of the 5′-UTR of a ribosomal protein Large 32 gene, preferably from a vertebrate ribosomal protein Large 32 (L32) gene, more preferably from a mammalian ribosomal protein Large 32 (L32) gene, most preferably from a human ribosomal protein Large 32 (L32) gene, wherein preferably the 5′-UTR element does not comprise the 5′TOP of said gene.


Accordingly, in some particularly preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence, which has an identity of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 3376 (5′-UTR of human ribosomal protein Large 32 lacking the 5′ terminal oligopyrimidine tract: GGCGCTGCCTACGGAGGTGGCAGCCATCTCCTTCTCGGCATC; corresponding to SEQ ID NO: 1368 of the patent application WO2013/143700) or preferably to a corresponding RNA sequence according to SEQ ID NO: 3377, or wherein the at least one 5′-UTR element comprises or consists of a fragment of a nucleic acid sequence which has an identity of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 3376 or more preferably to a corresponding RNA sequence (SEQ ID NO: 3377), wherein, preferably, the fragment is as described above, i.e. being a continuous stretch of nucleotides representing at least 20% etc. of the full-length 5′-UTR. Preferably, the fragment exhibits a length of at least about 20 nucleotides or more, preferably of at least about 30 nucleotides or more, more preferably of at least about 40 nucleotides or more. Preferably, the fragment is a functional fragment as described herein.


Preferably, the at least one 5′-UTR element and the at least one 3′-UTR element act synergistically to increase protein production from the inventive mRNA.


Histone Stem-Loop


In particularly preferred embodiments, the inventive mRNA comprises a histone stem-loop sequence/structure. Such histone stem-loop sequences are preferably selected from histone stem-loop sequences as disclosed in WO2012/019780, the disclosure of which is incorporated herewith by reference.


A histone stem-loop sequence, suitable to be used within the present invention, is preferably selected from at least one of the following formulae (I) or (II):


Formula (I) (Stem-Loop Sequence without Stem Bordering Elements):




embedded image


Formula (II) (Stem-Loop Sequence with Stem Bordering Elements):




embedded image


wherein:















stem1 or stem2 bordering elements N1-6
is a consecutive sequence of 1 to 6, preferably of 2 to 6, more



preferably of 2 to 5, even more preferably of 3 to 5, most



preferably of 4 to 5 or 5N, wherein each N is independently from



another selected from a nucleotide selected from A, U, T, G and



C, or a nucleotide analogue thereof;


stem1 [N0-2GN3-5]
is reverse complementary or partially reverse complementary



with element stem2, and is a consecutive sequence between of 5



to 7 nucleotides;



wherein N0-2 is a consecutive sequence of 0 to 2, preferably of 0



to 1, more preferably of 1N, wherein each N is independently



from another selected from a nucleotide selected from A, U, T, G



and C or a nucleotide analogue thereof;



wherein N3-5 is a consecutive sequence of 3 to 5, preferably of 4



to 5, more preferably of 4N, wherein each N is independently



from another selected from a nucleotide selected from A, U, T, G



and C or a nucleotide analogue thereof, and



wherein G is guanosine or an analogue thereof, and may be



optionally replaced by a cytidine or an analogue thereof, provided



that its complementary nucleoside cytidine in stem2 is replaced



by guanosine;


loop sequence [N0-4(U/T)N0-4]
is located between elements stem1 and stem2, and is a



consecutive sequence of 3 to 5 nucleotides, more preferably of 4



nucleotides;



wherein each N0-4 is independent from another a consecutive



sequence of 0 to 4, preferably of 1 to 3, more preferably of 1 to 2N,



wherein each N is independently from another selected from a



nucleotide selected from A, U, T, G and C or a nucleotide



analogue thereof; and



wherein U/T represents uridine, or optionally thymidine;


stem2 [N3-5CN0-2]
is reverse complementary or partially reverse complementary



with element stem1, and is a consecutive sequence between of 5



to 7 nucleotides;



wherein N3-5 is a consecutive sequence of 3 to 5, preferably of 4



to 5, more preferably of 4N, wherein each N is independently



from another selected from a nucleotide selected from A, U, T, G



and C or a nucleotide analogue thereof;



wherein N0-2 is a consecutive sequence of 0 to 2, preferably of 0



to 1, more preferably of 1N, wherein each N is independently



from another selected from a nucleotide selected from A, U, T, G



or C or a nucleotide analogue thereof; and



wherein C is cytidine or an analogue thereof, and may be



optionally replaced by a guanosine or an analogue thereof



provided that its complementary nucleoside guanosine in stem1



is replaced by cytidine;










wherein


stem1 and stem2 are capable of base pairing with each other forming a reverse complementary sequence, wherein base pairing may occur between stem1 and stem2, e.g. by Watson-Crick base pairing of nucleotides A and U/T or G and C or by non-Watson-Crick base pairing e.g. wobble base pairing, reverse Watson-Crick base pairing, Hoogsteen base pairing, reverse Hoogsteen base pairing or are capable of base pairing with each other forming a partially reverse complementary sequence, wherein an incomplete base pairing may occur between stem1 and stem2, on the basis that one or more bases in one stem do not have a complementary base in the reverse complementary sequence of the other stem.


According to a further preferred embodiment, the inventive mRNA may comprise at least one histone stem-loop sequence according to at least one of the following specific formulae (Ia) or (IIa):


Formula (Ia) (Stem-Loop Sequence without Stem Bordering Elements):




embedded image


Formula (IIa) (Stem-Loop Sequence with Stem Bordering Elements):




embedded image


wherein:


N, C, G, T and U are as defined above.


According to a further more particularly preferred embodiment, the inventive mRNA comprises at least one histone stem-loop sequence according to at least one of the following specific formulae (Ib) or (IIb):


Formula (Ib) (Stem-Loop Sequence without Stem Bordering Elements):




embedded image


Formula (IIb) (Stem-Loop Sequence with Stem Bordering Elements):




embedded image


wherein:


N, C, G, T and U are as defined above.


A particularly preferred histone stem-loop sequence is the sequence CAAAGGCTCTTTTCAGAGCCACCA (according to SEQ ID NO: 3394) or more preferably the corresponding RNA sequence CAAAGGCUCUUUUCAGAGCCACCA (according to SEQ ID NO: 3395).


Accordingly, in preferred embodiments, the artificial nucleic acid of the invention comprises at least one histone stem-loop as defined herein. Preferably, the at least one histone stem loop comprises a nucleic acid sequence according to SEQ ID NOs: 3394 or 3395, or a fragment or variant thereof.


In particularly preferred embodiments, the artificial nucleic acid, preferably the artificial mRNA of the invention comprises a 3W-terminal sequence element comprising a poly(A)sequence as defined herein and a histone-stem-loop sequence as defined herein, wherein the 3′-terminal sequence element may be selected from SEQ ID NOs: 6412, 6413, 6414 or 6415.


Signal Peptides


According to particularly preferred embodiments, the inventive further encodes a secretory (signal) peptide. The sequence encoding the secretory (signal) peptide is preferably joined to the sequence encoding the antigenic peptide or protein, so that said peptide or protein is expressed as a fusion protein comprising said secretory (signal) peptide.


Such signal peptides are sequences, which typically exhibit a length of about 15 to 30 amino acids and are preferably located at the N-terminus of the encoded peptide, without being limited thereto. Signal peptides as defined herein preferably allow the transport of the antigenic peptide or protein as encoded by the inventive mRNA into a defined cellular compartment, preferably the cell surface, the endoplasmic reticulum (ER) or the endosomal-lysosomal compartment. Examples of secretory signal peptide sequences as defined herein include, without being limited thereto, signal sequences of classical or non-classical MHC-molecules (e.g. signal sequences of MHC I and II molecules, e.g. of the MHC class I molecule HLA-A*0201), signal sequences of cytokines or immunoglobulins, signal sequences of the invariant chain of immunoglobulins or antibodies, signal sequences of Lamp1, Tapasin, Erp57, Calretikulin, Calnexin, PLAT, EPO or albumin and further membrane associated proteins or of proteins associated with the endoplasmic reticulum (ER) or the endosomal-lysosomal compartment.


Particularly preferred signal peptides are those derived from human HLA-A2 (amino acids 1-24), human PLAT (amino acids 1-23, 1-21 or 1-22), human EPO (amino acids 1-27), human ALB (amino acids 1-18), human IgE, human CD5 (amino acids 1-24), human IL2(amino acids 1-20), human CTRB2(amino acids 1-18), human IgG-HC (amino acids 1-19), human Ig-HC (amino acids 1-19), human Ig-LC (amino acids 1-19), Gaussia princeps Luc (amino acids 1-17), mouse Igkappa, NrChit1 (1-26) CILp1.1(1-21) Nepenthes raffiesiana Nep1 (amino acids 1-24), human Azu1 (amino acids 1-19), human CD33(amino acids 1-16), Vibrio cholera CtxB (amino acids 1-19), human CST4 (amino acids 1-20), human Ins-iso1 (amino acids 1-24), human SPARC (amino acids 1-17), or Influenza A SP-H1N1 (Netherlands2009)-HA.


Such signal peptides are preferably used to promote secretion of the encoded antigenic peptide or protein.


According to particularly preferred embodiments, the inventive RNA comprises a coding region encoding a signal peptide as defined herein. Said coding region is preferably joined to or comprised by the coding region encoding the antigenic peptide or protein as defined herein, so that the antigenic protein or peptide is expressed as a fusion protein comprising the encoded signal peptide.


Accordingly, particularly preferred embodiments, the inventive mRNA additionally comprises at least one coding region encoding a signal peptide as defined herein, said signal peptide preferably comprising or consisting of an amino acid sequence as defined by SEQ ID NO: 3399-3423, or a variant or fragment thereof. Such variants or fragments are preferably functional, i.e. exhibit the same desired biological function as the signal peptides they are derived from, and are thus preferably capable of mediating secretion of the fused antigenic protein or peptide.


Transmembrane Domains


According to further particularly preferred embodiments, the inventive mRNA further encodes a transmembrane domain. The sequence encoding the transmembrane domain is preferably joined to the sequence encoding the antigenic peptide or protein, so that said peptide or protein is expressed as a fusion protein comprising said transmembrane domain.


Transmembrane domains (TMDs) consist of amino acid sequences forming three-dimensional protein structures that span the lipid bilayer. TMDs are typically about 20 amino acids in length and can form a single alpha helix, a transmembrane beta barrel, a beta-helix of gramicidin A, or any other structure.


Transmembrane domains as defined herein preferably anchor the fused antigenic protein or peptide in a membrane. Such transmembrane domains may be useful to ensure that the antigenic proteins or peptides are membrane-bound and thus accessible for the immune system.


Particularly preferred transmembrane domains are those derived from Influenza virus, Human immunodeficiency virus 1, equine infectious anemia virus, murine leukemia virus, mouse mammary tumor virus, vesicular stomatitis virus and rabies virus.


According to particularly preferred embodiments, the inventive mRNA comprises a coding region encoding a TMD as defined herein. Said coding region is preferably joined to or comprised by the coding region encoding the antigenic peptide or protein as defined herein, so that the antigenic protein or peptide is expressed as a fusion protein comprising the encoded TMD.


Accordingly, particularly preferred embodiments, the inventive mRNA additionally comprises at least one coding region encoding a TMD as defined herein, said TMD preferably comprising or consisting of an amino acid sequence as defined by SEQ ID NO: 3424-3435, or a variant or fragment thereof. Such variants or fragments are preferably functional, i.e. exhibit the same desired biological function as the TMDs they are derived from, and are thus preferably capable of anchoring the fused antigenic protein or peptide in a membrane. Said membrane can be a cellular membrane or a virus-like particle membrane.


VLP-Forming Domains


According to further particularly preferred embodiments, the inventive mRNA further encodes a VLP-forming domain. The sequence encoding the VLP-forming domains preferably joined to the sequence encoding the antigenic peptide or protein, so that said peptide or protein is expressed as a fusion protein comprising said VLP-forming domain.


VLP-forming domains are capable of mediating the formation of virus-like particles (VLPs), i.e. non-infectious assemblies of viral structural proteins. VLPs preferably contain repetitive, high density displays of viral structural proteins that can elicit strong adaptive immune responses.


VLP-forming domains can therefore be fused to antigenic peptides or proteins as defined herein in order to mediate their assembly into VLPs.


Particularly preferred VLP-forming domains are those derived from Woodchuck hepatitis virus and Alfalfa mosaic virus.


According to particularly preferred embodiments, the inventive RNA comprises a coding region encoding a VLP-forming domain as defined herein. Said coding region is preferably joined to or comprised by the coding region encoding the antigenic peptide or protein as defined herein, so that the antigenic protein or peptide is expressed as a fusion protein comprising the encoded VLP-forming domain.


Accordingly, particularly preferred embodiments, the inventive mRNA additionally comprises at least one coding region encoding a VLP-forming domain as defined herein, said TMD preferably comprising or consisting of an amino acid sequence as defined by SEQ ID NO: 3436 or 3437, or a variant or fragment thereof. Such variants or fragments are preferably functional, i.e. exhibit the same desired biological function as the VLP-forming domains they are derived from, and are thus preferably capable of mediating the assembly of the fused antigenic proteins or peptides into VLPs.


Any of the above modifications may be applied to the mRNA of the present invention, and may be, if suitable or necessary, be combined with each other in any combination, provided, these combinations of modifications do not interfere with each other in said mRNA. A person skilled in the art will be able to take his choice accordingly.


RNA Constructs


The inventive mRNA, which comprises at least one coding sequence as defined herein may preferably comprise a 5′-UTR and/or a 3′-UTR optionally containing at least one histone stem-loop.


The 3′-UTR of the mRNA preferably comprises also a poly(A) and/or a poly(C) sequence as defined herein. The single elements of the 3′-UTR may occur therein in any order from 5′ to 3′ along the sequence of the mRNA.


In addition, further elements as described herein, may also be contained, such as a stabilizing sequence as defined herein (e.g. derived from the UTR of a globin gene), IRES sequences, etc. Each of the elements may also be repeated in the mRNA at least once (particularly in di- or multicistronic constructs), preferably twice or more. As an example, the single elements may be present in the inventive mRNA in the following order:


5′-coding sequence-histone stem-loop-poly(A)/(C) sequence-3′; or


5′-coding sequence-poly(A)/(C) sequence-histone stem-loop-3′; or


5′-coding sequence-histone stem-loop-polyadenylation signal-3′; or


5′-coding sequence-polyadenylation signal-histone stem-loop-3′; or


5′-coding sequence-histone stem-loop-histone stem-loop-poly(A)/(C) sequence-3′; or


5′-coding sequence-histone stem-loop-histone stem-loop-polyadenylation signal-3′; or


5′-coding sequence-stabilizing sequence-poly(A)/(C) sequence-histone stem-loop-3′; or


5′-coding sequence-stabilizing sequence-poly(A)/(C) sequence-poly(A)/(C) sequence-histone stem-loop-3′; etc.


According to further embodiments, the inventive mRNA preferably comprises at least one of the following structural elements: a 5′- and/or 3′-untranslated region element (UTR element), particularly a 5′-UTR element, which preferably comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a TOP gene or from a fragment, homolog or a variant thereof, or a 5′- and/or 3′-UTR element which may preferably be derivable from a gene that provides a stable mRNA or from a homolog, variant or fragment thereof; a histone-stem-loop structure, preferably a histone-stem-loop in its 3′ untranslated region; a 5′-cap structure; a poly-A tail; or a poly(C) sequence.


It may be particularly preferred that if, in addition to an antigenic peptide or protein as defined herein, a further peptide or protein is encoded by the at least one coding sequence as defined herein, the encoded peptide or protein is preferably no histone protein, no reporter protein (e.g. Luciferase, GFP, EGFP, beta-galactosidase, particularly EGFP) and/or no marker or selection protein (e.g. alpha-globin, galactokinase and Xanthine:Guanine phosphoribosyl transferase (GPT)). Thus, in some preferred embodiments, the inventive mRNA does not comprise a reporter gene or a marker gene. Preferably, the mRNA does not encode, for instance, luciferase; green fluorescent protein (GFP) and its variants (such as eGFP, RFP or BFP); alpha-globin; hypoxanthine-guanine phosphoribosyltransferase (HGPRT); beta-galactosidase; galactokinase; alkaline phosphatase; secreted embryonic alkaline phosphatase (SEAP)) or a resistance gene (such as a resistance gene against neomycin, puromycin, hygromycin and zeocin). In preferred embodiments, the inventive mRNA does not encode luciferase. In other embodiments, the inventive mRNA does not encode GFP or a variant thereof.


In preferred embodiments the nucleic acid sequence, in particular, the RNA sequence comprises, preferably in 5′- to 3′-direction, the following elements:

  • a) 5′-cap structure, preferably as defined herein;
  • b) optionally, 5′-UTR element, preferably as defined herein;
  • c) at least one coding sequence, preferably as defined herein;
  • d) a 3′-UTR element, preferably as defined herein;
  • e) optionally, poly(A) sequence, preferably as defined herein;
  • f) optionally, poly(C) sequence, preferably as defined herein;
  • g) optionally, a histone stem-loop, preferably as defined herein; and
  • h) optionally, a 3′-terminal sequence element as defined herein.


According to preferred embodiments, the mRNA comprises, preferably in 5′ to 3′ direction, the following elements:

  • a) a 5′-cap structure, preferably m7GpppN,
  • b) at least one coding region encoding an antigenic protein or peptide as defined herein,
  • c) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,
  • d) a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, and
  • e) a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395.
  • f) optionally, a poly(A) sequence and a histone stem-loop comprising the RNA sequence according to SEQ ID NOs: 6412, 6413, 6414 or 6415


More preferably, the mRNA comprises, preferably in 5′ to 3′ direction, the following elements:

  • a) a 5′-cap structure, preferably m7GpppN,
  • b) at least one coding region encoding an antigenic protein or peptide as defined herein,
  • c) a 3′-UTR element comprising a nucleic acid sequence, which is derived from an alpha-globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, or a homolog, a fragment or a variant thereof,
  • d) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,
  • e) a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, and
  • f) a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395, and
  • g) optionally, a poly(A) sequence and a histone stem-loop comprising the RNA sequence according to SEQ ID NOs: 6412, 6413, 6414 or 6415


More preferably, the mRNA comprises, preferably in 5′ to 3′ direction, the following elements:

  • a) a 5′-cap structure, preferably m7GpppN,
  • b) at least one coding region encoding an antigenic protein or peptide derived from a Lassa virus protein or peptide or a fragment or variant thereof, preferably comprising or consisting of any one of the nucleic acid sequences according to SEQ ID NO: 376-561, 751-936, 1126-1311, 1501-1686, 1876-2061, 2251-2436, 2626-2811 or 3001-3186, or 562-750, 937-1125, 1312-1500, 1687-1875, 2062-2250, 2437-2625, 2812-3000 or 3187-3375, or 3604-3756, 3757-3909, 3910-4062, 4063-4215, 4216-4368, 4369-4521, 4522-4674, 4675-4827, or a fragment or variant thereof.
  • c) a 3′-UTR element comprising a nucleic acid sequence, which is derived from an alpha-globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, or a homolog, a fragment or a variant thereof,
  • d) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,
  • e) a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, and
  • f) a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395, and
  • g) optionally, a poly(A) sequence and a histone stem-loop comprising the RNA sequence according to SEQ ID NOs: 6412, 6413, 6414 or 6415


In further preferred embodiments, the inventive mRNA comprises, preferably in 5′ to 3′ direction, the following elements:

  • a) a 5′-cap structure, preferably m7GpppN,
  • b) a 5′-UTR element, which preferably comprises or consists of a nucleic acid sequence, which is derived from the 5′-UTR of a TOP gene, preferably comprising an RNA sequence corresponding to the nucleic acid sequence according to SEQ ID NO: 3376, or a homolog, a fragment or a variant thereof,
  • c) at least one coding region encoding an antigenic protein or peptide as defined herein,
  • d) a 3′-UTR element comprising a nucleic acid sequence, which is preferably derived from an alpha-globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, or a homolog, a fragment or a variant thereof; and/or a 3′-UTR element comprising a nucleic acid sequence, which is derived from an albumin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3390 or 3392, or a homolog, a fragment or a variant thereof,
  • e) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,
  • f) a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, and
  • g) a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395.


In further preferred embodiments, the inventive mRNA comprises, preferably in 5′ to 3′ direction, the following elements:

  • a) a 5′-cap structure, preferably m7GpppN,
  • b) a 5′-UTR element, which preferably comprises or consists of a nucleic acid sequence, which is derived from the 5′-UTR of a TOP gene, preferably comprising an RNA sequence corresponding to the nucleic acid sequence according to SEQ ID NO: 3376, or a homolog, a fragment or a variant thereof,
  • c) at least one coding region encoding an antigenic protein or peptide derived from a Lassa virus protein or peptide or a fragment or variant thereof, preferably comprising or consisting of any one of the nucleic acid sequences according to SEQ ID NO: 376-561, 751-936, 1126-1311, 1501-1686, 1876-2061, 2251-2436, 2626-2811 or 3001-3186, or 562-750, 937-1125, 1312-1500, 1687-1875, 2062-2250, 2437-2625, 2812-3000 or 3187-3375, or 3604-3756, 3757-3909, 3910-4062, 4063-4215, 4216-4368, 4369-4521, 4522-4674, 4675-4827, or a fragment or variant thereof.
  • d) a 3′-UTR element comprising a nucleic acid sequence, which is preferably derived from an alpha-globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, or a homolog, a fragment or a variant thereof; and/or
    • a 3′-UTR element comprising a nucleic acid sequence, which is derived from an albumin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3390 or 3392, or a homolog, a fragment or a variant thereof,
  • e) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,
  • f) a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, and
  • g) a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395.


Preferred Lassa Virus Constructs of the Invention:


In the following, preferred and particularly suitable Lassa virus mRNA sequences of the invention are provided.


Preferred Lassa virus polypeptide, nucleic acid and mRNA sequences are provided in Table 1-3 (in the columns D, E and F). Therein, each row represents a specific suitable Lassa virus construct of the invention. Accession numbers are provided in the <223> identifier of the respective SEQ ID NOs in the sequence listing. Each column A provides the respective SEQ ID NOs of the protein constructs as provided in the sequence listing. mRNA constructs comprising coding sequences encoding said proteins are provided in column D “SEQ ID NO: mRNA design 1” column E “SEQ ID NO: mRNA design 2” and column F “SEQ ID NO: mRNA design 3”. Additional information regarding each of the sequences provided in Table 1-3 may also be derived from the sequence listing, in particular from the details provided therein under identifier <223>.


Accordingly, it is particularly preferred that the mRNA according to the invention comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 4828-5013, 5014-5202 or 5203-5355 (column 5 D “mRNA design 1”, Table 1-3) or a fragment or variant of any of these sequences.


Accordingly, it is particularly preferred that the mRNA according to the invention comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5356-5541, 5542-5730 or 5731-5883 (column 5 E “mRNA design 2”, Table 1-3) or a fragment or variant of any of these sequences.


Accordingly, it is particularly preferred that the mRNA according to the invention comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5884-6069, 6070-6259 or 6259-6411 (column 7 F “mRNA design 3”, Table 1-3) or a fragment or variant of any of these sequences.


Accordingly, it is particularly preferred that the mRNA according to the invention comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5541, 5729, 5731, 5529, 5356, 5540 or 5013 or a fragment or variant of any of these sequences.


(Pharmaceutical) Composition


In a further aspect, the present invention provides a composition comprising the mRNA of the invention, and at least one pharmaceutically acceptable carrier. The composition according to the invention is preferably provided as a pharmaceutical composition or as a vaccine.


A “vaccine” is typically understood to be a prophylactic or therapeutic material providing at least one epitope of an antigen, preferably an immunogen. “Providing at least on epitope” means, for example, that the vaccine comprises the epitope (or antigen comprising or providing said epitope) or that the vaccine comprises a molecule that, e.g., codes for the epitope or an antigen comprising or providing the epitope. The antigen preferably stimulates the adaptive immune system to provide an adaptive immune response.


The (pharmaceutical) composition or vaccine provided herein may further comprise at least one pharmaceutically acceptable excipient, adjuvant or further component (e.g. additives, auxiliary substances, and the like).


Plurality of mRNAs


In preferred embodiments, the (pharmaceutical) composition or vaccine according to the invention comprises a plurality or more than one of the inventive mRNAs.


In some embodiments, each of the mRNAs comprises a coding region encoding at least one different antigenic peptide or protein derived from proteins of the same lassa virus. In other embodiments, each of the mRNAs comprises a coding region which encodes at least one different antigenic peptide or protein derived from different proteins of the same lassa virus. In some embodiments, each of the mRNAs comprises a coding region which encodes at least one different antigenic peptide or protein derived from different proteins of different lassa viruses (e.g. clade I and II or lineage I and II. In a preferred embodiment, each of the mRNAs comprises a coding region which encodes at least one different antigenic peptide or protein derived from proteins of the different lassa virus clades or lineages (e.g. of clade I, II, III and IV or of lineage I, II, III and IV).


In said embodiments, the antigenic peptides or proteins are preferably derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, and/or nucleoprotein (NP) of a lassa virus, and/or zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof.


In a preferred embodiment, the antigenic peptides or proteins are derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, and nucleoprotein (NP) of a lassa virus, and zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof.


In a preferred embodiment, the antigenic peptides or proteins are derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof and nucleoprotein (NP) of a lassa virus, or a variant or fragment thereof.


In a further preferred embodiment, the antigenic peptides or proteins are derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof and zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof.


Specifically, said nucleoprotein (NP) and/or glycoprotein precursor (GPC), and/or zinc-binding matrix protein (Z) encoded by the plurality of mRNAs may be derived from 2, 3, 4 or more different lassa viruses, preferably derived from 2, 3, 4 or more different lassa clades or lassa lineages, or may be derived from the same lassa virus.


In a further preferred embodiment, the antigenic peptides or proteins are derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, wherein the lassa virus peptides or proteins are derived from at least two different lassa virus strains of clade I, II, III and/or IV or from a lassa virus of lineage I, II, III and/or IV. In a specific embodiment, the at least two lassa virus proteins exhibit considerable sequence dissimilarities. In a further preferred embodiment, the dissimilarities are greater than 11% amino acid sequence divergence in the region of the NP gene. In a second embodiment, the sequence divergence of the mRNA encoding the lassa virus protein is greater than 22% nucleotide sequence divergence.


In a preferred embodiment, the antigenic peptides or proteins are derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, wherein the lassa virus peptides or proteins are derived from four different lassa virus strains of clade I, II, III and IV, wherein the lassa virus strains are selected from Josiah (clade IV), LP (clade I), 803213 (clade II), and GA391 (clade III).


In a further preferred embodiment, each of the mRNAs comprises a coding region encoding at least GPC derived from proteins of different lassa viruses, wherein the lassa virus peptides or proteins are derived from four different lassa virus strains of clade I, II, III and IV, preferably the mRNA comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5541, 5529, 5356, 5540 and 5729.


Accordingly, the (pharmaceutical) composition or vaccine according to the invention may comprise an mRNA as defined herein, wherein said mRNA encodes one specific antigenic peptide or protein as defined herein. In such embodiments, the (pharmaceutical) composition or vaccine preferably comprises an inventive mRNA comprising the at least one coding sequence as defined herein encoding said antigenic peptide or protein.


Alternatively, the (pharmaceutical) composition or vaccine of the present invention may comprise, an mRNA as defined herein, wherein said mRNA encodes at least two, three, four, five, six, seven, eight, nine, ten, eleven or twelve distinct antigenic peptides or proteins (or variants or fragments thereof) as defined herein. In such embodiments, the (pharmaceutical) composition or vaccine preferably comprises several species of the inventive mRNA, wherein each mRNA species encodes a distinct antigenic peptide or protein as defined herein.


In other embodiments, the plurality of mRNAs comprised in the (pharmaceutical) composition or vaccine encode at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) and/or nucleoprotein (NP) and/or zinc-binding matrix protein (Z) of 2, 3, 4, 5, 6, 7, 6, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, or 100 different lassa viruses.


In other embodiments, the mRNA comprised in the (pharmaceutical) composition or vaccine is a bi- or multicistronic mRNA as defined herein, which encodes the at least two, three, four, five, six, seven, eight, nine, ten, eleven or twelve distinct antigenic peptides or proteins as defined herein.


Mixtures between these embodiments are also envisaged, such as compositions comprising more than one mRNA species, wherein at least one mRNA species may be monocistronic, while at least one other mRNA species may be bi- or multicistronic.


Complexation


In preferred embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, of the invention, is provided in a complexed form, i.e. complexed or associated with one or more (poly-)cationic compounds, preferably with (poly-)cationic polymers, (poly-)cationic peptides or proteins, e.g. protamine, (poly-)cationic polysaccharides and/or (poly-)cationic lipids. In this context, the terms “complexed” or “associated” refer to the essentially stable combination of said mRNA with one or more of the aforementioned compounds into larger complexes or assemblies without covalent binding.


Lipids


According to preferred embodiments, the mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is complexed or associated with lipids (in particular cationic and/or neutral lipids) to form one or more liposomes, lipoplexes, lipid nanoparticles, or nanoliposomes.


Therefore, in some embodiments, the at least one mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is provided in the form of a lipid-based formulation, in particular in the form of liposomes, lipoplexes, and/or lipid nanoparticles comprising said mRNA.


Lipid Nanoparticles:


According to some preferred embodiments, the mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is complexed or associated with lipids (in particular cationic and/or neutral lipids) to form one or more lipid nanoparticles.


In the context of the present invention, the term “lipid nanoparticle”, also referred to as “LNP”, is not restricted to any particular morphology, and includes any morphology generated when a cationic lipid and optionally one or more further lipids are combined, e.g. in an aqueous environment and/or in the presence of an RNA. For example, a liposome, a lipid complex, a lipoplex, an emulsion, a micelle, a lipidic nanocapsule, a nanosuspension and the like are within the scope of a lipid nanoparticle (LNP).


Preferably, lipid nanoparticles (LNPs) comprise: (a) at least one mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, (b) a cationic lipid, (c) an aggregation reducing agent (such as polyethylene glycol (PEG) lipid or PEG-modified lipid), (d) optionally a non-cationic lipid (such as a neutral lipid), and (e) optionally, a sterol.


In some embodiments, LNPs comprise, in addition to the at least one mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, (i) at least one cationic lipid; (ii) a neutral lipid; (iii) a sterol, e.g., cholesterol; and (iv) a PEG-lipid, in a molar ratio of about 20-60% cationic lipid: 5-25% neutral lipid: 25-55% sterol; 0.5-15% PEG-lipid.


In some embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, may be formulated in an aminoalcohol lipidoid. Aminoalcohol lipidoids which may be used in the present invention may be prepared by the methods described in U.S. Pat. No. 8,450,298, herein incorporated by reference in its entirety. Ionizable lipids can also be the compounds as disclosed in Tables 1, 2 and 3 and claims 1-24 of International Publication No. WO2017/075531 A1, hereby incorporated by reference in its entirety. In another embodiment, ionizable lipids can also be the compounds as disclosed in International Publication No. WO2015/074085 A1 (i.e. ATX-001 to ATX-032 or the compounds as mentioned in claims 1-26), U.S. Appl. No. 61/905,724 and Ser. No. 15/614,499 or U.S. Pat. Nos. 9,593,077 and 9,567,296 hereby incorporated by reference in their entirety.


(i) Cationic Lipids


LNPs may include any cationic lipid suitable for forming a lipid nanoparticle. Preferably, the cationic lipid carries a net positive charge at about physiological pH.


The cationic lipid may be an amino lipid. As used herein, the term “amino lipid” is meant to include those lipids having one or two fatty acid or fatty alkyl chains and an amino head group (including an alkylamino or dialkylamino group) that may be protonated to form a cationic lipid at physiological pH.


The cationic lipid may be, for example, N,N-dioleyl-N,N-dimethylammonium chloride (DODAC), N,N-distearyl-N,N-dimethylammonium bromide (DDAB), 1,2-dioleoyltrimethyl ammonium propane chloride (DOTAP) (also known as N-(2,3-dioleoyloxy)propyl)-N,N,N-trimethylammonium chloride and 1,2-Dioleyloxy-3-trimethylaminopropane chloride salt), N-(1-(2,3-dioleyloxy)propyl)-N,N,N-trimethylammonium chloride (DOTMA), N,N-dimethyl-2,3-dioleyloxy)propylamine (DODMA), 1,2-DiLinoleyloxy-N,N-dimethylaminopropane (DLinDMA), 1,2-Dilinolenyloxy-N,N-dimethylaminopropane (DLenDMA), 1,2-di-y-linolenyloxy-N,N-dimethylaminopropane (γ-DLenDMA), 1,2-Dilinoleylcarbamoyloxy-3-dimethylaminopropane (DLin-C-DAP), 1,2-Dilinoleyoxy-3-(dimethylamino)acetoxypropane (DLin-DAC), 1,2-Dilinoleyoxy-3-morpholinopropane (DLin-MA), 1,2-Dilinoleoyl-3-dimethylaminopropane (DLinDAP), 1,2-Dilinoleylthio-3-dimethylaminopropane (DLin-S-DMA), 1-Linoleoyl-2-linoleyloxy-3-dimethylaminopropane (DLin-2-DMAP), 1,2-Dilinoleyloxy-3-trimethylaminopropane chloride salt (DLin-TMA.Cl), 1,2-Dilinoleoyl-3-trimethylaminopropane chloride salt (DLin-TAP.Cl), 1,2-Dilinoleyloxy-3-(N-methylpiperazino)propane (DLin-MPZ), or 3-(N,N-Dilinoleylamino)-1,2-propanediol (DLinAP), 3-(N,N-Dioleylamino)-1,2-propanedio (DOAP), 1,2-Dilinoleyloxo-3-(2-N,N-dimethylamino)ethoxypropane (DLin-EG-DM A), 2,2-Dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (DLin-K-DMA) or analogs thereof, (3aR,5s,6aS)-N,N-dimethyl-2,2-di((9Z,12Z)-octadeca-9,12-dienyl)tetrahydro-3aH-cyclopenta[d][1,3]dioxol-5-amine, (6Z,9Z,28Z,31Z)-heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino)butanoate (MC3), 1,1-(2-(4-(2-((2-(bis(2-hydroxydodecyl)amino)ethyl) (2-hydroxydodecyl)amino)ethyl)piperazin-1-yl) ethylazanediyl)didodecan-2-ol (C12-200), 2,2-dilinoleyl-4-(2-dimethylaminoethyl)-[1,3]-dioxolane (DLin-K-C2-DMA), 2,2-dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (DLin-K-DMA), (6Z,9Z,28Z,31Z)-heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino) butanoate (DLin-M-C3-DMA), 3-((6Z,9Z,28Z,31Z)-heptatriaconta-6,9,28,31-tetraen-19-yloxy)-N,N-dimethylpropan-1-amine (MC3 Ether), 4-((6Z,9Z,28Z,31Z)-heptatriaconta-6,9,28,31-tetraen-19-yloxy)-N,N-dimethylbutan-1-amine (MC4 Ether), or any combination of any of the foregoing.


Other cationic lipids include, but are not limited to, N,N-distearyl-N,N-dimethylammonium bromide (DDAB), 3P—(N—(N′,N′-dimethylaminoethane)-carbamoyl)cholesterol (DC-Choi), N-(1-(2,3-dioleyloxy)propyl)-N-2-(sperminecarboxamido)ethyl)-N,N-dimethylammonium trifluoracetate (DOSPA), dioctadecylamidoglycyl carboxyspermine (DOGS), 1,2-dileoyl-sn-3-phosphoethanolamine (DOPE), 1,2-dioleoyl-3-dimethylammonium propane (DODAP), N-(1,2-dimyristyloxyprop-3-yl)-N,N-dimethyl-N-hydroxyethyl ammonium bromide (DMRIE), and 2,2-Dilinoleyl-4-dimethylaminoethyl-[1,3]-dioxolane (XTC). Additionally, commercial preparations of cationic lipids can be used, such as, e.g., LIPOFECTIN (including DOTMA and DOPE, available from GIBCO/BRL), and LIPOFECTAMINE (comprising DOSPA and DOPE, available from GIBCO/BRL).


Other suitable (cationic) lipids are disclosed in WO2009/086558, WO2009/127060, WO2010/048536, WO2010/054406, WO2010/088537, WO2010/129709, WO2011/153493, US2011/0256175, US2012/0128760, US2012/0027803, and U.S. Pat. No. 8,158,601. In that context, the disclosures of WO2009/086558, WO2009/127060, WO2010/048536, WO2010/054406, WO2010/088537, WO2010/129709, WO2011/153493, US2011/0256175, US2012/0128760, US2012/0027803, and U.S. Pat. No. 8,158,601 are incorporated herewith by reference.


In some aspects, the lipid is selected from the group consisting of 98N12-5, C12-200, and ckk-E12.


The cationic lipid may also be an amino lipid. Suitable amino lipids include those having alternative fatty acid groups and other dialkylamino groups, including those in which the alkyl substituents are different (e.g., N-ethyl-N-methylamino-, and N-propyl-N-ethylamino-). In general, amino lipids having less saturated acyl chains are more easily sized, particularly when the complexes must be sized below about 0.3 microns, for purposes of filter sterilization. Amino lipids containing unsaturated fatty acids with carbon chain lengths in the range of C14 to C22 may be used. Other scaffolds can also be used to separate the amino group and the fatty acid or fatty alkyl portion of the amino lipid.


Representative amino lipids include, but are not limited to, 1,2-dilinoleyoxy-3-(dimethylamino)acetoxypropane (DLin-DAC), 1,2-dilinoleyoxy-3morpholinopropane (DLin-MA), 1,2-dilinoleoyl-3-dimethylaminopropane (DLinDAP), 1,2-dilinoleylthio-3-dimethylaminopropane (DLin-S-DMA), 1-linoleoyl-2-linoleyloxy-3dimethylaminopropane (DLin-2-DMAP), 1,2-dilinoleyloxy-3-trimethylaminopropane chloride salt (DLin-TMA.Cl), 1,2-dilinoleoyl-3-trimethylaminopropane chloride salt (DLin-TAP.Cl), 1,2-dilinoleyloxy-3-(N-methylpiperazino)propane (DLin-MPZ), 3-(N,Ndilinoleylamino)-1,2-propanediol (DLinAP), 3-(N,N-dioleylamino)-1,2-propanediol (DOAP), 1,2-dilinoleyloxo-3-(2-N,N-dimethylamino)ethoxypropane (DLin-EG-DMA), and 2,2-dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (DLin-K-DMA), 2,2-dilinoleyl-4-(2-dimethylaminoethyl)-[1,3]-dioxolane (DLin-KC2-DMA); dilinoleyl-methyl-4-dimethylaminobutyrate (DLin-MC3-DMA); MC3 (US20100324120).


In some embodiments, amino or cationic lipids have at least one protonatable or deprotonatable group, such that the lipid is positively charged at a pH at or below physiological pH (e.g. pH 7.4), and neutral at a second pH, preferably at or above physiological pH. It will, of course, be understood that the addition or removal of protons as a function of pH is an equilibrium process, and that the reference to a charged or a neutral lipid refers to the nature of the predominant species and does not require that all of the lipid be present in the charged or neutral form. Lipids that have more than one protonatable or deprotonatable group, or which are zwitterionic, are not excluded from use in the invention.


In some embodiments, the protonatable lipids have a pKa of the protonatable group in the range of about 4 to about 11, e.g., a pKa of about 5 to about 7.


LNPs can include two or more cationic lipids. The cationic lipids can be selected to contribute different advantageous properties. For example, cationic lipids that differ in properties such as amine pKa, chemical stability, half-life in circulation, half-life in tissue, net accumulation in tissue, or toxicity can be used in the LNP. In particular, the cationic lipids can be chosen so that the properties of the mixed-LNP are more desirable than the properties of a single-LNP of individual lipids.


In some embodiments, the cationic lipid is present in a ratio of from about 20 mol % to about 70 or 75 mol % or from about 45 to about 65 mol % or about 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, or about 70 mol % of the total lipid present in the LNP. In further embodiments, the LNPs comprise from about 25% to about 75% on a molar basis of cationic lipid, e.g., from about 20 to about 70%, from about 35 to about 65%, from about 45 to about 65%, about 60%, about 57.5%, about 57.1%, about 50% or about 40% on a molar basis (based upon 100% total moles of lipid in the lipid nanoparticle). In some embodiments, the ratio of cationic lipid to nucleic acid is from about 3 to about 15, such as from about 5 to about 13 or from about 7 to about 11.


The amount of the permanently cationic lipid or lipidoid may be selected taking the amount of the nucleic acid cargo into account. In one embodiment, these amounts are selected such as to result in an N/P ratio of the nanoparticle(s) or of the composition in the range from about 0.1 to about 20. In this context, the N/P ratio is defined as the mole ratio of the nitrogen atoms (“N”) of the basic nitrogen-containing groups of the lipid or lipidoid to the phosphate groups (“P”) of the RNA which is used as cargo. The N/P ratio may be calculated on the basis that, for example, 1 μg RNA typically contains about 3 nmol phosphate residues, provided that the RNA exhibits a statistical distribution of bases. The “N”-value of the lipid or lipidoid may be calculated on the basis of its molecular weight and the relative content of permanently cationic and—if present—cationisable groups.


In certain embodiments, the LNP comprises one or more additional lipids which stabilize the formation of particles during their formation.


(ii) Neutral and Non-Cationic Lipids


The non-cationic lipid can be a neutral lipid, an anionic lipid, or an amphipathic lipid. Neutral lipids, when present, can be any of a number of lipid species which exist either in an uncharged or neutral zwitterionic form at physiological pH. Such lipids include, for example, diacylphosphatidylcholine, diacylphosphatidylethanolamine, ceramide, sphingomyelin, dihydrosphingomyelin, cephalin, and cerebrosides. The selection of neutral lipids for use in the particles described herein is generally guided by consideration of, e.g., LNP size and stability of the LNP in the bloodstream. Preferably, the neutral lipid is a lipid having two acyl groups (e.g., diacylphosphatidylcholine and diacylphosphatidylethanolamine).


In some embodiments, the neutral lipids contain saturated fatty acids with carbon chain lengths in the range of C10 to C20. In other embodiments, neutral lipids with mono- or di-unsaturated fatty acids with carbon chain lengths in the range of C10 to C20 are used. Additionally, neutral lipids having mixtures of saturated and unsaturated fatty acid chains can be used.


Suitable neutral lipids include, but are not limited to, distearoylphosphatidylcholine (DSPC), dioleoylphosphatidylcholine (DOPC), dipalmitoylphosphatidylcholine (DPPC), dioleoylphosphatidylglycerol (DOPG), dipalmitoylphosphatidylglycerol (DPPG), dioleoyl-phosphatidylethanolamine (DOPE), palmitoyloleoylphosphatidylcholine (POPC), palmitoyloleoylphosphatidylethanolamine (POPE), dioleoyl-phosphatidylethanolamine 4-(N-maleimidomethyl)-cyclohexane-1-carboxylate (DOPE-mal), dipalmitoyl phosphatidyl ethanolamine (DPPE), dimyristoylphosphoethanolamine (DMPE), dimyristoyl phosphatidylcholine (DMPC), distearoyl-phosphatidyl-ethanolamine (DSPE), SM, 16-O-monomethyl PE, 16-O-dimethyl PE, 18-1-trans PE, 1-stearoyl-2-oleoyl-phosphatidyethanolamine (SOPE), cholesterol, or a mixture thereof. Anionic lipids suitable for use in LNPs include, but are not limited to, phosphatidylglycerol, cardiolipin, diacylphosphatidylserine, diacylphosphatidic acid, N-dodecanoyl phosphatidylethanoloamine, N-succinyl phosphatidylethanolamine, N-glutaryl phosphatidylethanolamine, lysylphosphatidylglycerol, and other anionic modifying groups joined to neutral lipids.


In one embodiment, the neutral lipid is 1,2-distearoyl-sn-glycero-3phosphocholine (DSPC).


In some embodiments, the LNPs comprise a neutral lipid selected from DSPC, DPPC, DMPC, DOPC, POPC, DOPE and SM. In various embodiments, the molar ratio of the cationic lipid to the neutral lipid ranges from about 2:1 to about 8:1.


Amphipathic lipids refer to any suitable material, wherein the hydrophobic portion of the lipid material orients into a hydrophobic phase, while the hydrophilic portion orients toward the aqueous phase. Such compounds include, but are not limited to, phospholipids, aminolipids, and sphingolipids. Representative phospholipids include sphingomyelin, phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, phosphatidylinositol, phosphatidic acid, palmitoyloleoyl phosphatdylcholine, lysophosphatidylcholine, lysophosphatidylethanolamine, dipalmitoylphosphatidylcholine, dioleoylphosphatidylcholine, distearoylphosphatidylcholine, or dilinoleoylphosphatidylcholine. Other phosphorus-lacking compounds, such as sphingolipids, glycosphingolipid families, diacylglycerols, and beta-acyloxyacids, can also be used.


In some embodiments, the non-cationic lipid is present in a ratio of from about 5 mol % to about 90 mol %, about 5 mol % to about 10 mol %, about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, or about 90 mol % of the total lipid present in the LNP.


In some embodiments, LNPs comprise from about 0% to about 15 or 45% on a molar basis of neutral lipid, e.g., from about 3 to about 12% or from about 5 to about 10%. For instance, LNPs may include about 15%, about 10%, about 7.5%, or about 7.1% of neutral lipid on a molar basis (based upon 100% total moles of lipid in the LNP).


(iii) Sterols


The sterol is preferably cholesterol.


The sterol can be present in a ratio of about 10 mol % to about 60 mol % or about 25 mol % to about 40 mol % of the LNP. In some embodiments, the sterol is present in a ratio of about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, or about 60 mol % of the total lipid present in the LNP. In other embodiments, LNPs comprise from about 5% to about 50% on a molar basis of the sterol, e.g., about 15% to about 45%, about 20% to about 40%, about 48%, about 40%, about 38.5%, about 35%, about 34.4%, about 31.5% or about 31% on a molar basis (based upon 100% total moles of lipid in the LNP).


(iv) Aggregation Reducing Agents


The aggregation reducing agent can be a lipid capable of reducing aggregation.


Examples of such lipids include, but are not limited to, polyethylene glycol (PEG)-modified lipids, monosialoganglioside Gml, and polyamide oligomers (PAO) such as those described in U.S. Pat. No. 6,320,017, which is incorporated by reference in its entirety. Other compounds with uncharged, hydrophilic, steric-barrier moieties, which prevent aggregation during formulation, like PEG, Gml or ATTA, can also be coupled to lipids. ATTA-lipids are described, e.g., in U.S. Pat. No. 6,320,017, and PEG-lipid conjugates are described, e.g., in U.S. Pat. Nos. 5,820,873, 5,534,499, 5,885,613, US20150376115A1 and WO2015/199952, each of which is incorporated by reference in its entirety.


The aggregation reducing agent may be, for example, selected from a polyethyleneglycol (PEG)-lipid including, without limitation, a PEG-diacylglycerol (DAG), a PEG-dialkylglycerol, a PEG-dialkyloxypropyl (DAA), a PEG-phospholipid, a PEG-ceramide (Cer), or a mixture thereof (such as PEG-Cerl4 or PEG-Cer20). The PEG-DAA conjugate may be, for example, a PEG-dilauryloxypropyl (C12), a PEG-dimyristyloxypropyl (C14), a PEG-dipalmityloxypropyl (C16), or a PEG-distearyloxypropyl (C18). Other pegylated-lipids include, but are not limited to, polyethylene glycol-didimyristoyl glycerol (C14-PEG or PEG-C14, where PEG has an average molecular weight of 2000 Da) (PEG-DMG); (R)-2,3-bis(octadecyloxy)propyl-1-(methoxy poly(ethylene glycol)2000)propylcarbamate) (PEG-DSG); PEG-carbamoyl-1,2-dimyristyloxypropylamine, in which PEG has an average molecular weight of 2000 Da (PEG-cDMA); N-Acetylgalactosamine-((R)-2,3-bis(octadecyloxy)propyl-I-(methoxy poly(ethylene glycol)2000)propylcarbamate)) (GalNAc-PEG-DSG); mPEG (mw2000)-diastearoylphosphatidyl-ethanolamine (PEG-DSPE); and polyethylene glycol-dipalmitoylglycerol (PEG-DPG).


In some embodiments, the aggregation reducing agent is PEG-DMG. In other embodiments, the aggregation reducing agent is PEG-c-DMA.


In various embodiments, the molar ratio of the cationic lipid to the PEGylated lipid ranges from about 100:1 to about 25:1.


LNP Composition:


The composition of LNPs may be influenced by, inter alia, the selection of the cationic lipid component, the degree of cationic lipid saturation, the nature of the PEGylation, the ratio of all components and biophysical parameters such as its size. In one example by Semple et al. (Semple et al. Nature Biotech. 2010 28: 172-176; herein incorporated by reference in its entirety), the LNP composition was composed of 57.1% cationic lipid, 7.1% dipalmitoylphosphatidylcholine, 34.3% cholesterol, and 1.4% PEG-c-DMA (Basha et al. Mol Ther. 2011 19:2186-2200; herein incorporated by reference in its entirety).


In some embodiments, LNPs may comprise from about 35 to about 45% cationic lipid, from about 40% to about 50% cationic lipid, from about 50% to about 60% cationic lipid and/or from about 55% to about 65% cationic lipid. In some embodiments, the ratio of lipid to mRNA may range from about 5:1 to about 20: 1, from about 10:1 to about 25:1, from about 15:1 to about 30:1 and/or at least 30:1.


The average molecular weight of the PEG moiety in the PEG-modified lipids can range from about 500 to about 8,000 Daltons (e.g., from about 1,000 to about 4,000 Daltons). In one preferred embodiment, the average molecular weight of the PEG moiety is about 2,000 Daltons.


The concentration of the aggregation reducing agent may range from about 0.1 to about 15 mol %, per 100% total moles of lipid in the LNP. In some embodiments, LNPs include less than about 3, 2, or 1 mole percent of PEG or PEG-modified lipid, based on the total moles of lipid in the LNP. In further embodiments, LNPs comprise from about 0.1% to about 20% of the PEG-modified lipid on a molar basis, e.g., about 0.5 to about 10%, about 0.5 to about 5%, about 10%, about 5%, about 3.5%, about 3%, about 2,5%, about 2%, about 1.5%, about 1%, about 0.5%, or about 0.3% on a molar basis (based on 100% total moles of lipids in the LNP).


Different LNPs having varying molar ratios of cationic lipid, non-cationic (or neutral) lipid, sterol (e.g., cholesterol), and aggregation reducing agent (such as a PEG-modified lipid) on a molar basis (based upon the total moles of lipid in the lipid nanoparticles) as depicted in Table 5 below:









TABLE 5







Lipid-based formulations









Molar Ratio of Lipids



(Based upon 100% total moles of lipid in the lipid nanoparticle)











Formulation

Non-Cationic (or

Aggregation Reducing


No.
Cationic Lipid
Neutral) Lipid
Sterol
Agent (e.g., PEG-lipid)





1
from about 35 to
from about 3 to
from about 15
from about 0.1 to about



about 65%
about 12 or 15%
to about 45%
10% (preferably from






about 0.5 to about 2 or






3%)


2
from about 20 to
from about 5 to
from about 20
from about 0.1 to about



about 70%
about 45%
to about 55%
10% (preferably from






about 0.5 to about 2 or






3%)


3
from about 45 to
from about 5 to
from about 25
from about 0.1 to about



about 65%
about 10%
to about 40%
3%


4
from about 20 to
from about 5 to
from about 25
from about 0.1 to about



about 60%
about 25%
to about 55%
5% (preferably from






about 0.1 to about 3%)


5
about 40%
about 10%
about 40%
about 10%


6
about 35%
about 15%
about 40%
about 10%


7
about 52%
about 13%
about 30%
about 5%


8
about 50%
about 10%
about 38.5%
about 1.5%









In some embodiments, LNPs occur as liposeomes or lipoplexes as described in further detail below.


The total amount of nucleic acid, particularly the RNA in the lipid nanoparticles varies and may be defined depending on the e.g. RNA to total lipid w/w ratio. In one embodiment of the invention the RNA to total lipid ratio is less than 0.06 w/w, preferably between 0.03 w/w and 0.04 w/w.


LNP Size:


In some embodiments, LNPs have a median diameter size of from about 50 nm to about 300 nm, such as from about 50 nm to about 250 nm, for example, from about 50 nm to about 200 nm.


In some embodiments, smaller LNPs may be used. Such particles may comprise a diameter from below 0.1 μm up to 100 nm such as, but not limited to, less than 0.1 μm, less than 1.0 μm, less than 5 μm, less than 10 μm, less than 15 μm, less than 20 μm, less than 25 μm, less than 30 μm, less than 35 μm, less than 40 μm, less than 50 μm, less than 55 μm, less than 60 μm, less than 65 μm, less than 70 μm, less than 75 μm, less than 80 μm, less than 85 μm, less than 90 μm, less than 95 μm, less than 100 μm, less than 125 μm, less than 150 μm, less than 175 μm, less than 200 μm, less than 225 μm, less than 250 μm, less than 275 μm, less than 300 μm, less than 325 μm, less than 350 μm, less than 375 μm, less than 400 μm, less than 425 μm, less than 450 μm, less than 475 μm, less than 500 μm, less than 525 μm, less than 550 μm, less than 575 μm, less than 600 μm, less than 625 μm, less than 650 μm, less than 675 μm, less than 700 μm, less than 725 μm, less than 750 μm, less than 775 μm, less than 800 μm, less than 825 μm, less than 850 μm, less than 875 μm, less than 900 μm, less than 925 μm, less than 950 μm, less than 975 μm, In another embodiment, nucleic acids may be delivered using smaller LNPs which may comprise a diameter from about 1 nm to about 100 nm, from about 1 nm to about 10 nm, about 1 nm to about 20 nm, from about 1 nm to about 30 nm, from about 1 nm to about 40 nm, from about 1 nm to about 50 nm, from about 1 nm to about 60 nm, from about 1 nm to about 70 nm, from about 1 nm to about 80 nm, from about 1 nm to about 90 nm, from about 5 nm to about from 100 nm, from about 5 nm to about 10 nm, about 5 nm to about 20 nm, from about 5 nm to about 30 nm, from about 5 nm to about 40 nm, from about 5 nm to about 50 nm, from about 5 nm to about 60 nm, from about 5 nm to about 70 nm, from about 5 nm to about 80 nm, from about 5 nm to about 90 nm, about 10 nm to about 50 nm, from about 20 nm to about 50 nm, from about 30 nm to about 50 nm, from about 40 nm to about 50 nm, from about 20 nm to about 60 nm, from about 30 nm to about 60 nm, from about 40 nm to about 60 nm, from about 20 nm to about 70 nm, from about 30 nm to about 70 nm, from about 40 nm to about 70 nm, from about 50 nm to about 70 nm, from about 60 nm to about 70 nm, from about 20 nm to about 80 nm, from about 30 nm to about 80 nm, from about 40 nm to about 80 nm, from about 50 nm to about 80 nm, from about 60 nm to about 80 nm, from about 20 nm to about 90 nm, from about 30 nm to about 90 nm, from about 40 nm to about 90 nm, from about 50 nm to about 90 nm, from about 60 nm to about 90 nm and/or from about 70 nm to about 90 nm.


In some embodiments, the LNP may have a diameter greater than 100 nm, greater than 150 nm, greater than 200 nm, greater than 250 nm, greater than 300 nm, greater than 350 nm, greater than 400 nm, greater than 450 nm, greater than 500 nm, greater than 550 nm, greater than 600 nm, greater than 650 nm, greater than 700 nm, greater than 750 nm, greater than 800 nm, greater than 850 nm, greater than 900 nm, greater than 950 nm or greater than 1000 nm.


In other embodiments, LNPs have a single mode particle size distribution (i.e., they are not bi- or poly-modal).


Other Components:


LNPs may further comprise one or more lipids and/or other components in addition to those mentioned above.


Other lipids may be included in the liposome compositions for a variety of purposes, such as to prevent lipid oxidation or to attach ligands onto the liposome surface. Any of a number of lipids may be present in LNPs, including amphipathic, neutral, cationic, and anionic lipids. Such lipids can be used alone or in combination.


Additional components that may be present in a LNP include bilayer stabilizing components such as polyamide oligomers (see, e.g., U.S. Pat. No. 6,320,017, which is incorporated by reference in its entirety), peptides, proteins, and detergents.


Liposomes:


In some embodiments, the inventive mRNAs, optionally comprised by (pharmaceutical) compositions or vaccines are formulated as liposomes.


Cationic lipid-based liposomes are able to complex with negatively charged nucleic acids (e.g. mRNAs) via electrostatic interactions, resulting in complexes that offer biocompatibility, low toxicity, and the possibility of the large-scale production required for in vivo clinical applications. Liposomes can fuse with the plasma membrane for uptake; once inside the cell, the liposomes are processed via the endocytic pathway and the nucleic acid is then released from the endosome/carrier into the cytoplasm. Liposomes have long been perceived as drug delivery vehicles because of their superior biocompatibility, given that liposomes are basically analogues of biological membranes, and can be prepared from both natural and synthetic phospholipids (Int J Nanomedicine. 2014; 9: 1833-1843).


Liposomes typically consist of a lipid bilayer that can be composed of cationic, anionic, or neutral (phospho)lipids and cholesterol, which encloses an aqueous core. Both the lipid bilayer and the aqueous space can incorporate hydrophobic or hydrophilic compounds, respectively. Liposomes may have one or more lipid membranes. Liposomes can be single-layered, referred to as unilamellar, or multi-layered, referred to as multilamellar.


Liposome characteristics and behaviour in vivo can be modified by addition of a hydrophilic polymer coating, e.g. polyethylene glycol (PEG), to the liposome surface to confer steric stabilization. Furthermore, liposomes can be used for specific targeting by attaching ligands (e.g., antibodies, peptides, and carbohydrates) to its surface or to the terminal end of the attached PEG chains (Front Pharmacol. 2015 Dec. 1; 6:286).


Liposomes are typically present as spherical vesicles and can range in size from 20 nm to a few microns.


Liposomes can be of different sizes such as, but not limited to, a multilamellar vesicle (MLV) which may be hundreds of nanometers in diameter and may contain a series of concentric bilayers separated by narrow aqueous compartments, a small unicellular vesicle (SUV) which may be smaller than 50 nm in diameter, and a large unilamellar vesicle (LUV) which may be between 50 and 500 nm in diameter. Liposome design may include, but is not limited to, opsonins or ligands in order to improve the attachment of liposomes to unhealthy tissue or to activate events such as, but not limited to, endocytosis. Liposomes may contain a low or a high pH in order to improve the delivery of the pharmaceutical formulations.


As a non-limiting example, liposomes such as synthetic membrane vesicles may be prepared by the methods, apparatus and devices described in US Patent Publication No. US20130177638, US20130177637, US20130177636, US20130177635, US20130177634, US20130177633, US20130183375, US20130183373 and US20130183372, the contents of each of which are herein incorporated by reference in its entirety. The inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, may be encapsulated by the liposome and/or it may be contained in an aqueous core which may then be encapsulated by the liposome (see International Pub. Nos. WO2012/031046, WO2012/031043, WO2012/030901 and WO2012/006378 and US Patent Publication No. US20130189351, 0520130195969 and 0520130202684; the contents of each of which are herein incorporated by reference in their entirety).


In some embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, may be formulated in liposomes such as, but not limited to, DiLa2 liposomes (Marina Biotech, Bothell, Wash.), SMARTICLES® (Marina Biotech, Bothell, Wash.), neutral DOPC (1,2-dioleoyl-sn-glycero-3-phosphocholine) based liposomes (e.g., siRNA delivery for ovarian cancer (Landen et al. Cancer Biology & Therapy 2006 5(12)1708-1713); herein incorporated by reference in its entirety) and hyaluronan-coated liposomes (Quiet Therapeutics, Israel).


Lipoplexes:


In some embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is formulated in the form of lipoplexes, i.e. cationic lipid bilayers sandwiched between nucleic acid (e.g. mRNA) layers.


Cationic lipids, such as DOTAP, (1,2-dioleoyl-3-trimethylammonium-propane) and DOTMA (N-[1-(2,3-dioleoyloxy)propyl]-N,N,N-trimethyl-ammonium methyl sulfate) can form complexes or lipoplexes with negatively charged nucleic acids to form nanoparticles by electrostatic interaction, providing high in vitro transfection efficiency.


Nanoliposomes


In some embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, is formulated is the form of neutral lipid-based nanoliposomes such as 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC)-based nanoliposomes (Adv Drug Deliv Rev. 2014 February; 66: 110-116).


Emulsions:


In some embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, is provided in the form of an emulsion. In some embodiment, said mRNA is formulated in a cationic oil-in-water emulsion, wherein the emulsion particle comprises an oil core and a cationic lipid which can interact with said mRNA, anchoring the molecule to the emulsion particle (see International Pub. No. WO2012/006380; herein incorporated by reference in its entirety). In some embodiments, said mRNA is formulated in a water-in-oil emulsion comprising a continuous hydrophobic phase in which the hydrophilic phase is dispersed. As a non-limiting example, the emulsion may be made by the methods described in International Publication No. WO2010/87791, the contents of which are herein incorporated by reference in its entirety.


(Poly)Cationic Compounds


According to preferred embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is complexed or associated with a cationic or polycationic compound (“(poly-)cationic compound”) and/or a polymeric carrier.


The term “(poly-)cationic compound” typically refers to a charged molecule, which is positively charged (cation) at a pH value typically from 1 to 9, preferably at a pH value of or below 9 (e.g. from 5 to 9), of or below 8 (e.g. from 5 to 8), of or below 7 (e.g. from 5 to 7), most preferably at a physiological pH, e.g. from 7.3 to 7.4.


Accordingly, a “(poly-)cationic compound” may be any positively charged compound or polymer, preferably a cationic peptide or protein, which is positively charged under physiological conditions, particularly under physiological conditions in vivo. A “(poly-)cationic peptide or protein” may contain at least one positively charged amino acid, or more than one positively charged amino acid, e.g. selected from Arg, His, Lys or Orn. (Poly-)cationic amino acids, peptides and proteins


(Poly-)cationic compounds being particularly preferred agents for complexation or association with the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine include protamine, nucleoline, spermine or spermidine, or other cationic peptides or proteins, such as poly-L-lysine (PLL), poly-arginine, basic polypeptides, cell penetrating peptides (CPPs), including HIV-binding peptides, HIV-1 Tat (HIV), Tat-derived peptides, Penetratin, VP22 derived or analogous peptides, HSV VP22 (Herpes simplex), MAP, KALA or protein transduction domains (PTDs), PpT620, prolin-rich peptides, arginine-rich peptides, lysine-rich peptides, MPG-peptide(s), Pep-1, L-oligomers, Calcitonin peptide(s), Antennapedia-derived peptides (particularly from Drosophila antennapedia), pAntp, pIsl, FGF, Lactoferrin, Transportan, Buforin-2, Bac715-24, SynB, SynB(1), pVEC, hCT-derived peptides, SAP, or histones.


More preferably, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, is complexed with one or more polycations, preferably with protamine or oligofectamine (discussed below), most preferably with protamine. In this context protamine is particularly preferred.


Additionally, preferred (poly-)cationic proteins or peptides may be selected from the following proteins or peptides having the following total formula (III):





(Arg)l;(Lys)m;(His)n;(Orn)o;(Xaa)x,  (formula (III))


wherein l+m+n+o+x=8-15, and l, m, n or o independently of each other may be any number selected from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15, provided that the overall content of Arg, Lys, His and Orn represents at least 50% of all amino acids of the oligopeptide; and Xaa may be any amino acid selected from native (=naturally occurring) or non-native amino acids except of Arg, Lys, His or Orn; and x may be any number selected from 0, 1, 2, 3 or 4, provided, that the overall content of Xaa does not exceed 50% of all amino acids of the oligopeptide. Particularly preferred cationic peptides in this context are e.g. Arg7, Arg8, Arg9, H3R9, R9H3, H3R9H3, YSSR9SSY, (RKH)4, Y(RKH)2R, etc. In this context the disclosure of WO2009/030481 is incorporated herewith by reference.


Preferred cationic or polycationic proteins or peptides may be derived from formula Cys{(Arg)l;(Lys)m;(His)n;(Orn)o;(Xaa)x}Cys or {(Arg)I;(Lys)m;(His)n;(Orn)o;(Xaa)x} of the patent application WO2009/030481 or WO2011/026641, the disclosure of WO2009/030481 and WO2011/026641 relating thereto are incorporated herewith by reference. In a preferred embodiment, the cationic or polycationic proteins or peptides comprises CHHHHHHRRRRHHHHHHC (SEQ ID NO: 3450), CR12C (SEQ ID NO: 3447), CR12 (SEQ ID NO: 3448) or WR12C (SEQ ID NO: 3449).


(Poly-)Cationic Polysaccharides:


Further preferred (poly-)cationic compounds for complexation or association with the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, include (poly-)cationic polysaccharides, e.g. chitosan, polybrene, cationic polymers, e.g. polyethyleneimine (PEI).


(Poly-)Cationic Lipids:


Further preferred (poly-)cationic compounds for complexation or association with the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, include (poly-)cationic lipids, e.g. DOTMA: [1-(2,3-sioleyloxy)propyl)]-N,N,N-trimethylammonium chloride, DMRIE, di-C14-amidine, DOTIM, SAINT, DC-Chol, BGTC, CTAP, DOPC, DODAP, DOPE: Dioleyl phosphatidylethanol-amine, DOSPA, DODAB, DOIC, DMEPC, DOGS: Dioctadecylamidoglicylspermin, DIMRI: Dimyristo-oxypropyl dimethyl hydroxyethyl ammonium bromide, DOTAP: dioleoyloxy-3-(trimethylammonio)propane, DC-6-14: 0,0-ditetradecanoyl-N-(α-trimethylammonioacetyl)diethanolamine chloride, CLIP1: rac-[(2,3-dioctadecyloxypropyl)(2-hydroxyethyl)]-dimethylammonium chloride, CLIP6: rac-[2(2,3-dihexadecyloxypropyl-oxymethyloxy)ethyl]trimethylammonium, CLIP9: rac-[2(2,3-dihexadecyloxypropyl-oxysuccinyloxy)ethyl]-trimethylammonium, oligofectamine, or (poly-)cationic polymers, e.g. modified polyaminoacids, such as beta-aminoacid-polymers or reversed polyamides, etc., modified polyethylenes, such as PVP (poly(N-ethyl-4-vinylpyridinium bromide)), etc., modified acrylates, such as pDMAEMA (poly(dimethylaminoethyl methylacrylate)), etc., modified amidoamines such as pAMAM (poly(amidoamine)), etc., modified polybetaaminoester (PBAE), such as diamine end-modified 1,4 butanediol diacrylate-co-5-amino-1-pentanol polymers, etc., dendrimers, such as polypropylamine dendrimers or pAMAM based dendrimers, etc., polyimine(s), such as PEI: poly(ethyleneimine), poly(propyleneimine), etc., polyallylamine, sugar backbone based polymers, such as cyclodextrin based polymers, dextran based polymers, chitosan, etc., silan backbone based polymers, such as PMOXA-PDMS copolymers, etc., blockpolymers consisting of a combination of one or more cationic blocks (e.g. selected from a cationic polymer as mentioned above) and of one or more hydrophilic or hydrophobic blocks (e.g. polyethyleneglycole); etc.


Polymeric Carriers:


According to further preferred embodiments, inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, is complexed or associated with a polymeric carrier.


A “polymeric carrier” used according to the invention might be a polymeric carrier formed by disulfide-crosslinked cationic components. The disulfide-crosslinked cationic components may be the same or different from each other. The polymeric carrier can also contain further components.


It is also particularly preferred that the polymeric carrier used according to the present invention comprises mixtures of cationic peptides, proteins or polymers and optionally further components as defined herein, which are crosslinked by disulfide bonds as described herein. In this context, the disclosure of WO2012/013326 is incorporated herewith by reference.


In this context, the cationic components, which form basis for the polymeric carrier by disulfide-crosslinkage, are typically selected from any suitable (poly-)cationic peptide, protein or polymer suitable for this purpose, particular any (poly-)cationic peptide, protein or polymer capable of complexing, and thereby preferably condensing, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein. The (poly-)cationic peptide, protein or polymer, is preferably a linear molecule, however, branched (poly-)cationic peptides, proteins or polymers may also be used.


Every disulfide-crosslinking (poly-)cationic protein, peptide or polymer of the polymeric carrier, which may be used to complex the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, contains at least one—SH moiety, most preferably at least one cysteine residue or any further chemical group exhibiting an —SH moiety, capable of forming a disulfide linkage upon condensation with at least one further (poly-)cationic protein, peptide or polymer as cationic component of the polymeric carrier as mentioned herein.


As defined above, the polymeric carrier, which may be used to complex the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, may be formed by disulfide-crosslinked cationic (or polycationic) components. Preferably, such (poly-)cationic peptides or proteins or polymers of the polymeric carrier, which comprise or are additionally modified to comprise at least one—SH moiety, are selected from, proteins, peptides and polymers as defined herein.


In some embodiments, the polymeric carrier may be selected from a polymeric carrier molecule according to generic formula (IV):





L-P1—S—[S—P2—S]n—S—P3-L  formula (IV)


wherein,


P1 and P3 are different or identical to each other and represent a linear or branched hydrophilic polymer chain, each P1 and P3 exhibiting at least one—SH-moiety, capable to form a disulfide linkage upon condensation with component P2, or alternatively with (AA), (AA)x, or [(AA)x]z if such components are used as a linker between P1 and P2 or P3 and P2) and/or with further components (e.g. (AA), (AA)x, [(AA)x]z or L), the linear or branched hydrophilic polymer chain selected independent from each other from polyethylene glycol (PEG), poly-N-(2-hydroxypropyl)methacrylamide, poly-2-(methacryloyloxy)ethyl phosphorylcholines, poly(hydroxyalkyl L-asparagine), poly(2-(methacryloyloxy)ethyl phosphorylcholine), hydroxyethylstarch or poly(hydroxyalkyl L-glutamine), wherein the hydrophilic polymer chain exhibits a molecular weight of about 1 kDa to about 100 kDa, preferably of about 2 kDa to about 25 kDa; or more preferably of about 2 kDa to about 10 kDa, e.g. about 5 kDa to about 25 kDa or 5 kDa to about 10 kDa;


P2 is a (poly-)cationic peptide or protein, e.g. as defined above for the polymeric carrier formed by disulfide-crosslinked cationic components, and preferably having a length of about 3 to about 100 amino acids, more preferably having a length of about 3 to about 50 amino acids, even more preferably having a length of about 3 to about 25 amino acids, e.g. a length of about 3 to 10, 5 to 15, 10 to 20 or 15 to 25 amino acids, more preferably a length of about 5 to about 20 and even more preferably a length of about 10 to about 20; or


is a (poly-)cationic polymer, e.g. as defined above for the polymeric carrier formed by disulfide-crosslinked cationic components, typically having a molecular weight of about 0.5 kDa to about 30 kDa, including a molecular weight of about 1 kDa to about 20 kDa, even more preferably of about 1.5 kDa to about 10 kDa, or having a molecular weight of about 0.5 kDa to about 100 kDa, including a molecular weight of about 10 kDa to about 50 kDa, even more preferably of about 10 kDa to about 30 kDa;


each P2 exhibiting at least two—SH-moieties, capable to form a disulfide linkage upon condensation with further components P2 or component(s) P1 and/or P3 or alternatively with further components (e.g. (AA), (AA)x, or [(AA)x]z);


—S—S— is a (reversible) disulfide bond (the brackets are omitted for better readability), wherein S preferably represents sulphur or a —SH carrying moiety, which has formed a (reversible) disulfide bond. The (reversible) disulfide bond is preferably formed by condensation of —SH-moieties of either components P1 and P2, P2 and P2, or P2 and P3, or optionally of further components as defined herein (e.g. L, (AA), (AA)x, [(AA)x]z, etc); The —SH-moiety may be part of the structure of these components or added by a modification as defined below;


L is an optional ligand, which may be present or not, and may be selected independent from the other from RGD, Transferrin, Folate, a signal peptide or signal sequence, a localization signal or sequence, a nuclear localization signal or sequence (NLS), an antibody, a cell penetrating peptide, (e.g. TAT or KALA), a ligand of a receptor (e.g. cytokines, hormones, growth factors etc), small molecules (e.g. carbohydrates like mannose or galactose or synthetic ligands), small molecule agonists, inhibitors or antagonists of receptors (e.g. RGD peptidomimetic analogues), or any further protein as defined herein, etc.;


n is an integer, typically selected from a range of about 1 to 50, preferably from a range of about 1, 2 or 3 to 30, more preferably from a range of about 1, 2, 3, 4, or 5 to 25, or a range of about 1, 2, 3, 4, or 5 to 20, or a range of about 1, 2, 3, 4, or 5 to 15, or a range of about 1, 2, 3, 4, or 5 to 10, including e.g. a range of about 4 to 9, 4 to 10, 3 to 20, 4 to 20, 5 to 20, or 10 to 20, or a range of about 3 to 15, 4 to 15, 5 to 15, or 10 to 15, or a range of about 6 to 11 or 7 to 10. Most preferably, n is in a range of about 1, 2, 3, 4, or 5 to 10, more preferably in a range of about 1, 2, 3, or 4 to 9, in a range of about 1, 2, 3, or 4 to 8, or in a range of about 1, 2, or 3 to 7.


In this context, the disclosure of WO2011/026641 is incorporated herewith by reference. Each of hydrophilic polymers P1 and P3 typically exhibits at least one—SH-moiety, wherein the at least one—SH-moiety is capable to form a disulfide linkage upon reaction with component P2 or with component (AA) or (AA)x, if used as linker between P1 and P2 or P3 and P2 as defined below and optionally with a further component, e.g. L and/or (AA) or (AA)x, e.g. if two or more—SH-moieties are contained. The following subformulae “P1—S—S—P2” and “P2—S—S-P3” within generic formula (IV) above (the brackets are omitted for better readability), wherein any of S, P1 and P3 are as defined herein, typically represent a situation, wherein one-SH-moiety of hydrophilic polymers P1 and P3 was condensed with one—SH-moiety of component P2 of generic formula (IV) above, wherein both sulphurs of these —SH-moieties form a disulfide bond —S—S— as defined herein in formula (IV). These —SH-moieties are typically provided by each of the hydrophilic polymers P1 and P3, e.g. via an internal cysteine or any further (modified) amino acid or compound which carries a —SH moiety. Accordingly, the subformulae “P1—S—S—P2” and “P2—S—S—P3” may also be written as “P1-Cys-Cys-P2” and “P2-Cys-Cys-P3”, if the —SH— moiety is provided by a cysteine, wherein the term “Cys-Cys” represents two cysteines coupled via a disulfide bond, not via a peptide bond. In this case, the term “—S—S—” in these formulae may also be written as “—S-Cys”, as “-Cys-S” or as “-Cys-Cys-”. In this context, the term “-Cys-Cys-” does not represent a peptide bond but a linkage of two cysteines via their —SH-moieties to form a disulfide bond. Accordingly, the term “-Cys-Cys-” also may be understood generally as “-(Cys-S)—(S-Cys)-”, wherein in this specific case S indicates the sulphur of the —SH-moiety of cysteine. Likewise, the terms “—S-Cys” and “—Cys-S” indicate a disulfide bond between a —SH containing moiety and a cysteine, which may also be written as “—S—(S-Cys)” and “-(Cys-S)—S”. Alternatively, the hydrophilic polymers P1 and P3 may be modified with a —SH moiety, preferably via a chemical reaction with a compound carrying a —SH moiety, such that each of the hydrophilic polymers P1 and P3 carries at least one such —SH moiety. Such a compound carrying a —SH moiety may be e.g. an (additional) cysteine or any further (modified) amino acid, which carries a —SH moiety. Such a compound may also be any non-amino compound or moiety, which contains or allows to introduce a —SH moiety into hydrophilic polymers P1 and P3 as defined herein. Such non-amino compounds may be attached to the hydrophilic polymers P1 and P3 of formula (IV) of the polymeric carrier according to the present invention via chemical reactions or binding of compounds, e.g. by binding of a 3-thio propionic acid or thioimolane, by amide formation (e.g. carboxylic acids, sulphonic acids, amines, etc), by Michael addition (e.g maleinimide moieties, α,β-unsatured carbonyls, etc), by click chemistry (e.g. azides or alkines), by alkene/alkine methatesis (e.g. alkenes or alkines), imine or hydrozone formation (aldehydes or ketons, hydrazins, hydroxylamins, amines), complexation reactions (avidin, biotin, protein G) or components which allow Sn-type substitution reactions (e.g halogenalkans, thiols, alcohols, amines, hydrazines, hydrazides, sulphonic acid esters, oxyphosphonium salts) or other chemical moieties which can be utilized in the attachment of further components. A particularly preferred PEG derivate in this context is alpha-Methoxy-omega-mercapto poly(ethylene glycol). In each case, the SH-moiety, e.g. of a cysteine or of any further (modified) amino acid or compound, may be present at the terminal ends or internally at any position of hydrophilic polymers P1 and P3. As defined herein, each of hydrophilic polymers P1 and P3 typically exhibits at least one—SH-moiety preferably at one terminal end, but may also contain two or even more—SH-moieties, which may be used to additionally attach further components as defined herein, preferably further functional peptides or proteins e.g. a ligand, an amino acid component (AA) or (AA)x, antibodies, cell penetrating peptides or enhancer peptides (e.g. TAT, KALA), etc.


In a particularly preferred embodiment, the polymeric carrier is a peptide polymer, preferably a polyethylene glycol/peptide polymer comprising HO-PEG5000-S—(S—CHHHHHHRRRRHHHHHHC—S-)7-S-PEG5000-OH (peptide component: SEQ ID NO: 3450) and a lipid component, preferably a lipidoid component, more preferably lipidoid 3-C12-OH.


The Lipidoid 3-C12-OH




embedded image


(as shown above) may be obtained by acylation of tris(2-aminoethyl)amine with an activated lauric (C12) acid derivative, followed by reduction of the amide. Alternatively, it may be prepared by reductive amination with the corresponding aldehyde. Lipidoid 3-C12-OH is prepared by addition of the terminal C12 alkyl epoxide with the same oligoamine according to Love et al., pp. 1864-1869, PNAS, vol. 107 (2010), no. 5 (cf. compound C12 and compound 110 in FIG. 1 of Love et al.). In preferred embodiments, the peptide polymer comprising lipidoid 3-C12-OH as specified above is used to complex the artificial nucleic acid of the invention, in particular RNA, to form complexes having an N/P ratio from about 0.1 to about 20, or from about 0.2 to about 15, or from about 2 to about 15, or from about 2 to about 12, wherein the N/P ratio is defined as the mole ratio of the nitrogen atoms of the basic groups of the cationic peptide or polymer to the phosphate groups of the artificial nucleic acid.


In another embodiment, the polymeric carrier comprises a lipidoid compound according to formula Va




embedded image


wherein

    • RA is independently selected for each occurrence an unsubstituted, cyclic or acyclic, branched or unbranched C1-20 aliphatic group; a substituted or unsubstituted, cyclic or acyclic, branched or unbranched C1-20 heteroaliphatic group; a substituted or unsubstituted aryl; a substituted or unsubstituted heteroaryl;




embedded image


wherein at least one RA is




embedded image




    • R5 is independently selected for each occurrence of from an unsubstituted, cyclic or acyclic, branched or unbranched C8-16 aliphatic; a substituted or unsubstituted aryl; or a substituted or unsubstituted heteroaryl;

    • each occurrence of x is an integer from 1 to 10;

    • each occurrence of y is an integer from 1 to 10;


      or a pharmaceutically acceptable salt thereof.





In that context, the disclosure of the PCT patent application PCT/EP2017/064059 is herewith incorporated by reference.


In other embodiments, the composition, which is preferably a (pharmaceutical) composition comprises at least one artificial nucleic acid as described herein, wherein the at least one artificial nucleic acid is complexed or associated with polymeric carriers and, optionally, with at least one lipid component as described in the PCT applications PCT/EP2017/064065, PCT/EP2017/064058. In this context, the disclosures of PCT/EP2017/064065, and PCT/EP2017/064058 is herewith incorporated by reference.


Weight Ratio and N/P Ratio


In preferred embodiments of the invention, the inventive mRNA is associated with or complexed with a (poly-) cationic compound or a polymeric carrier, optionally in a weight ratio selected from a range of about 6:1 (w/w) to about 0.25:1 (w/w), more preferably from about 5:1 (w/w) to about 0.5:1 (w/w), even more preferably of about 4:1 (w/w) to about 1:1 (w/w) or of about 3:1 (w/w) to about 1:1 (w/w), and most preferably a ratio of about 3:1 (w/w) to about 2:1 (w/w) of said RNA to (poly-)cationic compound and/or polymeric carrier; or optionally in a nitrogen/phosphate (N/P) ratio of said RNA to (poly-)cationic compound and/or polymeric carrier in the range of about 0.1-10, preferably in a range of about 0.3-4 or 0.3-1, and most preferably in a range of about 0.5-1 or 0.7-1, and even most preferably in a range of about 0.3-0.9 or 0.5-0.9. More preferably, the N/P ratio of the at least one said RNA to the one or more polycations is in the range of about 0.1 to 10, including a range of about 0.3 to 4, of about 0.5 to 2, of about 0.7 to 2 and of about 0.7 to 1.5.


The mRNA, optionally comprised by the (pharmaceutical) composition or vaccine, can also be associated with a vehicle, transfection or complexation agent for increasing the transfection efficiency and/or the immunostimulatory properties of said mRNA.


The (pharmaceutical) composition or vaccine may comprise at least one mRNA which is complexed with one or more (poly-)cationic compounds and/or polymeric carriers as defined above, and at least one “free” mRNA, wherein the at least one complexed RNA is preferably identical to the at least one “free” RNA.


In this context, it is particularly preferred that the inventive (pharmaceutical) composition or vaccine comprises the mRNA that is complexed at least partially with a (poly-)cationic compound and/or a polymeric carrier, preferably cationic proteins or peptides. In this context, the disclosure of WO2010/037539 and WO2012/113513 is incorporated herewith by reference. “Partially” means that only a part of said mRNA is complexed with a (poly-)cationic compound and/or polymeric carrier, while the rest of said mRNA is present in uncomplexed form (“free”).


Preferably, the molar ratio of the complexed mRNA to the free mRNA is selected from a molar ratio of about 0.001:1 to about 1:0.001, including a ratio of about 1:1. More preferably the ratio of complexed mRNA to free mRNA is selected from a range of about 5:1 (w/w) to about 1:10 (w/w), more preferably from a range of about 4:1 (w/w) to about 1:8 (w/w), even more preferably from a range of about 3:1 (w/w) to about 1:5 (w/w) or 1:3 (w/w), and most preferably the ratio of complexed mRNA to free mRNA is selected from a ratio of about 1:1 (w/w).


The complexed mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, is preferably prepared according to a first step by complexing said mRNA with a (poly-)cationic compound and/or with a polymeric carrier, preferably as defined herein, in a specific ratio to form a stable complex. In this context, it is highly preferable, that no free (poly-)cationic compound or polymeric carrier or only a negligibly small amount thereof remains in the fraction of the complexed mRNA after complexing said mRNA. Accordingly, the ratio of the mRNA and the (poly-)cationic compound and/or the polymeric carrier in the fraction of the complexed RNA is typically selected in a range so that the mRNA is entirely complexed and no free (poly-)cationic compound or polymeric carrier or only a negligibly small amount thereof remains in said fraction.


Preferably, the ratio of said mRNA as defined herein to the (poly-)cationic compound and/or the polymeric carrier, preferably as defined herein, is selected from a range of about 6:1 (w/w) to about 0.25:1 (w/w), more preferably from about 5:1 (w/w) to about 0.5:1 (w/w), even more preferably of about 4:1 (w/w) to about 1:1 (w/w) or of about 3:1 (w/w) to about 1:1 (w/w), and most preferably a ratio of about 3:1 (w/w) to about 2:1 (w/w).


Alternatively, the ratio of said mRNA to the (poly-)cationic compound and/or the polymeric carrier may also be calculated on the basis of the nitrogen/phosphate ratio (N/P-ratio) of the entire complex. In the context of the present invention, an N/P-ratio is preferably in the range of about 0.1-10, preferably in a range of about 0.3-4 and most preferably in a range of about 0.5-2 or 0.7-2 regarding the ratio of mRNA: (poly-)cationic compound and/or polymeric carrier, preferably as defined herein, in the complex, and most preferably in a range of about 0.7-1.5, 0.5-1 or 0.7-1, and even most preferably in a range of about 0.3-0.9 or 0.5-0.9, preferably provided that the (poly-)cationic compound in the complex is a (poly-)cationic protein or peptide and/or the polymeric carrier as defined above.


In other embodiments, the inventive mRNA, optionally comprised by the (pharmaceutical) composition or vaccine as defined herein, may be used in free or naked form without being associated with any further vehicle, transfection or complexation agent.


It has to be understood and recognized, that according to the present invention, the (pharmaceutical) composition or vaccine as defined herein may comprise at least one free mRNA as defined herein, and/or at least one complexed mRNA as defined herein, wherein every agent disclosed herein may be used for complexation.


Adjuvants


According to further embodiments, the (pharmaceutical) composition or vaccine of the invention comprises an adjuvant, which is preferably added in order to enhance the immunostimulatory properties of said (pharmaceutical) composition or vaccine.


An adjuvant or an adjuvant component in the broadest sense is typically a pharmacological and/or immunological agent that may modify, e.g. enhance, the effect of other agents, e.g. therapeutic agents or vaccines. In this context, an adjuvant may be understood as any compound, which is suitable to support administration and delivery of the composition according to the invention.


Furthermore, such an adjuvant may, without being bound thereto, initiate or increase an immune response of the innate immune system, i.e. a non-specific immune response. “Adjuvants” typically do not elicit an adaptive immune response. Insofar, “adjuvants” do not qualify as antigens. In other words, when administered, the inventive (pharmaceutical) composition or vaccine typically initiates an adaptive immune response due to an antigenic peptide or protein, which is encoded by the at least one coding sequence of the mRNA contained in said (pharmaceutical) composition or vaccine. Additionally, an adjuvant present in the (pharmaceutical) composition or vaccine according to the invention may generate an (supportive) innate immune response.


Such an adjuvant may be selected from any adjuvant known to a skilled person and suitable for the present case, i.e. supporting the induction of an immune response in a mammal. Preferably, the adjuvant may be selected from the group consisting of, without being limited thereto, TDM, MDP, muramyl dipeptide, pluronics, alum solution, aluminium hydroxide, ADJUMER™ (polyphosphazene); aluminium phosphate gel; glucans from algae; algammulin; aluminium hydroxide gel (alum); highly protein-adsorbing aluminium hydroxide gel; low viscosity aluminium hydroxide gel; AF or SPT (emulsion of squalane (5%), Tween 80 (0.2%), Pluronic L121 (1.25%), phosphate-buffered saline, pH 7.4); AVRIDINE™ (propanediamine); BAY R1005™ ((N-(2-deoxy-2-L-leucylamino-b-D-glucopyranosyl)-N-octadecyl-dodecanoyl-amide hydroacetate); CALCITRIOL™ (1-alpha,25-dihydroxy-vitamin D3); calcium phosphate gel; CAP™ (calcium phosphate nanoparticles); cholera holotoxin, cholera-toxin-A1-protein-A-D-fragment fusion protein, sub-unit B of the cholera toxin; CRL 1005 (block copolymer P1205); cytokine-containing liposomes; DDA (dimethyldioctadecylammonium bromide); DHEA (dehydroepiandrosterone); DMPC (dimyristoylphosphatidylcholine); DMPG (dimyristoylphosphatidylglycerol); DOC/alum complex (deoxycholic acid sodium salt); Freund's complete adjuvant; Freund's incomplete adjuvant; gamma inulin; Gerbu adjuvant (mixture of: i)N-acetylglucosaminyl-(P1-4)-N-acetylmuramyl-L-alanyl-D-glutamine (GMDP), ii) dimethyldioctadecylammonium chloride (DDA), iii) zinc-L-proline salt complex (ZnPro-8); GM-CSF); GMDP (N-acetylglucosaminyl-(b1-4)-N-acetylmuramyl-L-alanyl-D-isoglutamine); imiquimod (1-(2-methypropyl)-1H-imidazo[4,5-c]quinoline-4-amine); ImmTher™ (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-glycerol dipalmitate); DRVs (immunoliposomes prepared from dehydration-rehydration vesicles); interferon-gamma; interleukin-ibeta; interleukin-2; interleukin-7; interleukin-12; ISCOMS™; ISCOPREP 7.0.3.™; liposomes; LOXORIBINE™ (7-allyl-8-oxoguanosine); LT oral adjuvant (E. coli labile enterotoxin-protoxin); microspheres and microparticles of any composition; MF59™; (squalene-water emulsion); MONTANIDE ISA 51™ (purified incomplete Freund's adjuvant); MONTANIDE ISA 720™ (metabolisable oil adjuvant); MPL™ (3-Q-desacyl-4′-monophosphoryl lipid A); MTP-PE and MTP-PE liposomes ((N-acetyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1,2-dipalmitoyl-sn-glycero-3-(hydroxyphosphoryloxy))-ethylamide, monosodium salt); MURAMETIDE™ (Nac-Mur-L-Ala-D-Gln-OCH3); MURAPALMITINE™ and D-MURAPALMITINE™ (Nac-Mur-L-Thr-D-isoGln-sn-glyceroldipalmitoyl); NAGO (neuraminidase-galactose oxidase); nanospheres or nanoparticles of any composition; NISVs (non-ionic surfactant vesicles); PLEURAN™ (β-glucan); PLGA, PGA and PLA (homo- and co-polymers of lactic acid and glycolic acid; microspheres/nanospheres); PLURONIC L121™; PMMA (polymethyl methacrylate); PODDS™ (proteinoid microspheres); polyethylene carbamate derivatives; poly-rA: poly-rU (polyadenylic acid-polyuridylic acid complex); polysorbate 80 (Tween 80); protein cochleates (Avanti Polar Lipids, Inc., Alabaster, Ala.); STIMULON™ (QS-21); Quil-A (Quil-A saponin); S-28463 (4-amino-otec-dimethyl-2-ethoxymethyl-1H-imidazo[4,5 c]quinoline-1-ethanol); SAF-1™ (“Syntex adjuvant formulation”); Sendai proteoliposomes and Sendai-containing lipid matrices; Span-85 (sorbitan trioleate); Specol (emulsion of Marcol 52, Span 85 and Tween 85); squalene or Robane® (2,6,10,15,19,23-hexamethyltetracosan and 2,6,10,15,19,23-hexamethyl-2,6,10,14,18,22-tetracosahexane); stearyltyrosine (octadecyltyrosine hydrochloride); Theramid® (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-dipalmitoxypropylamide); Theronyl-MDP (Termurtide™ or [thr 1]-MDP; N-acetylmuramyl-L-threonyl-D-isoglutamine); Ty particles (Ty-VLPs or virus-like particles); Walter-Reed liposomes (liposomes containing lipid A adsorbed on aluminium hydroxide), and lipopeptides, including Pam3Cys, in particular aluminium salts, such as Adju-phos, Alhydrogel, Rehydragel; emulsions, including CFA, SAF, IFA, MF59, Provax, TiterMax, Montanide, Vaxfectin; copolymers, including Optivax (CRL1005), L121, Poloaxmer4010), etc.; liposomes, including Stealth, cochleates, including BIORAL; plant derived adjuvants, including QS21, Quil A, Iscomatrix, ISCOM; adjuvants suitable for costimulation including Tomatine, biopolymers, including PLG, PMM, Inulin; microbe derived adjuvants, including Romurtide, DETOX, MPL, CWS, Mannose, CpG nucleic acid sequences, CpG7909, ligands of human TLR 1-10, ligands of murine TLR 1-13, ISS-1018, IC31, Imidazoquinolines, Ampligen, Ribi529, IMOxine, IRIVs, VLPs, cholera toxin, heat-labile toxin, Pam3Cys, Flagellin, GPI anchor, LNFPIII/Lewis X, antimicrobial peptides, UC-1V150, RSV fusion protein, cdiGMP; and adjuvants suitable as antagonists including CGRP neuropeptide.


Particularly preferred, an adjuvant may be selected from adjuvants, which support induction of a Th1-immune response or maturation of naïve T-cells, such as GM-CSF, IL-12, IFNγ, any immunostimulatory nucleic acid as defined above, preferably an immunostimulatory RNA, CpG DNA, etc.


(Poly-)Cationic Compounds:


Suitable adjuvants may also be selected from (poly-)cationic compounds as described herein for complexation of the inventive mRNA. Associating or complexing the inventive mRNA comprised by the (pharmaceutical) composition or vaccine with (poly-)cationic compounds as defined herein preferably provides adjuvant properties and confers a stabilizing effect to the said mRNA.


In particular, preferred (poly-)cationic compounds are selected from (poly-)cationic peptides or proteins, including protamine, nucleoline, spermin or spermidine, or other cationic peptides or proteins, such as poly-L-lysine (PLL), poly-arginine, basic polypeptides, cell penetrating peptides (CPPs), including HIV-binding peptides, Tat, HIV-1 Tat (HIV), Tat-derived peptides, Penetratin, VP22 derived or analog peptides, HSV VP22 (Herpes simplex), MAP, KALA or protein transduction domains (PTDs, PpT620, prolin-rich peptides, arginine-rich peptides, lysine-rich peptides, MPG-peptide(s), Pep-1, L-oligomers, Calcitonin peptide(s), Antennapedia-derived peptides (particularly from Drosophila antennapedia), pAntp, pIsl, FGF, Lactoferrin, Transportan, Buforin-2, Bac715-24, SynB, SynB(1), pVEC, hCT-derived peptides, SAP, protamine, spermine, spermidine, or histones. Further preferred (poly-)cationic compounds may include cationic polysaccharides, for example chitosan, polybrene, cationic polymers, e.g. polyethyleneimine (PEI), cationic lipids, e.g. DOTMA: [1-(2,3-sioleyloxy)propyl)]-N,N,N-trimethylammonium chloride, DMRIE, di-C14-amidine, DOTIM, SAINT, DC-Chol, BGTC, CTAP, DOPC, DODAP, DOPE: Dioleyl phosphatidylethanol-amine, DOSPA, DODAB, DOIC, DMEPC, DOGS: Dioctadecylamidoglicylspermin, DIMRI: Dimyristo-oxypropyl dimethyl hydroxyethyl ammonium bromide, DOTAP: dioleoyloxy-3-(trimethylammonio)propane, DC-6-14: O,O-ditetradecanoyl-N-(α-trimethylammonioacetyl)diethanolamine chloride, CLIP1: rac-[(2,3-dioctadecyloxypropyl)(2-hydroxyethyl)]-dimethylammonium chloride, CLIP6: rac-[2(2,3-dihexadecyloxypropyl-oxymethyloxy)ethyl]-trimethylammonium, CLIP9: rac-[2(2,3-dihexadecyloxypropyl-oxysuccinyloxy)ethyl]-trimethylammonium, oligofectamine, or (poly-)cationic polymers, e.g. modified polyaminoacids, such as β-aminoacid-polymers or reversed polyamides, etc., modified polyethylenes, such as PVP (poly(N-ethyl-4-vinylpyridinium bromide)), etc., modified acrylates, such as pDMAEMA (poly(dimethylaminoethyl methylacrylate)), etc., modified Amidoamines such as pAMAM (poly(amidoamine)), etc., modified polybetaaminoester (PBAE), such as diamine end modified 1,4 butanediol diacrylate-co-5-amino-1-pentanol polymers, etc., dendrimers, such as polypropylamine dendrimers or pAMAM based dendrimers, etc., polyimine(s), such as PEI: poly(ethyleneimine), poly(propyleneimine), etc., polyallylamine, sugar backbone based polymers, such as cyclodextrin based polymers, dextran based polymers, Chitosan, etc., silan backbone based polymers, such as PMOXA-PDMS copolymers, etc., blockpolymers consisting of a combination of one or more cationic blocks (e.g. selected of a cationic polymer as mentioned above) and of one or more hydrophilic- or hydrophobic blocks (e.g. polyethyleneglycole); etc.


Additionally, preferred (poly-)cationic proteins or peptides, which can be used as an adjuvant by complexing the inventive mRNA comprised by the (pharmaceutical) composition or vaccine according to the invention, may be selected from following proteins or peptides having the following total formula (III):





(Arg)l;(Lys)m;(His)n;(Orn)o;(Xaa)x


wherein l+m+n+o+x=8-15, and l, m, n or o independently of each other may be any number selected from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15, provided that the overall content of Arg, Lys, His and Orn represents at least 50% of all amino acids of the oligopeptide; and Xaa may be any amino acid selected from native (=naturally occurring) or non-native amino acids except of Arg, Lys, His or Orn; and x may be any number selected from 0, 1, 2, 3 or 4, provided, that the overall content of Xaa does not exceed 50% of all amino acids of the oligopeptide. Particularly preferred oligoarginines in this context are e.g. Arg7, Arg8, Arg9, Arg7, H3R9, R9H3, H3R9H3, YSSR9SSY, (RKH)4, Y(RKH)2R, etc.


The ratio of the inventive mRNA to the (poly-)cationic compound in the adjuvant component may be calculated on the basis of the nitrogen/phosphate ratio (N/P-ratio) of the entire complex, i.e. the ratio of positively charged (nitrogen) atoms of the (poly-)cationic compound to the negatively charged phosphate atoms of said mRNA. For example, lag of mRNA typically contains about 3 nmol phosphate residues, provided said RNA exhibits a statistical distribution of bases. Additionally, 1 μg of peptide typically contains about x nmol nitrogen residues, dependent on the molecular weight and the number of basic amino acids. When exemplarily calculated for (Arg)9 (molecular weight 1424 g/mol, 9 nitrogen atoms), 1 μg (Arg)9 contains about 700 pmol (Arg)9 and thus 700×9=6300 pmol basic amino acids=6.3 nmol nitrogen atoms. For a mass ratio of about 1:1 RNA/(Arg)9 an N/P ratio of about 2 can be calculated. When exemplarily calculated for protamine (molecular weight about 4250 g/mol, 21 nitrogen atoms, when protamine from salmon is used) with a mass ratio of about 2:1 with 2 μg RNA, 6 nmol phosphate are to be calculated for the RNA; 1 μg protamine contains about 235 pmol protamine molecules and thus 235×21=4935 pmol basic nitrogen atoms=4.9 nmol nitrogen atoms. For a mass ratio of about 2:1 RNA/protamine an N/P ratio of about 0.81 can be calculated. For a mass ratio of about 8:1 RNA/protamine an N/P ratio of about 0.2 can be calculated. In the context of the present invention, an N/P-ratio is preferably in the range of about 0.1-10, preferably in a range of about 0.3-4 and most preferably in a range of about 0.5-2 or 0.7-2 regarding the ratio of RNA:peptide in the complex, and most preferably in the range of about 0.7-1.5.


Preparation:


In preferred embodiments, the (pharmaceutical) composition or vaccine of the present invention is obtained in two separate steps in order to obtain both, an efficient immunostimulatory effect and efficient translation of the mRNA comprised by said (pharmaceutical) composition or vaccine.


In a first step, an mRNA according to the invention is complexed with a (poly-)cationic compound in a specific ratio to form a stable complex (“complexed (RNA”). In this context, it is important, that no free (poly-)cationic compound or only a neglibly small amount remains in the fraction of the complexed RNA. Accordingly, the ratio of the mRNA and the (poly-)cationic compound is typically selected in a range that the mRNA is entirely complexed and no free (poly-)cationic compound or only a neglectably small amount remains in the fraction of said complexed mRNA. Preferably the ratio of the mRNA to the (poly-)cationic compound is selected from a range of about 6:1 (w/w) to about 0.25:1 (w/w), more preferably from about 5:1 (w/w) to about 0.5:1 (w/w), even more preferably of about 4:1 (w/w) to about 1:1 (w/w) or of about 3:1 (w/w) to about 1:1 (w/w), and most preferably a ratio of about 3:1 (w/w) to about 2:1 (w/w).


According to preferred embodiments, in a second step, an mRNA is added to the fraction of the complexed mRNA in order to form the (immunostimulatory) composition of the invention. Therein, said added mRNA is present as free RNA, preferably as free mRNA, which is not complexed by other compounds. Prior to addition, the free RNA is not complexed and will preferably not undergo any detectable or significant complexation reaction upon the addition to the complexed mRNA. This is due to the strong binding of the (poly-)cationic compound to the complexed mRNA. In other words, when the free mRNA is added to the complexed mRNA, preferably no free or substantially no free (poly-)cationic compound is present, which could form a complex with said free mRNA. Accordingly, the free mRNA of the inventive (pharmaceutical) composition or vaccine can efficiently be transcribed in vivo. Therein, the free mRNA, may occur as a mono-, di-, or multicistronic (m)RNA, i.e. an mRNA which carries the coding sequences of one or more antigenic peptides or proteins. Such coding sequences in di-, or even multicistronic mRNA may be separated by at least one IRES sequence, e.g. as defined herein.


In particularly preferred embodiments, the free mRNA comprised by the inventive (pharmaceutical) composition or vaccine, may be identical or different to the complexed mRNA, depending on the specific requirements of therapy. Even more preferably, the free mRNA, which is comprised by the inventive (pharmaceutical) composition or vaccine, is identical to the complexed mRNA, in other words, the (pharmaceutical) composition or vaccine comprises the mRNA in both free and complexed form.


In particularly preferred embodiments, the inventive (pharmaceutical) composition or vaccine thus comprises the mRNA as defined herein, wherein said mRNA is present in the said (pharmaceutical) composition or vaccine partially as free RNA and partially as complexed RNA. Preferably, the mRNA as defined herein, preferably an mRNA, is complexed as described above and the same mRNA is then added in the form of free RNA, wherein preferably the compound, which is used for complexing the mRNA is not present in free form at the moment of addition of the free mRNA.


The ratio of the complexed mRNA and the free mRNA may be selected depending on the specific requirements of a particular therapy. Typically, the ratio of the complexed mRNA and the free mRNA is selected such that a significant stimulation of the innate immune system is elicited due to the presence of the complexed mRNA. In parallel, the ratio is selected such that a significant amount of the free mRNA can be provided in vivo leading to an efficient translation and concentration of the expressed antigenic peptide or protein in vivo. Preferably the ratio of the complexed mRNA to free mRNA in the inventive (pharmaceutical) composition or vaccine is selected from a range of about 5:1 (w/w) to about 1:10 (w/w), more preferably from a range of about 4:1 (w/w) to about 1:8 (w/w), even more preferably from a range of about 3:1 (w/w) to about 1:5 (w/w) or 1:3 (w/w), and most preferably about 1:1 (w/w).


Additionally or alternatively, the ratio of the complexed mRNA and the free mRNA may be calculated on the basis of the nitrogen/phosphate ratio (N/P-ratio) of the entire RNA complex. In the context of the present invention, an N/P-ratio is preferably in the range of about 0.1-10, preferably in a range of about 0.3-4 and most preferably in a range of about 0.5-2 or 0.7-2 regarding the ratio of RNA:peptide in the complex, and most preferably in the range of about 0.7-1.5.


Additionally or alternatively, the ratio of the complexed mRNA and the free mRNA may also be selected on the basis of the molar ratio of both RNAs to each other. Typically, the molar ratio of the complexed mRNA to the free mRNA may be selected such, that the molar ratio suffices the above (w/w) and/or N/P-definitions. More preferably, the molar ratio of the complexed mRNA to the free mRNA may be selected e.g. from a molar ratio of about 0.001:1, 0.01:1, 0.1:1, 0.2:1, 0.3:1, 0.4:1, 0.5:1, 0.6:1, 0.7:1, 0.8:1, 0.9:1, 1:1, 1:0.9, 1:0.8, 1:0.7, 1:0.6, 1:0.5, 1:0.4, 1:0.3, 1:0.2, 1:0.1, 1:0.01, 1:0.001, etc. or from any range formed by any two of the above values, e.g. a range selected from about 0.001:1 to 1:0.001, including a range of about 0.01:1 to 1:0.001, 0.1:1 to 1:0.001, 0.2:1 to 1:0.001, 0.3:1 to 1:0.001, 0.4:1 to 1:0.001, 0.5:1 to 1:0.001, 0.6:1 to 1:0.001, 0.7:1 to 1:0.001, 0.8:1 to 1:0.001, 0.9:1 to 1:0.001, 1:1 to 1:0.001, 1:0.9 to 1:0.001, 1:0.8 to 1:0.001, 1:0.7 to 1:0.001, 1:0.6 to 1:0.001, 1:0.5 to 1:0.001, 1:0.4 to 1:0.001, 1:0.3 to 1:0.001, 1:0.2 to 1:0.001, 1:0.1 to 1:0.001, 1:0.01 to 1:0.001, or a range of about 0.01:1 to 1:0.01, 0.1:1 to 1:0.01, 0.2:1 to 1:0.01, 0.3:1 to 1:0.01, 0.4:1 to 1:0.01, 0.5:1 to 1:0.01, 0.6:1 to 1:0.01, 0.7:1 to 1:0.01, 0.8:1 to 1:0.01, 0.9:1 to 1:0.01, 1:1 to 1:0.01, 1:0.9 to 1:0.01, 1:0.8 to 1:0.01, 1:0.7 to 1:0.01, 1:0.6 to 1:0.01, 1:0.5 to 1:0.01, 1:0.4 to 1:0.01, 1:0.3 to 1:0.01, 1:0.2 to 1:0.01, 1:0.1 to 1:0.01, 1:0.01 to 1:0.01, or including a range of about 0.001:1 to 1:0.01, 0.001:1 to 1:0.1, 0.001:1 to 1:0.2, 0.001:1 to 1:0.3, 0.001:1 to 1:0.4, 0.001:1 to 1:0.5, 0.001:1 to 1:0.6, 0.001:1 to 1:0.7, 0.001:1 to 1:0.8, 0.001:1 to 1:0.9, 0.001:1 to 1:1, 0.001 to 0.9:1, 0.001 to 0.8:1, 0.001 to 0.7:1, 0.001 to 0.6:1, 0.001 to 0.5:1, 0.001 to 0.4:1, 0.001 to 0.3:1, 0.001 to 0.2:1, 0.001 to 0.1:1, or a range of about 0.01:1 to 1:0.01, 0.01:1 to 1:0.1, 0.01:1 to 1:0.2, 0.01:1 to 1:0.3, 0.01:1 to 1:0.4, 0.01:1 to 1:0.5, 0.01:1 to 1:0.6, 0.01:1 to 1:0.7, 0.01:1 to 1:0.8, 0.01:1 to 1:0.9, 0.01:1 to 1:1, 0.001 to 0.9:1, 0.001 to 0.8:1, 0.001 to 0.7:1, 0.001 to 0.6:1, 0.001 to 0.5:1, 0.001 to 0.4:1, 0.001 to 0.3:1, 0.001 to 0.2:1, 0.001 to 0.1:1, etc.


Even more preferably, the molar ratio of the complexed mRNA to the free mRNA may be selected e.g. from a range of about 0.01:1 to 1:0.01. Most preferably, the molar ratio of the complexed mRNA to the free mRNA may be selected e.g. from a molar ratio of about 1:1. Any of the above definitions with regard to (w/w) and/or N/P ratio may also apply.


Adjuvant Nucleic Acids:


Suitable adjuvants may furthermore be selected from nucleic acids having the formula (Va):





GlXmGn


wherein:

    • G is guanosine, uracil or an analogue of guanosine or uracil;
    • X is guanosine, uracil, adenosine, thymidine, cytosine or an analogue of the above-mentioned nucleotides;
    • l is an integer from 1 to 40, wherein when l=1 G is guanosine or an analogue thereof, when l>1 at least 50% of the nucleotides are guanosine or an analogue thereof;
    • m is an integer and is at least 3; wherein when m=3 X is uracil or an analogue thereof, when m>3 at least 3 successive uracils or analogues of uracil occur;
    • n is an integer from 1 to 40, wherein when n=1 G is guanosine or an analogue thereof, when n>1 at least 50% of the nucleotides are guanosine or an analogue thereof,


      or formula (Vb):





(NuGlXmGnNv)a,


wherein:

    • G is guanosine (guanine), uridine (uracil) or an analogue of guanosine (guanine) or uridine (uracil), preferably guanosine (guanine) or an analogue thereof;
    • X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine), or an analogue of these nucleotides (nucleosides), preferably uridine (uracil) or an analogue thereof;
    • N is a nucleic acid sequence having a length of about 4 to 50, preferably of about 4 to 40, more preferably of about 4 to 30 or 4 to 20 nucleic acids, each N independently being selected from guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of these nucleotides (nucleosides);
    • a is an integer from 1 to 20, preferably from 1 to 15, most preferably from 1 to 10;
    • l is an integer from 1 to 40, wherein when l=1, G is guanosine (guanine) or an analogue thereof, when l>1, at least 50% of these nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof;
    • m is an integer and is at least 3; wherein when m=3, X is uridine (uracil) or an analogue thereof,
    • and when m>3, at least 3 successive uridines (uracils) or analogues of uridine (uracil) occur;
    • n is an integer from 1 to 40, wherein when n=1, G is guanosine (guanine) or an analogue thereof, when n>1, at least 50% of these nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof;
    • u,v may be independently from each other an integer from 0 to 50, preferably wherein when u=0, 2≥1, or when v=0, u≥1; wherein the nucleic acid molecule of formula (Vb) has a length of at least 50 nucleotides, preferably of at least 100 nucleotides, more preferably of at least 150 nucleotides, even more preferably of at least 200 nucleotides and most preferably of at least 250 nucleotides.


Other suitable adjuvants may furthermore be selected from nucleic acids having the formula (VI):





ClXmCn


wherein:

    • C is cytosine, uracil or an analogue of cytosine or uracil;
    • X is guanosine, uracil, adenosine, thymidine, cytosine or an analogue of the above-mentioned nucleotides;
    • l is an integer from 1 to 40, wherein when l=1 C is cytosine or an analogue thereof, when l>1 at least 50% of the nucleotides are cytosine or an analogue thereof;
    • m is an integer and is at least 3; wherein when m=3 X is uracil or an analogue thereof, when m>3 at least 3 successive uracils or analogues of uracil occur;
    • n is an integer from 1 to 40, wherein when n=1 C is cytosine or an analogue thereof, when n>1 at least 50% of the nucleotides are cytosine or an analogue thereof.


In this context the disclosure of WO2008/014979 and WO2009/095226 is also incorporated herein by reference.


Further Agents


In further preferred embodiments it is also possible that the inventive (pharmaceutical) composition or vaccine contains, in addition to the inventive mRNA, further agents or components optionally selected from the group comprising: further antigens (e.g. in the form of a peptide or protein) or further nucleic acids encoding an antigenic peptide or protein; further immunotherapeutic agents; further auxiliary substances; or any further compound, which is known to be immunostimulating due to its binding affinity (as ligands) to human Toll-like receptors; and/or an adjuvant nucleic acid, preferably an immunostimulatory RNA (isRNA).


Auxiliary Substances:


The inventive (pharmaceutical) composition or vaccine may thus additionally contain one or more auxiliary substances in order to increase its immunogenicity or immunostimulatory capacity, if desired. Such auxiliary substances preferably act together with the inventive mRNA in a synergistic manner.


In general, it is possible to use as auxiliary substance any agent that influences the immune system in the manner of a “danger signal” (LPS, GP96, etc.) or cytokines, such as GM-CFS, which allow an immune response to be enhanced and/or influenced in a targeted manner. Particularly preferred auxiliary substances are cytokines, such as monokines, lymphokines, interleukins or chemokines, that further promote the innate immune response, such as IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL-20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32, IL-33, IFN-alpha, IFN-beta, IFN-gamma, GM-CSF, G-CSF, M-CSF, LT-beta or TNF-alpha, growth factors, such as hGH.


The inventive (pharmaceutical) composition or vaccine may also additionally contain any further compound, which is known to be immune-stimulating due to its binding affinity (as ligands) to human Toll-like receptors TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, or due to its binding affinity (as ligands) to murine Toll-like receptors TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, TLR11, TLR12 or TLR13.


Another class of compounds, which may be added to an inventive (pharmaceutical) composition or vaccine may be CpG nucleic acids, in particular CpG-RNA or CpG-DNA. A CpG-RNA or CpG-DNA can be a single-stranded CpG-DNA (ss CpG-DNA), a double-stranded CpG-DNA (dsDNA), a single-stranded CpG-RNA (ss CpG-RNA) or a double-stranded CpG-RNA (ds CpG-RNA). The CpG nucleic acid is preferably in the form of CpG-RNA, more preferably in the form of single-stranded CpG-RNA (ss CpG-RNA). The CpG nucleic acid preferably contains at least one or more (mitogenic) cytosine/guanine dinucleotide sequence(s) (CpG motif(s)). According to a first preferred alternative, at least one CpG motif contained in these sequences, that is to say the C (cytosine) and the G (guanine) of the CpG motif, is unmethylated. All further cytosines or guanines optionally contained in these sequences can be either methylated or unmethylated. According to a further preferred alternative, however, the C (cytosine) and the G (guanine) of the CpG motif can also be present in methylated form.


Another class of compounds, which may be added to an inventive (pharmaceutical) composition or vaccine may be immunostimulatory RNAs, i.e. RNAs that are able to induce an innate immune response. isRNAs usually do not comprise open reading frame and thus does not provide an epitope or antigen or immunogen but elicit an immune response e.g. by binding to a specific kind of Toll-like-receptor (TLR) or other suitable receptors. However, of course also (m)RNAs having an open reading frame and coding for a peptide/protein may induce an innate immune response and, thus, may be immunostimulatory RNAs.


Pharmaceutically Acceptable Carriers and Excipients


Preferably, the (pharmaceutical) composition or vaccine according to the invention comprises at least one pharmaceutically acceptable excipient, in particular at least one pharmaceutically acceptable carrier. The term “pharmaceutically acceptable” refers to a compound or agent that is compatible with the one or more active agent(s) (here: the inventive mRNA) and does not interfere with and/or substantially reduce their pharmaceutical activities. Pharmaceutically acceptable carriers preferably have sufficiently high purity and sufficiently low toxicity to make them suitable for administration to a subject to be treated.


Pharmaceutically acceptable excipients can exhibit different functional roles and include, without limitation, diluents, fillers, bulking agents, carriers, disintegrants, binders, lubricants, glidants, coatings, solvents and co-solvents, buffering agents, preservatives, adjuvants, anti-oxidants, wetting agents, anti-foaming agents, thickening agents, sweetening agents, flavouring agents and humectants.


For (pharmaceutical) compositions or vaccines in liquid form, useful pharmaceutically acceptable excipients in general include solvents, diluents or carriers such as (pyrogen-free) water, (isotonic) saline solutions such phosphate or citrate buffered saline, fixed oils, vegetable oils, such as, for example, groundnut oil, cottonseed oil, sesame oil, olive oil, corn oil, ethanol, polyols (for example, glycerol, propylene glycol, polyetheylene glycol, and the like); lecithin; surfactants; preservatives such as benzyl alcohol, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like; isotonic agents such as sugars, polyalcohols such as manitol, sorbitol, or sodium chloride; aluminum monostearate or gelatin; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. Buffers may be hypertonic, isotonic or hypotonic with reference to the specific reference medium, i.e. the buffer may have a higher, identical or lower salt content with reference to the specific reference medium, wherein preferably such concentrations of the aforementioned salts may be used, which do not lead to damage of cells due to osmosis or other concentration effects. Reference media are e.g. liquids occurring in “in vivo” methods, such as blood, lymph, cytosolic liquids, or other body liquids, or e.g. liquids, which may be used as reference media in “in vitro” methods, such as common buffers or liquids. Such common buffers or liquids are known to a skilled person. Ringer-Lactate solution is particularly preferred as a liquid basis.


Carriers


Suitable pharmaceutically acceptable carriers are typically chosen based on the formulation of the (pharmaceutical) composition or vaccine.


Liquid (pharmaceutical) compositions or vaccines administered via injection and in particular via i.v. injection should be sterile and stable under the conditions of manufacture and storage. Such compositions are typically formulated as parenterally acceptable aqueous solutions that are pyrogen-free, have suitable pH, are isotonic and maintain stability of the active ingredient(s).


Particularly useful pharmaceutically acceptable carriers for liquid (pharmaceutical) compositions or vaccines according to the invention include water, typically pyrogen-free water; isotonic saline or buffered (aqueous) solutions, e.g phosphate, citrate etc. buffered solutions. Particularly for injection of the inventive (pharmaceutical) compositions or vaccines, water or preferably a buffer, more preferably an aqueous buffer, may be used, containing a sodium salt, preferably at least 50 mM of a sodium salt, a calcium salt, preferably at least 0.01 mM of a calcium salt, and optionally a potassium salt, preferably at least 3 mM of a potassium salt.


According to preferred embodiments, the sodium, calcium and, optionally, potassium salts may occur in the form of their halogenides, e.g. chlorides, iodides, or bromides, in the form of their hydroxides, carbonates, hydrogen carbonates, or sulfates, etc. Without being limited thereto, examples of sodium salts include e.g. NaCl, NaI, NaBr, Na2CO3, NaHCO3, Na2SO4, examples of the optional potassium salts include e.g. KCl, KI, KBr, K2CO3, KHCO3, K2SO4, and examples of calcium salts include e.g. CaCl2, CaI2, CaBr2, CaCO3, CaSO4, Ca(OH)2. Furthermore, organic anions of the aforementioned cations may be contained in the buffer.


According to more preferred embodiments, the buffer suitable for injection purposes as defined above, may contain salts selected from sodium chloride (NaCl), calcium chloride (CaCl2) and optionally potassium chloride (KCl), wherein further anions may be present additional to the chlorides. CaCl2 can also be replaced by another salt like KCl. Typically, the salts in the injection buffer are present in a concentration of at least 50 mM sodium chloride (NaCl), at least 3 mM potassium chloride (KCl) and at least 0.01 mM calcium chloride (CaCl2). The injection buffer may be hypertonic, isotonic or hypotonic with reference to the specific reference medium, i.e. the buffer may have a higher, identical or lower salt content with reference to the specific reference medium, wherein preferably such concentrations of the afore mentioned salts may be used, which do not lead to damage of cells due to osmosis or other concentration effects. Reference media are e.g. in “in vivo” methods occurring liquids such as blood, lymph, cytosolic liquids, or other body liquids, or e.g. liquids, which may be used as reference media in “in vitro” methods, such as common buffers or liquids. Such common buffers or liquids are known to a skilled person. Ringer-Lactate solution is particularly preferred as a liquid basis.


For (pharmaceutical) compositions in (semi-)solid form, useful pharmaceutically acceptable excipients include binders such as microcrystalline cellulose, gum tragacanth or gelatin; starch or lactose; sugars, such as, for example, lactose, glucose and sucrose; starches, such as, for example, corn starch or potato starch; cellulose and its derivatives, such as, for example, sodium carboxymethylcellulose, ethylcellulose, cellulose acetate; disintegrants such as alginic acid; lubricants such as magnesium stearate; glidants such as stearic acid, magnesium stearate; calcium sulphate, colloidal silicon dioxide and the like; sweetening agents such as sucrose or saccharin; and/or flavouring agents such as peppermint, methyl salicylate, or orange flavouring.


Formulation


(Pharmaceutical) compositions for topical administration can be formulated as creams, ointments, gels, pastes or powders. (Pharmaceutical) compositions for oral administration can be formulated as tablets, capsules, liquids, powders or in a sustained release format.


According to preferred embodiments, the inventive mRNA, (pharmaceutical) composition or vaccine is administered parenterally, in particular via intradermal or intramuscular injection. Accordingly, mRNAs, (pharmaceutical) compositions or vaccines of the invention are preferably formulated for parenteral administration (in particular injection) and are thus typically provided in liquid form (e.g. lipid based or saline based) or lyophilized form. Parenteral formulations are typically stored in vials, IV bags, ampoules, cartridges, or prefilled syringes and can be administered as injections, inhalants, or aerosols, with injections being preferred. Preferred parenteral formulations for injection include sterile solutions of water, physiological saline or mixtures thereof, with a physiological pH of about 7.4


Lyophilized Formulations:


In preferred embodiments, the mRNA, (pharmaceutical) composition or vaccine of the invention is provided in lyophilized form. Preferably, the lyophilized mRNA, (pharmaceutical) composition or vaccine is reconstituted in a suitable buffer, advantageously based on an aqueous carrier, prior to administration, e.g. Ringer-Lactate solution, which is preferred, Ringer solution, a phosphate buffer solution. In some embodiments, the (pharmaceutical) composition or vaccine according to the invention contains at least two, three, four, five, six or more mRNAs, preferably mRNAs which are provided separately in lyophilized form (optionally together with at least one further additive) and which are preferably reconstituted separately in a suitable buffer (such as Ringer-Lactate solution) prior to their use so as to allow individual administration of each of said mRNAs.


Liquid Formulations:


In further preferred embodiments, the mRNA, (pharmaceutical) composition or vaccine is provided in the form of a saline or a lipid-based formulation. Lipid-based formulations may be selected from, but not limited to, liposomes, lipoplexes, nanoliposomes and lipid nanoparticles which are described above in the section headed “Complexation”.


Dose


The (pharmaceutical) composition or vaccine typically comprises a safe and effective amount the inventive mRNA.


As used herein, “safe and effective amount” means an amount of the active agent(s) that is sufficient to significantly induce a positive modification of the disease to be treated. At the same time, however, a “safe and effective amount” is small enough to avoid serious side-effects, that is to say to permit a sensible relationship between advantage and risk.


In the context of the present invention, the expression “safe and effective amount” preferably means an amount of the active agent(s) that is suitable for stimulating the adaptive immune system in such a manner that no excessive or damaging immune reactions are achieved but, preferably, also no such immune reactions below a measurable level.


A “safe and effective amount” of any of the active agent(s) will furthermore vary in connection with the particular condition to be treated and also with the age and physical condition of the patient to be treated, the severity of the condition, the duration of the treatment, the nature of the accompanying therapy, of the particular pharmaceutically acceptable carrier used, and similar factors, within the knowledge and experience of the accompanying doctor.


Specifically, a “safe and effective amount” of the inventive mRNA may furthermore be selected depending on the type of mRNA, e.g. monocistronic, bi- or even multicistronic RNA, since a bi- or even multicistronic RNA may lead to a significantly higher expression of the encoded antigen(s) than the use of an equal amount of a monocistronic RNA.


Kit


In a further aspect, the present invention relates to a kit or kit-of-parts comprising as its components the mRNA, (pharmaceutical) composition or vaccine according to the invention, and optionally technical instructions with information on the administration and dosage of said components.


Optionally, the kit-of-parts may comprise at least one further agent as defined herein in the context of the pharmaceutical composition, antimicrobial agents, RNase inhibitors, solubilizing agents or the like.


The kit-of-parts may be a kit of two or more parts and typically comprises the components in suitable containers. For example, each container may be in the form of vials, bottles, squeeze bottles, jars, sealed sleeves, envelopes or pouches, tubes or blister packages or any other suitable form provided the container is configured so as to prevent premature mixing of components. Each of the different components may be provided separately, or some of the different components may be provided together (i.e. in the same container). A container may also be a compartment or a chamber within a vial, a tube, a jar, or an envelope, or a sleeve, or a blister package or a bottle, provided that the contents of one compartment are not able to associate physically with the contents of another compartment prior to their deliberate mixing by a pharmacist or physician.


Medical Use and Treatment


The inventive mRNA, the (pharmaceutical composition), vaccine or kit-of-parts defined herein may be used for human and also for veterinary medical purposes, preferably for human medical purposes.


According to a further aspect, the invention thus relates to the inventive mRNA, (pharmaceutical composition), vaccine or kit-of-parts for use as a medicament. The inventive mRNA, (pharmaceutical composition), vaccine or kit-of-parts are particularly useful for treatment and/or prophylaxis of diseases of lassa infections or disorders related thereto.


The term “treatment” or “treating” of a disease includes preventing or protecting against the disease (that is, causing the clinical symptoms not to develop); inhibiting the disease (i.e., arresting or suppressing the development of clinical symptoms; and/or relieving the disease (i.e., causing the regression of clinical symptoms). As will be appreciated, it is not always possible to distinguish between “preventing” and “suppressing” a disease or disorder since the ultimate inductive event or events may be unknown or latent. Accordingly, the term “prophylaxis” will be understood to constitute a type of “treatment” that encompasses both “preventing” and “suppressing.” The term “treatment” thus includes “prophylaxis”.


The term “subject”, “patient” or “individual” as used herein generally includes humans and non-human animals and preferably mammals (e.g., non-human primates, including marmosets, tamarins, spider monkeys, owl monkeys, vervet monkeys, squirrel monkeys, and baboons, macaques, chimpanzees, orangutans, gorillas; cows; horses; sheep; pigs; chicken; cats; dogs; mice; rat; rabbits; guinea pigs; etc.), including chimeric and transgenic animals and disease models. In the context of the present invention, the term “subject” preferably refers a non-human primate or a human, most preferably a human.


Accordingly, the present invention further provides methods of treating or preventing a lassa infection, by providing an mRNA, (pharmaceutical) composition, vaccine or kit-of-parts as defined herein, and applying or administering said mRNA, (pharmaceutical) composition, vaccine or kit-of-parts to a subject in need thereof, preferably in a pharmaceutically and/or therapeutically effective amount. Administration to the tissue or to the subject is preferably accomplished by subcutaneous, intramuscular or intradermal injection, preferably by intramuscular or intradermal injection, more preferably by intradermal injection. Preferably, injection is carried out by using conventional needle injection or jet injection, preferably by using jet injection.


The invention also relates to the use of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts, preferably for modulating, preferably for inducing or enhancing, an immune response in a subject, more preferably for the treatment or prophylaxis of a lassa infection as defined herein.


Administration Routes


The inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts can be administered, for example, systemically or locally.


Routes for systemic administration in general include, for example, transdermal, oral, parenteral routes, including subcutaneous, intravenous, intramuscular, intraarterial, intradermal and intraperitoneal injections and/or intranasal administration routes.


Routes for local administration in general include, for example, topical administration routes but also intradermal, transdermal, subcutaneous, or intramuscular injections or intralesional, intracranial, intrapulmonal, intracardial, and sublingual injections.


It is further conceivable to use different administration routes for different components of the inventive (pharmaceutical) composition, vaccine or kit-of-parts, for instance in case said (pharmaceutical) composition, vaccine or kit-of-parts comprises several mRNA species.


According to preferred embodiments, the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts is administered by a parenteral route, preferably via intradermal, subcutaneous, or intramuscular routes. Preferably, said mRNA, (pharmaceutical) composition, vaccine or kit-of-parts is administered by injection, e.g. subcutaneous, intramuscular or intradermal injection, which may be needle-free and/or needle injection. Accordingly, in preferred embodiments, the medical use and/or method of treatment according to the present invention involves administration of said mRNA, (pharmaceutical) composition, vaccine or kit-of-parts by subcutaneous, intramuscular or intradermal injection, preferably by intramuscular or intradermal injection, more preferably by intradermal injection. Such injection may be carried out by using conventional needle injection or jet injection, preferably by using jet injection.


Administration Regimen


The inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts may be administered to a subject in need thereof several times a day, daily, every other day, weekly, or monthly; and may be administered sequentially or simultaneously.


In case the (pharmaceutical) composition, vaccine or kit-of-parts comprises several components (e.g. different mRNA species), said components may be administered to a subject in need thereof via different administration routes as defined above.


Such components may be administered simultaneously (i.e. at the same time via the same or different administrations routes) or separately (i.e. sequentially at different time points and/or via different administrations routes). Such a sequential administration scheme is also referred to as “time-staggered” administration. A time-staggered administration of the several components of the inventive (pharmaceutical) composition, vaccine or kit-of-parts may ensure that the separate mechanisms elicited by said components do not negatively influence each other. Time-staggered administration may for instance mean that one mRNA species is administrated e.g. prior, concurrent or subsequent to different mRNA species.


Dosage


The inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts are preferably administered to the subject in need thereof in a “pharmaceutically effective” amount.


A “pharmaceutically effective amount” in the context of the invention is typically understood as an amount that is sufficient to induce a desired pharmaceutical effect, such as an immune response. Preferably, the administered amount of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts is also “therapeutically effective”, i.e. sufficient for the alleviation of the symptoms of the disease or condition being treated and/or for prophylaxis of the symptoms of the disease or condition being prevented. In other words, a “therapeutically effective amount” means an amount of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts that is sufficient to significantly induce a positive modification of a disease or disorder, i.e. an amount of the active ingredient that elicits the biological or medicinal response in a tissue, system, animal or human that is being sought.


Therapeutic efficacy and toxicity of inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and can be expressed as the ratio LD50/ED50. mRNAs, (pharmaceutical) composition, vaccine or kit-of-parts which exhibit large therapeutic indices are generally preferred. The data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED50 with little or no toxicity.


The term also includes an amount of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts sufficient to reduce the progression of the disease, for instance to reduce or inhibit lassa virus replication. At the same time, however, a “therapeutically effective amount” is preferably small enough to avoid serious side-effects, i.e. to permit a sensible relationship between advantage and risk. The determination of these limits typically lies within the scope of sensible medical judgment. A “therapeutically effective amount” of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts will furthermore vary in connection with the particular disease or condition to be treated, characteristics of the patient (including age, physical condition, body weight, sex and diet), concurrent treatments, pharmacokinetic properties of the active agent(s), treatment regime and the desired effect (amelioration vs. complete remission), etc.


For instance, therapeutically effective doses of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts described herein may range from about 0.001 mg to 10 mg, preferably from about 0.01 mg to 5 mg, more preferably from about 0.1 mg to 2 mg per dosage unit or from about 0.01 nmol to 1 mmol per dosage unit, in particular from 1 nmol to 1 mmol per dosage unit, preferably from 1 pmol to 1 mmol per dosage unit. It is also envisaged that the therapeutically effective dose of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts may range (per kg body weight) from about 0.01 mg/kg to 10 g/kg, preferably from about 0.05 mg/kg to 5 g/kg, more preferably from about 0.1 mg/kg to 2.5 g/kg.


The suitable amount of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts according to the invention to be administered can be determined by routine experiments, e.g. by using animal models. Such models include, without implying any limitation, rabbit, sheep, mouse, rat, dog and non-human primate models.


Combination Therapy


The inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts may also be used in combination therapy. Any other therapy useful for treating or preventing lassa virus infections may be combined with the uses and methods disclosed herein. Such therapies include, inter alia, the administration of Ribavirin.


Administration of the inventive mRNA, (pharmaceutical) composition, vaccine or kit-of-parts may be accomplished prior to, simultaneously and/or subsequently to administering another therapeutic or subjecting the patient to another therapy that is useful for treating or preventing lassa virus infections (such as Ribavirin administration).





BRIEF DECRYPTION OF THE DRAWINGS


FIG. 1 shows that mRNA constructs encoding Lassa virus glycoprotein precursor (GPC) of different lassa virus clades (clade I, II, III and IV) were expressed and processed in vitro. Further details are provided in Example 3.





EXAMPLES

The Examples shown in the following are merely illustrative and shall describe the present invention in a further way. These Examples shall not be construed to limit the present invention thereto.


Example 1: Preparation of mRNA Constructs for In Vitro and In Vivo Experiments

1.1. Preparation of DNA and mRNA Constructs:


For the present Examples, DNA sequences encoding Lassa virus proteins are prepared and used for subsequent RNA in vitro transcription reactions. The generated RNA constructs (RNA sequences) are provided in the sequence listing (Lassa virus sequences: SEQ ID NOs: 5541, 5729, 5731, 5529, 5356, 5540, and 5913) and in Table 6 with the encoded proteins and virus origin and virus clade indicated.









TABLE 6







mRNA constructs used in the present expamles














Virus



SEQ ID
mRNA


RNA ID
strain
Virus clade
Construct description
SEQ ID NO RNA
Protein
design





R6023/
Josiah
clade IV
LASV(Josiah)-GPC(GC)
SEQ ID NO:
SEQ ID NO:
design 2


R6630



5541
186


R6024
Josiah
clade IV
LASV(Josiah)-NP(GC)
SEQ ID NO:
SEQ ID NO:
design 2






5729
374


R6025
Josiah
clade IV
LASV(Josiah)-Z(GC)
SEQ ID NO:
SEQ ID NO:
design 2






5731
3451


R6026/
LP
clade I
LASV(LP)-GPC(GC)
SEQ ID NO:
SEQ ID NO:
design 2


R6631



5529
174


R6027/
803213
clade II
LASV(803213)-
SEQ ID NO:
SEQ ID NO: 1
design 2


R6632


GPC(GC)
5356


R6028/
GA391
clade III
LASV(GA391)-
SEQ ID NO:
SEQ ID NO:
design 2


R6633


GPC(GC)
5540
185


R6717
Josiah
clade IV
LASV(Josiah)-GPC(GC)
SEQ ID NO:
SEQ ID NO:
design 1






5013
186









DNA sequences are prepared by modifying the wild type encoding DNA sequences by introducing a GC-optimized sequence for stabilization, using an in silico algorithms that increase the GC content of the respective coding sequence. Moreover, sequences are introduced into a pUC19 derived vector and modified to comprise stabilizing sequences derived from alpha-globin-3′-UTR, a stretch of 30 cytosines, a histone-stem-loop structure, and a stretch of 64 adenosines at the 3′-terminal end (poly-A-tail) (indicated as “mRNA design 1” in Table 1-3 and 6). Other sequences were introduced into a pUC19 derived vector to comprise stabilizing sequences derived from 32L4 5′-UTR ribosomal 5′ TOP UTR and 3′-UTR derived from albumin 7, a stretch of 30 cytosines, a histone-stem-loop structure, and a stretch of 64 adenosines at the 3′-terminal end (poly-A-tail) (indicated as “mRNA design 2” in Table 1-3 and 6). Generated mRNA constructs are provided in Table 6


The obtained plasmid DNA constructs are transformed and propagated in bacteria (Escherichia coli) using common protocols known in the art.


1.2. RNA In Vitro Transcription on Linearized pDNA:


The DNA plasmids prepared according to paragraph 1 are enzymatically linearized using EcoRI and transcribed in vitro using DNA dependent T7 RNA polymerase in the presence of a nucleotide mixture and cap analog (m7GpppG) under suitable buffer conditions. RNA production is performed under current good manufacturing practice according to WO2016/180430. The obtained mRNAs are purified using PureMessenger® (CureVac, Tübingen, Germany; WO2008/077592) and used for in vitro and in vivo experiments. The generated mRNA constructs are indicated as “mRNA design 1” and as “mRNA design 2” in Table 1-3.


1.3. RNA In Vitro Transcription on PCR Amplified DNA Templates:


DNA plasmids prepared according to paragraph 1, or synthic DNA constructs are used for PCR-amplification. The generated PCR templates are used for subsequent RNA in vitro transcription using DNA dependent T7 RNA polymerase in the presence of a nucleotide mixture and cap analog (m7GpppG) under suitable buffer conditions. The obtained mRNA constructs are purified using PureMessenger® (CureVac, Tubingen, Germany; WO2008/077592) and used for in vitro and in vivo experiments. The generated mRNA constructs are indicated as “mRNA design 3” in Table 1-3.


Example 2: Expression of Lassa Virus Proteins in HeLa Cells and Analysis by FACS

To determine in vitro protein expression of the constructs, HeLa cells are transiently transfected with mRNA encoding Lassa virus antigens and stained using suitable customized anti-LASV antibodies (raised in rabbits), counterstained with a FITC-coupled secondary antibody.


HeLa cells are seeded in a 6-well plate at a density of 400,000 cells/well in cell culture medium (RPMI, 10% FCS, 1% L-Glutamine, 1% Pen/Strep), 24 h prior to transfection. HeLa cells are transfected with 1 and 2 μg unformulated mRNA using Lipofectamine 2000 (Invitrogen). The mRNA constructs according to Example 1 are used in the experiment, including a negative control encoding an irrelevant protein.


24 hours post transfection, HeLa cells are stained with suitable anti anti-LASV antibodies (raised in rabbits; 1:200) and anti-rabbits FITC labelled secondary antibody (1:500) and subsequently analyzed by flow cytometry (FACS) on a BD FACS Canto II using the FACS Diva software. Quantitative analysis of the fluorescent FITC signal is performed using the FlowJo software package (Tree Star, Inc.).


Example 3: Expression of Lassa Virus Proteins Using Western Blot

The results of the present Example show that mRNA encoding Lassa Virus GPC protein are expressed and processed in Hela cells after transfection.


For the analysis of Lassa virus protein expression, HeLa cells are transfected with 1 μg and 2 μg unformulated mRNA using Lipofectamine as the transfection agent. Supernatants, harvested 24 hours post transfection, are filtered through a 0.2 μm filter. Clarified supernatants are applied on top of 1 ml 20% sucrose cushion (in PBS) and centrifuged at 14000 rcf (relative centrifugal force) for 2 hours at 4° C. Protein content is analyzed by Western Blot using suitable customized polyclonal anti-LASV antibodies (raised in rabbits) (1:1000) as primary antibody in combination with secondary anti rabbit antibody coupled to IRDye 800CW (Licor Biosciences). The presence of αβ-tubulin is also analyzed as control for cellular contamination (αβ-tubulin; Cell Signalling Technology; 1:1000 diluted) in combination with secondary anti mouse antibody coupled to IRDye 680RD (Licor Biosciences).


For the analysis LASV proteins in cell lysates, HeLa cells were transfected with 2 μg unformulated mRNAs (generated according to Example 1, see Table 6) using Lipofectamine as the transfection agent 24 hours post transfection, HeLa cells are detached by trypsin-free/EDTA buffer, harvested, and cell lysates are prepared. Cell lysates are subjected to SDS-PAGE under denaturating and reducing conditions followed by western blot detection. Western Blot analysis is performed using suitable customized polyclonal rabbit anti-LASV GP antibodies (diluted 1:200) as primary antibody in combination with secondary anti anti-rabbit antibody coupled to IRDye 800CW (Licor Biosciences). The presence of αβ-tubulin was analyzed (αβ-tubulin; Cell Signalling Technology; 1:1000 diluted) in combination with secondary anti rabbit antibody coupled to IRDye 680RD (Licor Biosciences). Inactivated Lassa virus (Josiah) was used as positive control for the western blot. The outline of the experiment is shown in Table 7. The result of the experiment is shown in FIG. 1.









TABLE 7







Expression analysis experiment (Example 3):















Lassa virus


Lane
SEQ ID No
R#
encoded antigen
clade





1
SEQ ID NO: 5541
R6023
LASV(Josiah)-GP
clade IV


2
SEQ ID NO: 5529
R6026
LASV(LP)-GP
clade I


3
SEQ ID NO: 5356
R6027
LASV(803213)-GP
clade II


4
SEQ ID NO: 5540
R6028
LASV(GA391)-GP
clade III


5
SEQ ID NO: 5729
R6024
LASV(Josiah)-NP
clade IV


6


WFI


7


Inactivated
clade IV





LASV Josiah









Results:


As shown in FIG. 1, mRNA constructs encoding lassa virus glycoprotein precursors (GPC) were expressed in vitro. Moreover, the expressed encoding lassa virus glycoprotein precursors were properly processed, thus cotranslationally cleaved by signal peptidases. The subunit protein GP2 was detected as expected.


The results exemplify that the inventive mRNA encoding Lassa virus GPC protein is translated in cells and that alternative mRNA constructs according to the invention may also be translated in cells, which is a prerequisite for an mRNA-based vaccine.


Example 4: Vaccination of Mice with mRNA Encoding Lassa Virus Proteins

4.1. Preparation of Protamine Complexed mRNA (“Vaccine Composition 1”):


Lassa virus mRNA constructs are complexed with protamine prior to use in in vivo vaccination experiments. The mRNA complexation consists of a mixture of 50% free mRNA and 50% mRNA complexed with protamine at a weight ratio of 2:1. First, mRNA is complexed with protamine by addition of protamine-Ringer's lactate solution to mRNA. After incubation for 10 minutes, when the complexes are stably generated, free mRNA is added, and the final concentration of the vaccine is adjusted with Ringer's lactate solution.


4.2. Immunization:


Female BALB/c mice are injected intradermally (i.d.) with mRNA vaccine compositions with doses, application routes and vaccination schedules as indicated in Table 8. As a negative control, one group of mice is vaccinated with buffer (ringer lactate). All animals are vaccinated on day 0, 21 and 42. Blood samples are collected on day 21, 35, and 49 for the determination of antibody titers. Splenocytes are isolated on day 49 for T-cell analysis.









TABLE 8







Vaccination regimen (Example 4):













Number
Vaccine





Group
of mice
composition
SEQ ID NO
Dose
Route/Volume





1
6
R6630
SEQ ID NO:
80 μg
i.d. 2 × 50 μl




(Lassa GPC
5541




Josiah strain)


2
6
R6631
SEQ ID NO:
80 μg
i.d. 2 × 50 μl




(Lassa GPC
5529




LP strain)


3
6
RiLa buffer











4.3. Detection of Specific Humoral Immune Responses:


Hela cells are transfected with 2 μg of either R6630 or R6631 mRNA using lipofectamine. The cells are harvested 20 h post transfection, and seeded at 1×105 per well into an 96 well plate. The cells are incubated with sera of the vaccinated mice (diluted 1:50) followed by a FITC-conjugated anti-mouse IgG antibody. Cells are acquired on BD FACS Canto II using DIVA software and analyzed by FlowJo.


4.4. Determination of Anti Lassa Virus Protein Antibodies by ELISA:


ELISA is performed using recombinant Lassa Glycoproteins for coating. Coated plates are incubated using respective serum dilutions, and binding of specific antibodies to the Lassa virus antigens are detected using biotinylated isotype specific anti-mouse antibodies followed by streptavidin-HRP (horse radish peroxidase) with ABTS as substrate. Endpoint titers of antibodies directed against the Lassa virus antigens are measured by ELISA on day 49 after three vaccinations.


4.5. Intracellular Cytokine Staining:


Splenocytes from vaccinated mice are isolated according to a standard protocol known in the art. Briefly, isolated spleens are grinded through a cell strainer and washed in PBS/1% FBS followed by red blood cell lysis. After an extensive washing step with PBS/1% FBS splenocytes are seeded into 96-well plates (2×106 cells per well). The cells are stimulated with recombinant LASV virus in the presence of 2.5 μg/ml of an anti-CD28 antibody (BD Biosciences) and α-CD107a-PE-Cy7 (1:100) for 24 hours at 37° C. After stimulation, cells are washed and stained for intracellular cytokines using the Cytofix/Cytoperm reagent (BD Biosciences) according to the manufacturer's instructions. The following antibodies are used for staining: anti-CD8-APC-H7 (1:100), anti-CD4-BD-Horizon V450 (1:200), anti-CD3-Thy1.2-FITC (1:200) and incubated with Fcγ-block diluted 1:100. Aqua Dye is used to distinguish live/dead cells (Invitrogen). Cells are acquired using a Canto II flow cytometer (Beckton Dickinson). Flow cytometry data is analyzed using FlowJo software package (Tree Star, Inc.)


Example 5: Preparation of Lassa Virus Vaccine Compositions

For further in vivo vaccination experiments, different compositions of Lassa virus mRNA vaccine are prepared using constructs obtained in Example 1. One composition comprises protamine-complexed mRNA, one composition comprises mRNA that is formulated without protamine (“naked”), one composition comprises mRNA that is encapsulated in lipid nanoparticles (LNPs), and one composition comprises polymer-lipidoid complexed mRNA.


5.1. Preparation of Protamine Complexed mRNA (“Vaccine Composition 1”):


Lassa virus mRNA constructs are complexed as described in Example 4.


5.2. Preparation of “Naked” mRNA (“Vaccine Composition 2”):


Lassa virus mRNA constructs are formulated without protamine. The final concentration of the vaccine is adjusted with Ringer's lactate solution.


5.3. Preparation of LNP Encapsulated mRNA (“Vaccine Composition 3”):


A lipid nanoparticle (LNP)-encapsulated mRNA mixture is prepared using an ionizable amino lipid (cationic lipid), phospholipid, cholesterol and a PEGylated lipid. LNPs are prepared as follows. Cationic lipid, DSPC, cholesterol and PEG-lipid are solubilized in ethanol. Briefly, mRNA mixture is diluted to a total concentration of 0.05 mg/mL in 50 mM citrate buffer, pH 4. Syringe pumps are used to mix the ethanolic lipid solution with the mRNA mixture at a ratio of about 1:6 to 1:2 (vol/vol). The ethanol is then removed and the external buffer replaced with PBS by dialysis. Finally, the lipid nanoparticles are filtered through a 0.2 μm pore sterile filter. Lipid nanoparticle particle diameter size is determined by quasi-elastic light scattering using a Malvern Zetasizer Nano (Malvern, UK).


5.4. Preparation of Polymer-Lipidoid Complexed mRNA (“Vaccine Composition 4”):


20 mg peptide (CHHHHHHRRRRHHHHHHC—NH2; SEQ ID NO: 3450) TFA salt was dissolved in 2 mL borate buffer pH 8.5 and stirred at room temperature for approximately 18 h. Then, 12.6 mg PEG-SH 5000 (Sunbright) dissolved in N-methylpyrrolidone was added to the peptide solution and filled up to 3 mL with borate buffer pH 8.5. After 18 h incubation at room temperature, the reaction mixture was purified and concentrated by centricon procedure (MWCO 10 kDa), washed against water, and lyophilized. The obtained lyophilisate was dissolved in ELGA water and the concentration of the polymer was adjusted to 10 mg/mL. The obtained polyethylene glycol/peptide polymers (HO-PEG 5000-S—(S—CHHHHHHRRRRHHHHHHC—S-)7-S-PEG 5000-OH—peptide component: SEQ ID NO: 3450) were used for further formulation and are hereinafter referred to as PB83.


Preparation of 3-C12-OH Lipidoid:


First, lipidoid 3-C12 was obtained by acylation of tris(2-aminoethyl)amine with an activated lauric (C12) acid derivative, followed by reduction of the amide. Alternatively, it may be prepared by reductive amination with the corresponding aldehyde. Lipidoid 3-C12-OH was prepared by addition of the terminal C12 alkyl epoxide with the same oligoamine according to Love et al., pp. 1864-1869, PNAS, vol. 107 (2010), no. 5.


Preparation of Compositions with Nanoparticles of Polymer-Lipidoid Complexed mRNA:


First, ringer lactate buffer (RiLa; alternatively e.g. saline (NaCl) or PBS buffer may be used), respective amounts of lipidoid, and respective amounts of a polymer (PB83) were mixed to prepare compositions comprising a lipidoid and a peptide or polymer. Then, the carrier compositions were used to assemble nanoparticles with the mRNA by mixing the mRNA with respective amounts of polymer-lipidoid carrier and allowing an incubation period of 10 minutes at room temperature such as to enable the formation of a complex between the lipidoid, polymer and mRNA. In order to characterize the integrity of the obtained polymer-lipidoid complexed mRNA particles, RNA agarose gel shift assays were performed. In addition, size measurements were performed (gel shift assay, Zetasizer) to evaluate whether the obtained nanoparticles have a uniform size profile.


Example 6: Vaccination of Mice and Evaluation of Lassa Virus Specific Immune Response

6.1. Immunization:


Female BALB/c mice are injected intradermally (i.d.) and intramuscularly (i.m.) with respective mRNA vaccine compositions (prepared according to Example 5) with doses, application routes and vaccination schedules as indicated in Table 9. As a negative control, one group of mice is vaccinated with buffer (ringer lactate). All animals are vaccinated on day 1, 21 and 35. Blood samples are collected on day 21, 35, and 63 for the determination of binding and neutralizing antibody titers (see below). Splenocytes were isolated on day 49 for T-cell analysis.









TABLE 9







Vaccination regimen - Lassa virus experiment (Example 6)












Number

Route/
Vaccination


Group
of mice
Vaccine composition
Volume
Schedule (day)





1
10
40 μg Lassa virus RNA
i.d.
0/21/35




Composition 1
2 × 25 μl


2
10
40 μg Lassa virus RNA
i.m.
0/21/35




Composition 1
2 × 25 μl


3
10
20 μg Lassa virus RNA
i.d.
0/21/35




Composition 2
2 × 25 μl


4
10
20 μg Lassa virus RNA
i.m.
0/21/35




Composition 2
2 × 25 μl


5
10
10 μg Lassa virus RNA
i.d.
0/21/35




Composition 3
2 × 25 μl


6
10
10 μg Lassa virus RNA
i.m.
0/21/35




Composition 3
2 × 25 μl


7
10
40 μg Lassa virus RNA
i.d.
0/21/35




Composition 4
2 × 25 μl


8
10
40 μg Lassa virus RNA
i.m.
0/21/35




Composition 4
2 × 25 μl


9
10
100% RiLa
i. d.
0/21/35




Control
2 × 25 μl









6.2. Determination of Anti Lassa Virus Protein Antibodies by ELISA:


ELISA is performed using inactivated Lassa virus infected cell lysate or recombinant Lassa Glycoproteins for coating. Coated plates are incubated using respective serum dilutions, and binding of specific antibodies to the Lassa virus antigens are detected using biotinylated isotype specific anti-mouse antibodies followed by streptavidin-HRP (horse radish peroxidase) with ABTS as substrate. Endpoint titers of antibodies directed against the Lassa virus antigens are measured by ELISA on day 63 after three vaccinations.


6.3. Intracellular Cytokine Staining:


Splenocytes from vaccinated mice are isolated according to a standard protocol known in the art. Briefly, isolated spleens are grinded through a cell strainer and washed in PBS/1% FBS followed by red blood cell lysis. After an extensive washing step with PBS/1% FBS splenocytes are seeded into 96-well plates (2×106 cells per well). The cells are stimulated with recombinant LASV virus in the presence of 2.5 μg/ml of an anti-CD28 antibody (BD Biosciences) and α-CD107a-PE-Cy7 (1:100) for 24 hours at 37° C. After stimulation, cells are washed and stained for intracellular cytokines using the Cytofix/Cytoperm reagent (BD Biosciences) according to the manufacturer's instructions. The following antibodies are used for staining: anti-CD8-APC-H7 (1:100), anti-CD4-BD-Horizon V450 (1:200), anti-CD3-Thy1.2-FITC (1:200) and incubated with Fcγ-block diluted 1:100. Aqua Dye is used to distinguish live/dead cells (Invitrogen). Cells are acquired using a Canto II flow cytometer (Beckton Dickinson). Flow cytometry data is analyzed using FlowJo software package (Tree Star, Inc.)


6.4. Lassa Virus Plaque Reduction Neutralization Test (PRNT50):


Sera are analyzed by a plaque reduction neutralization test (PRNT50), performed as commonly known in the art. Briefly, obtained serum samples of vaccinated mice are incubated with Lassa virus. That mixture is used to infect cultured cells, and the reduction in the number of plaques is determined.


Example 7: Vaccination of Mice and Evaluation of Lassa Virus Specific Immune Response

7.1. Immunization:


Female BALB/c mice are injected intramuscularly (i.m.) with LNP encapsulated mRNA R6717 coding for GPC (LASV Josiah, SEQ ID NO: 5013, see Table 6, Example 1, “mRNA design 1” and“vaccine composition 3” prepared according to Example 5) with doses, application routes and vaccination schedules as indicated in Table 10. As a negative control, one group of mice is vaccinated with buffer (ringer lactate). All animals are vaccinated on day 1 and 21. Blood samples are collected on day 21 and 35 for the determination of binding and neutralizing antibody titers (see below). Splenocytes were isolated on day 35 for T-cell analysis.









TABLE 10







Vaccination regimen - Lassa virus experiment (Example 7)












Number

Route/
Vaccination


Group
of mice
Vaccine composition
Volume
Schedule (day)





1
6
5 μg Lassa virus RNA
i.d.
0/21




GPC (Josiah; R6717)
2 × 25 μl




vaccine composition 3


2
6
2 μg Lassa virus RNA
i.m.
0/21




GPC (Josiah; R6717)
2 × 25 μl




vaccine composition 3


3
6
1 μg Lassa virus RNA
i.d.
0/21




GPC (Josiah; R6717)
2 × 25 μl




vaccine composition 3


4
6
100% RiLa
i.m.
0/21




Control
2 × 25 μl









7.2. Determination of Anti Lassa Virus Protein Antibodies by ELISA:


ELISA is performed using inactivated Lassa virus infected cell lysate for coating. Coated plates are incubated using respective serum dilutions, and binding of specific antibodies to the Lassa virus antigens are detected using biotinylated isotype specific anti-mouse antibodies followed by streptavidin-HRP (horse radish peroxidase) with ABTS as substrate. Endpoint titers of antibodies directed against the Lassa virus antigens are measured by ELISA on day 35 after two vaccinations.


7.3. Intracellular Cytokine Staining:


Splenocytes from vaccinated mice are isolated according to a standard protocol known in the art. Briefly, isolated spleens are grinded through a cell strainer and washed in PBS/1% FBS followed by red blood cell lysis. After an extensive washing step with PBS/1% FBS splenocytes are seeded into 96-well plates (2×106 cells per well). The cells are stimulated with recombinant LASV virus in the presence of 2.5 μg/ml of an anti-CD28 antibody (BD Biosciences) and α-CD107a-PE-Cy7 (1:100) for 24 hours at 37° C. After stimulation, cells are washed and stained for intracellular cytokines using the Cytofix/Cytoperm reagent (BD Biosciences) according to the manufacturer's instructions. The following antibodies are used for staining: anti-CD8-APC-H7 (1:100), anti-CD4-BD-Horizon V450 (1:200), anti-CD3-Thy1.2-FITC (1:200) and incubated with Fcγ-block diluted 1:100. Aqua Dye is used to distinguish live/dead cells (Invitrogen). Cells are acquired using a Canto II flow cytometer (Beckton Dickinson). Flow cytometry data is analyzed using FlowJo software package (Tree Star, Inc.)


7.4. Lassa Virus Plaque Reduction Neutralization Test (PRNT50):


Sera are analyzed by a plaque reduction neutralization test (PRNT50), performed as commonly known in the art. Briefly, obtained serum samples of vaccinated mice are incubated with Lassa virus. That mixture is used to infect cultured cells, and the reduction in the number of plaques is determined.


Example 8: Clinical Development of a Lassa Virus mRNA Vaccine Composition

To demonstrate safety and efficiency of the Lassa virus mRNA vaccine composition, a clinical trial (phase I) is initiated.


In the clinical trial, a cohort of human volunteers is intradermally or intramuscularly injected for at least two times.


In order to assess the safety profile of the vaccine compositions according to the invention, subjects are monitored after administration (vital signs, vaccination site tolerability assessments, hematologic analysis). The efficacy of the immunization is analyzed by determination of virus neutralizing titers (VNT) in sera from vaccinated subjects. Blood samples are collected on day 0 as baseline and after completed vaccination. Sera are analyzed for virus neutralizing antibodies.

Claims
  • 1. mRNA comprising at least one coding region, said coding region encoding at least one antigenic peptide or protein derived from a protein of a lassa virus, or a variant or fragment thereof.
  • 2. The mRNA according to claim 1 usable as a vaccine.
  • 3. The mRNA according to claim 1 or 2, wherein the lassa virus is selected from lassa virus clade I, II, III and/or IV or from lassa virus lineage I, II, III and/or IV.
  • 4. The mRNA according to any one of claims 1 to 3, wherein said coding region encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, and/or at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus, or a variant or fragment thereof, and/or at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof, or a combination thereof.
  • 5. The mRNA according to any one of claims 1 to 4, wherein the mRNA is an artificial mRNA.
  • 6. The mRNA according to any one of claims 1 to 5, wherein the mRNA is a modified mRNA, preferably a stabilized mRNA.
  • 7. The mRNA according to any one of claims 1 to 6, wherein the at least one coding region encodes at least one full-length glycoprotein precursor (GPC) of a lassa virus, or a variant or fragment thereof, and/or at least one full-length nucleoprotein (NP) of a lassa virus, and/or at least one full-length zinc-binding matrix protein (Z) of a lassa virus, or a variant or fragment thereof.
  • 8. The mRNA according to any one of claims 1 to 7, wherein the mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus comprising or consisting of an amino acid sequence as defined by SEQ ID Nos: 1-186, or a variant or fragment of any of these sequences.
  • 9. The mRNA according to claim 8, wherein the coding region comprises an RNA sequence as defined by any of SEQ ID NOs: 376-561, or a variant or fragment of any of these sequences.
  • 10. The mRNA according to claim 8 or 9, wherein said the coding region comprises an RNA sequence selected from RNA sequences as defined by any of SEQ ID NOs: 751-936, 1126-1311, 1501-1686, 1876-2061, 2251-2436, 2626-2811 or 3001-3186, or a variant or fragment of any of these sequences.
  • 11. The mRNA according to any one of claims 1 to 10, wherein the mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from nucleoprotein (NP) of a lassa virus comprising or consisting of an amino acid sequence as defined by any one of SEQ ID NOs. 187-375 or a variant or fragment of any of these sequences.
  • 12. The mRNA according to claim 11, wherein the coding region comprises an RNA sequence as defined by any of SEQ ID NOs: 562-750, or a variant or fragment of any of these sequences.
  • 13. The mRNA according to claim 10 or 12, wherein said the coding region comprises an RNA sequence selected from RNA sequences as defined by any of SEQ ID NOs: 937-1125, 1312-1500, 1687-1875, 2062-2250, 2437-2625, 2812-3000 or 3187-3375, or a variant or fragment of any of these sequences.
  • 14. The mRNA according to any one of claims 1 to 13, wherein the mRNA comprises at least one coding region encoding at least one antigenic peptide or protein derived from zinc-binding matrix protein (Z) of a lassa virus comprising or consisting of an amino acid sequence as defined by any one of SEQ ID NOs. 3451-3603 or a variant or fragment of any of these sequences.
  • 15. The mRNA according to claim 14, wherein the coding region comprises an RNA sequence as defined by any of SEQ ID NOs: 3604-3756, or a variant or fragment of any of these sequences.
  • 16. The mRNA according to claim 14 or 15, wherein said the coding region comprises an RNA sequence selected from RNA sequences as defined by any of SEQ ID NOs: 3757-3909, 3910-4062, 4063-4215, 4216-4368, 4369-4521, 4522-4674, or 4675-4827, or a variant or fragment of any of these sequences.
  • 17. The mRNA according to any one of claims 1 to 16, wherein the G/C content of the coding region of the mRNA is increased compared to the G/C content of the corresponding coding sequence of the wild type mRNA, or wherein the C content of the coding region of the mRNA is increased compared to the C content of the corresponding coding sequence of the wild type mRNA, or wherein the codon usage in the coding region of the mRNA is adapted to the human codon usage, or wherein the codon adaptation index (CAI) is increased or maximised in the coding region of the mRNA, wherein the encoded amino acid sequence of the mRNA is preferably not being modified compared to the encoded amino acid sequence of the wild type mRNA.
  • 18. The mRNA according to any of claims 1 to 17 comprising additionally a) a 5′-cap structure,b) a poly(A) sequence,c) and optionally a poly (C) sequence.
  • 19. The mRNA according to claim 18, wherein the poly(A) sequence comprises a sequence of about 25 to about 400 adenosine nucleotides, preferably a sequence of about 50 to about 400 adenosine nucleotides, more preferably a sequence of about 50 to about 300 adenosine nucleotides, even more preferably a sequence of about 50 to about 250 adenosine nucleotides, most preferably a sequence of about 60 to about 250 adenosine nucleotides.
  • 20. The mRNA according to any of claims 1 to 19 comprising additionally at least one histone stem-loop.
  • 21. The mRNA according to claim 20, wherein the at least one histone stem-loop comprises a nucleic acid sequence according to the following formulae (I) or (II): formula (I) (stem-loop sequence without stem bordering elements):
  • 22. The mRNA according to claim 20 or 21, wherein the at least one histone stem-loop comprises a nucleic acid sequence according to the following formulae (Ia) or (IIa): formula (Ia) (stem-loop sequence without stem bordering elements):
  • 23. The mRNA according to any one of claims 20 to 22, wherein the at least one histone stem loop comprises a nucleic acid sequence according to SEQ ID NO: 3394 and most preferably a RNA sequence according to SEQ ID NO: 3395.
  • 24. The mRNA according to any one of claims 1 to 23, wherein the mRNA comprises a poly(A) sequence, preferably comprising 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides, and/or a poly(C) sequence, preferably comprising 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides.
  • 25. The mRNA according to any one of claims 1 to 24, wherein the mRNA comprises, preferably in 5′ to 3′ direction, the following elements: a.) a 5′-cap structure, preferably m7GpppN,b.) at least one coding region encoding at least one antigenic peptide or protein derived from a protein of a lassa virus or a variant or fragment thereof,c.) a poly(A) tail, preferably consisting of 10 to 200, 10 to 100, 40 to 80 or 50 to 70 adenosine nucleotides,d.) optionally a poly(C) tail, preferably consisting of 10 to 200, 10 to 100, 20 to 70, 20 to 60 or 10 to 40 cytosine nucleotides, ande.) optionally a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395.
  • 26. The mRNA according to any one of claims 1 to 25 comprising additionally a 3′-UTR element.
  • 27. The mRNA according to claim 26, wherein the at least one 3′-UTR element comprises or consists of a sequence element according to SEQ ID NO: 6412, 6413, 6314 or 6415 or a homolog, a fragment or a variant thereof.
  • 28. The mRNA according to claims 1 to 27, wherein the RNA comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5884-6069, 6070-6259 or 6259-6411 (column 7 F “mRNA design 3”, Table 1-3) or a fragment or variant of any of these sequences.
  • 29. The mRNA according to claim 26 or 27, wherein the at least one 3′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of a gene providing a stable mRNA or from a homolog, a fragment or a variant thereof.
  • 30. The mRNA according to claim 29, wherein the 3′-UTR element comprises or consists of a nucleic acid sequence derived from a 3′-UTR of a gene selected from the group consisting of an albumin gene, an alpha-globin gene, a beta-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen alpha gene, or from a homolog, a fragment or a variant thereof.
  • 31. The mRNA according to any one of claims 29 to 30, wherein the 3′-UTR element comprises a nucleic acid sequence derived from a 3′-UTR of an alpha-globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, a homolog, a fragment, or a variant thereof.
  • 32. The mRNA according to any of claims 1 to 31, wherein the mRNA comprises, preferably in 5′- to 3′-direction: a.) a 5′-cap structure, preferably m7GpppN;b.) at least one coding region encoding at least one antigenic peptide or protein derived from a protein of a lassa virus or a variant or fragment thereof as defined in any one of the preceding claims,c.) a 3′-UTR element comprising or consisting of a nucleic acid sequence which is derived from an alpha globin gene, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3386, a homolog, a fragment or a variant thereof;d.) optionally, a poly(A) sequence, preferably comprising 64 adenosines;e.) optionally, a poly(C) sequence, preferably comprising 30 cytosines; andf.) optionally, a histone stem-loop, preferably comprising the RNA sequence according to SEQ ID NO: 3395.
  • 33. The mRNA according to claims 1 to 32, wherein the RNA comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 4828-5013, 5014-5202 or 5203-5355 (column 5 D “mRNA design 1”, Table 1-3) or a fragment or variant of any of these sequences.
  • 34. The mRNA according to claim 26, wherein the at least one 3′-UTR element comprises a nucleic acid sequence, which is derived from the 3′-UTR of a vertebrate albumin gene or from a variant thereof, preferably from the 3′-UTR of a mammalian albumin gene or from a variant thereof, more preferably from the 3′-UTR of a human albumin gene or from a variant thereof, even more preferably from the 3′ UTR of the human albumin gene according to GenebankGenebank Accession number NM_000477.5, or from a variant or fragment thereof.
  • 35. The mRNA according to claim 32, wherein the 3-′UTR element is derived from a nucleic acid sequence according to SEQ ID NO: 3390 or 3392, preferably from a corresponding RNA sequence, or a homolog, a fragment or a variant thereof.
  • 36. The mRNA according to any one of claims 1 to 35, wherein the mRNA sequence comprises a 5′-UTR element.
  • 37. The mRNA according to claim 36, wherein the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a TOP gene preferably from a corresponding RNA sequence, a homolog, a fragment, or a variant thereof, preferably lacking the 5′TOP motif.
  • 38. The mRNA according to claim 37, wherein the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a TOP gene encoding a ribosomal protein, preferably from a corresponding RNA sequence or from a homolog, a fragment or a variant thereof, preferably lacking the 5′TOP motif.
  • 39. The mRNA according to claim 38, wherein the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a TOP gene encoding a ribosomal Large protein (RPL) or from a homolog, a variant or fragment thereof, preferably lacking the 5′TOP motif and more preferably comprising or consisting of a corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3376.
  • 40. The mRNA according to claim 39, wherein the 5′-UTR element which is derived from a 5′-UTR of a TOP gene comprises or consists of a corresponding RNA sequence of a nucleic acid sequence according to SEQ ID NO: 3377.
  • 41. The mRNA according to any one of claims 1 to 26 and 34 to 40, wherein the mRNA comprises, preferably in 5′- to 3′-direction: a.) a 5′-cap structure, preferably m7GpppN;b.) a 5′-UTR element which comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a TOP gene, preferably comprising or consisting of the corresponding RNA sequence of a nucleic acid sequence according to SEQ ID NO: 3376 or 3378, a homolog, a fragment or a variant thereof;c.) at least one coding region encoding at least one antigenic peptide or protein derived from a protein of a lassa virus or a variant or fragment thereof as defined in any one of the preceding claims,d.) a 3′-UTR element comprising or consisting of a nucleic acid sequence which is derived from a gene providing a stable mRNA, preferably comprising or consisting of the corresponding RNA sequence of a nucleic acid sequence according to SEQ ID NO: 3390 or 3392, a homolog, a fragment or a variant thereof;e.) a poly(A) sequence preferably comprising 64 adenosines;f.) optionally a poly(C) sequence, preferably comprising 30 cytosines; andg.) optionally a histone-stem-loop, preferably comprising the corresponding RNA sequence of the nucleic acid sequence according to SEQ ID NO: 3395.
  • 42. The mRNA according to any one of claims 1 to 26 and 34 to 41, wherein the RNA comprises or consists of an RNA sequence which is identical or at least 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 5356-5541, 5542-5730 or 5731-5883 (column 5 E “mRNA design 2”, Table 1-3) or a fragment or variant of any of these sequences.
  • 43. Composition comprising at least one mRNA according to any one of claims 1 to 42 and a pharmaceutically acceptable carrier.
  • 44. The composition according to claim 43, wherein the at least one mRNA is complexed with one or more cationic or polycationic compounds, preferably with cationic or polycationic polymers, cationic or polycationic peptides or proteins, e.g. protamine, cationic or polycationic polysaccharides and/or cationic or polycationic lipids.
  • 45. The composition according to claim 44, wherein the N/P ratio of the at least one mRNA to the one or more cationic or polycationic compounds is in the range of about 0.1 to 20, including a range of about 0.3 to 4, of about 0.5 to 2, of about 0.7 to 2 and of about 0.7 to 1.5.
  • 46. The composition according to claim 44 or 45, wherein the at least one mRNA is complexed with one or more cationic or polycationic compounds in a weight ratio selected from a range of about 6:1 (w/w) to about 0.25:1 (w/w), more preferably from about 5:1 (w/w) to about 0.5:1 (w/w), even more preferably of about 4:1 (w/w) to about 1:1 (w:w) or of about 3:1 (w/w) to about 1:1 (w/w), and most preferably a ratio of about 3:1 (w/w) to about 2:1 (w/w) of mRNA to cationic or polycationic compound and/or with a polymeric carrier; or optionally in a nitrogen/phosphate ratio of mRNA to cationic or polycationic compound and/or polymeric carrier in the range of about 0.1-10, preferably in a range of about 0.3-4 or 0.3-1, and most preferably in a range of about 0.5-1 or 0.7-1, and even most preferably in a range of about 0.3-0.9 or 0.5-0.9.
  • 47. The composition according to any one of claims 43 to 46 comprising the at least one mRNA, which is complexed with one or more cationic or polycationic compounds, and at least one free (m)RNA.
  • 48. The composition according to claim 47, wherein the at least one complexed mRNA is identical to the at least one free (m)RNA.
  • 49. The composition according to claim 47 or 48, wherein the molar ratio of the complexed mRNA to the free (m)RNA is selected from a molar ratio of about 0.001:1 to about 1:0.001, including a ratio of about 1:1.
  • 50. The composition according to any one of claims 47 to 49, wherein the ratio of the complexed mRNA to the free (m)RNA is selected from a range of about 5:1 (w/w) to about 1:10 (w/w), more preferably from a range of about 4:1 (w/w) to about 1:8 (w/w), even more preferably from a range of about 3:1 (w/w) to about 1:5 (w/w) or 1:3 (w/w), and most preferably the ratio of the complexed mRNA to the free (m)RNA is selected from a ratio of 1:1 (w/w).
  • 51. The composition according to any one of claims 43 to 50, wherein the mRNA is complexed or associated with one or more lipids, thereby forming liposomes, lipid nanoparticles and/or lipoplexes.
  • 52. The composition according to any one of claims 4336 to 51, wherein the composition comprises at least one adjuvant.
  • 53. The composition according to claim 52, wherein the adjuvant is selected from the group consisting of: cationic or polycationic compounds, comprising cationic or polycationic peptides or proteins, including protamine, nucleoline, spermin or spermidine, poly-L-lysine (PLL), poly-arginine, basic polypeptides, cell penetrating peptides (CPPs), including HIV-binding peptides, Tat, HIV-1 Tat (HIV), Tat-derived peptides, Penetratin, VP22 derived or analog peptides, HSV VP22 (Herpes simplex), MAP, KALA or protein transduction domains (PTDs, PpT620, proline-rich peptides, arginine-rich peptides, lysine-rich peptides, MPG-peptide(s), Pep-1, L-oligomers, Calcitonin peptide(s), Antennapedia-derived peptides (particularly from Drosophila antennapedia), pAntp, pIsl, FGF, Lactoferrin, Transportan, Buforin-2, Bac715-24, SynB, SynB(1), pVEC, hCT-derived peptides, SAP, protamine, spermine, spermidine, or histones, cationic polysaccharides, including chitosan, polybrene, cationic polymers, including polyethyleneimine (PEI), cationic lipids, including DOTMA: [1-(2,3-sioleyloxy)propyl)]-N,N,N-trimethylammonium chloride, DMRIE, di-C14-amidine, DOTIM, SAINT, DC-Chol, BGTC, CTAP, DOPC, DODAP, DOPE: Dioleyl phosphatidylethanol-amine, DOSPA, DODAB, DOIC, DMEPC, DOGS: Dioctadecylamidoglicylspermin, DIMRI: Dimyristo-oxypropyl dimethyl hydroxyethyl ammonium bromide, DOTAP: dioleoyloxy-3-(trimethylammonio)propane, DC-6-14: O,O-ditetradecanoyl-N-(α-trimethylammonioacetyl)diethanolamine chloride, CLIP1: rac-[(2,3-dioctadecyloxypropyl)(2-hydroxyethyl)]-dimethylammonium chloride, CLIP6: rac-[2(2,3-dihexadecyloxypropyl-oxymethyloxy)ethyl]-trimethylammonium, CLIP9: rac-[2(2,3-dihexadecyloxypropyl-oxysuccinyloxy)ethyl]-trimethylammonium, oligofectamine, or cationic or polycationic polymers, including modified polyaminoacids, including β-aminoacid-polymers or reversed polyamides, modified polyethylenes, including PVP (poly(N-ethyl-4-vinylpyridinium bromide)), modified acrylates, including pDMAEMA (poly(dimethylaminoethyl methylacrylate)), modified Amidoamines including pAMAM (poly(amidoamine)), modified polybetaaminoester (PBAE), including diamine end modified 1,4 butanediol diacrylate-co-5-amino-1-pentanol polymers, dendrimers, including polypropylamine dendrimers or pAMAM based dendrimers, polyimine(s), including PEI: poly(ethyleneimine), poly(propyleneimine), polyallylamine, sugar backbone based polymers, including cyclodextrin based polymers, dextran based polymers, Chitosan, etc., silan backbone based polymers, such as PMOXA-PDMS copolymers, etc., block polymers consisting of a combination of one or more cationic blocks selected from a cationic polymer as mentioned before, and of one or more hydrophilic- or hydrophobic blocks (e.g polyethyleneglycole); orcationic or polycationic proteins or peptides, selected from the following proteins or peptides having the following total formula (III): (Arg)l;(Lys)m;(His)n;(Orn)o;(Xaa)x, wherein l+m+n+o+x=8-15, and l, m, n or o independently of each other may be any number selected from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15, provided that the overall content of Arg, Lys, His and Orn represents at least 50% of all amino acids of the oligopeptide; and Xaa may be any amino acid selected from native (=naturally occurring) or non-native amino acids except from Arg, Lys, His or Orn; and x may be any number selected from 0, 1, 2, 3 or 4, provided, that the overall content of Xaa does not exceed 50% of all amino acids of the oligopeptide; ornucleic acids having the formula (V): GlXmGn, wherein: G is guanosine (guanine), uridine (uracil) or an analogue of guanosine (guanine) or uridine (uracil); X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of the above-mentioned nucleotides (nucleosides); l is an integer from 1 to 40, wherein, when l=1 G is guanosine (guanine) or an analogue thereof, when l>1 at least 50% of the nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof; m is an integer and is at least 3; wherein when m=3 X is uridine (uracil) or an analogue thereof, when m>3 at least 3 successive uridines (uracils) or analogues of uridine (uracil) occur; n is an integer from 1 to 40, wherein when n=1 G is guanosine (guanine) or an analogue thereof, when n>1 at least 50% of the nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof;ornucleic acids having the formula (VI): ClXmCn, wherein: C is cytidine (cytosine), uridine (uracil) or an analogue of cytidine (cytosine) or uridine (uracil); X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of the above-mentioned nucleotides (nucleosides); I is an integer from 1 to 40, wherein when l=1 C is cytidine (cytosine) or an analogue thereof, when l>1 at least 50% of the nucleotides (nucleosides) are cytidine (cytosine) or an analogue thereof; m is an integer and is at least 3; wherein when m=3 X is uridine (uracil) or an analogue thereof, when m>3 at least 3 successive uridine (uracils) or analogues of uridine (uracil) occur; n is an integer from 1 to 40, wherein when n=1 C is cytidine (cytosine) or an analogue thereof, when n>1 at least 50% of the nucleotides (nucleosides) are cytidine (cytosine) or an analogue thereof;oradjuvants selected from the group consisting of:TDM, MDP, muramyl dipeptide, pluronics, alum solution, aluminium hydroxide, ADJUMER™ (polyphosphazene); aluminium phosphate gel; glucans from algae; algammulin; aluminium hydroxide gel (alum); highly protein-adsorbing aluminium hydroxide gel; low viscosity aluminium hydroxide gel; AF or SPT (emulsion of squalane (5%), Tween 80 (0.2%), Pluronic L121 (1.25%), phosphate-buffered saline, pH 7.4); AVRIDINE™ (propanediamine); BAY R1005™ ((N-(2-deoxy-2-L-leucylamino-b-D-glucopyranosyl)-N-octadecyl-dodecanoyl-amide hydroacetate); CALCITRIOL™ (1-alpha,25-dihydroxy-vitamin D3); calcium phosphate gel; CAP™ (calcium phosphate nanoparticles); cholera holotoxin, cholera-toxin-A1-protein-A-D-fragment fusion protein, sub-unit B of the cholera toxin; CRL 1005 (block copolymer P1205); cytokine-containing liposomes; DDA (dimethyldioctadecylammonium bromide); DHEA (dehydroepiandrosterone); DMPC (dimyristoylphosphatidylcholine); DMPG (dimyristoylphosphatidylglycerol); DOC/alum complex (deoxycholic acid sodium salt); Freund's complete adjuvant; Freund's incomplete adjuvant; gamma inulin; Gerbu adjuvant (mixture of: i)N-acetylglucosaminyl-(P1-4)-N-acetylmuramyl-L-alanyl-D-glutamine (GMDP), ii) dimethyldioctadecylammonium chloride (DDA), iii) zinc-L-proline salt complex (ZnPro-8); GM-CSF); GMDP (N-acetylglucosaminyl-(b1-4)-N-acetylmuramyl-L-alanyl-D-isoglutamine); imiquimod (1-(2-methypropyl)-1H-imidazo[4,5-c]quinoline-4-amine); ImmTher™ (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-glycerol dipalmitate); DRVs (immunoliposomes prepared from dehydration-rehydration vesicles); interferon-gamma; interleukin-ibeta; interleukin-2; interleukin-7; interleukin-12; ISCOMS™; ISCOPREP 7.0.3.™; liposomes; LOXORIBINE™ (7-allyl-8-oxoguanosine); LT oral adjuvant (E. coli labile enterotoxin-protoxin); microspheres and microparticles of any composition; MF59™; (squalene-water emulsion); MONTANIDE ISA 51™ (purified incomplete Freund's adjuvant); MONTANIDE ISA 720™ (metabolisable oil adjuvant); MPL™ (3-Q-desacyl-4′-monophosphoryl lipid A); MTP-PE and MTP-PE liposomes ((N-acetyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1,2-dipalmitoyl-sn-glycero-3-(hydroxyphosphoryloxy))-ethylamide, monosodium salt); MURAMETIDE™ (Nac-Mur-L-Ala-D-Gln-OCH3); MURAPALMITINE™ and D-MURAPALMITINE™ (Nac-Mur-L-Thr-D-isoGIn-sn-glyceroldipalmitoyl); NAGO (neuraminidase-galactose oxidase); nanospheres or nanoparticles of any composition; NISVs (non-ionic surfactant vesicles); PLEURAN™ (β-glucan); PLGA, PGA and PLA (homo- and co-polymers of lactic acid and glycolic acid; microspheres/nanospheres); PLURONIC L121™; PMMA (polymethyl methacrylate); PODDS™ (proteinoid microspheres); polyethylene carbamate derivatives; poly-rA: poly-rU (polyadenylic acid-polyuridylic acid complex); polysorbate 80 (Tween 80); protein cochleates (Avanti Polar Lipids, Inc., Alabaster, Ala.); STIMULON™ (QS-21); Quil-A (Quil-A saponin); S-28463 (4-amino-otec-dimethyl-2-ethoxymethyl-1H-imidazo[4,5 c]quinoline-1-ethanol); SAF-1™ (“Syntex adjuvant formulation”); Sendai proteoliposomes and Sendai-containing lipid matrices; Span-85 (sorbitan trioleate); Specol (emulsion of Marcol 52, Span 85 and Tween 85); squalene or Robane® (2,6,10,15,19,23-hexamethyltetracosan and 2,6,10,15,19,23-hexamethyl-2,6,10,14,18,22-tetracosahexane); stearyltyrosine (octadecyltyrosine hydrochloride); Theramid® (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-dipalmitoxypropylamide); Theronyl-MDP (Termurtide™ or [thr 1]-MDP; N-acetylmuramyl-L-threonyl-D-isoglutamine); Ty particles (Ty-VLPs or virus-like particles); Walter-Reed liposomes (liposomes containing lipid A adsorbed on aluminium hydroxide), and lipopeptides, including Pam3Cys, in particular aluminium salts, such as Adju-phos, Alhydrogel, Rehydragel; emulsions, including CFA, SAF, IFA, MF59, Provax, TiterMax, Montanide, Vaxfectin; copolymers, including Optivax (CRL1005), L121, Poloaxmer4010), etc.; liposomes, including Stealth, cochleates, including BIORAL; plant derived adjuvants, including QS21, Quil A, Iscomatrix, ISCOM; adjuvants suitable for costimulation including Tomatine, biopolymers, including PLG, PMM, Inulin; microbe derived adjuvants, including Romurtide, DETOX, MPL, CWS, Mannose, CpG nucleic acid sequences, CpG7909, ligands of human TLR 1-10, ligands of murine TLR 1-13, ISS-1018, IC31, Imidazoquinolines, Ampligen, Ribi529, IMOxine, IRIVs, VLPs, cholera toxin, heat-labile toxin, Pam3Cys, Flagellin, GPI anchor, LNFPIII/Lewis X, antimicrobial peptides, UC-1V150, RSV fusion protein, cdiGMP; and adjuvants suitable as antagonists including CGRP neuropeptide.
  • 54. The composition according to any one of claims 43 to 53, wherein the composition comprises a plurality or more than one of the mRNAs each defined in any one of claims 1 to 42.
  • 55. The composition according to claim 54, wherein each of the mRNAs comprises a coding region encoding at least one different antigenic peptide or protein derived from proteins of the same lassa virus.
  • 56. The composition according to claim 55, wherein each of the mRNAs comprises a coding region which encodes at least one different antigenic peptide or protein derived from different proteins of the same lassa virus.
  • 57. The composition according to claim 55, wherein each of the mRNAs comprises a coding region which encodes at least one different antigenic peptide or protein derived from different proteins of different lassa viruses.
  • 58. The composition according to any one of claims 54 to 57, wherein each of the mRNAs comprises a coding region which encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) or nucleoprotein (NP) or zinc-binding matrix protein (Z) of a lassa virus or a variant or fragment thereof.
  • 59. The composition according to any one of claims 54 to 57, wherein each of the mRNAs comprises a coding region which encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) and nucleoprotein (NP) of a lassa virus or a variant or fragment thereof.
  • 60. The composition according to any one of claims 54 to 57, wherein each of the mRNAs comprises a coding region which encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) and zinc-binding matrix protein (Z) of a lassa virus or a variant or fragment thereof.
  • 61. The composition according to any one of claims 54 to 57, wherein each of the mRNAs comprises a coding region which encodes at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) and nucleoprotein (NP) and zinc-binding matrix protein (Z) of a lassa virus or a variant or fragment thereof.
  • 62. The composition according to claims 54 to 61, wherein at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) and/or nucleoprotein (NP) and/or zinc-binding matrix protein (Z) of 2, 3, 4, 5, 6, 7, 6, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, or 100 different lassa viruses are encoded by the plurality of mRNAs.
  • 63. The composition according to claims 57 to 62, wherein the different lassa viruses are belonging to different lassa virus clades or different lassa virus lineages, preferably to the lassa virus clades I, II, III and IV or to the lassa virus lineages I, II, III and IV.
  • 64. The composition according to claim 62 or 63, wherein the composition comprises at least one mRNA encoding at least one antigenic peptide or protein derived from glycoprotein precursor (GPC) of a lassa virus and at least one mRNAs encoding at least one antigenic peptide or protein derived from nucleoprotein (NP) of the same lassa virus.
  • 65. Vaccine comprising the mRNA according to any one of claims 1 to 42 and a pharmaceutically acceptable carrier, or comprising the composition according to any one of claims 43 to 64.
  • 66. The vaccine according to claim 65, wherein the mRNA according to any one of claims 1 to 42 or the composition according to any one of claims 43 to 65 elicits an adaptive immune response.
  • 67. The vaccine according to claim 65 or 66 comprising an adjuvant.
  • 68. Kit or kit of parts comprising the components of the mRNA according to any one of claims 1 to 42, the composition according to any one of claims 43 to 64, the vaccine according to any one of claims 65 to 67 and optionally technical instructions with information on the administration and dosage of the components.
  • 69. mRNA as defined according to any one of claims 1 to 42, the composition according to any one of claims 43 to 64, the vaccine according to any one of claims 65 to 67, the kit or kit of parts according to claim 68 for use as a medicament.
  • 70. mRNA as defined according to any one of claims 1 to 42, the composition according to any one of claims 43 to 64, the vaccine according to any one of claims 65 to 67, the kit or kit of parts according to claim 68 for use in the treatment or prophylaxis of lassa infections or disorders related thereto.
  • 71. mRNA, composition, vaccine or kit or kit of parts for use according to claim 69 or 70, wherein the mRNA, the composition, the vaccine or the kit or kit of parts is administered by subcutaneous, intramuscular or intradermal injection, preferably by intramuscular or intradermal injection, more preferably by intradermal injection.
  • 72. mRNA, composition, vaccine and kit or kit of parts for use according to claim 71, wherein the injection is carried out by using conventional needle injection or jet injection, preferably by using jet injection.
  • 73. A method of treatment or prophylaxis of lassa infections comprising the steps: a) providing the mRNA as defined according to any one of claims 1 to 42, the composition according to any one of claims 43 to 65, the vaccine according to any one of claims 65 to 67, the kit or kit of parts according to claim 68;b) applying or administering the mRNA, the composition, the vaccine or the kit or kit of parts to a tissue or an organism.
  • 74. The method according to claim 73, wherein the mRNA, the composition, the vaccine or the kit or kit of parts is administered to the tissue or to the organism by subcutaneous, intramuscular or intradermal injection, preferably by intramuscular or intradermal injection, more preferably by intradermal injection.
  • 75. The method according to claim 74, wherein the injection is carried out by using conventional needle injection or jet injection, preferably by using jet injection.
Priority Claims (1)
Number Date Country Kind
PCT/EP2016/082605 Dec 2016 EP regional
PCT Information
Filing Document Filing Date Country Kind
PCT/EP2017/084562 12/22/2017 WO 00